Solving for Innovation

Stay Ahead of the Curve: Key Insights from Forrester on Streaming Data Platforms

Streaming data is the pulse of an enterprise. It represents customers navigating websites to find products, supply chains that are about to break, and pricing that starts to soar or plummet. Streaming data represents the real-time reality of what is happening to the business, internally and externally. That’s why enterprises must leverage streaming data to attain blazing-fast reflexes to conquer intraday business challenges and opportunities. All data starts as streaming data because streaming data is simply data that has just been born in an application. The question is, how can enterprises leverage that data? Streaming data platforms enable enterprises to capture, analyze, and act in real time on streaming data that originates in any application without refactoring its existing architecture.

As a result of these trends, streaming data platform customers should look for providers that offer:

  • A breadth of core capabilities. The three core streaming data capabilities are movement, analytics, and processing. Movement is connecting to enterprise data sources in real time and making that data available to any applications that need it. Analytics is gleaning insights from streaming data whether simple event triggers, aggregate queries, or advanced analytics, including machine learning. Processing is performing any number of arbitrary transformations, joins, and enrichments on streaming data in real time. It’s common for enterprises to already have one or more of these core capabilities. For example, many enterprises have adopted open-source Apache Kafka (on-premises or in cloud) to move and deliver streaming data, but lack a robust streaming analytics capability. Thus, enterprise buyers should either look to vendor solutions to augment their existing core streaming capabilities or look to a new, unified streaming data platform to replace one-off implementations.
  • Development tools that increase productivity and support fast change. Raw streaming data technology is insufficient for enterprises that wish to implement a steady stream of new use cases. They need tools to design, develop, and deploy additional use cases and manage the platform. Fortunately, tooling has become a major focus of vendor innovation now and on roadmaps. All vendors offer a code-based interface and many now offer tools such as visual streaming pipeline builders, SQL consoles, and other tools aimed at both developer productivity and enabling access to the platform for less technical developers. Enterprise buyers should match their existing developer maturity to the tools that the vendor offers, but also anticipate how additional roles may use the platform, including IT ops and citizen developers.
  • Efficient and fault-tolerant scalability. The nature of streaming data is such that it can roar like Niagara Falls and then suddenly, without warning, gurgle like a Zen fountain. Streaming data platforms must be able to handle both of these situations: scale up to handle huge volumes of data, and scale down to avoid wasting compute, storage, and network resources. Enterprise buyers should anticipate and incorporate current and future requirements for scalability in terms of throughput, latency, efficiency (resource utilization), and fault tolerance into their buying criteria.
Evaluation Summary

The Forrester Wave™ evaluation highlights Leaders, Strong Performers, Contenders, and Challengers. It’s an assessment of the top vendors in the market; it doesn’t represent the entire vendor landscape. You’ll find more information about this market in our report The Streaming Data Platforms Landscape, Q3 2023.

We intend this evaluation to be a starting point only and encourage clients to view product evaluations and adapt criteria weightings using the Excel-based vendor comparison tool (see Figures 1 and 2). Click the link at the beginning of this report on Forrester.com to download the tool.

Figure 1
Forrester Wave™: Streaming Data Platforms, Q4 2023

Figure 2
Forrester Wave™: Streaming Data Platforms Scorecard, Q4 2023

Vendor Offerings

Forrester evaluated the offerings listed below (see Figure 3).

Figure 3

Vendor Profiles

Our analysis uncovered the following strengths and weaknesses of individual vendors.

Leaders

  • Confluent is a streaming force to be reckoned with.Confluent started in 2014 as the preeminent Kafka company started by the founders of the 2011 open-source project that originated in LinkedIn. Fast forward to today, Confluent has grown rapidly and expanded its portfolio to offer a full complement of streaming data platform capabilities including, via a recent acquisition, Apache Flink to gain more stream-processing capabilities. Its superior vision is to transform streaming from an enterprise architecture nice-to-have to a must-have. Confluent’s key strength lies in its offering of cloud-native versions of Kafka that also adds developer tools to make creating streaming applications easier to develop and manage. Confluent’s strategy is to provide a balanced mix of cloud-agnostic, open-source solution and value-added capabilities that make it hard to beat.

    The core strengths of Confluent’s product offering are in data movement, data connectors, developer tooling, fault tolerance, and solid capabilities in analytics. Confluent can improve by integrating the Apache Flink engine from its recent acquisition. Reference customers appreciate that Confluent Cloud is portable across cloud service providers and is centered on a significantly improved version of open-source Apache Kafka. Confluent is a good fit for customers who want a solution that is based on Apache Kafka, is cloud portable, and provides ample capabilities for both streaming analytics and processing.
  • Microsoft makes world-class streaming data easy to use at cloud scale. Microsoft’s streaming platform uses Azure Event Hubs to receive streaming data from millions of devices. That real-time data in turn can be directed toward numerous other Azure services including Azure Stream Analytics to analyze and take actions on streaming data in real time. Microsoft’s strategic strength lies in hiding the gnarly complexity of cloud-scale streaming data behind no-code development tools or SQL, making world-class streaming data accessible to all levels of developers.

    Microsoft has strengths in streaming analytics, developer tools, low latency, and fault tolerance. Microsoft can improve the platform by adding more native data connectors. Reference customers appreciate how quickly developers can learn to develop streaming data applications, the seamless integration with other Azure services, and scalability. Microsoft is a good fit for existing Azure customers as well as new customers who wish to quickly build an enormously scalable IoT application or any other cloud-scale application.
  • Google offers a streaming data platform at Google scale. Google’s streaming platform is centered on its Cloud Dataflow service that seamlessly integrates with many other services such as Cloud Pub/Sub, BigQuery, Vertex AI, and Looker. Developers can use Cloud Dataflow to create both streaming and batch applications using the same programming paradigm: SQL or APIs. In addition to horizontal autoscaling, Google also uniquely offers vertical autoscaling to optimize the underlying resources based on the characteristics of the workload. Google consistently invests in Dataflow to seamlessly and more efficiently work with sister cloud services including real-time AI.

    Google has strengths in all core streaming data capabilities (movement, analytics, processing), efficiency via autoscaling, fault tolerance, and scalability. Google can improve the platform by enhancing no-code developer tools and adding more data connectors. Reference customers appreciate the integration with other Google Cloud services, autoscaling for throughput, and cloud pricing. Google is a good fit for existing Google Cloud customers and for new customers who want precise autoscaling that minimizes the cost of high volume yet spikey streaming workloads.

Strong Performers

  • Hazelcast foretells the future of streaming but needs to invest in more tooling. Hazelcast offers a breath of fresh air in the streaming data platform market with the Hazelcast Platform and Hazelcast Viridian Cloud. Unlike other streaming data platforms, Hazelcast’s strategic innovation lies in also providing an extremely low-latency transactional platform. Creating durable transactions in the same platform as streaming analytics and processing enables enterprises to take instant actions on streaming data without leaving the platform. The result: the lowest possible latency. Hazelcast has done yeoman’s work in bringing these capabilities together in one platform.

    Hazelcast has strengths in data enrichment, low latency, and throughput. Hazelcast can improve by enhancing data movement, data processing, and developer tools. Reference customers appreciate the ability to develop real-time applications with low latency and high fault tolerance. Hazelcast is an ideal fit for enterprises that need to build real-time applications that must be tightly integrated with systems of record because of their transactional capabilities.
  • Oracle excels at streaming but must fend off startups and cloud service providers. The majority of the world’s enterprise data lands in databases and increasingly is needed instantly for other applications. Oracle offers Oracle GoldenGate streaming data platform to capture this data, copy it, and process it for use in one or more destinations. Oracle’s strategy is to leverage its enormous customer base for databases and business applications to deploy Oracle GoldenGate to enterprises wishing to capture and use streaming data. Oracle’s superior vision is to make GoldenGate a bridge not only for Oracle’s own vast portfolio of data platforms, but also to an enterprise’s larger platform ecosystem.

    Oracle has strengths in data connections, data enrichment, data movement, streaming analytics, and developer tools. Oracle can improve the platform by improving extensibility. Reference customers appreciate Oracle’s change data capture capabilities that are linked seamlessly to other data movement capabilities and analytics. Oracle is a good fit for customers that wish to build real-time applications that originate in Oracle databases, other databases, and Kafka.
  • Cloudera harmonizes streaming data but must expand its vision to applications. Cloudera offers a single platform that unifies an enterprise’s analytical and operational data estate including streaming data. For streaming data, Cloudera offers Cloudera Data Platform (CDP) that includes streaming data movement, analytics, and processing. Cloudera’s strategy is to leverage open-source projects including Apache NiFi, Apache Flink, and dozens of other open-source projects to create a unified set of data management tools and most importantly the tools necessary to develop and manage streaming flows. Also, key to Cloudera’s strategy is that it supports multiple cloud service providers. Cloudera’s primary focus for CDP is to get existing customers to use it. CDP has solid capabilities that are differentiated from other data lake/data management platforms, but it can become more competitive in this market by sharpening its vision, innovation, and roadmap.

    Cloudera has strengths in core streaming capabilities (movement, analytics, and processing), data connections, developer tools, extensibility, and platform management. Cloudera can improve the platform by improving autoscaling to reduce latency and stream data enrichment. Reference customers appreciate the hybrid architecture that allows a complete suite of data management capabilities to run across clouds and/or on-premises infrastructure. Cloudera is a good fit for customers who wish to have the breadth and depth of data management services of the big cloud service providers without the infrastructure lock-in.
  • AWS excels at choices for its streaming platform but needs clearer real-time vision. Amazon Web Services (AWS) offers customers the choice to mix and match different AWS streaming data services. First is Amazon Kinesis Data Firehose and Amazon Kinesis Data Streams. Second are open-source-inspired services, Amazon Managed Service for Apache Flink and Amazon Managed Streaming for Apache Kafka (aka Amazon MSK). The Amazon Kinesis services are optimized for the AWS platform and especially integration with other AWS services. The underlying stream-processing engine for Kinesis is an AWS version of Apache Flink. Customers that prefer open source can use the AWS-optimized open-source versions of Apache Kafka and Apache Flink. AWS needs a clearer vision of how streaming data platforms are essential architectural components of modern applications and innovation to distinguish it from the other cloud competitors.

    AWS has strengths in data movement, processing, deployment, efficiency, fault tolerance, and throughput. AWS can improve the platform by improving developer tooling. Reference customers appreciate the integration with other AWS services and scalability. AWS is a good fit for existing AWS customers and for new customers that need a cloud-scalable, streaming data platform in conjunction with other AWS services to build a real-time application.
  • Spotfire offers excellent no-code tools but should add a processing engine. Spotfire Streaming is a highly capable streaming analytics platform that is tightly integrated with the TIBCO Spotfire business intelligence platform. This combination allows customers to build world-class visualizations and dashboards that are fed with real-time analytics. Spotfire also offers a portfolio of other integration and data management services that enterprises can use to build event-driven applications. Spotfire could become much more competitive if it adds a scalable streaming processing engine such as Apache Flink.

    Spotfire has strengths in streaming analytics, developer tools, extensibility, and platform management. Spotfire can improve its platform by enhancing data movement and processing capabilities. Reference customers appreciate the easy-to-use pipeline tooling and integration with Spotfire. Spotfire is a good fit for existing Spotfire customers that need to create streaming applications and for new customers that want to build real-time dashboards with world-class visualizations.
  • SAS offers top-notch streaming analytics but must boost data processing. SAS Event Stream Processing is offered as part of SAS’s larger SAS Viya platform and is thus seamlessly integrated will all other Viya capabilities, including world-class analytics, AI, and business intelligence. SAS’s strategy for streaming data is to offer the greatest number of analytics that can be executed in real-time data streams. No other vendor comes close to SAS in offering advanced analytics that can be executed in real time including real-time machine learning techniques.

    SAS offers strengths in streaming analytics, developer and analytics tools, and low latency. SAS can improve the platform by adding more options for data movement, processing, and deployment management. Reference customers appreciate SAS’s comprehensive analytics capabilities, including real-time machine learning and integration with other SAS analytical tools. SAS is a good fit for customers that need extremely low latency combined with complex analytics such as industrial, manufacturing, and financial use cases.

Contenders

  • IBM offers world-class streaming capabilities but got lost in Cloud Pak. IBM offers IBM Streams in IBM Cloud Pak for Data which includes dozens of various data-related services. IBM Streams shines in the breadth and depth of real-time analytics it offers. However, IBM doesn’t have a strong strategy specifically for the streaming data platform market. Customers scrolling through IBM Cloud Pak for Data services may see IBM Streams as a niche capability rather than an essential platform for the real-time enterprise.

    IBM has strengths in data connectors, streaming analytics, developer tools, and low latency. IBM can improve the platform by more tightly integrating data movement and data processing capabilities. IBM is a good fit for enterprises that have a preponderance of data in IBM Cloud Pak for Data services. IBM declined to participate in the full Forrester Wave evaluation process.
  • Ververica drives Apache Flink but must expand market awareness. Ververica was originally formed by the founders of the Apache Flink open-source project. Apache Flink has been so successful that many of the other vendors in this evaluation use the open-source version for stream processing. Ververica offers its own supercharged version of Apache Flink named the Ververica Cloud. Ververica is owned by Alibaba and certainly is used in the Alibaba Cloud, but it is also run as a separate company to market Ververica to enterprises. The company’s strategy is to continue to drive the Apache Flink open-source community while also offering the most performant version of Apache Flink available. The company needs to significantly ramp up sales and marketing to be more competitive. Ververica must better articulate its vision for Apache Flink in the enterprise and innovate much more on tooling for developers and other enterprise roles.

    Ververica has strengths in stream processing, streaming analytics, and throughput. Ververica can improve the platform by integrating more options for data movement and more developer tooling. Reference customers appreciate the enhanced performance and hyper scalability added on top of Apache Flink. Ververica is a good fit for customers who wish to use an enhanced version of Apache Flink for hyperscale real-time applications.
  • Aiven offers a multicloud solution but needs to add developer tooling. Aiven bills itself as “the trusted open source data platform for everyone.” To wit, Aiven offers a bevy of open-source databases and streaming services that run on all major cloud service providers and private clouds. Aiven’s streaming data platform is comprised primarily of Aiven for Apache Kafka and Aiven for Apache Flink. Aiven’s strategy is to offer customers very portable, cost-effective, and managed versions of the most popular open-source data platforms. Aiven needs to articulate a broader real-time vision beyond just offering open-source Kafka and Flink. It could improve its roadmap by adding developer tooling and/or more integration with third-party tools.

    Aiven’s strength is in offering a full portfolio of fault-tolerant, open-source data projects, including Aiven for Apache Kafka, Aiven for Apache Flink, and more. Aiven can improve its streaming platform by adding development tools that make it easier for customers to build applications above and beyond coding interfaces. Reference customers appreciate the low cost and flexibility of running key open-source projects on any cloud. Aiven is a good fit for enterprises that have standardized on open source but want to simplify infrastructure management and have the flexibility in deployment to be portable to run on any public cloud or cloud infrastructure.
  • Software AG focuses on real-time integration and IoT; ignores broader streaming market. Software AG streaming data platform is comprised of a combination of streaming analytics embedded in an IoT platform and its recent acquisition of StreamSets. When it comes to streaming data platforms, Software AG focuses its capabilities on industrial IoT and continuous integration use cases.

    Software AG has strengths in data connections and streaming analytics. Software AG can improve its platform by improving autoscaling and developer tooling. Software AG is a good fit for customers that already use other Software AG integration tools and/or customers that wish to implement an industrial IoT application. Software AG declined to participate in the full Forrester Wave evaluation process.

Challengers

  • Red Hat offers open-source streaming capabilities but must add more value. Red Hat offers AMQ Streams, which is based on Apache Kafka running on Red Hat’s OpenShift cloud platform. Having an easier to manage version of Apache Kafka is but an opening wager for a streaming data platform. Enterprises expect more, including streaming analytics and a stream processing engine such as Apache Flink. To be a true player in the market, Red Hat must have a strategy to offer robust streaming analytics and stream-processing capabilities. Apache Flink would be a natural platform for Red Hat to add since it is open source and many of the other vendors base their streaming data platform on it.

    Red Hat has strengths in offering streaming data capabilities based on Apache Kafka and other open-source projects. Red Hat can improve its platform by improving additional core capabilities for streaming analytics and stream processing. Reference customers appreciate the ease with which they can deploy and manage open source with their larger OpenShift environment. Red Hat is a good fit for customers that wish to primarily use an Apache Kafka service within Red Hat OpenShift.
Evaluation Overview

We grouped our evaluation criteria into three high-level categories:

  • Current offering. Each vendor’s position on the vertical axis of the Forrester Wave graphic indicates the strength of its current offering. Key criteria for these solutions include data, core, development, and scale.
  • Strategy. Placement on the horizontal axis indicates the strength of the vendors’ strategies. We evaluated vision, innovation, roadmap, and partner ecosystem.
  • Market presence. Represented by the size of the markers on the graphic, our market presence scores reflect each vendor’s revenue and number of customers.

Vendor Inclusion Criteria

Each of the vendors we included in this assessment has:

  • Streaming data platforms as identified by Forrester. Vendors included in this evaluation were determined by Forrester to offer a streaming data platform as defined above and as represented in the Landscape report on this market.
  • Comprehensive, differentiated streaming data platform. The vendors included in the evaluation offer a platform that provides one or more streaming data core capabilities: movement, analysis, and processing.
  • A standalone streaming data platform. We included only solutions that are marketed as streaming data platforms and participate in the market by competing directly with other streaming data platforms. Streaming data platforms that Forrester deemed are technologically embedded into any particular application; business intelligence platform; data prep; extract, transform, load (ETL); databases; or middleware stacks weren’t included in this evaluation.
  • Minimum product revenue requirement. Vendors included in this evaluation have greater than $10 million in revenue from the streaming data platform product.
Supplemental Material

Online Resource

We publish all our Forrester Wave scores and weightings in an Excel file that provides detailed product evaluations and customizable rankings; download this tool by clicking the link at the beginning of this report on Forrester.com. We intend these scores and default weightings to serve only as a starting point and encourage readers to adapt the weightings to fit their individual needs.

The Forrester Wave Methodology

A Forrester Wave is a guide for buyers considering their purchasing options in a technology marketplace. To offer an equitable process for all participants, Forrester follows The Forrester Wave™ Methodology to evaluate participating vendors.

In our review, we conduct primary research to develop a list of vendors to consider for the evaluation. From that initial pool of vendors, we narrow our final list based on the inclusion criteria. We then gather details of product and strategy through a detailed questionnaire, demos/briefings, and customer reference surveys/interviews. We use those inputs, along with the analyst’s experience and expertise in the marketplace, to score vendors, using a relative rating system that compares each vendor against the others in the evaluation.

We include the Forrester Wave publishing date (quarter and year) clearly in the title of each Forrester Wave report. We evaluated the vendors participating in this Forrester Wave using materials they provided to us by September 6, 2023 and did not allow additional information after that point. We encourage readers to evaluate how the market and vendor offerings change over time.

In accordance with our vendor review policy, Forrester asks vendors to review our findings prior to publishing to check for accuracy. Vendors marked as nonparticipating vendors in the Forrester Wave graphic met our defined inclusion criteria but declined to participate in or contributed only partially to the evaluation. We score these vendors in accordance with our vendor participation policy and publish their positioning along with those of the participating vendors.

Integrity Policy

We conduct all our research, including Forrester Wave evaluations, in accordance with the integrity policy posted on our website.

Stay Ahead of the Curve: Key Insights from Forrester on Streaming Data PlatformsMastering a Data Deluge: Transform Your Strategy with Google Cloud's BigQueryUnlock the Power of Your Data with Databricks on Google CloudHow Boa Vista Broke New Ground in Data Management and Product Development with Google CloudSHAREit Group leverages Google’s Data Cloud to maximize cloud data analyticsJourney to Optimization: Squarespace Transforms Analytics with Google CloudBuild a modern, distributed Data Mesh with Google CloudPowering Google Cloud with GeminiMLB leverages AI to provide the ultimate baseball experienceHK Express: Ensuring high customer satisfaction with Vertex AITrendyol: Creating trend-setting ecommerce experiences with near real-time insights on LookerGemini in Looker brings intelligent AI-powered BI to everyone
Solving for Innovation Home Page