Apache Spark

Shift Left Architecture for AI and Analytics with Confluent and Databricks

Confluent and Databricks enable a modern data architecture that unifies real-time streaming and lakehouse analytics. By combining shift-left principles with…

4 weeks ago

Confluent Data Streaming Platform vs. Databricks Data Intelligence Platform for Data Integration and Processing

This blog explores how Confluent and Databricks address data integration and processing in modern architectures. Confluent provides real-time, event-driven pipelines…

4 weeks ago

Fraud Detection in Mobility Services (Ride-Hailing, Food Delivery) with Data Streaming using Apache Kafka and Flink

Mobility services like Uber, Grab, and FREE NOW (Lyft) rely on real-time data to power seamless trips, deliveries, and payments.…

1 month ago

The Data Streaming Landscape 2025

Data streaming is a new software category. It has grown from niche adoption to becoming a fundamental part of modern…

6 months ago

Top Trends for Data Streaming with Apache Kafka and Flink in 2025

Apache Kafka and Apache Flink are leading open-source frameworks for data streaming that serve as the foundation for cloud services,…

6 months ago

The Data Streaming Landscape 2024

The research company Forrester defines data streaming platforms as a new software category in a new Forrester Wave. Apache Kafka…

1 year ago

The Data Streaming Landscape 2023

Data streaming is a new software category to process data in motion. Apache Kafka is the de facto standard used…

2 years ago

Case Studies: Cloud-native Data Streaming for Data Warehouse Modernization

The concepts and architectures of a data warehouse, a data lake, and data streaming are complementary to solving business problems.…

3 years ago

Machine Learning Trends of 2018 combined with the Apache Kafka Ecosystem

At OOP 2018 conference in Munich, I presented an updated version of my talk about building scalable, mission-critical microservices with…

7 years ago

Apache Kafka Streams + Machine Learning (Spark, TensorFlow, H2O.ai)

Apache Kafka Streams to build Real Time Streaming Microservices. Apply Machine Learning / Deep Learning using Spark, TensorFlow, H2O.ai, etc.…

8 years ago