#apache-kafka tag

The Schema Proliferation Problem in Kafka and Flink Pipelines: How to Solve It

One-to-one event-to-schema mapping scales poorly, causing fragmented queries, maintenance overhead, and schema drift.

Event schemas with 80–95% structural overlap can be consolidated using discriminator enum fields into fewer tables and simpler consumer queries.

Nullable attribute blocks support backward-compatible schema evolution when adding new event variants.

A layered adapter design separates transformation logic from framework integration, easing consolidation implementation and testing in Apache Flink pipelines.

Schema design aligned to consumer access patterns simplifies queries and reduces long-term maintenance overhead.

Scala

fromInfoQ

3 weeks ago

Confluent Moves Schema IDs to Kafka Headers to Simplify Schema Governance

Confluent's new schema management approach stores schema IDs in message headers, simplifying data governance and enhancing schema evolution in Kafka.

fromInfoWorld

1 month ago

How Apache Kafka flexed to support queues

Apache Kafka has cemented itself as the de facto platform for event streaming, often referred to as the 'universal data substrate' due to its extensive ecosystem that enables connectivity and processing capabilities.

Scala

fromInfoQ

2 months ago

QCon London 2026: Introducing Tansu.io -- Rethinking Kafka for Lean Operations

Tansu is an open-source, stateless messaging broker that replaces Kafka's complex architecture with a simpler, durable storage model.

fromInfoQ

3 months ago

Panel: Modern Data Architectures

I wrote a book for O'Reilly on scaling machine learning with Spark specifically. My second book is coming out on how to improve high-performance Spark, the second edition. Started my career in the machine learning space 15 years ago, moved into data infrastructure, batch processing, and a year and a half ago I moved into the data streaming space, which I think it's what's going to help us pave the future in the data.

Data science

Software development

fromInfoQ

3 months ago

[Video Podcast] Building Resilient Event-Driven Microservices in Financial Systems with Muzeeb Mohammad

Event-driven architectures using Kafka enable decoupling backend workflows, improving scalability and SLAs for complex multi-system processes like account opening.

fromInfoWorld

5 months ago

IBM to buy Confluent to extend its data and automation portfolio

Confluent connects data sources and cleans up data. It built its service on Apache Kafka, an open-source distributed event streaming platform, sparing its customers the hassle of buying and managing their own server clusters in return for a monthly fee per cluster, plus additional fees for data stored and data moved in or out. IBM expects the deal, which it valued at $11 billion, to close by the middle of next year.

Artificial intelligence

fromInfoQ

5 months ago

Grab Adds Real-Time Data Quality Monitoring to Its Platform

This engine takes topic data schemas, metadata, and test rules as inputs to create a set of FlinkSQL-based test definitions. A Flink job then executes these tests, consuming messages from production Kafka topics and forwarding any errors to Grab's observability platform. FlinkSQL was selected because its ability to represent stream data as dynamic tables allowed the team to automatically generate data filters for rules that could be efficiently implemented.

Software development

fromInfoQ

6 months ago

Grafana Labs Releases Mimir 3.0 with Redesigned Architecture for Enhanced Performance and Reliabilit

The main feature of the 3.0 release is a new decoupled architecture. This change fixes a key limitation found in earlier versions. In earlier versions of Mimir, the ingester component handled both reading and writing. This setup meant that heavy query loads could hurt ingestion performance. The new design adds Apache Kafka as an asynchronous buffer between ingestion and query tasks. This allows each path to scale on its own and removes the cross-path dependencies that affected system stability before.

Software development

#apache-kafka#apache-kafka

The Schema Proliferation Problem in Kafka and Flink Pipelines: How to Solve It

Confluent Moves Schema IDs to Kafka Headers to Simplify Schema Governance

How Apache Kafka flexed to support queues

QCon London 2026: Introducing Tansu.io -- Rethinking Kafka for Lean Operations

Panel: Modern Data Architectures

[Video Podcast] Building Resilient Event-Driven Microservices in Financial Systems with Muzeeb Mohammad

IBM to buy Confluent to extend its data and automation portfolio

Grab Adds Real-Time Data Quality Monitoring to Its Platform

Grafana Labs Releases Mimir 3.0 with Redesigned Architecture for Enhanced Performance and Reliabilit

Harnessing Real-Time Data: Integration of MySQL, Debezium, Kafka, Scala

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Beyond Kafka: How LinkedIn Built Northguard & Xinfra to Scale Event Streaming for the Next Decade!

Confluent introduces Streaming Agents for real-time AI agents

#apache-kafka
#apache-kafka