KIP-932: Queues for Kafka

taywrobel · on Sept 27, 2023

On the one hand, I've seen people (including myself) try to hack job-queue like semantics onto Kafka many a time, and it always hits issues once redelivery or backoff comes up. So it's nice to see them considering making this a first-class citizen of Kafka.

On the other hand, Kafka isn't the only player in the queue game nowadays. If you need message queue and job queue semantics combined (which you likely do), just use Pulsar.

touisteur · on Sept 27, 2023

I think the most likely use case, the one making me happy they're working on this, is reducing infra spend and having a separate tool/guarantees/storage for queues and for whatever kafka is more made for.

I'm just hoping librdkafka gets good too-tier support for this feature in a timely manner.

garou · on Sept 27, 2023

It's looking like PostgreSQL vs MongoDB...

RabbitMQ has implemented Streams and "Super Streams":

> Super streams are a way to scale out by partitioning a large stream into smaller streams. They integrate with single active consumer to preserve message order within a partition. Super streams are available starting with RabbitMQ 3.11.

https://www.rabbitmq.com/streams.html

throw1234651234 · on Sept 27, 2023

This is asking for a huge favor, but can anyone explain to me what Kafka does that RabbitMQ doesn't? Is it just scale?

flashgordon · on Sept 27, 2023

One way to think about it is who does the routing. My understanding of rabbitmq from 10 years ago was that RMQ pushes to connected consumers and to only 1 at a time? You'd need a fanout consumer that does more of the work. And lower throughput overall.

With Kafka it "just" keeps appending to a dumb (but huge) circular buffer. But you can have multiple consumers read off this buffer and can start any point. Downside is customers have to maintain their own offsets (in some storage) but there is now a big decoupling between producer and consumer. This contributes a large part to high throughput too (and consumers can go at their own pace -ofcourse if they are too slow they can fall off the log).

teraflop · on Sept 27, 2023

> Downside is customers have to maintain their own offsets (in some storage)

Minor correction: you can maintain the offsets yourself if you want, but usually it's not necessary because Kafka can do it for you.

The abstraction Kafka provides is that for each consumer group and for each (topic, partition) tuple, your consumer object that is guaranteed not to receive messages before the last offset at which you called commit(). Internally, the committed offsets are stored in a special Kafka topic of their own.

flashgordon · on Sept 27, 2023

Ah is this offset maintained by the Kafka client? (I included that as "client" as well). I thought the Kafka topic itself did not maintian any client specific offsets (how could it)? Unless this was added in recent years. Interesting tho!

leetharris · on Sept 27, 2023

They are very different, but some people use them for similar things.

Kafka is a stream, RabbitMQ is a queue. Without getting into the details, RabbitMQ is designed to add things to a stack and pop them off when consumed. Kafka is designed to stream everything to a continuous log and anywhere can tune in when appropriate.

RomanPushkin · on Sept 27, 2023

Kafka events are replayable, and represent the final state we're aiming for. You might have two exactly the same events that tell you how the state should look like. And there are batches of events. It's good for batch processing - "we're getting new portion of data to train AI model".

RabbitMQ messages are supposed to be processed/consumed/acked only once. Your app most probably won't ever get two exactly the same messages, unless you misconfigured/misued RabbitMQ. It's good for classic message processing - "used clicked something, run the job of informing subscribers that new post has been created" (because you can't send 1000 messages from a web worker thread).

bikezen · on Sept 27, 2023

This hasnt been true for awhile, RabbitMQ added streams back in 2021.

wmal · on Sept 27, 2023

They are conceptually different. Kafka optimizes throughput of data.

I wouldn't use Kafka for a job queue, and wouldn't use RabbitMQ for streaming data when ordering would be important.

facorreia · on Sept 27, 2023

This blog post does a good job of explaining how they compare on different use cases:

https://eranstiller.com/rabbitmq-vs-kafka-an-architects-dile...

j45 · on Sept 27, 2023

The lack of Queue like support as a first class citizen in Kafka has kept a few things on the sidelines for me.

Native capacity to do queueing is exciting, especially the concept of share groups to allow potentially different types of queues and shares in the future.

It’s might not be appropriate, but one step closer to eating more workflow engines for lunch.

dang · on Sept 27, 2023

Presumably-related ongoing thread:

Deno Queues - https://news.ycombinator.com/item?id=37674752 - Sept 2023 (67 comments)

tannhaeuser · on Sept 27, 2023

Recent, IMO more relevant discussion (though not covering Kafka Queues yet):

RabbitMQ vs. Kafka – An Architect’s Dilemma (Part 1) - https://news.ycombinator.com/item?id=37574552 - Sep 2023

mzi · on Sept 27, 2023

Not so related. The linked is a Javascript lib for Denos hosted FoundationDB-based key-value store.

bsaul · on Sept 27, 2023

i'm having a hard time understanding what scenario requires a queue semantic instead of a stream one.

Is is the ability to parallelize consuming to a super large amount, without having to setup partitions ?

I'm planning on using kafka for a job queue, and i like the idea that i can add ordering definition if i need to, and that i can keep the jobs in the queue for auditing later on if i need to. What am i missing ?

Spivak · on Sept 27, 2023

> without having to setup partitions

Without being limited by partitions. In Kafka your unit of parallelism is partitions but what happens when you don't care at all (or much) about ordering and just want to add or remove consumers to match your current load? Queue semantics.

In Kafka the number of partitions can go up, but not down. And even when you do that the messages don't get split up to fill the new partition so you can't burn down a backup by adding more partitions or more consumers -- ope.

anentropic · on Sept 27, 2023

Is there anything like this for Kinesis?

salil999 · on Sept 27, 2023

What specifically are you looking for?

berkle4455 · on Sept 27, 2023

Don’t use Kafka (or PostgreSQL ) as a job queue. Can we just stop forcing this?

Kafka is an amazing event streaming platform.