Kafka vs ZeroMQ

Overview

Kafka

Stacks24.2K

Followers22.3K

Votes607

GitHub Stars31.2K

Forks14.8K

ZeroMQ

Stacks258

Followers586

Votes71

GitHub Stars10.6K

Forks2.5K

Kafka vs ZeroMQ: What are the differences?

Introduction

Kafka and ZeroMQ are both widely used messaging systems in the field of distributed computing. While they share similarities in their ability to handle high volumes of data and provide reliable message delivery, there are several key differences between the two platforms.

Scalability: Kafka is designed to handle high-throughput workloads and boasts exceptional scalability. It achieves this by allowing multiple consumers to read messages from multiple partitions simultaneously. On the other hand, ZeroMQ is geared towards smaller-scale deployments and does not offer the same level of scalability as Kafka.
Persistence: Kafka is built with data persistence as a core feature. It stores all messages on disk, allowing for reliable message replay and fault tolerance. ZeroMQ, however, prioritizes low-latency message delivery and does not provide built-in persistence. Messages are typically lost if not immediately consumed.
Message Ordering: Kafka ensures strict ordering of messages within a partition, guaranteeing that messages are processed in the same order they are received. ZeroMQ, on the other hand, does not have the same level of ordering guarantees. It can deliver messages out of order, making it more suitable for scenarios where strict ordering is not a requirement.
Data Distribution: Kafka uses a publish-subscribe model and allows for one-to-many distribution of data. This makes it highly suitable for scenarios where data needs to be replicated or processed by multiple subscribers. In contrast, ZeroMQ follows a one-to-one or one-to-many pattern, where each message is sent to a specific destination. It does not provide built-in replication or distribution capabilities.
Complexity: Kafka is a more complex messaging system compared to ZeroMQ. It has a sophisticated architecture with features like brokers, partitions, and consumer groups, which require careful setup and configuration. ZeroMQ, on the other hand, is lightweight and easy to deploy. It can be used as a simple messaging library without the need for additional infrastructure components.
Community Support: Kafka has a large and active community behind it, as it is developed and maintained by Confluent, a company founded by the creators of Kafka. This means there is extensive documentation, resources, and community support available. ZeroMQ, while still supported by a community, may not have the same level of resources and support as Kafka.

In summary, Kafka and ZeroMQ have distinct differences in terms of scalability, persistence, message ordering, data distribution, complexity, and community support. These factors should be considered when choosing between the two platforms for messaging in distributed systems.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Kafka, ZeroMQ

Tarun

Senior Software Developer at Okta

Dec 4, 2021

Review

We have faced the same question some time ago. Before I begin, DO NOT use Redis as a message broker. It is fast and easy to set up in the beginning but it does not scale. It is not made to be reliable in scale and that is mentioned in the official docs. This analysis of our problems with Redis may help you.

We have used Kafka and RabbitMQ both in scale. We concluded that RabbitMQ is a really good general purpose message broker (for our case) and Kafka is really fast but limited in features. That’s the trade off that we understood from using it. In-fact I blogged about the trade offs between Kafka and RabbitMQ to document it. I hope it helps you in choosing the best pub-sub layer for your use case.

153k views153k

Comments

viradiya

Apr 12, 2020

Needs adviceon

AngularJS

ASP.NET Core

MSSQL

We are going to develop a microservices-based application. It consists of AngularJS, ASP.NET Core, and MSSQL.

We have 3 types of microservices. Emailservice, Filemanagementservice, Filevalidationservice

I am a beginner in microservices. But I have read about RabbitMQ, but come to know that there are Redis and Kafka also in the market. So, I want to know which is best.

933k views933k

Comments

Kirill

GO/C developer at Duckling Sales

Feb 16, 2021

Decided

Maybe not an obvious comparison with Kafka, since Kafka is pretty different from rabbitmq. But for small service, Rabbit as a pubsub platform is super easy to use and pretty powerful. Kafka as an alternative was the original choice, but its really a kind of overkill for a small-medium service. Especially if you are not planning to use k8s, since pure docker deployment can be a pain because of networking setup. Google PubSub was another alternative, its actually pretty cheap, but I never tested it since Rabbit was matching really good for mailing/notification services.

266k views266k

Comments

Detailed Comparison

Kafka	ZeroMQ
Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a messaging system, but with a unique design.	The 0MQ lightweight messaging kernel is a library which extends the standard socket interfaces with features traditionally provided by specialised messaging middleware products. 0MQ sockets provide an abstraction of asynchronous message queues, multiple messaging patterns, message filtering (subscriptions), seamless access to multiple transport protocols and more.
Written at LinkedIn in Scala;Used by LinkedIn to offload processing of all page and other views;Defaults to using persistence, uses OS disk cache for hot data (has higher throughput then any of the above having persistence enabled);Supports both on-line as off-line processing	Connect your code in any language, on any platform.;Carries messages across inproc, IPC, TCP, TPIC, multicast.;Smart patterns like pub-sub, push-pull, and router-dealer.;High-speed asynchronous I/O engines, in a tiny library.;Backed by a large and active open source community.;Supports every modern language and platform.;Build any architecture: centralized, distributed, small, or large.;Free software with full commercial support.
Statistics
GitHub Stars 31.2K	GitHub Stars 10.6K
GitHub Forks 14.8K	GitHub Forks 2.5K
Stacks 24.2K	Stacks 258
Followers 22.3K	Followers 586
Votes 607	Votes 71
Pros & Cons
Pros 126 High-throughput 119 Distributed 92 Scalable 86 High-Performance 66 Durable Cons 32 Non-Java clients are second-class citizens 29 Needs Zookeeper 9 Operational difficulties 5 Terrible Packaging	Pros 23 Fast 20 Lightweight 11 Transport agnostic 7 No broker required 4 Low level APIs are in C Cons 5 No message durability 3 Not a very reliable system - message delivery wise 1 M x N problem with M producers and N consumers

What are some alternatives to Kafka, ZeroMQ?

RabbitMQ

RabbitMQ gives your applications a common platform to send and receive messages, and your messages a safe place to live until received.

Celery

Celery is an asynchronous task queue/job queue based on distributed message passing. It is focused on real-time operation, but supports scheduling as well.

Amazon SQS

Transmit any volume of data, at any level of throughput, without losing messages or requiring other services to be always available. With SQS, you can offload the administrative burden of operating and scaling a highly available messaging cluster, while paying a low price for only what you use.

NSQ

NSQ is a realtime distributed messaging platform designed to operate at scale, handling billions of messages per day. It promotes distributed and decentralized topologies without single points of failure, enabling fault tolerance and high availability coupled with a reliable message delivery guarantee. See features & guarantees.

ActiveMQ

Apache ActiveMQ is fast, supports many Cross Language Clients and Protocols, comes with easy to use Enterprise Integration Patterns and many advanced features while fully supporting JMS 1.1 and J2EE 1.4. Apache ActiveMQ is released under the Apache 2.0 License.

Apache NiFi

An easy to use, powerful, and reliable system to process and distribute data. It supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.

Gearman

Gearman allows you to do work in parallel, to load balance processing, and to call functions between languages. It can be used in a variety of applications, from high-availability web sites to the transport of database replication events.

Memphis

Highly scalable and effortless data streaming platform. Made to enable developers and data teams to collaborate and build real-time and streaming apps fast.

IronMQ

An easy-to-use highly available message queuing service. Built for distributed cloud applications with critical messaging needs. Provides on-demand message queuing with advanced features and cloud-optimized performance.

Apache Pulsar

Apache Pulsar is a distributed messaging solution developed and released to open source at Yahoo. Pulsar supports both pub-sub messaging and queuing in a platform designed for performance, scalability, and ease of development and operation.

Related Comparisons

Kafka vs ZeroMQ: What are the differences?

Introduction

Scalability: Kafka is designed to handle high-throughput workloads and boasts exceptional scalability. It achieves this by allowing multiple consumers to read messages from multiple partitions simultaneously. On the other hand, ZeroMQ is geared towards smaller-scale deployments and does not offer the same level of scalability as Kafka.
Persistence: Kafka is built with data persistence as a core feature. It stores all messages on disk, allowing for reliable message replay and fault tolerance. ZeroMQ, however, prioritizes low-latency message delivery and does not provide built-in persistence. Messages are typically lost if not immediately consumed.
Message Ordering: Kafka ensures strict ordering of messages within a partition, guaranteeing that messages are processed in the same order they are received. ZeroMQ, on the other hand, does not have the same level of ordering guarantees. It can deliver messages out of order, making it more suitable for scenarios where strict ordering is not a requirement.
Data Distribution: Kafka uses a publish-subscribe model and allows for one-to-many distribution of data. This makes it highly suitable for scenarios where data needs to be replicated or processed by multiple subscribers. In contrast, ZeroMQ follows a one-to-one or one-to-many pattern, where each message is sent to a specific destination. It does not provide built-in replication or distribution capabilities.
Complexity: Kafka is a more complex messaging system compared to ZeroMQ. It has a sophisticated architecture with features like brokers, partitions, and consumer groups, which require careful setup and configuration. ZeroMQ, on the other hand, is lightweight and easy to deploy. It can be used as a simple messaging library without the need for additional infrastructure components.
Community Support: Kafka has a large and active community behind it, as it is developed and maintained by Confluent, a company founded by the creators of Kafka. This means there is extensive documentation, resources, and community support available. ZeroMQ, while still supported by a community, may not have the same level of resources and support as Kafka.

Kafka vs ZeroMQ

Overview

Kafka vs ZeroMQ: What are the differences?