What is Clickhouse?
It allows analysis of data that is updated in real time. It offers instant results in most cases: the data is processed faster than it takes to create a query.
Clickhouse is a tool in the Databases category of a tech stack.
Who uses Clickhouse?
68 companies reportedly use Clickhouse in their tech stacks, including Groww, Postclick, and Avito.
210 developers on StackShare have stated that they use Clickhouse.
dbt, Rudderstack, Percona Monitoring and Management, Sematext, and Ebean are some of the popular tools that integrate with Clickhouse. Here's a list of all 21 tools that integrate with Clickhouse.
Pros of Clickhouse
Fast, very very fast
Good compression ratio
Utilizes all CPU resources
Has no transactions
Great number of SQL functions
In IDEA data import via HTTP interface not working
Server crashes its normal :(
Flexible compression options
Flexible connection options
Clickhouse Alternatives & Comparisons
What are some alternatives to Clickhouse?
See all alternatives
Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.
Elasticsearch is a distributed, RESTful search and analytics engine capable of storing data and searching it in near real time. Elasticsearch, Kibana, Beats and Logstash are the Elastic Stack (sometimes called the ELK Stack).
The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.
InfluxDB is a scalable datastore for metrics, events, and real-time analytics. It has a built-in HTTP API so you don't have to write any server side code to get up and running. InfluxDB is designed to be scalable, simple to install and manage, and fast to get data in and out.
Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.