Need advice about which tool to choose?Ask the StackShare community!
ClustrixDB vs RocksDB: What are the differences?
ClustrixDB: Clustered SQL database built for e-commerce. ClustrixDB is a scale-out SQL database built from the ground up with a distributed shared nothing architecture, automatic data redistribution (so you never need to shard), with built in fault tolerance, all accessible by a simple SQL interface and support for business critical MySQL features – replication, triggers, stored routines, etc; RocksDB: Embeddable persistent key-value store for fast storage, developed and maintained by Facebook Database Engineering Team. RocksDB is an embeddable persistent key-value store for fast storage. RocksDB can also be the foundation for a client-server database but our current focus is on embedded workloads. RocksDB builds on LevelDB to be scalable to run on servers with many CPU cores, to efficiently use fast storage, to support IO-bound, in-memory and write-once workloads, and to be flexible to allow for innovation.
ClustrixDB and RocksDB can be categorized as "Databases" tools.
Some of the features offered by ClustrixDB are:
- Is built from the ground up with a shared-nothing architecture. There is no MySQL code in ClustrixDB
- Is built to scale transactions while maintaning ACID
- Scales to add capacity by simply adding commodity servers to the cluster
On the other hand, RocksDB provides the following key features:
- Designed for application servers wanting to store up to a few terabytes of data on locally attached Flash drives or in RAM
- Optimized for storing small to medium size key-values on fast storage -- flash devices or in-memory
- Scales linearly with number of CPUs so that it works well on ARM processors
RocksDB is an open source tool with 14.3K GitHub stars and 3.12K GitHub forks. Here's a link to RocksDB's open source repository on GitHub.
I am researching different querying solutions to handle ~1 trillion records of data (in the realm of a petabyte). The data is mostly textual. I have identified a few options: Milvus, HBase, RocksDB, and Elasticsearch. I was wondering if there is a good way to compare the performance of these options (or if anyone has already done something like this). I want to be able to compare the speed of ingesting and querying textual data from these tools. Does anyone have information on this or know where I can find some? Thanks in advance!
You've probably come to a decision already but for those reading...here are some resources we put together to help people learn more about Milvus and other databases https://zilliz.com/comparison and https://github.com/zilliztech/VectorDBBench. I don't think they include RocksDB or HBase yet (you could could recommend on GitHub) but hopefully they help answer your Elastic Search questions.
Pros of ClustrixDB
- ClustrixDB is a scale-out RDBMS and drop-in replacement1
- Very High Connection Count1
- Relational Scale-Out database1
Pros of RocksDB
- Very fast5
- Made by Facebook3
- Consistent performance2
- Ability to add logic to the database layer where needed1