Aerospike vs Scylla

Overview

Aerospike

Stacks200

Followers288

Votes48

GitHub Stars1.3K

Forks196

ScyllaDB

Stacks143

Followers197

Votes8

Aerospike vs Scylla: What are the differences?

Aerospike vs. Scylla: Key Differences

Introduction

Aerospike and Scylla are both highly scalable NoSQL databases used for big data applications. While both offer high performance and low-latency data storage, they have key differences that distinguish them from each other.

Data Model: Aerospike uses a key-value data model, where each record is identified by a unique key and contains a set of bins (name-value pairs). On the other hand, Scylla is based on the Apache Cassandra data model, using a wide-column data model where data is organized into rows, columns, and column families. This allows Scylla to handle complex and dynamic data with a flexible schema.
Consistency Model: Aerospike provides strong consistency by default, ensuring that all replicas in a cluster are updated before acknowledging a write operation. In contrast, Scylla, like Cassandra, adopts a tunable consistency model that allows users to choose the level of consistency required for each operation. This allows for better trade-offs between latency and consistency based on specific application needs.
Distribution Strategy: Aerospike distributes data using a data replication strategy called "replication factor," where each record is replicated on multiple nodes. This ensures high availability and fault tolerance. On the other hand, Scylla uses a consistent hashing algorithm to distribute data across a cluster using virtual nodes. This allows for better load balancing and automatic data redistribution when nodes are added or removed from the cluster.
Query Language: Aerospike uses a SQL-like query language called Aerospike Query Language (AQL) to perform queries and aggregations on data. It offers a powerful set of functionalities for data manipulation. In contrast, Scylla uses Cassandra Query Language (CQL), which is based on SQL but also offers additional features specific to Cassandra's data model, such as wide-column support and secondary indexes.
Write Path Optimization: Aerospike optimizes the write path by storing data in memory and persisting it to disk asynchronously. This allows for extremely fast write operations and high throughput. Scylla, on the other hand, utilizes a log-structured merge (LSM) tree data structure to optimize write operations. It writes data sequentially to disk, minimizing disk I/O and enabling efficient compaction and recovery processes.
Community Support and Adoption: Aerospike has a dedicated and active community, with a strong focus on high-performance and low-latency use cases. Scylla, on the other hand, has gained significant traction among the developer community due to its compatibility with Apache Cassandra and its ability to handle large-scale workloads with low latencies.

In summary, Aerospike and Scylla differ in terms of their data models, consistency models, distribution strategies, query languages, write path optimization, and community support. These differences highlight the unique strengths and use cases of each database, allowing developers to choose the one that best suits their specific requirements.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Aerospike, ScyllaDB

Tom

CEO at Gentlent

Jun 9, 2020

Decided

The Gentlent Tech Team made lots of updates within the past year. The biggest one being our database:

We decided to migrate our #PostgreSQL -based database systems to a custom implementation of #Cassandra . This allows us to integrate our product data perfectly in a system that just makes sense. High availability and scalability are supported out of the box.

387k views387k

Comments

Vinay

Head of Engineering

Sep 19, 2019

Needs advice

The problem I have is - we need to process & change(update/insert) 55M Data every 2 min and this updated data to be available for Rest API for Filtering / Selection. Response time for Rest API should be less than 1 sec.

The most important factors for me are processing and storing time of 2 min. There need to be 2 views of Data One is for Selection & 2. Changed data.

174k views174k

Comments

Detailed Comparison

Aerospike	ScyllaDB
Aerospike is an open-source, modern database built from the ground up to push the limits of flash storage, processors and networks. It was designed to operate with predictable low latency at high throughput with uncompromising reliability – both high availability and ACID guarantees.	ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables teams to harness the ever-increasing computing power of modern infrastructures – eliminating barriers to scale as data grows.
99% of reads/writes complete in under 1 millisecond.;Predictable low latency at high throughput – second to none. Read the YCSB Benchmark.;The secret sauce? A thousand things done right. Server code in ‘C’ (not Java or Erlang) precisely tuned to avoid context switching and memory copies. Highly parallelized multi-threaded, multi-core, multi-cpu, multi-SSD execution.;Indexes are always stored in RAM. Pure RAM mode is backed by spinning disks. In hybrid mode, individual tables are stored in either RAM or flash.	High availability; horizontal scalability; vertical scalability; Cassandra compatible; DynamoDB compatible; wide column; NoSQL; lightweight transactions; change data capture; workload prioritization; shard-per-core; IO scheduler; self-tuning
Statistics
GitHub Stars 1.3K	GitHub Stars -
GitHub Forks 196	GitHub Forks -
Stacks 200	Stacks 143
Followers 288	Followers 197
Votes 48	Votes 8
Pros & Cons
Pros 16 Ram and/or ssd persistence 12 Easy clustering support 5 Easy setup 4 Acid 3 Performance better than Redis	Pros 2 Replication 1 High availability 1 Scale up 1 High performance 1 Distributed
Integrations
No integrations available	KairosDB Wireshark JanusGraph Grafana Hackolade Prometheus Kubernetes Datadog Kafka Apache Spark

What are some alternatives to Aerospike, ScyllaDB?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

Redis

Redis is an open source (BSD licensed), in-memory data structure store, used as a database, cache, and message broker. Redis provides data structures such as strings, hashes, lists, sets, sorted sets with range queries, bitmaps, hyperloglogs, geospatial indexes, and streams.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Cassandra

Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

RethinkDB

RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.

Related Comparisons

Aerospike vs Scylla: What are the differences?

Aerospike vs. Scylla: Key Differences

Introduction

Data Model: Aerospike uses a key-value data model, where each record is identified by a unique key and contains a set of bins (name-value pairs). On the other hand, Scylla is based on the Apache Cassandra data model, using a wide-column data model where data is organized into rows, columns, and column families. This allows Scylla to handle complex and dynamic data with a flexible schema.
Consistency Model: Aerospike provides strong consistency by default, ensuring that all replicas in a cluster are updated before acknowledging a write operation. In contrast, Scylla, like Cassandra, adopts a tunable consistency model that allows users to choose the level of consistency required for each operation. This allows for better trade-offs between latency and consistency based on specific application needs.
Distribution Strategy: Aerospike distributes data using a data replication strategy called "replication factor," where each record is replicated on multiple nodes. This ensures high availability and fault tolerance. On the other hand, Scylla uses a consistent hashing algorithm to distribute data across a cluster using virtual nodes. This allows for better load balancing and automatic data redistribution when nodes are added or removed from the cluster.
Query Language: Aerospike uses a SQL-like query language called Aerospike Query Language (AQL) to perform queries and aggregations on data. It offers a powerful set of functionalities for data manipulation. In contrast, Scylla uses Cassandra Query Language (CQL), which is based on SQL but also offers additional features specific to Cassandra's data model, such as wide-column support and secondary indexes.
Write Path Optimization: Aerospike optimizes the write path by storing data in memory and persisting it to disk asynchronously. This allows for extremely fast write operations and high throughput. Scylla, on the other hand, utilizes a log-structured merge (LSM) tree data structure to optimize write operations. It writes data sequentially to disk, minimizing disk I/O and enabling efficient compaction and recovery processes.
Community Support and Adoption: Aerospike has a dedicated and active community, with a strong focus on high-performance and low-latency use cases. Scylla, on the other hand, has gained significant traction among the developer community due to its compatibility with Apache Cassandra and its ability to handle large-scale workloads with low latencies.

Aerospike vs Scylla

Overview

Aerospike vs Scylla: What are the differences?

Aerospike vs. Scylla: Key Differences