Neo4j vs OrientDB

Overview

Neo4j

Stacks1.2K

Followers1.4K

Votes351

GitHub Stars15.3K

Forks2.5K

OrientDB

Stacks77

Followers107

Votes14

Neo4j vs OrientDB: What are the differences?

Introduction

In this article, we will discuss the key differences between Neo4j and OrientDB, two popular graph databases. Graph databases are designed to handle highly connected data, making them suitable for use cases such as social networks, recommendation engines, and fraud detection systems. While both Neo4j and OrientDB offer graph database functionalities, they differ in several aspects, which are outlined below.

Data Model: Neo4j uses a property graph data model, where nodes represent entities and edges represent relationships between those entities. Each node and edge can have key-value properties associated with it. On the other hand, OrientDB supports multiple data models, including graph, document, and key-value models. This flexibility allows OrientDB to handle different types of data more seamlessly.
Language Compatibility: Neo4j uses the Cypher query language, which is specifically designed for querying graph data. It offers a simple and expressive syntax, making it easy to work with graph structures. In contrast, OrientDB supports multiple query languages, including SQL for querying relational data, and Gremlin for querying graph data. This provides more flexibility in terms of language choice, but may require additional learning if using OrientDB.
Scalability: Neo4j is known for its scalability and ability to handle large graph datasets. It offers horizontal scalability through a clustering feature called Neo4j Causal Clustering, which allows for high availability and fault tolerance. OrientDB also supports horizontal scalability, but its clustering mechanism, called Distributed Multi-Master Architecture, requires manual configuration and does not provide the same level of automatic failover as Neo4j.
Consistency: Neo4j guarantees strict consistency, meaning that all nodes and relationships are always in a fully consistent state. This ensures data integrity but can impact the availability of the database, especially in distributed environments. OrientDB, on the other hand, offers eventual consistency, which allows for better availability but may result in temporary inconsistency during concurrent updates.
Native Graph Storage: Neo4j has a native graph storage engine optimized for handling graph data structures efficiently. This enables fast traversal and querying of graph structures, making it suitable for use cases that heavily rely on graph traversals. OrientDB, on the other hand, does not have a dedicated native graph storage engine, as it is designed to support multiple data models. While OrientDB can handle graph data efficiently, it may not provide the same level of performance as Neo4j for graph-specific operations.
Community and Ecosystem: Neo4j has a large and active community, with extensive documentation, tutorials, and a wide range of third-party integrations and tools. This vibrant ecosystem makes it easy to find support and resources when working with Neo4j. Although OrientDB also has a community and ecosystem, it is relatively smaller compared to Neo4j, which may result in fewer readily available resources and integrations.

In summary, Neo4j and OrientDB differ in their data models, query languages, scalability mechanisms, consistency guarantees, native graph storage optimization, and community/ecosystem size. Each database has its own strengths and considerations, and the choice between them depends on the specific requirements and use case of the project.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Neo4j, OrientDB

Thiru

Jul 16, 2020

Needs adviceon

MySQL

Neo4j

OrientDB

We have an in-house build experiment management system. We produce samples as input to the next step, which then could produce 1 sample(1-1) and many samples (1 - many). There are many steps like this. So far, we are tracking genealogy (limited tracking) in the MySQL database, which is becoming hard to trace back to the original material or sample(I can give more details if required). So, we are considering a Graph database. I am requesting advice from the experts.

Is a graph database the right choice, or can we manage with RDBMS?
If RDBMS, which RDMS, which feature, or which approach could make this manageable or sustainable
If Graph database(@{Neo4j}|tool:1482|, @{OrientDB}|tool:1611|, @{Azure Cosmos DB}|tool:1369|, @{Amazon Neptune}|tool:7979|, @{ArangoDB}|tool:1440|), which one is good, and what are the best practices?

I am sorry that this might be a loaded question.

217k views217k

Comments

Detailed Comparison

Neo4j	OrientDB
Neo4j stores data in nodes connected by directed, typed relationships with properties on both, also known as a Property Graph. It is a high performance graph store with all the features expected of a mature and robust database, like a friendly query language and ACID transactions.	It is an open source NoSQL database management system written in Java. It is a Multi-model database, supporting graph, document, key/value, and object models, but the relationships are managed as in graph databases with direct connections between records.
intuitive, using a graph model for data representation;reliable, with full ACID transactions;durable and fast, using a custom disk-based, native storage engine;massively scalable, up to several billion nodes/relationships/properties;highly-available, when distributed across multiple machines;expressive, with a powerful, human readable graph query language;fast, with a powerful traversal framework for high-speed graph queries;embeddable, with a few small jars;simple, accessible by a convenient REST interface or an object-oriented Java API	-
Statistics
GitHub Stars 15.3K	GitHub Stars -
GitHub Forks 2.5K	GitHub Forks -
Stacks 1.2K	Stacks 77
Followers 1.4K	Followers 107
Votes 351	Votes 14
Pros & Cons
Pros 69 Cypher – graph query language 61 Great graphdb 33 Open source 31 Rest api 27 High-Performance Native API Cons 9 Comparably slow 4 Can't store a vertex as JSON 1 Doesn't have a managed cloud service at low cost	Pros 4 Great graphdb 2 Open source 2 Great support 1 Multi-Model/Paradigm 1 Highly-available Cons 4 Unstable

What are some alternatives to Neo4j, OrientDB?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Cassandra

Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

RethinkDB

RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.

ArangoDB

A distributed free and open-source database with a flexible data model for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

Related Comparisons

Neo4j vs OrientDB: What are the differences?

Introduction

Data Model: Neo4j uses a property graph data model, where nodes represent entities and edges represent relationships between those entities. Each node and edge can have key-value properties associated with it. On the other hand, OrientDB supports multiple data models, including graph, document, and key-value models. This flexibility allows OrientDB to handle different types of data more seamlessly.
Language Compatibility: Neo4j uses the Cypher query language, which is specifically designed for querying graph data. It offers a simple and expressive syntax, making it easy to work with graph structures. In contrast, OrientDB supports multiple query languages, including SQL for querying relational data, and Gremlin for querying graph data. This provides more flexibility in terms of language choice, but may require additional learning if using OrientDB.
Scalability: Neo4j is known for its scalability and ability to handle large graph datasets. It offers horizontal scalability through a clustering feature called Neo4j Causal Clustering, which allows for high availability and fault tolerance. OrientDB also supports horizontal scalability, but its clustering mechanism, called Distributed Multi-Master Architecture, requires manual configuration and does not provide the same level of automatic failover as Neo4j.
Consistency: Neo4j guarantees strict consistency, meaning that all nodes and relationships are always in a fully consistent state. This ensures data integrity but can impact the availability of the database, especially in distributed environments. OrientDB, on the other hand, offers eventual consistency, which allows for better availability but may result in temporary inconsistency during concurrent updates.
Native Graph Storage: Neo4j has a native graph storage engine optimized for handling graph data structures efficiently. This enables fast traversal and querying of graph structures, making it suitable for use cases that heavily rely on graph traversals. OrientDB, on the other hand, does not have a dedicated native graph storage engine, as it is designed to support multiple data models. While OrientDB can handle graph data efficiently, it may not provide the same level of performance as Neo4j for graph-specific operations.
Community and Ecosystem: Neo4j has a large and active community, with extensive documentation, tutorials, and a wide range of third-party integrations and tools. This vibrant ecosystem makes it easy to find support and resources when working with Neo4j. Although OrientDB also has a community and ecosystem, it is relatively smaller compared to Neo4j, which may result in fewer readily available resources and integrations.

Neo4j vs OrientDB

Overview

Neo4j vs OrientDB: What are the differences?