Datadog vs Prometheus

Overview

Datadog

Stacks9.8K

Followers8.2K

Votes861

Prometheus

Stacks4.8K

Followers3.8K

Votes239

GitHub Stars61.1K

Forks9.9K

Datadog vs Prometheus: What are the differences?

Introduction

In this article, we will explore the key differences between Datadog and Prometheus, two popular monitoring and observability tools.

Data Collection: Datadog collects and visualizes metrics, traces, and logs from various sources, including custom applications, cloud providers, and infrastructure. It supports a wide range of integrations and provides an agent-based or agentless data collection approach. Prometheus, on the other hand, focuses on monitoring infrastructure and application metrics. It uses a pull-based model, where a Prometheus server scrapes metrics from configured targets at regular intervals.
Scalability and Performance: Datadog is a cloud-based platform that offers scalability and ease of use. It can handle large-scale deployments and provides automatic scaling of resources. Prometheus is a self-hosted solution that requires manual management and scaling. It is suitable for smaller environments and may need additional components like a federation, alert manager, and long-term storage for larger deployments.
Alerting and Notification: Datadog provides a comprehensive alerting and notification system. It offers flexible alerting rules, integrates with various notification channels like email, SMS, and PagerDuty, and supports advanced features like anomaly detection and machine learning-based alerting. Prometheus has a built-in alerting system that allows users to define alert rules based on metrics and send notifications via integrations like email and webhooks. However, it lacks some advanced features available in Datadog.
Data Storage and Retention: Datadog provides built-in storage and retention of metrics and logs for a certain period. It offers long-term storage options like time series databases and provides query and visualization capabilities for historical data. Prometheus uses a local on-disk storage model and does not provide long-term data retention capabilities out of the box. Users need to set up additional components like remote storage solutions to retain metrics for a longer duration.
Query Language and Analysis: Datadog uses a proprietary query language called DDQL (Datadog Query Language) that allows users to perform complex analytics and aggregations on their data. It offers built-in functions, operators, and visualizations to analyze and visualize data. Prometheus uses its own query language called PromQL (Prometheus Query Language) for data retrieval and analysis. It provides powerful functionalities like filtering, aggregation, and time series functions but may have a steeper learning curve for beginners.
Ecosystem and Integrations: Datadog offers a rich ecosystem of integrations with popular services, frameworks, and platforms like AWS, Kubernetes, and Datadog's APM and Log Management solutions. It provides pre-built dashboards and out-of-the-box integrations with third-party tools. Prometheus also has a strong ecosystem with exporters available for various services and applications. However, it may require additional configuration and customization to integrate with certain systems.

In summary, the key differences between Datadog and Prometheus lie in their data collection approaches, scalability, alerting capabilities, data storage and retention, query languages, and ecosystem and integration options. The choice between these tools depends on specific monitoring and observability requirements, infrastructure size, and resources available for management and scaling.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Datadog, Prometheus

Raja Subramaniam

Aug 27, 2019

Needs adviceon

Prometheus

Kubernetes

Sysdig

We have Prometheus as a monitoring engine as a part of our stack which contains Kubernetes cluster, container images and other open source tools. Also, I am aware that Sysdig can be integrated with Prometheus but I really wanted to know whether Sysdig or sysdig+prometheus will make better monitoring solution.

779k views779k

Comments

Farzeem Diamond

Software Engineer at IVP

Jul 21, 2020

Needs adviceon

Datadog

Dynatrace

AppDynamics

Hey there! We are looking at Datadog, Dynatrace, AppDynamics, and New Relic as options for our web application monitoring.

Current Environment: .NET Core Web app hosted on Microsoft IIS

Future Environment: Web app will be hosted on Microsoft Azure

Tech Stacks: IIS, RabbitMQ, Redis, Microsoft SQL Server

Requirement: Infra Monitoring, APM, Real - User Monitoring (User activity monitoring i.e., time spent on a page, most active page, etc.), Service Tracing, Root Cause Analysis, and Centralized Log Management.

Please advise on the above. Thanks!

1.59M views1.59M

Comments

Medeti

Jun 27, 2020

Needs adviceon

Amazon EKS

Kubernetes

AWS Elastic Load Balancing (ELB)

We are looking for a centralised monitoring solution for our application deployed on Amazon EKS. We would like to monitor using metrics from Kubernetes, AWS services (NeptuneDB, AWS Elastic Load Balancing (ELB), Amazon EBS, Amazon S3, etc) and application microservice's custom metrics.

We are expected to use around 80 microservices (not replicas). I think a total of 200-250 microservices will be there in the system with 10-12 slave nodes.

We tried Prometheus but it looks like maintenance is a big issue. We need to manage scaling, maintaining the storage, and dealing with multiple exporters and Grafana. I felt this itself needs few dedicated resources (at least 2-3 people) to manage. Not sure if I am thinking in the correct direction. Please confirm.

You mentioned Datadog and Sysdig charges per host. Does it charge per slave node?

1.51M views1.51M

Comments

Detailed Comparison

Datadog	Prometheus
Datadog is the leading service for cloud-scale monitoring. It is used by IT, operations, and development teams who build and operate applications that run on dynamic or hybrid cloud infrastructure. Start monitoring in minutes with Datadog!	Prometheus is a systems and service monitoring system. It collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts if some condition is observed to be true.
14-day Free Trial for an unlimited number of hosts;200+ turn-key integrations for data aggregation;Clean graphs of StatsD and other integrations;Slice and dice graphs and alerts by tags, roles, and more;Easy-to-use search for hosts, metrics, and tags;Alert notifications via e-mail and PagerDuty;Receive alerts on any metric, for a single host or an entire cluster;Full API access in more than 15 languages;Overlay metrics and events across disparate sources;Out-of-the-box and customizable monitoring dashboards;Easy way to compute rates, ratios, averages, or integrals;Sampling intervals of 10 seconds;Mute all alerts with 1 click during upgrades and maintenance;Tools for team collaboration	Dimensional data; Powerful queries; Great visualization; Efficient storage; Precise alerting; Simple operation
Statistics
GitHub Stars -	GitHub Stars 61.1K
GitHub Forks -	GitHub Forks 9.9K
Stacks 9.8K	Stacks 4.8K
Followers 8.2K	Followers 3.8K
Votes 861	Votes 239
Pros & Cons
Pros 140 Monitoring for many apps (databases, web servers, etc) 107 Easy setup 87 Powerful ui 84 Powerful integrations 70 Great value Cons 20 Expensive 4 No errors exception tracking 2 External Network Goes Down You Wont Be Logging 1 Complicated	Pros 47 Powerful easy to use monitoring 38 Flexible query language 32 Dimensional data model 27 Alerts 23 Active and responsive community Cons 12 Just for metrics 6 Bad UI 6 Needs monitoring to access metrics endpoints 4 Not easy to configure and use 3 Supports only active agents
Integrations
NGINX Google App Engine Apache HTTP Server Java Docker Pingdom MySQL Ruby Python Memcached	Grafana

What are some alternatives to Datadog, Prometheus?

New Relic

The world’s best software and DevOps teams rely on New Relic to move faster, make better decisions and create best-in-class digital experiences. If you run software, you need to run New Relic. More than 50% of the Fortune 100 do too.

Grafana

Grafana is a general purpose dashboard and graph composer. It's focused on providing rich ways to visualize time series metrics, mainly though graphs but supports other ways to visualize data through a pluggable panel architecture. It currently has rich support for for Graphite, InfluxDB and OpenTSDB. But supports other data sources via plugins.

Kibana

Kibana is an open source (Apache Licensed), browser based analytics and search dashboard for Elasticsearch. Kibana is a snap to setup and start using. Kibana strives to be easy to get started with, while also being flexible and powerful, just like Elasticsearch.

Raygun

Raygun gives you a window into how users are really experiencing your software applications. Detect, diagnose and resolve issues that are affecting end users with greater speed and accuracy.

Nagios

Nagios is a host/service/network monitoring program written in C and released under the GNU General Public License.

AppSignal

AppSignal gives you and your team alerts and detailed metrics about your Ruby, Node.js or Elixir application. Sensible pricing, no aggressive sales & support by developers.

Netdata

Netdata collects metrics per second & presents them in low-latency dashboards. It's designed to run on all of your physical & virtual servers, cloud deployments, Kubernetes clusters & edge/IoT devices, to monitor systems, containers & apps

AppDynamics

AppDynamics develops application performance management (APM) solutions that deliver problem resolution for highly distributed applications through transaction flow monitoring and deep diagnostics.

Zabbix

Zabbix is a mature and effortless enterprise-class open source monitoring solution for network monitoring and application monitoring of millions of metrics.

Sensu

Sensu is the future-proof solution for multi-cloud monitoring at scale. The Sensu monitoring event pipeline empowers businesses to automate their monitoring workflows and gain deep visibility into their multi-cloud environments.

Related Comparisons

Datadog vs Prometheus: What are the differences?

Introduction

In this article, we will explore the key differences between Datadog and Prometheus, two popular monitoring and observability tools.

Data Collection: Datadog collects and visualizes metrics, traces, and logs from various sources, including custom applications, cloud providers, and infrastructure. It supports a wide range of integrations and provides an agent-based or agentless data collection approach. Prometheus, on the other hand, focuses on monitoring infrastructure and application metrics. It uses a pull-based model, where a Prometheus server scrapes metrics from configured targets at regular intervals.
Scalability and Performance: Datadog is a cloud-based platform that offers scalability and ease of use. It can handle large-scale deployments and provides automatic scaling of resources. Prometheus is a self-hosted solution that requires manual management and scaling. It is suitable for smaller environments and may need additional components like a federation, alert manager, and long-term storage for larger deployments.
Alerting and Notification: Datadog provides a comprehensive alerting and notification system. It offers flexible alerting rules, integrates with various notification channels like email, SMS, and PagerDuty, and supports advanced features like anomaly detection and machine learning-based alerting. Prometheus has a built-in alerting system that allows users to define alert rules based on metrics and send notifications via integrations like email and webhooks. However, it lacks some advanced features available in Datadog.
Data Storage and Retention: Datadog provides built-in storage and retention of metrics and logs for a certain period. It offers long-term storage options like time series databases and provides query and visualization capabilities for historical data. Prometheus uses a local on-disk storage model and does not provide long-term data retention capabilities out of the box. Users need to set up additional components like remote storage solutions to retain metrics for a longer duration.
Query Language and Analysis: Datadog uses a proprietary query language called DDQL (Datadog Query Language) that allows users to perform complex analytics and aggregations on their data. It offers built-in functions, operators, and visualizations to analyze and visualize data. Prometheus uses its own query language called PromQL (Prometheus Query Language) for data retrieval and analysis. It provides powerful functionalities like filtering, aggregation, and time series functions but may have a steeper learning curve for beginners.
Ecosystem and Integrations: Datadog offers a rich ecosystem of integrations with popular services, frameworks, and platforms like AWS, Kubernetes, and Datadog's APM and Log Management solutions. It provides pre-built dashboards and out-of-the-box integrations with third-party tools. Prometheus also has a strong ecosystem with exporters available for various services and applications. However, it may require additional configuration and customization to integrate with certain systems.

Datadog vs Prometheus

Overview

Datadog vs Prometheus: What are the differences?