Need advice about which tool to choose?Ask the StackShare community!
CDAP vs Amundsen: What are the differences?
Developers describe CDAP as "Open source virtualization platform for Hadoop data and apps". Cask Data Application Platform (CDAP) is an open source application development platform for the Hadoop ecosystem that provides developers with data and application virtualization to accelerate application development, address a broader range of real-time and batch use cases, and deploy applications into production while satisfying enterprise requirements. On the other hand, Amundsen is detailed as "A metadata driven application for improving the productivity of data analysts, data scientists and engineers". It is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
CDAP and Amundsen can be categorized as "Big Data" tools.
Some of the features offered by CDAP are:
- Streams for data ingestion
- Reusable libraries for common Big Data access patterns
- Data available to multiple applications and different paradigms
On the other hand, Amundsen provides the following key features:
- Datasets (Tables) schema and usage frequency/popularity
- Users bookmark, owner, frequent user
- Dashboard popularity, lineage to datasets
CDAP and Amundsen are both open source tools. It seems that Amundsen with 889 GitHub stars and 163 forks on GitHub has more adoption than CDAP with 430 GitHub stars and 233 GitHub forks.