Jawahir Kasim
jawahirak
Data Engginer
|
University of Utah
3 points
Tools jawahirak is Following
Apache Spark
spark.apache.org
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters th...
YARN Hadoop
hadoop.apache.org/docs/curr...
Its fundamental idea is to split up the functionalities of resource management and job scheduling/monitorin...
Dataiku
dataiku.com
It is the platform democratizing access to data and enabling enterprises to build their own path to AI in a...
Databricks
databricks.com
Databricks Unified Analytics Platform, from the original creators of Apache Spark™, unifies data science an...
Delta Lake
delta.io
An open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads.