Data Engineer at Tata Consultancy Services·

I have to collect different data from multiple sources and store them in a single cloud location. Then perform cleaning and transforming using PySpark, and push the end results to other applications like reporting tools, etc. What would be the best solution? I can only think of Azure Data Factory + Databricks. Are there any alternatives to #AWS services + Databricks?

4 upvotes·247.4K views
Avatar of Vamshi Krishna

Vamshi Krishna

Data Engineer at Tata Consultancy Services