Amazon Athena vs Azure Functions

Overview

Azure Functions

Stacks785

Followers705

Votes62

Amazon Athena

Stacks521

Followers840

Votes49

Amazon Athena vs Azure Functions: What are the differences?

Key Differences between Amazon Athena and Azure Functions

Amazon Athena and Azure Functions are both cloud-based services provided by Amazon Web Services (AWS) and Microsoft Azure respectively. While they are both used in the cloud computing domain, they have some key differences.

Purpose and Functionality: Amazon Athena is an interactive query service that allows users to analyze data in Amazon S3 using SQL. It is primarily used for ad-hoc querying and analysis of data. On the other hand, Azure Functions is a serverless computing service that allows developers to run event-driven code in Azure. It enables the development of scalable, event-based applications and offers various triggers and bindings to integrate with other Azure services.
Language Support: Amazon Athena supports SQL as its query language, making it accessible to users familiar with SQL. However, Azure Functions supports a wider range of programming languages including C#, JavaScript, PowerShell, Python, and TypeScript. This gives developers more flexibility in choosing the language they are most comfortable with.
Billing Model: Amazon Athena follows a pay-per-query model, where users are charged based on the amount of data scanned by their queries. This provides cost-effectiveness for occasional and small-scale queries. On the other hand, Azure Functions follows a pay-as-you-go pricing model, where users are charged based on the number of executions and the resources utilized by their functions. This allows users to pay only for the actual usage of their functions.
Event Triggers: Azure Functions supports a wide range of triggers such as HTTP requests, timers, message queues, and database events. These triggers can automatically invoke the functions when specific events occur. In contrast, Amazon Athena does not have built-in event triggers. It relies on users manually invoking queries through the Athena API or the AWS Management Console.
Scalability and Infrastructure: Azure Functions automatically scales based on the number of incoming requests and the configured scale settings. It abstracts away the underlying infrastructure, allowing developers to focus on writing code. Amazon Athena, on the other hand, is built on top of Amazon S3 and uses a distributed query engine to process queries. It scales automatically to handle large datasets but may require users to optimize their queries for better performance.
Integration with Other Services: Azure Functions has tight integration with various Azure services like Azure Storage, Azure Event Hubs, Azure Cosmos DB, and Azure Service Bus. This enables seamless integration and data processing between different services. Amazon Athena can integrate with other AWS services such as AWS Glue, Amazon QuickSight, and Amazon QuickSight to form a comprehensive data analytics stack within the AWS ecosystem.

In summary, Amazon Athena is primarily focused on ad-hoc querying and analysis of data using SQL, while Azure Functions is geared towards building event-driven applications with support for multiple programming languages and a wide range of event triggers.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Azure Functions, Amazon Athena

Mark

Nov 2, 2020

Needs adviceon

Microsoft Azure

Need advice on what platform, systems and tools to use.

Evaluating whether to start a new digital business for which we will need to build a website that handles all traffic. Website only right now. May add smartphone apps later. No desktop app will ever be added. Website to serve various countries and languages. B2B and B2C type customers. Need to handle heavy traffic, be low cost, and scale well.

We are open to either build it on AWS or on Microsoft Azure.

Apologies if I'm leaving out some info. My first post. :) Thanks in advance!

133k views133k

Comments

Kevin

Co-founder at Transloadit

Dec 18, 2020

Review

Hey there, the trick to keeping costs under control is to partition. This means you split up your source files by date, and also query within dates, so that Athena only scans the few files necessary for those dates. I hope that makes sense (and I also hope I understood your question right). This article explains better https://aws.amazon.com/blogs/big-data/analyze-your-amazon-cloudfront-access-logs-at-scale/.

5.08k views5.08k

Comments

Pavithra

Mar 12, 2020

Needs adviceon

Amazon S3

Amazon Athena

Amazon Redshift

Hi all,

Currently, we need to ingest the data from Amazon S3 to DB either Amazon Athena or Amazon Redshift. But the problem with the data is, it is in .PSV (pipe separated values) format and the size is also above 200 GB. The query performance of the timeout in Athena/Redshift is not up to the mark, too slow while compared to Google BigQuery. How would I optimize the performance and query result time? Can anyone please help me out?

522k views522k

Comments

Detailed Comparison

Azure Functions	Amazon Athena
Azure Functions is an event driven, compute-on-demand experience that extends the existing Azure application platform with capabilities to implement code triggered by events occurring in virtually any Azure or 3rd party service as well as on-premises systems.	Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
Easily schedule event-driven tasks across services;Expose Functions as HTTP API endpoints;Scale Functions based on customer demand;Develop how you want, using a browser-based UI or existing tools;Get continuous deployment, remote debugging, and authentication out of the box	-
Statistics
Stacks 785	Stacks 521
Followers 705	Followers 840
Votes 62	Votes 49
Pros & Cons
Pros 14 Pay only when invoked 11 Great developer experience for C# 9 Multiple languages supported 7 Great debugging support 5 Can be used as lightweight https service Cons 1 Sporadic server & language runtime issues 1 Poor support for Linux environments 1 No persistent (writable) file system available 1 Not suited for long-running applications	Pros 16 Use SQL to analyze CSV files 8 Glue crawlers gives easy Data catalogue 7 Cheap 6 Query all my data without running servers 24x7 4 No data base servers yay
Integrations
Azure DevOps Java Bitbucket Node.js Microsoft Azure GitHub Visual Studio Code JavaScript Azure Cosmos DB C#	Amazon S3 Presto

What are some alternatives to Azure Functions, Amazon Athena?

AWS Lambda

AWS Lambda is a compute service that runs your code in response to events and automatically manages the underlying compute resources for you. You can use AWS Lambda to extend other AWS services with custom logic, or create your own back-end services that operate at AWS scale, performance, and security.

Apache Spark

Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.

Presto

Distributed SQL Query Engine for Big Data

Google Cloud Run

A managed compute platform that enables you to run stateless containers that are invocable via HTTP requests. It's serverless by abstracting away all infrastructure management.

Apache Flink

Apache Flink is an open source system for fast and versatile data analytics in clusters. Flink supports batch and streaming analytics, in one system. Analytical programs can be written in concise and elegant APIs in Java and Scala.

lakeFS

It is an open-source data version control system for data lakes. It provides a “Git for data” platform enabling you to implement best practices from software engineering on your data lake, including branching and merging, CI/CD, and production-like dev/test environments.

Druid

Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.

Serverless

Build applications comprised of microservices that run in response to events, auto-scale for you, and only charge you when they run. This lowers the total cost of maintaining your apps, enabling you to build more logic, faster. The Framework uses new event-driven compute services, like AWS Lambda, Google CloudFunctions, and more.

Google Cloud Functions

Construct applications from bite-sized business logic billed to the nearest 100 milliseconds, only while your code is running

Apache Kylin

Apache Kylin™ is an open source Distributed Analytics Engine designed to provide SQL interface and multi-dimensional analysis (OLAP) on Hadoop/Spark supporting extremely large datasets, originally contributed from eBay Inc.

Related Comparisons

Amazon Athena vs Azure Functions: What are the differences?

Key Differences between Amazon Athena and Azure Functions

Purpose and Functionality: Amazon Athena is an interactive query service that allows users to analyze data in Amazon S3 using SQL. It is primarily used for ad-hoc querying and analysis of data. On the other hand, Azure Functions is a serverless computing service that allows developers to run event-driven code in Azure. It enables the development of scalable, event-based applications and offers various triggers and bindings to integrate with other Azure services.
Language Support: Amazon Athena supports SQL as its query language, making it accessible to users familiar with SQL. However, Azure Functions supports a wider range of programming languages including C#, JavaScript, PowerShell, Python, and TypeScript. This gives developers more flexibility in choosing the language they are most comfortable with.
Billing Model: Amazon Athena follows a pay-per-query model, where users are charged based on the amount of data scanned by their queries. This provides cost-effectiveness for occasional and small-scale queries. On the other hand, Azure Functions follows a pay-as-you-go pricing model, where users are charged based on the number of executions and the resources utilized by their functions. This allows users to pay only for the actual usage of their functions.
Event Triggers: Azure Functions supports a wide range of triggers such as HTTP requests, timers, message queues, and database events. These triggers can automatically invoke the functions when specific events occur. In contrast, Amazon Athena does not have built-in event triggers. It relies on users manually invoking queries through the Athena API or the AWS Management Console.
Scalability and Infrastructure: Azure Functions automatically scales based on the number of incoming requests and the configured scale settings. It abstracts away the underlying infrastructure, allowing developers to focus on writing code. Amazon Athena, on the other hand, is built on top of Amazon S3 and uses a distributed query engine to process queries. It scales automatically to handle large datasets but may require users to optimize their queries for better performance.
Integration with Other Services: Azure Functions has tight integration with various Azure services like Azure Storage, Azure Event Hubs, Azure Cosmos DB, and Azure Service Bus. This enables seamless integration and data processing between different services. Amazon Athena can integrate with other AWS services such as AWS Glue, Amazon QuickSight, and Amazon QuickSight to form a comprehensive data analytics stack within the AWS ecosystem.

Amazon Athena vs Azure Functions

Overview

Amazon Athena vs Azure Functions: What are the differences?