Amazon Rekognition vs Google Cloud Vision API

Need advice about which tool to choose?Ask the StackShare community!

Amazon Rekognition

+ 1
Google Cloud Vision API

+ 1
Add tool

Amazon Rekognition vs Google Cloud Vision API: What are the differences?


Amazon Rekognition and Google Cloud Vision API are two popular computer vision services that provide image and video analysis capabilities. While both services offer similar functionalities, there are several key differences between them. This article aims to highlight these differences in order to help users make an informed decision when choosing between the two.

  1. Pricing model: Amazon Rekognition and Google Cloud Vision API have different pricing models. Amazon Rekognition charges users based on the number of API calls, the amount of data processed, and the storage used. On the other hand, Google Cloud Vision API has a tiered pricing structure that takes into account the number of features requested, such as label detection or face detection.

  2. Customization options: Amazon Rekognition allows users to create custom models based on their specific use cases. This feature enables users to train the system to recognize specific objects or entities that are relevant to their applications. In contrast, Google Cloud Vision API does not currently offer custom model training, limiting the level of customization that users can achieve.

  3. Supported platforms: While both services can be used in various programming languages and platforms, Amazon Rekognition provides SDKs (Software Development Kits) for a wider range of platforms, including mobile platforms like iOS and Android. Google Cloud Vision API, on the other hand, has SDKs available for popular programming languages but does not have dedicated SDKs for mobile platforms at the time of writing.

  4. Integration with other services: Amazon Rekognition seamlessly integrates with other AWS (Amazon Web Services) services, such as Amazon S3 (Simple Storage Service) for storing and retrieving images and videos. It also integrates well with Amazon Kinesis Video Streams for real-time streaming analysis. In comparison, Google Cloud Vision API integrates with other Google Cloud Platform services, such as Google Cloud Storage for image storage and Google Cloud Pub/Sub for real-time messaging.

  5. Supported image formats: Amazon Rekognition supports a wide range of image formats, including JPEG, PNG, BMP, and GIF, allowing users to analyze images in different formats. In contrast, Google Cloud Vision API primarily supports JPEG and PNG formats, limiting the types of images that can be processed.

  6. Text extraction capabilities: When it comes to text extraction from images, Amazon Rekognition provides more advanced capabilities. It can detect text in images and also extract text embedded in the image itself, such as text within signs or labels. Google Cloud Vision API, on the other hand, focuses more on general text detection rather than extracting text from specific image elements.

In summary, Amazon Rekognition and Google Cloud Vision API differ in terms of pricing model, customization options, supported platforms, integration with other services, supported image formats, and text extraction capabilities. These differences highlight the unique strengths of each service, allowing users to choose the one that best aligns with their specific requirements.

Decisions about Amazon Rekognition and Google Cloud Vision API
Vladyslav Holubiev
Sr. Directory of Technology at Shelf · | 1 upvote · 47.3K views

AWS Rekognition has an OCR feature but can recognize only up to 50 words per image, which is a deal-breaker for us. (see my tweet).

Also, we discovered fantastic speed and quality improvements in the 4.x versions of Tesseract. Meanwhile, the quality of AWS Rekognition's OCR remains to be mediocre in comparison.

We run Tesseract serverlessly in AWS Lambda via aws-lambda-tesseract library that we made open-source.

See more
Get Advice from developers at your company using StackShare Enterprise. Sign up for StackShare Enterprise.
Learn More
Pros of Amazon Rekognition
Pros of Google Cloud Vision API
  • 4
    Integrate easily with AWS
  • 9
    Image Recognition
  • 7
    Built by Google

Sign up to add or upvote prosMake informed product decisions

Cons of Amazon Rekognition
Cons of Google Cloud Vision API
  • 1
    Be the first to leave a con

    Sign up to add or upvote consMake informed product decisions

    What is Amazon Rekognition?

    Amazon Rekognition is a service that makes it easy to add image analysis to your applications. With Rekognition, you can detect objects, scenes, and faces in images. You can also search and compare faces. Rekognition’s API enables you to quickly add sophisticated deep learning-based visual search and image classification to your applications.

    What is Google Cloud Vision API?

    Google Cloud Vision API enables developers to understand the content of an image by encapsulating powerful machine learning models in an easy to use REST API.

    Need advice about which tool to choose?Ask the StackShare community!

    What companies use Amazon Rekognition?
    What companies use Google Cloud Vision API?
    See which teams inside your own company are using Amazon Rekognition or Google Cloud Vision API.
    Sign up for StackShare EnterpriseLearn More

    Sign up to get full access to all the companiesMake informed product decisions

    What tools integrate with Amazon Rekognition?
    What tools integrate with Google Cloud Vision API?
    What are some alternatives to Amazon Rekognition and Google Cloud Vision API?
    TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API.
    OpenCV was designed for computational efficiency and with a strong focus on real-time applications. Written in optimized C/C++, the library can take advantage of multi-core processing. Enabled with OpenCL, it can take advantage of the hardware acceleration of the underlying heterogeneous compute platform.
    JavaScript is most known as the scripting language for Web pages, but used in many non-browser environments as well such as node.js or Apache CouchDB. It is a prototype-based, multi-paradigm scripting language that is dynamic,and supports object-oriented, imperative, and functional programming styles.
    Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.
    GitHub is the best place to share code with friends, co-workers, classmates, and complete strangers. Over three million people use GitHub to build amazing things together.
    See all alternatives