Need advice about which tool to choose?Ask the StackShare community!

JSON

Stacks2K

Followers1.6K

+ 1

Votes9

Protobuf

Stacks2.7K

Followers393

+ 1

Votes0

Add tool

JSON vs Protobuf: What are the differences?

Introduction

When choosing a data serialization format for applications, developers often come across JSON and Protobuf. Both JSON (JavaScript Object Notation) and Protobuf (Protocol Buffers) serve the purpose of exchanging data between systems, but they have key differences that distinguish them from each other.

Data Size: One significant difference between JSON and Protobuf is the data size. JSON is text-based and therefore tends to be bulkier compared to Protobuf, which is a binary format. This difference in data size can impact network bandwidth utilization and storage space requirements, especially in situations where large volumes of data need to be transferred quickly and efficiently.
Schema Definition: Another key difference between JSON and Protobuf is the way they handle schema definition. JSON is schema-less, meaning that it does not require a predefined schema to serialize or deserialize data. On the other hand, Protobuf relies on a defined schema (Protocol Buffer Message) to serialize and deserialize data. This schema-based approach in Protobuf offers advantages like improved communication between services and better data consistency.
Parsing Performance: JSON and Protobuf also differ in parsing performance. Due to its lightweight nature and human-readability, JSON parsing can be slower compared to Protobuf, which is designed for efficient serialization and deserialization. This difference in parsing performance can be critical in high-throughput or real-time systems where processing speed is a priority.
Support for Data Types: JSON has limited support for complex data types, primarily dealing with simple data structures like strings, numbers, arrays, and objects. In contrast, Protobuf provides robust support for defining complex data types, including nested structures, enums, and more. This difference in data type support can make Protobuf a better choice for scenarios involving intricate data models.
Compatibility and Versioning: JSON is more forgiving when it comes to changes in data structure or schema, as it allows for flexible evolution of data models. Protobuf, however, enforces stricter schema compatibility rules, making it challenging to evolve schemas without breaking existing implementations. This difference in compatibility and versioning handling can impact the maintainability and extensibility of systems using JSON or Protobuf.
Language Support: JSON has widespread support in various programming languages due to its simplicity and readability. In comparison, while Protobuf has libraries available for multiple languages, it may not be as universally supported as JSON. Developers need to consider the language ecosystem of their project when choosing between JSON and Protobuf to ensure seamless integration and interoperability.

In Summary, JSON and Protobuf differ in data size, schema definition, parsing performance, support for data types, compatibility and versioning, and language support, which can influence the choice of serialization format based on specific project requirements.

Advice on JSON and Protobuf

Dhinesh Ram

architect · Jun 16, 2020 | 7 upvotes · 349.9K views

Needs advice

JSON

and

Python

Hi. Currently, I have a requirement where I have to create a new JSON file based on the input CSV file, validate the generated JSON file, and upload the JSON file into the application (which runs in AWS) using API. Kindly suggest the best language that can meet the above requirement. I feel Python will be better, but I am not sure with the justification of why python. Can you provide your views on this?

Replies (3)

Nick Butlin

Jul 10, 2020 | 3 upvotes · 331.4K views

Recommends

Python

Python is very flexible and definitely up the job (although, in reality, any language will be able to cope with this task!). Python has some good libraries built in, and also some third party libraries that will help here. 1. Convert CSV -> JSON 2. Validate against a schema 3. Deploy to AWS

The builtins include json and csv libraries, and, depending on the complexity of the csv file, it is fairly simple to convert:

import csv
import json

with open("your_input.csv", "r") as f:
    csv_as_dict = list(csv.DictReader(f))[0]

with open("your_output.json", "w") as f:
    json.dump(csv_as_dict, f)

The validation part is handled nicely by this library: https://pypi.org/project/jsonschema/ It allows you to create a schema and check whether what you have created works for what you want to do. It is based on the json schema standard, allowing annotation and validation of any json
It as an AWS library to automate the upload - or in fact do pretty much anything with AWS - from within your codebase: https://aws.amazon.com/sdk-for-python/ This will handle authentication to AWS and uploading / deploying the file to wherever it needs to go.

A lot depends on the last two pieces, but the converting itself is really pretty neat.

Max Musing

Founder & CEO at BaseDash · Jul 9, 2020 | 1 upvotes · 329.1K views

Recommends

Node.js

BaseDash

This should be pretty doable in any language. Go with whatever you're most familiar with.

That being said, there's a case to be made for using Node.js since it's trivial to convert an object to JSON and vice versa.

Doug Schwartz

Jul 10, 2020 | 1 upvotes · 329.1K views

Recommends

Golang

I would use Go. Since CSV files are flat (no hierarchy), you could use the encoding/csv package to read each row, and write out the values as JSON. See https://medium.com/@ankurraina/reading-a-simple-csv-in-go-36d7a269cecd. You just have to figure out in advance what the key is for each row.

Manage your open source components, licenses, and vulnerabilities

Learn More

Pros of JSON

Pros of Protobuf

5
Simple
4
Widely supported

Be the first to leave a pro

Sign up to add or upvote prosMake informed product decisions

- No public GitHub repository available -

What is JSON?

JavaScript Object Notation is a lightweight data-interchange format. It is easy for humans to read and write. It is easy for machines to parse and generate. It is based on a subset of the JavaScript Programming Language.

What is Protobuf?

Protocol buffers are Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data – think XML, but smaller, faster, and simpler.

Need advice about which tool to choose?Ask the StackShare community!

Jobs that mention JSON and Protobuf as a desired skillset

Staff iOS Software Engineer, Performance Ad Formats

San Francisco, CA, US; , US

View Job Details

Staff iOS Software Engineer, Performance Ad Formats

San Francisco, CA, US; , US

View Job Details

Software Engineer, Backend

Mexico City, MX; , MX

View Job Details

Software Engineer, Backend

Mexico City, MX; , MX

View Job Details

Staff Android Software Engineer, Mid Funnel Formats

San Francisco, CA, US; , US

View Job Details

Staff Android Software Engineer, Mid Funnel Formats

San Francisco, CA, US; , US

View Job Details

Software Quality Engineer in Test

Mexico City, MX; , MX

View Job Details

Software Quality Engineer in Test

Mexico City, MX; , MX

View Job Details

See jobs for JSON

See jobs for Protobuf

What companies use JSON?

What companies use Protobuf?

Manage your open source components, licenses, and vulnerabilities

Learn More

Sign up to get full access to all the companiesMake informed product decisions

What tools integrate with JSON?

What tools integrate with Protobuf?

Sign up to get full access to all the tool integrationsMake informed product decisions

Blog Posts

How We Designed Our Continuous Integration System to be More T...

Mar 3 2021 at 5:37PM

4944

Cultivating your Data Lake

Aug 28 2019 at 3:10AM

Segment

+16

2788

Kafka and Ruby, a Sidekiq Love Story

Jun 6 2019 at 5:11PM

AppSignal

1811

What are some alternatives to JSON and Protobuf?

YAML

A human-readable data-serialization language. It is commonly used for configuration files, but could be used in many applications where data is being stored or transmitted.

Avro

It is a row-oriented remote procedure call and data serialization framework developed within Apache's Hadoop project. It uses JSON for defining data types and protocols, and serializes data in a compact binary format.

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

OData

It is an ISO/IEC approved, OASIS standard that defines a set of best practices for building and consuming RESTful APIs. It helps you focus on your business logic while building RESTful APIs without having to worry about the various approaches to define request and response headers, status codes, HTTP methods, URL conventions, media types, payload formats, query options, etc.

MessagePack

It is an efficient binary serialization format. It lets you exchange data among multiple languages like JSON. But it's faster and smaller. Small integers are encoded into a single byte, and typical short strings require only one extra byte in addition to the strings themselves.

See all alternatives

JSON vs Protobuf

Need advice about which tool to choose?Ask the StackShare community!

JSON vs Protobuf: What are the differences?

Pros of JSON

Pros of Protobuf

Sign up to add or upvote prosMake informed product decisions

What is JSON?

What is Protobuf?

Need advice about which tool to choose?Ask the StackShare community!

What companies use JSON?

What companies use Protobuf?

Sign up to get full access to all the companiesMake informed product decisions

What tools integrate with JSON?

What tools integrate with Protobuf?

Sign up to get full access to all the tool integrationsMake informed product decisions

Blog Posts

Related Comparisons

Trending Comparisons

Top Comparisons