Need advice about which tool to choose?Ask the StackShare community!

Gensim

52
61
+ 1
0
SpaCy

167
213
+ 1
11
Add tool

Gensim vs SpaCy: What are the differences?

What is Gensim? A python library for Topic Modelling. It is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

What is SpaCy? Industrial-Strength Natural Language Processing in Python. It is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It comes with pre-trained statistical models and word vectors, and currently supports tokenization for 49+ languages.

Gensim and SpaCy belong to "NLP / Sentiment Analysis" category of the tech stack.

Gensim is an open source tool with 9.65K GitHub stars and 3.52K GitHub forks. Here's a link to Gensim's open source repository on GitHub.

According to the StackShare community, SpaCy has a broader approval, being mentioned in 14 company stacks & 11 developers stacks; compared to Gensim, which is listed in 3 company stacks and 5 developer stacks.

Get Advice from developers at your company using Private StackShare. Sign up for Private StackShare.
Learn More
Pros of Gensim
Pros of SpaCy
    Be the first to leave a pro
    • 10
      Speed
    • 1
      No vendor lock-in

    Sign up to add or upvote prosMake informed product decisions

    Cons of Gensim
    Cons of SpaCy
      Be the first to leave a con
      • 1
        Requires creating a training set and managing training

      Sign up to add or upvote consMake informed product decisions

      What is Gensim?

      It is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

      What is SpaCy?

      It is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It comes with pre-trained statistical models and word vectors, and currently supports tokenization for 49+ languages.

      Need advice about which tool to choose?Ask the StackShare community!

      What companies use Gensim?
      What companies use SpaCy?
      See which teams inside your own company are using Gensim or SpaCy.
      Sign up for Private StackShareLearn More

      Sign up to get full access to all the companiesMake informed product decisions

      What tools integrate with Gensim?
      What tools integrate with SpaCy?
      What are some alternatives to Gensim and SpaCy?
      NLTK
      It is a suite of libraries and programs for symbolic and statistical natural language processing for English written in the Python programming language.
      Keras
      Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on TensorFlow or Theano. https://keras.io/
      FastText
      It is an open-source, free, lightweight library that allows users to learn text representations and text classifiers. It works on standard, generic hardware. Models can later be reduced in size to even fit on mobile devices.
      TensorFlow
      TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API.
      rasa NLU
      rasa NLU (Natural Language Understanding) is a tool for intent classification and entity extraction. You can think of rasa NLU as a set of high level APIs for building your own language parser using existing NLP and ML libraries.
      See all alternatives