Spark NLP logo

Spark NLP

State of the Art Natural Language Processing
+ 1

What is Spark NLP?

It is a Natural Language Processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines that scale easily in a distributed environment. It comes with 160+ pretrained pipelines and models in more than 20+ languages.
Spark NLP is a tool in the NLP / Sentiment Analysis category of a tech stack.
Spark NLP is an open source tool with 3.7K GitHub stars and 703 GitHub forks. Here’s a link to Spark NLP's open source repository on GitHub

Who uses Spark NLP?

5 companies reportedly use Spark NLP in their tech stacks, including Newzera, Ukuli Data, and Multivac DSL.

22 developers on StackShare have stated that they use Spark NLP.

Spark NLP Integrations

Spark NLP's Features

  • Tokenization
  • Stop Words Removal
  • Normalizer
  • Stemmer
  • Lemmatizer
  • NGrams
  • Regex Matching
  • Text Matching
  • Chunking
  • Date Matcher
  • Part-of-speech tagging
  • Sentence Detector
  • Dependency parsing (Labeled/unlabled)
  • Sentiment Detection (ML models)
  • Spell Checker (ML and DL models)
  • Word Embeddings (GloVe and Word2Vec)
  • BERT Embeddings
  • ELMO Embeddings
  • Universal Sentence EncoderSentence Embeddings
  • Chunk Embeddings

Spark NLP Alternatives & Comparisons

What are some alternatives to Spark NLP?
JavaScript is most known as the scripting language for Web pages, but used in many non-browser environments as well such as node.js or Apache CouchDB. It is a prototype-based, multi-paradigm scripting language that is dynamic,and supports object-oriented, imperative, and functional programming styles.
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.
GitHub is the best place to share code with friends, co-workers, classmates, and complete strangers. Over three million people use GitHub to build amazing things together.
Python is a general purpose programming language created by Guido Van Rossum. Python is most praised for its elegant syntax and readable code, if you are just beginning your programming career python suits you best.
jQuery is a cross-platform JavaScript library designed to simplify the client-side scripting of HTML.
See all alternatives

Spark NLP's Followers
38 developers follow Spark NLP to keep up with related blogs and decisions.