State of the Art Natural Language Processing

John Snow Labs’ Spark NLP is an open source text processing library for Python, Java, and Scala.

It provides production-grade, scalable, and trainable versions of the latest research in
natural language processing.

Unmatched
Speed & Scale

Spark NLP was 80x faster than spaCy to train locally on 2.6MB of data.

Scale to a Spark cluster with zero code changes.

Read more

State of the Art
Accuracy

First production-grade versions of novel deep learning NLP research.

Use pre-trained models to train to fit your data.

Read more

Most Widely Used in
the Enterprise

Widely deployed production-grade codebase.

New releases every 2 weeks since 2017.

Growing community.

Read more

The most widely used NLP library in the Enterprise, by far

Why Spark NLP?

Accuracy

Spark NLP delivered the best performing results on academic peer-reviewed benchmarks.

Scalability

Zero code changes are needed to scale a pipeline to any spark cluster.

Speed

Optimized builds for the latest Intel & Nvidia chips enable the fastest training & tuning of state-of-the-art models.

Out Of The Box Functionality

Entity Recognition
Algorithms
Split Text
  • Sentence Detector
  • Deep Sentence Detector
  • Tokenizer
  • nGram Generator
Understand Grammar
  • Stemmer
  • Lemmatizer
  • Part of Speech Tagger
  • Dependency Parser
Information Extraction
Algorithms
Clean Text
  • Spell Checking
  • Spell Correction
  • Normalizer
  • Stopword Cleaner
Find in Text
  • Text Matcher
  • Regex Matcher
  • Date Matcher
  • Chunker
Sentiment Analysis
Content
Transformers
GloVeELMOBERTALBERTXLNetUSESmall BERTELECTRABioBERTLaBSE
Pre-trained Models
250+
Pretrained
Information Extraction
Content
46 Languages
Pre-trained Pipelines
90+
Pretrained
Trainable & Tunable
Scalable to a Cluster
Fast Inference
Hardware Optimized
Community

Trainable to understand your language

Spark NLP is optimized for training domain-specific NLP models, so you can adapt it to learn the nuances of jargon and documents you must support.

Introducing Spark NLP at Top Level AI Conferences

Strata

Spark NLP: How Roche automates knowledge extraction from pathology and radiology reports

Read More

Strata

Spark NLP in action: Intelligent, high-accuracy fact extraction from long financial documents

Read More

Strata

Spark NLP in action: How SelectData uses AI to better understand home health patients

Read More