AI Platform As A Service

Spark OCR

High-accuracy text recognition for real-world noisy images

Try Free

By all accounts, John Snow Labs has created the most accurate software in history to extract facts from unstructured text.

Healthcare Tech Outlook

What’s in the box

Read
AI in Medical Field

Text or PDF

AI in Medical Field

Scanned PDF

AI in Medical Field

Image

AI in Medical Field

DICOM

Transform
AI in Medical Field

Binarizer

AI in Medical Field

Adaptive Tresholding

AI in Medical Field

Erosion

AI in Medical Field

Layout analyzer

AI in Medical Field

Skew corrections

AI in Medical Field

Scaler

Enriched Healthcare Data

Adaptive scaler

Enriched Healthcare Data

Split Regions

Enriched Healthcare Data

Noise Scorer

Enriched Healthcare Data

Remove objects

AI in Healthcare

Morphology opening

AI in Healthcare

Cropper

Annotate
AI in Healthcare

Extract text from images

AI in Healthcare

Extract data from tables

AI in Healthcare

Entity Recognition

AI in Healthcare

De-identification

Write
AI & NLP in Healthcare

Structured data

AI & NLP in Healthcare

Highlighted entities

AI & NLP in Healthcare

De-identified text, PDF or DICOM

AI & NLP in Healthcare

Images & Regions

Trainable & Tunable
Healthcare Data Market
Scalable to a Cluster
Healthcare Data Market
Fast Inference
Healthcare Data Market
Hardware Optimized
Healthcare Data Market
Healthcare Data Market
Community
Healthcare Data Market

Spark OCR in Action

AI Business Solutions
Recognize entities
in scanned PDFs

End-to-end example of regular NER pipeline: import scanned images from cloud storage, preprocess them for improving their quality, recognize text using Spark OCR, correct the spelling mistakes for improving OCR results and finally run NER for extracting entities.

AI in Medical Field
AI & NLP in Healthcare
AI & NLP in Healthcare
AI Business Solutions
Correct skewness in
scanned documents

Correct the skewness of your scanned documents will highly improve the results of the OCR. Spark OCR is the only library that allows you to finetune the image preprocessing for excellent OCR results.

AI in Medical Field
AI in Healthcare
Recognize text in natural scenes

By using image segmentation and preprocessing techniques Spark OCR recognizes and extracts text from natural scenes.

AI in Medical Field
AI in Healthcare
AI in Healthcare
AI in Healthcare
Remove background noise from scanned documents

Removing the background noise in a scanned document will highly improve the results of the OCR. Spark OCR is the only library that allows you to finetune the image preprocessing for excellent OCR results.

AI in Medical Field
Advanced Analytics And AI
DICOM to Text

Recognize text from DICOM format documents. This feature explores both the text on the image and the text from the metadata file.

AI in Medical Field
Advanced Analytics And AI

Optimized for Accuracy

AI in Healthcare
OCR Software F1 Score

Combine image transformers that optimize accuracy for your image sources. Here are the F1 scores on a tuned Spark OCR pipeline

Advanced Analytics And AI

Proven customer success

INTERPRETING MILLIONS OF PATIENT STORIES WITH DEEP LEARNED OCR AND NLP

Stacy Ashworth
Chief Clinical Officer, SelectData
Alberto Andreotti
Data Scientist, John Snow labs

Read More