Home » Visual NLP

Visual NLP

Understand Visual Documents with High-Accuracy OCR, Form Summarization, Table Extraction, PDF Parsing, and more.

By all accounts, John Snow Labs has created the most accurate software in history to extract facts from unstructured text.

Healthcare Tech Outlook

Proven Success across

What’s in the box

Trainable & Tunable

Scalable to a Cluster

Fast Inference

Hardware Optimized

Community

Visual NLP in Action

De-identify Images, PDF, and DICOM files

Combine computer vision, OCR, and NLP models to classify documents, extract normalized entities and figures, find signatures on forms, extract data from tables, and de-identify images.

See Demo

Extract data from images & forms

Extract and normalize specific facts & figures from custom images and forms, by training your own models to learn where in the image, next to which words, and using what formatting the facts you’re interested in are.

See Demo

Extract whole tables

Find tables in images, visually identify rows and columns, and extract data from cells into data frames. Turn scans from financial disclosures, academic papers, lab results and more into usable data.

See Demo

Recognize entities in scanned PDFs

End-to-end example of regular NER pipeline: import scanned images from cloud storage, preprocess them for improving their quality, recognize text using Spark OCR tool, correct the spelling mistakes for improving OCR results and finally run NER for extracting entities.

See Demo

Correct skewness in scanned documents

Correct the skewness of your scanned documents will highly improve the results of the OCR. Spark OCR is the only library that allows you to finetune the image preprocessing for excellent OCR results.

See Demo

Recognize text in natural scenes

By using image segmentation and preprocessing NLP techniques Spark OCR recognizes and extracts text from natural scenes.

See Demo

Remove background noise from scanned documents

Removing the background noise in a scanned document will highly improve the results of the OCR. Visual NLP is the only OCR tool that allows you to finetune the image preprocessing for excellent OCR results.

See Demo

DICOM to Text

Recognize text from DICOM format documents. This feature explores both the text on the image and the text from the metadata file.

See Demo

Extract Signatures & Dates from Signed Forms

Detect signatures in image-based documents.

See Demo

Learn More from Webinars

Accurate Table Extraction from Documents & Images with Spark OCR

Accurate de-identification, obfuscation, and editing of scanned medical documents and images

Visual Document Understanding with Multi-Modal Image & Text Mining in Spark OCR

Frequently Asked Questions

No. However, even though OCR can be considered a separate task from NLP, many NLP pipelines naturally start with an OCR stage, and many Visual NLP pipelines return textual results.

These are complementing technologies, and Johnsnowlabs' Visual NLP integrates seamlessly with all other NLP products.

Visual-NLP delivers visually enriched versions of NLP tasks such as Visual NER, Visual Document Classification or Visual Question Answering.

To mention the most important: Text Detection and Extraction, Layout Analysis, Visual Document NER, Visual Document Classification, Visual Question Answering, Table Detection & Extraction, De-identification, Dicom Processing.

Input document quality can be a limiting factor. Extremely distorted or damaged inputs can lower the final quality of the results obtained.

No. But we provide trial periods and convenient licensing offerings for your team and organization.

Visual NLP is designed to deliver state of the art results with the best runtime performance. Visual NLP is a production-grade, scalable and secure product.

Visual NLP

Understand Visual Documents with High-Accuracy OCR, Form Summarization, Table Extraction, PDF Parsing, and more.

Proven Success across

NLP for Finance – Automated Invoice Classification for Submission Compliance

Interpreting millions of patient stories with deep learned OCR and NLP

A unified CV, OCR, and NLP approach for scalable document understanding at DocuSign

What’s in the box

Visual NLP in Action

Learn More from Webinars

Frequently Asked Questions

Join the Global Healthcare AI Community