Meet us at the AI4 2022, August 16th-18th, MGM Grand Las Vegas at Booth #202. Schedule a meeting today here.
was successfully added to your cart.

Multimodal AI Blog

Build AI Models That Learn from Text, Images, and Audio Together.

Accurate Table Extraction from Documents & Images with Spark OCR

Extracting data formatted as a table (tabular data) is a common task — whether you’re analyzing financial statements, academic research papers, or clinical trial documentation. Table-based information varies heavily in...

Annotation Lab Improves Performance and Layout for OCR tasks

Annotation Lab improves the performance of the Project Setup Page, adds a "View as" option in the Labeling Page, improves the layout of OCR-ed documents, adds the option to stop...

Search, Export, and Labeling of Multi-page PDF Documents in the Annotation Lab

The Annotation supports labeling entities and relationships directly on multi-page PDF documents, and with release 2.5 has added support for search within PDF documents and export of annotated named entities...

Visual NLP – Combining Computer Vision and Text Mining for Intelligent Document Processing

Many businesses depend on paper documents or documents stored as images, such as receipts, manifests, invoices, medical reports, contracts, waivers, leases, forms, and audit records digitized with scanners. Up until...

Visual NLP – Combining Computer Vision and Text Mining for Intelligent Document Processing

Many businesses depend on paper documents or documents stored as images, such as receipts, manifests, invoices, medical reports, contracts, waivers, leases, forms, and audit records digitized with scanners. Up until...