Meet us at the Databricks Data + AI Summit in San Francisco, June 9-12
was successfully added to your cart.

Multimodal AI Blog

Build AI Models That Learn from Text, Images, and Audio Together.

Converting tables in scanned documents & images into structured data Motivation Extracting data formatted as a table is a common task - whether you’re analyzing financial statements, academic research papers,...

The Transformer architecture in NLP has truly changed the way we analyze text. NLP models are great at processing digital text, but many real-word applications use documents with more complex...

Our expert data science team worked around the clock to deliver the major release of Spark OCR 3.0 at the same time of our Spark NLP for Healthcare 3.0. Spark...