Home » Case study Spark NLP in action: intelligent, high-accuracy fact extraction from long financial documents (UIPath)

UIPath

Spark NLP in action: intelligent, high-accuracy fact extraction from long financial documents

Read the full case study

INDUSTRY: Finance

Introduction: UiPath is a global software company that develops a platform for robotic process automation. Following its acquisition of both ProcessGold and StepShot in 2019, UiPath has become the first vendor of scale to bring together both process mining and robotic process automation.

Challenge: “Answering questions accurately based on information from financial documents, which can be a hundred or more pages long, is a challenge even for human domain experts. While traditional rule-based or expression-matching techniques work for simple fields in templated documents, it is harder to infer facts based on implied statements, on the absence of certain statements, or on the combination of other facts.

Answering such questions at a very high level of accuracy requires state-of-the-art deep learning techniques applied to NLP.”

Solution: “Spark NLP was used to augment the UiPath smart data extraction platform in order to automatically infer fuzzy, implied, and complex facts from long financial documents. About Spark NLP:

Industrial Grade NLP for Apache Spark ecosystem
Design Goals
- 1. Performance & Scale
- 2. Frictionless Reuse
- 3. Enterprise Grade
Built on top of Spark ML API’s
Open Source Apache 2.0 licensed
Active development & support”

38x faster to train on 100kb of data

80 times faster to train on 2.6mb of data

“UiPath is excited to support this technology partnership and support a seamless integration of John Snow Labs’ state-of-the-art NLP technology inside UiPath Activities. The joint capability is already providing value to business customers and is broadly applicable.”

Senior Manager for Partnerships and Alliances at UiPath