Register for the 5th NLP Summit, a Free Online Conference on Sep 24-26. Register now.
was successfully added to your cart.


Spark NLP in action: intelligent, high-accuracy fact extraction from long financial documents

Read the full case study


Introduction: UiPath is a global software company that develops a platform for robotic process automation. Following its acquisition of both ProcessGold and StepShot in 2019, UiPath has become the first vendor of scale to bring together both process mining and robotic process automation.

Challenge: “Answering questions accurately based on information from financial documents, which can be a hundred or more pages long, is a challenge even for human domain experts. While traditional rule-based or expression-matching techniques work for simple fields in templated documents, it is harder to infer facts based on implied statements, on the absence of certain statements, or on the combination of other facts.

Answering such questions at a very high level of accuracy requires state-of-the-art deep learning techniques applied to NLP.”

Solution: “Spark NLP was used to augment the UiPath smart data extraction platform in order to automatically infer fuzzy, implied, and complex facts from long financial documents. About Spark NLP:

  • Industrial Grade NLP for Apache Spark ecosystem
  • Design Goals
    • 1. Performance & Scale
    • 2. Frictionless Reuse
    • 3. Enterprise Grade
  • Built on top of Spark ML API’s
  • Open Source Apache 2.0 licensed
  • Active development & support”
38x faster to train on 100kb of data
80 times faster to train on 2.6mb of data

“UiPath is excited to support this technology partnership and support a seamless integration of John Snow Labs’ state-of-the-art NLP technology inside UiPath Activities. The joint capability is already providing value to business customers and is broadly applicable.”

Senior Manager for Partnerships and Alliances at UiPath