Spark NLP in Action

Spark NLP – English Analyze Non-English Text & Documents
PDF to Text (Non-English Text)
Extract non-English text from generated/selectable PDF documents and keep the original structure of the document by using our out-of-the-box Spark OCR library.
Image to Text (Non-English Text)
Recognize non-English text in images and scanned PDF documents by using our out-of-the-box Spark OCR library.
DOCX to Text (Non-English Text)
Extract non-English text from Word documents using out out-of-the-box Spark OCR library.