Watch Healthcare NLP Summit 2024. Watch now.
was successfully added to your cart.

Content by Mykola Melnyk

Avatar photo
Mykola Melnyk is a senior Scala, Python, and Spark software engineer with 15 years of industry experience. He has led teams and projects building machine learning and big data solutions in a variety of industries – and is currently the lead developer of the Spark OCR library at John Snow Labs.

Blog

How to detect signature in image-based documents For document comprehension pipelines in the healthcare and the financial area, we need some time to detect the signature of the document or...

Motivation Spark OCR already contains an ImageToText transformer for recognising text on the image. It works fine for documents in general, but needs custom preprocessing to recognise text contained on...

Converting tables in scanned documents & images into structured data Motivation Extracting data formatted as a table is a common task - whether you’re analyzing financial statements, academic research papers,...

The Transformer architecture in NLP has truly changed the way we analyze text. NLP models are great at processing digital text, but many real-word applications use documents with more complex...

Our expert data science team worked around the clock to deliver the major release of Spark OCR 3.0 at the same time of our Spark NLP for Healthcare 3.0. Spark...