How to automatically identify different languages using out of the box pretrained models with Spark NLP in Python. Language detection is the process of automatically identifying the language of a...
De-identification is detecting privacy-related entities in text, such as person, organization names, emails, and other contact data, and masking them with different techniques. This task, also called anonymization or redaction, can help you:...
See how Bert-based models in Spark NLP can effortlessly resolve co-reference in your text data Coreference resolution is the task of identifying and linking all expressions within a text that...
Using Spark NLP in Python to identify named entities in texts at scale. Named Entity Recognition with Python TL;DR: Named Entity Recognition (NER) is a Natural Language Processing (NLP) technique...
Using machine learning and regex patterns to identify and extract date information in Spark NLP TL; DR: Dates extraction from a text is a common Natural Language Processing (NLP) task...