Skip to main content
was successfully added to your cart.

Frequently Asked Questions

Licensing

Free, forever, unlimited, for personal and commercial use. Spark NLP is released under an Apache 2.0 open-source license – including the pre-trained models and documentation.

Each license includes the software libraries in all supported languages, the pre-trained models that are included with it, premium support, and all updates to the software & models that are released during the subscription period.

Healthcare NLP, Visual NLP, Finance NLP and Legal NLP are licensed as annual subscriptions, payable either once a year in full or each month in advance.

No. There is no limitation imposed on the number of characters, words or documents processed with the free or paid libraries. However, one license only gives you the right to run one Spark job at a given moment.

The software will stop processing documents – for both training and inference. If you choose to buy a license, we will provide you with a new license key. Otherwise, you must uninstall the software. In any case, data you have already processed is yours to keep.

Running the Software

Python, Java, and Scala.

Spark 2.3.x, 2.4.x, 3.0.x, 3.1.x and 3.2.x.

We officially support AWS, Azure, Databricks, Cloudera, Colab, Kaggle and GCP.

Yes. Spark NLP is used heavily in high-compliance industries like healthcare, life science, finance, and insurance where on-premise deployments are common. Most single-machine, Spark, Hadoop, and Kubernetes distributions are supported.

Yes. Make sure to allocate enough memory & compute power for your use case.

Yes. Make sure to allocate enough memory & compute power for your use case.

This depends heavily on your use case. For training custom models based on the BERT family of embeddings, at least 8 cores and 64GB of memory are recommended. For inference, as little as 1 core and 8GB may be enough. Using GPU’s will usually provide faster execution at a higher cost.

Payments

The cost depends on which edition(s) you need (Healthcare, Visual, Finance or/and Legal), the level of support (8x5 or 24x7), and the number of licenses you need. Please email us with those details at sales@johnsnowlabs.com and we’ll reply with an exact quote.

Online bank transfers (ACH or wire), checks, and all major credit cards.

Yes! Please email us to describe your situation and needs.

Privacy

No. You install and run the software on your infrastructure. The software does not “call home” and no data or results are sent to John Snow Labs.

You do. We will never even see them.

This is not a SaaS solution – instead, you run the software on your infrastructure. Nothing ever gets sent to John Snow Labs or another third party. Spark NLP is designed for high-compliance, locked-down environments.

No, after an initial installation & downloading of pre-trained models.

Yes, an internet connection is necessary to establish the validity of your license key when starting your Spark session.

No.

Customization

Yes. Spark NLP is designed to enable you to train & tune your own models for most tasks.

The full list is available here. Expect the list to keep growing over time.

Support

Email support@johnsnowlabs.com, call us at +1-302-786-5227, or start a chat on spark-nlp.slack.com. Paying customers get a private Slack channel, so that you can ask your questions privately.

Same business day 8x5 support is included with all subscriptions. We can also provide 24x7 support for production systems – please email us if you require it.

Yes. The Demos and Notebooks Hub includes links to runnable Google Colab notebooks in Python.