was successfully added to your cart.

Generative AI Lab

Implement Human-in-the-loop Workflows
to build Regulatory-Grade AI Faster
on a No-Code, Enterprise-Grade platform

Speed up data labeling, train, tune, test and improve your models — all within one unified, scalable, and HIPAA compliant platform.

Install on:
Request a License
Install on:

Powering the Future of AI with 7.5+ Million Expert-Led Annotation Hours – Trusted by 500+ Leading Pharma and Healthcare Organizations

Build Regulatory Grade Human-in-the-Loop Workflows

High Throughput Document Annotation

Eliminate manual spreadsheets and annotate thousands of documents daily with built-in Quality Assurance.

  • High Productivity UI
  • Shareable Guidelines & Reviewer Comments
  • Consensus Analysis to Reach Agreement Fast

Jumpstart your Annotations with AI-Powered Labeling

Speed up annotation without compromising quality — whether you’re labeling medical records, coding diagnoses, or preparing training data.

  • AI-Powered Labeling for automatic text pre-labeling, minimizing initial setup.
  • Integrate with your custom workflows via API – Push and pull data from EHRs, data lakes, or ML pipelines.

Train, Tune, Test and Automatically Improve your Models

Fine-tune models based on domain-specific data — no ML engineering required.

  • Train-as-you-go with Active Learning
  • Compare Model Performance with Built-In Metrics
  • Transfer Learning from Available Models
  • Test your models and automatically improve them

De-Identify Patient Data for Research & Collaboration

Remove PHI from medical records by combining automated detection with expert review to meet regulatory standards.

  • Obfuscation or Masking
  • Entity Level Configuration
  • Consistent De-Identification Throughout a Task
  • Human-in-the-Loop Validation

Manage Models, Prompts and Rules

Access ready-to-go NLP models and pipelines to reduce time-to-deploy.

  • Search and Filter by Task or Domain
  • Private Hub of Models, Rules and Prompts for Your Team
  • Experiment with your data in a Live Playground

Leverage LLMs to bootstrap annotations

Use LLMs to experiment and prepare training examples for classification, NER, or relation detection. Includes:

  • Zero-Shot Prompts support classification, NER, and relation extraction in your environment.
  • OpenAI Integration for text classification and NER with secure data transfer.

Split Long Documents into Sections for More Precise Labeling

Focus on the sections you need to analyze and ignore the noise in your long documents.

  • Flexible splitting – Break documents into sentences, paragraphs, or pages
  • Adaptive taxonomy – Customize your taxonomy to match the specific details of each section

Regulatory Grade Data Curation Processes

Support complex workflows with transparency and auditability.

  • Custom Review Workflows & Access Control
  • Track Changes with Versioning
  • Monitor Quality Metrics and Team Output

Manage Multiple Projects and Teams

Support complex workflows with transparency and auditability.

  • Advanced Analytics
  • Guidelines for consistency
  • Custom Workflows
  • Comments and feedback

Enterprise-Grade Security

Trusted by healthcare, finance, and legal organizations.

  • Role-Based Access Control,
  • Strong Authentication: MFA, AD/LDAP
  • Cloud, On-Prem, or Air-Gapped Deployments

What Can You Annotate?

Automatically extract insights from clinical notes, unstructured EHR data, medical literature, and patient documentation.

Convert scanned medical reports, lab results, discharge summaries, or research papers into structured, actionable data.

Annotate medical imaging like X-rays, MRIs, CT scans, and pathology slides for diagnostic support.

Label entities on HTML content, rate, and compare LLM responses saved as HTML with references and links.

Transcribe doctor-patient conversations, medical dictation, and telemedicine consultations into text.

Analyze surgical procedures, patient examinations, and medical training videos for clinical insights.

Built for Scalable Real-World Projects with High Regulatory Requirements

Protect sensitive data and meet strict regulatory standards with scalable, no-code solutions, ensuring cost efficiency and trusted outcomes for healthcare and enterprise needs.

Validate Data with
Expert Oversight

Achieve high data accuracy through rigorous Human-in-the-Loop validation

Learn More
Air-Gapped
Deployments

Deploy HIPAA-compliant software that analyzes your data in isolation

Install Software
Real-Time Control & Security

Precise control over data access and user permissions with full audit trail

See How It Works
Enterprise-Grade Data Protection

Robust encryption & real-time events monitoring dashboards

Consult an Expert

Powered by Top Tech Partners

Partnering with AWS, Azure, and Oracle ensures Kubernetes-based scalability for millions of annotations daily

Master AI Skills John Snow Labs Certifications

Enhance your team’s AI proficiency with hands-on expertise in text annotation, Spark NLP, and healthcare-specific NLP through live workshops and certification exams, tailored to your preferred track

Awarded for Pioneering Healthcare AI

With more than 40 awards and 10 years of innovation, John Snow Labs is recognized as a leader in no-code, privacy-first AI powered solutions for healthcare.

2025
Data Partner Award

2024
Data Partner Award
AI Breakthrough Award

2024
Data Partner Award

2024
Corporate America Today
Corporate America

Watch & Learn

Explore our Generative AI Lab tutorial videos, client case studies, and solutions. Learn how to leverage advanced AI for data annotation, model training, and compliance to get proven results

Identifying opioid-related adverse events from unstructured text in electronic health records using rule-based algorithms and deep learning methods

Frequently Asked Questions

To mention the most important: Text Detection and Extraction, Layout Analysis, Visual Document NER, Visual Document Classification, Visual Question Answering, Table Detection & Extraction, De-identification, Dicom Processing.

Input document quality can be a limiting factor. Extremely distorted or damaged inputs can lower the final quality of the results obtained.

No. But we provide trial periods and convenient licensing offerings for your team and organization.

Security, Compliance, and Scalability

Generative AI Lab supports HIPAA compliance with air-gapped or on-premise deployments, zero data sharing, full audit trails, and Human-in-the-Loop (HITL) workflows for expert validation. It offers enterprise-grade security with role-based access control, multi-factor authentication (MFA), tamper-proof audit logs, and identity provider integration, ensuring data privacy for healthcare, legal, and finance sectors.

Yes, Generative AI Lab supports annotation of sensitive and proprietary data in air-gapped or on-premise deployments, ensuring zero data sharing with John Snow Labs or third parties.

The platform ensures accuracy and reliability through Human-in-the-Loop (HITL) workflows with expert oversight, pre-trained models, quality checks, and full audit trails, delivering consistent outcomes for healthcare and research tasks.

Generative AI Lab eliminates the need for data science expertise to deploy, test, train, and tune AI models through its no-code interface. The platform provides immediate access through subscription-based configuration without requiring custom development or IT infrastructure setup.

Cost optimization is achieved through two pricing models: pay-as-you-go billing that charges only for active feature usage and duration, and on-premise deployment options designed for enterprise teams requiring continuous annotation workflows.

The platform's Kubernetes-based auto-scaling architecture automatically adjusts computational resources based on demand, supporting concurrent multi-user access and processing high numbers of documents per day while maintaining consistent performance metrics across varying workloads.

The platform provides a high-productivity UI with keyboard shortcuts, pre-annotations, and AI-assisted labelling, plus a QuickStart guide and video tutorials for immediate use.

Licensing

Generative AI Lab is offered as a pay-as-you-go solution on cloud marketplaces. The software subscription includes:

  • GPU support for enhanced speed of pre-annotation and training,
  • Visual Document Understanding features as well as support for training Visual Document Classification and Visual NER models,
  • Preannotation with LLM prompts and Zero-Shot prompts,
  • 2,000+ Healthcare tuned embeddings and AI models for classification, entity extraction, entity resolution, relation extraction, and assertion status detection,
  • Preannotation via rules, models, or prompts,
  • Premium support.

No. There is no limitation imposed on the number of projects, users, or documents that can be annotated within the Generative AI Lab. No limitation on the number of models, prompts or rules you can define, test, train or tune with the Generative AI Lab.

Also, there is no limitation on the number of pre-annotations you run or on the number of models you can train.

No, installation through the marketplaces will generate a cost, and using an on-premise deployment will require a license key. Contact us at support@johnsnowlabs.com for on-premise deployments.

Installation instructions are available here: https://nlp.johnsnowlabs.com/docs/en/alab/install

Yes. The Generative AI Lab will replace the NLP Lab products on the AWS and Azure Marketplaces. You can continue to use your existing subscription until the end of 2024, when we will end support for these products in these marketplaces.

Running the Software

We currently offer the product on AWS Marketplace and Azure Marketplace.

Yes. The Generative AI Lab can be used in high-compliance industries like healthcare, life science, finance, and insurance where on-premise deployments are common.

Most single-machine, and Kubernetes distributions are supported.

Yes. Make sure to allocate enough memory & compute power for your use case.

This depends heavily on your use case. The minimal required configuration for on-premise deployments is an 8-Core CPU, 32GB RAM of memory, and 512 GB of SSD storage.

The recommended configuration to support model training and AI-assisted pre-annotations for a team building or validating text models is a 16-Core CPU, 64 GB of memory, and 512 GB of SSD storage. The recommended configuration for teams using Document Understanding features and Visual model training is a 4-GPU instance with 48 CPU cores and 192 GB RAM, equivalent to g4dn.12xlarge AWS instances.

Payments

The software price is calculated based on usage and on the type of server where it is deployed.

Usage of Visual Document Understanding and of Healthcare features are charged based on consumption per vCPU per hour.

Charges are reflected on your AWS or Azure bill, and billed through your cloud provider.

Yes! Please email us to describe your situation and needs.

Privacy

No. The software is designed to be installed and operated entirely within your own infrastructure. It is built with privacy and data sovereignty in mind, ensuring that it does not transmit any data or results outside of your controlled environment.

This architecture guarantees that your data remains within your jurisdiction, providing you with full control over its security and privacy.

You do. We will never even see them.

Our software is engineered specifically for environments that require strict compliance and robust security measures. It operates directly within your infrastructure, ensuring that all data processing occurs locally.

This means your data, including any Protected Health Information (PHI) or Personally Identifiable Information (PII), remains within your control at all times and is never sent to John Snow Labs.

We also provide the option to integrate with third-party Large Language Model (LLM) services, such as OpenAI, to leverage features like prompt-based pre-annotation and synthetic data generation. Should you decide to utilize these functionalities, the responsibility for implementing appropriate safeguards to securely and privately share PHI data with these external services rests with you.

It is crucial to ensure that any data sharing complies with your organization's privacy policies and relevant regulatory requirements to maintain the confidentiality and integrity of sensitive information.

Generative AI Lab does necessitate an active Internet connection for its operation. This requirement serves two primary purposes:

  • Metering Usage Collection: An Internet connection is essential for transmitting usage data. This allows for accurate metering of your usage of the platform, ensuring that you are billed correctly based on your actual usage.
  • Model and Pipeline Downloads: In order to utilize the advanced features of Generative AI Lab, such as pre-annotating documents, an Internet connection is required to download the necessary models and pipelines from the NLP Models Hub. This ensures that you have access to the latest and most efficient tools for your data processing needs.

We understand the importance of consistent and reliable access to these resources and functionalities, and an Internet connection ensures that Generative AI Lab can deliver its full capabilities to enhance your projects.

Support
Email support@johnsnowlabs.com, call us at +1-302-786-5227, or start a chat on spark-nlp.slack.com. Paying customers get a private Slack channel, so that you can ask your questions privately.

Same business day 8x5 support is included with all subscriptions. We can also provide 24x7 support for production systems - please email us if you require it.

Resources & Guides

Quick Start Guide

Quickly set up Generative AI Lab

View Guide

On-Prem Installation

Deploy securely on your infrastructure

Request Installation

Tutorials

Master Generative AI Lab step-by-step

View Tutorials
Schedule a call

Available on AWS and Azure Marketplaces, or for on-premise deployment