Home » Academic Research

John Snow Labs for Academia

You have our full support for using John Snow Labs’ state-of-the-art Medical Language Models, Healthcare NLP & LLM libraries, and the Generative AI Lab for open research & teaching projects.

Free Software Licenses for Academic Research and Teaching

John Snow Labs’ commercial libraries for state-of-the-art medical language models are available under a free license for academic researchers, educators, and students.
This includes over 2,500 pre-trained healthcare-specific LLM & NLP models.
You can get a free personal license if you are doing academic research that will be publicly published under open-access, open-source, and open-data principles.
If you are teaching a course that makes use of the library, you and your students can get a free license for it.
The free license includes the full capabilities of the software, all pre-trained models, and regular updates. Its goal is to enable you to easily reuse, reproduce, and improve production-grade, state-of-the-art NLP in your research & teaching.
Fill in the form to apply for your free license. Please use your university’s email address and briefly explain how you will use the academic license.

Sign up

for a Free Academic License

Medical Language Models

gives you access to state-of-the-art:

Clinical information extraction

train your own or use pre-trained models to extract 400+ entities including clinical facts (symptoms, facts (symptoms, diagnoses, treatments, procedures), drug facts (name, strength, dosage, route, frequency, duration), biomedical terms (organism, tissue, gene, gene product, chemical, …) and much more.

Clinical text summarization

apply healthcare-specific LLMs to summarize discharge summaries, radiology reports, patient reported outcomes and questions, or biomedical research. Translate across language and paraphrase to reduce medical jargon.

Entity resolution and coding

Use pre-trained models or train your own to resolve recognized entities standard medical terminologies including SNOMED-CT, ICD-10-CM, ICD-10-PCS, ICD-O, CPT, LOINC, RxNorm, NDC, MeSH, HPO, and UMLS.

Medical question answering

apply fine-tuned medical LLMs to answer questions about clinical narratives, radiology or pathology reports, clinical trial protocols, or medical research papers. Both open-book and closed-book question answering is supported.

Medical data normalization

normalize medications, lab results, vital signs, and demographic data – to simplify downstream analysis for extracted clinical information.

De-identification

Anonymize either structured tables or unstructured free text including all GDPR and HIPAA-required fields as well as and then either remove, mask, or obfuscate PHI.

Our company is named after Dr. John Snow – the medical doctor who helped stop the outbreak of cholera in 1854 London by analyzing data.

We exist for the very purpose of empowering many more like him in the 21st century.

Sign up

for a Free Academic License