How it works

1

Find

Search 2,000+ data sets, read descriptions, browse the schemas, and view sample data

2

Subscribe

The commercial license for the entire catalog is provided as an annual subscription, which includes all updates, support and integrations

3

Analyze

Download the full data sets or use one of 26 prebuilt integrations for analysts & data scientists

AI path

Frequently Asked Questions

1. General Questions

Data coverage

Through the Data Market, John Snow Labs offers a wide range of health and life science datasets and data packages.

 

Is data trustworthy?

John Snow Labs offers access to datasets that have been curated by a team of specialists in the health and life science domains. Thanks to the vast team expertise and experience in data acquisition, data curation, data normalization and data publishing, our datasets are cleaner, better documented, better structured and enriched with useful information than their free equivalents offered by various well established and trustworthy data publishers.

Our datasets are extremely easy to understand, use and integrate into your existing systems and tools.  You can find a list of our databases on our  vendors page.

Every single dataset on John Snow Labs has a fully transparent link back to its source. This means you can always verify the data as published by its original source. Transparency is the ultimate enabler of trust.

Is this data for me?
The main customers targeted by John Snow Labs Data Library are:
– Healthcare and Life Science application providers;
– Data integrators that want to provide data-centered services and are interested in John Snow Labs datasets;
– SMEs that want to develop new products based on health and life science data;
– CIOs/CEOs/CTOs healthcare related businesses;
– Data scientists;
– Data publishers that want to integrate their datasets with complementary health and life datasets for a richer context and relevance;
What is John Snow Labs Data Library?

The John Snow Labs Data Library is an online data repository that allows users to access, download, and use datasets or data packages (groups of related datasets) curated by John Snow Labs team of experts. It is a quick and easy to access gateway to the John Snow Labs data catalog, a unique resource of normalized, clean and enriched collection of health and life science datasets.

The data library contains virtual products in the form of datasets and data packages that can be downloaded and used:

  • for research purposes for free and
  • for commercial purposes after paying a subscription fee.

As long as the subscription is valid the user will have a commercial license to use to the datasets and will get all available updates.

3. Data Info

Why is John Snow Labs data premium?
The datasets published on John Snow Labs Data  Library are premium quality datasets already tested, optimized and customized in a ready to use format.
Extensive efforts have been invested in preparing and optimizing those datasets for immediate use:
– They have been curated by human experts,
– Out of the box optimized data formats for R, Python, SAS, Hadoop, Spark, SQL & BI tools;
– Daily updates are integrated and published so the user can get automatic, versioned, clean & tested updates as they happen;
– All data is under one license with royalty-free, commercial redistribution rights;
– Datasets are triple checked – automatically and manually, to make sure that they are error-free and ready for production use;
– Our datasets are clean and interoperable. For this, we are using a unified and standards-based data model – including numbers, dates, units, currency, null values, identifiers & references.

 

Why would I not buy/take the data directly from the data publisher?

By using our datasets you will save more than 4,000 hours in data preparation (cleaning, transformation, normalization, etc.) each month.

We offer you turnkey data for analysis already tested, optimized and customized in a ready to use format for your big data, data science or visualization platform.

4. Subscriptions

Can I cancel my order?
A user can cancel any order which has on-hold status. On-hold status means that the payment has not been processed yet. Once the payment is computed, the user receives a commercial license agreement for the entire data catalog, the order can no longer be canceled.
An order cancellation does not imply any payment/penalty.
Can I cancel my subscription?

Any active subscription on John Snow Labs Data Library can be cancelled at any time. The cancelation of a subscription stops future renewal charges but does not result in a refund of your order.

Commercial use of the dataset(s) is still allowed until the day the current subscription expires.  

Do you offer discounts for academics, researchers and students?

The use of John Snow Labs datasets is free forever for academics, researchers, and students.

How can I get a commercial license?

The Data Library allows users to easily buy a subscription to the entire catalog. The subscription functionality is accessible from the Data Library main page.

By clicking on the Subscribe to Data Library buttons on the Data Library main page, or on the dataset details pages, the subscription is added to your cart. Once the order is passed and the payment is confirmed the user will gain commercial rights to all datasets.

The subscription is valid for one year and entitles the user to instantly access all available data updates and an unlimited number of downloads.

How can I pay for my order?

The payment methods currently supported by John Snow Labs Data Library are:

  • Credit card directly on our website;
  • Bank transfer to the account received via e-mail once the order is confirmed.
RETURNS AND REFUNDS

Subscriptions to John Snow Labs Data Library are not returnable or refundable after purchase.

Orders with status on hold can be canceled for free.

A user can cancel any order which has on-hold status. On hold status means that the payment has not been processed yet. Once the payment is computed the order can no longer be canceled. An order cancellation does not imply any payment/penalty.

Active subscriptions can be canceled but we do not provide any reimbursement for the already paid subscriptions.

Any active subscription on John Snow Labs Data Library can be canceled at any time. The cancelation of a subscription stops future renewal charges but does not result in a refund of your order. Commercial exploitation rights to the datasets will be valid until the day the current subscription expires.  

Where can I see my current subscriptions?

Your list of subscriptions can be accessed in your account section of the Data Library.

2. Data Library Functionalities

How can I find the dataset(s) I want?

The Data Library provides a dedicated web page where the users can search for the datasets she/he is interested in and explore the available data catalog.

The search functionality works on both dataset name and dataset description. By default, all available datasets are displayed as a list of products.

The following information is available for each product, on the main shop page:
– name of the dataset;
– relevant short description; 
– image that identifies the name of the data package that includes the current dataset;
data download button for logged in users.
What details are available for each product?
The Data Library provides dedicated pages for all available datasets and data packages.
The dataset details page includes the following information:
– the dataset name;
– license information for the logged in user;
– direct download links for CSV data, PDF reference file, and JSON metadata file;
– the image associated with the dataset;
– a short description of the dataset;
– a detailed description of the dataset;
– a clear description of the list of fields together with typing information; 
– data preview;
– a data package section that shortly describes the data package that includes the current dataset;
– a related dataset section containing all datasets that are in the same accelerator as the current dataset
What is a data package?
A data package is a group of datasets that are related. In other words, datasets included in the same package describe the same data from different points of view or describe complementary data or data that is somehow related.

Need help?

Talk to our healthcare & life science experts
for help finding the data you need.

Schedule a call