How it works



Search 2,000+ data sets, read descriptions, browse the schemas, and view sample data



Premium data sets are licensed as an annual subscription, which includes all updates, support and integrations



Download the full data sets or use one of 26 prebuilt integrations for analysts & data scientists

Frequently Asked Questions

1. General Questions

Data coverage

Through the Data Market, John Snow Labs offers a wide range of health and life science datasets and data packages.


Is data trustworthy?

John Snow Labs offers access to both free and premium datasets, that have been curated by a team of specialists in the health and life science domains. Thanks to the vast team expertise and experience in data acquisition, data curation, data normalization and data publishing, our datasets are cleaner, better documented, better structured and enriched with useful information than their free equivalents offered by various well established and trustworthy data publishers.

Our datasets are extremely easy to understand, use and integrate into your existing systems and tools.  You can find a list of our databases on our  vendors page.

Every single dataset on John Snow Labs has a fully transparent link back to its source. This means you can always verify the data as published by its original source. Transparency is the ultimate enabler of trust.

Is this data for me?
The main customers targeted by John Snow Labs Data Market are:
– Healthcare and Life Science application providers;
– Data integrators that want to provide data centered services and are interested in John Snow Labs datasets;
– SMEs that want to develop new products based on health and life science data;
– CIOs/CEOs/CTOs healthcare related businesses;
– Data scientists;
– Data publishers that want to integrate their datasets with complementary health and life datasets for a richer context and relevance;
What is John Snow Labs Data Market?

The John Snow Labs Data Market is an online data store that allows users to purchase subscriptions to John Snow Labs datasets or to John Snow Labs data packages (groups of related datasets). It is a quick and easy to access gateway to the John Snow Labs data catalog, a unique resource of normalized, clean and enriched collection of health and life science datasets.

The products sold here are virtual products in the form of datasets that can be downloaded by users after paying a subscription fee. As long as the subscription is valid the user will have access to the datasets and all available updates. John Snow Labs datasets that are related from a thematic point of view (they contain similar or related data) are grouped into data packages and the user has the option to subscribe to a data package in order to obtain a better financial deal.

2. Data Market Functionalities

How can I find the dataset(s) I want?

The Data Market provides a dedicated web page where the users can search for the datasets she/he is interested in and explore the available data catalog.

The search functionality works on both dataset name and dataset description. By default, all available datasets are displayed as a list of products.

The following information is available for each product, on the main shop page:
– name of the dataset;
– relevant short description; 
– image that identifies the name of the data package that includes the current dataset;
– a download sample CSV button;
– a logo which specifies the type of dataset (premium/free).
What details are available for each product?
The data market provides dedicated product pages for all available products.
The product details page for datasets products includes the following information:
– the dataset name;
– the price of the dataset;
– the image associated with the dataset;
– a short description of the dataset;
– a detailed description of the dataset;
– a clear description of the list of fields together with typing information; 
– data preview;
CSV sample data for immediate download;
– a data package section that shortly describes the data package that include the current dataset;
– a related dataset section containing all datasets that are in the same accelerator as the current dataset
What is a data package?
A data package is a group of datasets that are related. In other words, datasets included in the same package describe the same data from different points of view or describe complementary data or data that is somehow related.
The advantage provided by data packages is that the user can quickly subscribe to all dataset included in the package. This group subscription allows for a better price.

3. Data Info

What is premium data?
Premium databases are higher quality datasets already tested, optimized and customized in a ready to use format.
Extensive efforts have been invested in preparing and optimizing those datasets for immediate use:
– They have been curated by human experts,
– Out of the box optimized data formats for R, Python, SAS, Hadoop, Spark, SQL & BI tools;
– Daily updates are integrated and published so the user can get automatic, versioned, clean & tested updates as they happen;
– All data is under one license with royalty-free, commercial redistribution rights;
– Datasets are triple checked – automatically and manually, to make sure that they are error-free and ready for production use;
– Our datasets are clean and interoperable. For this, we are using unified and standards-based data model – including numbers, dates, units, currency, null values, identifiers & references.


Why would I not buy/take the data directly from the data publisher?

By using our datasets you will save more than 4,000 hours in data preparation (cleaning, transformation, normalization, etc.) each month.

We offer you turnkey data for analysis already tested, optimized and customized in a ready to use format for your big data, data science or visualization platform.

4. Subscriptions

Can I cancel my order?
A user can cancel any order which has on hold status. On hold status means that the payment has not been processed yet and the user did not get access to the data products he/she ordered. Once the payment is computed and the user gets access to the ordered subscriptions the order can no longer be cancelled.
An order cancellation does not imply any payment/penalty.
Can I cancel my subscription?

Any active subscription on John Snow Labs Data Market can be cancelled at any time. The cancelation of a subscription stops future renewal charges but does not result in a refund of your order.

Access to the dataset(s) included in the canceled subscription will be available until the day the current subscription expires.  

Do I have a discount if I subscribe to multiple datasets?

Discounts are offered when subscribing to data packages. On average, the price of the data package is cheaper by 25% than the total price of the included datasets.

Do you offer discounts for academics, researchers and students?

Yes, please email us to discuss your needs.

How can I get access to the premium datasets/ data packages?

The Data Market allows users to easily subscribe to premium datasets or data packages. The subscription functionality is accessible from the Data Market main page at both dataset and data package level.

By clicking on the Subscribe to dataset/Subscribe to data package buttons on the Data Market main page, the subscription is added to your cart. Once the order is passed and the payment is confirmed the user has instant access to datasets downloads links.

The subscription is valid for one year and entitles the user to instantly access all available data updates and an unlimited number of downloads.

How can I pay for my order?

The payment methods currently supported by John Snow Labs Data Market are:

  • Credit card directly on our website;
  • Bank transfer to the account received via e-mail once the order is confirmed.
Where can I see my current subscriptions?

Your list of subscriptions can be accessed in your account section of the Data Market.

Need help?

Talk to our healthcare & life science experts
for help finding the data you need.

Schedule a call