Pharmacologic Substance Concepts and Types

$716 / year

This dataset contains the entire concept structure of UMLS Metathesaurus for the semantic type “Pharmacologic Substance”. One of the primary purposes of this dataset is to connect different names for all the concepts for a specific Semantic Type. There are 125 semantic types in the Semantic Network. Every Metathesaurus concept is assigned at least one semantic type; very few terms are assigned as many as five semantic types.

Categories: ,

The UMLS, or Unified Medical Language System, is a set of files and software that brings together many health and biomedical vocabularies and standards to enable interoperability between computer systems. One powerful use of the UMLS is linking health information, medical terms, drug names, and billing codes across different computer systems. Some examples of this are:
– Linking terms and codes between your doctor, your pharmacy, and your insurance company
– Patient care coordination among several departments within a hospital
The UMLS has many other uses, including search engine retrieval, data mining, public health statistics reporting, and terminology research.
The UMLS has three tools (Knowledge Sources):
– Metathesaurus: Terms and codes from many vocabularies, including CPT, ICD-10-CM, LOINC, MeSH, RxNorm, and SNOMED CT
– Semantic Network: Broad categories (semantic types) and their relationships (semantic relations)
– SPECIALIST Lexicon and Lexical Tools: Natural language processing tools

The 2018AB Metathesaurus contains approximately 3.44 million concepts and 13.7 million unique concept names from 199 source vocabularies. The Metathesaurus is a very large, multi-purpose, and multi-lingual vocabulary database that contains information about biomedical and health related concepts, their various names, and the relationships among them. It is built from the electronic versions of many different thesauri, classifications, code sets, and lists of controlled terms used in patient care, health services billing, public health statistics, indexing and cataloging biomedical literature, and /or basic, clinical, and health services research. In this documentation, these are referred to as the “source vocabularies” of the Metathesaurus. In the Metathesaurus, all the source vocabularies are available in a single, fully-specified database format.

The Metathesaurus is organized by concept or meaning. In essence, its purpose is to link alternative names and views of the same concept together and to identify useful relationships between different concepts. All concepts in the Metathesaurus are assigned to at least one semantic type from the Semantic Network. This provides consistent categorization of all concepts in the Metathesaurus at the relatively general level represented in the Semantic Network.
The purpose of the Semantic Network is to provide a consistent categorization of all concepts represented in the UMLS Metathesaurus and to provide a set of useful relationships between these concepts. All information about specific concepts is found in the Metathesaurus; the Network provides information about the set of basic semantic types, or categories, which may be assigned to these concepts, and it defines the set of relationships that may hold between the semantic types. The current release of the Semantic Network contains 125 semantic types and 54 relationships. The Semantic Network serves as an authority for the semantic types that are assigned to concepts in the Metathesaurus. The Network defines these types, both with textual descriptions and by means of the information inherent in its hierarchies.

Date Created


Last Modified




Update Frequency


Temporal Coverage


Spatial Coverage



Source License URL

John Snow Labs Standard License

Source License Requirements

Reporting Requirements

Source Citation



UMLS, Metathesaurus, UMLS concepts, Semantic Type, RRF, NLM

Other Titles

Metathesaurus Concepts and Their Semantic Types, Substances Pharmacologic Substance Concepts and Types, MRCONSO & MRSTY

Concept_Unique_IdentifierConcept Unique Identifier (CUI) is the unique identifier for a Metathesaurus concept to which strings with the same meaning are linked. CUI starts with C followed by 7 digits.stringrequired : 1
Language_of_TermsLanguage of terms in the source vocabularystringrequired : 1
Term_StatusStatus of the term. P= Preferred LUI of the CUI, S= Non-Preferred LUI of the CUIstringrequired : 1
Lexical_Unique_IdentifierLexical Unique Identifier (LUI) is the unique identifier of a term in the Metathesaurus. Terms are different from strings in that they group together strings that are lexical variants of one another. LUI starts with L followed by 7 digits.stringrequired : 1
String_TypeType of string. PF= Preferred form of term, VCW=Case and word-order variant of the preferred form, VC=Case variant of the preferred form, VO=Variant of the preferred form, VW=Word-order variant of the preferred form.stringrequired : 1
String_Unique_IdentifierString Unique Identifier (SUI) is a unique identifier for each unique string in the Metathesaurus. Strings that differ in any way, e.g., by upper or lower case, will have different SUIs. SUI starts with S followed by 7 digits.stringrequired : 1
Is_PreferredIndicates if the atom status is preferred (true) or not (false) for this string within this concept.booleanrequired : 1
Atom_Unique_IdentifierAtom Unique Identifier (AUI) is an identifier for the atom in the UMLS. It is the primary key to the concepts table. AUI starts with A followed by 7 digits. They are the concept names or strings from each of the source vocabulariesstringrequired : 1
Source_Asserted_Atom_IdentifierSource asserted identifier for an atomstring-
Source_Asserted_Concept_IdentifierSource asserted identifier for a conceptstring-
Source_Asserted_Descriptor_IdentifierSource asserted identifier for descriptor in the metathesaurusstring-
Source_AbbreviationAbbreviation of the source vocabularystringrequired : 1
Term_TypeType of term within the source vocabulary. A value indicating the kind of role an atom plays in its source. Examples include PT for "preferred term," SY for "synonym," and MH for "main heading."string-
Source_String_CodeSource string code (CODE) is the Unique Identifier or code for string in sourcestring-
String_NameName of string in the Metathesaurusstring-
Source_Restriction_Levelintegerrequired : 1 level : Nominal
Suppressible_FlagIn the UMLS Metathesaurus terms can be marked as "suppressible", these terms can then be removed from the subset. These terms are most often identified as suppressible because of ambiguity in meaning or lack of face validity. Suppressible flag. Values = E: Suppressible due to editor decision, N: Not suppressible, O: Obsolete, Y: Suppressible due to SABstringrequired : 1
Content_View_Flag_1Content View Flag. Bit field used to flag rows included in Content View. This field is a varchar field to maximize the number of bits available for use.integerlevel : Nominal
Semantic_Type_Unique_IdentifierUnique id assigned to semantic type e.g., T004=Fungus, T005=Virusstringrequired : 1
Semantic_Type_Tree_IdentifierSemantic type tree numberstringrequired : 1
Semantic_Type_NameName of semantic typestringrequired : 1
Attribute_Type_IdentifierEach concept has specific attributes defining its meaning and is linked to the corresponding concept names in the various source vocabulariesstringrequired : 1
Content_View_Flag_2A bit field used to flag rows included in Content View. This field is a varchar field to maximize the number of bits available for use.integerlevel : Nominal
C0065828ENGSL1218041PFS1459573trueA27800910CVXAB03MMR0NT121A1. SubstanceAT17569911256
C0086413ENGSL0019682PFS0048631trueA27801658CVXAB61HIV0NT121A1. SubstanceAT17584380256
C0718003ENGSL0398824PFS0490600trueA27802887CVXAB10IPV0NT121A1. SubstanceAT17573385256
C0004886ENGSL0004885PFS0018024falseA27793046CVXAB19BCG0NT121A1. SubstanceAT60657770256
C0065829ENGSL5525057PFS6318557trueA27794865CVXAB94MMRV0NT121A1. SubstanceAT17582233256
C0301503ENGSL0605773PFS0722181trueA27791770CVXAB34RIG0NT121A1. SubstanceAT374078984352
C1589699ENGPL1225038PFS6418129trueA13431996MTHPNNOCODEHSS0NT121A1. SubstanceAT55538083
C0026782ENGSL0026780PFS0422284trueA27793733CVXAB07mumps0NT121A1. SubstanceAT17572985256
C0259714ENGPL0269574PFS0361622trueA6788118MTHPNNOCODECP-100NT121A1. SubstanceAT07963277
C0360506ENGSL0585951PFS0664820trueA27793016CVXAB86IG0N256T121A1. SubstanceAT07937217256
Related Data Packages