ChEMBL Approved Drug Targets Human Swiss-Prot Accessions

$179 / year

ChEMBL dataset, released on 17 August 2013, includes a download option for approved drug targets. This converted to 251 human Swiss-Prot accessions but note this does not encompass additional protein IDs from target groups.

Complexity

This dataset is a selection of attributed target lists extracted from the literature, Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds; as a supplementary data. The criteria for inclusion are drug target coverage for the human proteins.

There are utilities that can be explored in this dataset including the two: a) following the database links and b) comparing them for intersects (protein IDs in common) and differentials (protein IDs unique to particular lists or subsets). This can be extended to comparisons with lists that may be generated in the course of studying other published work (e.g. expression data or disease association gene candidates).

The metadata descriptions are minimal since context is provided either in the references and/or the download descriptions for the appropriate databases. The lists are Excel sheets of UniProtKB, HGNC and ChEMBL live links. These entry points should be able to get to the most from other sources.

Lists that are not UniProtKB Accessions are normalized to these (e.g. mappings of Human Gene Nomenclature Committee (HGNC) Symbols or Entrez Gene IDs (EGID) to UniProtKB). They are then filtered to human and Swiss-Prot (i.e. any TrEMBL entries are removed) and to approved drug targets if this is an option in the original list. In such cases, lists that are hosted thus become transformations, rather than direct facsimiles, of the primary sources. Given such ID cross-mappings are not perfect; absolute correctness cannot be guaranteed. Versions, however, are supplied in good faith and the originals are available in every case.

Date Created

2013-08-13

Last Modified

2017-01-27

Version

2017-01-27

Update Frequency

Irregular

Temporal Coverage

N/A

Spatial Coverage

N/A

Source

John Snow Labs; International Union of Basic and Clinical Pharmacology (IUPHAR) and the British Pharmacological Society (BPS) Guide to PHARMACOLOGY;

Source License URL

Source License Requirements

The Guide to PHARMACOLOGY database is licensed under the Open Data Commons Open Database License (ODbL). Its contents are licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported license.

Source Citation

Rask-Andersen M1, Masuram S, Schiöth HB. The druggable genome Evaluation of drug targets in clinical trials suggests major shifts in molecular class and indication.

Keywords

Protein Database, Target Bedside Table, Block Targets, Free Database

Other Titles

Approved Drug Targets, Drug Targets ChEMBL

NameDescriptionTypeConstraints
Entry_ReportThis subsection of the ‘Entry information’ section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.stringunique : 1 required : 1
Protein_NamesThis subsection of the Protein Names on ‘Names and Taxonomy’ section provides a list of names of the human protein, from commonly used to obsolete, to allow unambiguous identification of a protein. This subsection may also include information on the activity of the protein, such as a precise description of the catalytic mechanism of enzymes, or information about individual protein chains or functional domains contained within it, if pertinent.stringrequired : 1
Cross_Reference_HGNC_IdHGNC ID for each protein name.stringrequired : 1
Cross_Reference_HGNC_URLLinks to the HGNC databases cross-referenced in UniProtKB Swiss-Prot > HGNCstringrequired : 1
Cross_Reference_CHEMBL_IdChEMBL ID for each protein name.stringrequired : 1
Cross_Reference_CHEMBL_URLLinks to the ChEMBL databases cross-referenced in UniProtKB Swiss-Prot > CHeMBLstringrequired : 1
Entry_ReportProtein_NamesCross_Reference_HGNC_IdCross_Reference_HGNC_URLCross_Reference_CHEMBL_IdCross_Reference_CHEMBL_URL
P00797ReninHGNC:9958, RENhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=9958CHEMBL286https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL286
P02751FibronectinHGNC:3778, FN1http://www.genenames.org/data/hgnc_data.php?hgnc_id=3778CHEMBL3810https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL3810
P00747PlasminogenHGNC:9071, PLGhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=9071CHEMBL1801https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL1801
P09237MatrilysinHGNC:7174, MMP7http://www.genenames.org/data/hgnc_data.php?hgnc_id=7174CHEMBL4073https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL4073
P0DJD9Pepsin A-5HGNC:8887, PGA5http://www.genenames.org/data/hgnc_data.php?hgnc_id=8887CHEMBL3295https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL3295
P14679TyrosinaseHGNC:12442, TYRhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=12442CHEMBL1973https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL1973
P10275Androgen receptorHGNC:644, ARhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=644CHEMBL1871https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL1871
P16444Dipeptidase 1HGNC:3002, DPEP1http://www.genenames.org/data/hgnc_data.php?hgnc_id=3002CHEMBL1989https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL1989
P00813Adenosine deaminaseHGNC:186, ADAhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=186CHEMBL1910https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL1910
P06213Insulin receptorHGNC:6091, INSRhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=6091CHEMBL1981https://www.ebi.ac.uk/chembldb/target/inspect/CHEMBL1981