Approved and Researched Drug Targets Human SwissProt Accessions

$179 / year

This dataset is a supplementary data from “Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds” (2011). In this case the Entrez Gene IDs were mapped to 1651 human Swiss-Prot accessions but this includes both approved and research targets.

Complexity

This dataset is a selection of both approved and research drug targets extracted from the literature, “Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds” of 2011. The criteria for inclusion are drug target coverage for approved and researched drug targets.

There are utilities that can be explored in this dataset including the two: a) following the database links and b) comparing them for intersects (protein IDs in common) and differentials (protein IDs unique to particular lists or subsets). This can be extended to comparisons with lists that may be generated in the course of studying other published work (e.g. expression data or disease association gene candidates).

The metadata descriptions are minimal since context is provided either in the references and/or the download descriptions for the appropriate databases. The lists are Excel sheets of UniProtKB, HGNC and ChEMBL live links. These entry points should be able to get to the most from other sources.

Lists that are not UniProtKB Accessions are normalized to these (e.g. mappings of Human Gene Nomenclature Committee (HGNC) Symbols or Entrez Gene IDs (EGID) to UniProtKB). They are then filtered to human and Swiss-Prot (i.e. any TrEMBL entries are removed) and to approved drug targets if this is an option in the original list. In such cases, lists that are hosted thus become transformations, rather than direct facsimiles, of the primary sources. Given such ID cross-mappings are not perfect; absolute correctness cannot be guaranteed. Versions, however, are supplied in good faith and the originals are available in every case.

Date Created

2011-05-13

Last Modified

2017-01-27

Version

2017-01-27

Update Frequency

Irregular

Temporal Coverage

N/A

Spatial Coverage

N/A

Source

John Snow Labs; International Union of Basic and Clinical Pharmacology (IUPHAR) and the British Pharmacological Society (BPS) Guide to PHARMACOLOGY;

Source License URL

Source License Requirements

The Guide to PHARMACOLOGY database is licensed under the Open Data Commons Open Database License (ODbL). Its contents are licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported license.

Source Citation

Southan C, Boppana K, Jagarlapudi SA, Muresan S. Analysis of in vitro bioactivity data extracted from drug discovery literature and patents Ranking 1654 human protein targets by assayed compounds and molecular scaffolds. J Cheminform. 2011 May 13;3(1)14. doi 10.1186/1758-2946-3-14.

Keywords

Drug on Target, Approved and Researched Drug, Drug Discovery and Development, Pharmaceutical Process Development, New Drug Development, List of Proteins, Drug Information, Protein Database, Free Database

Other Titles

Approved and Researched Drug Targets ChEMBL HGNC, Researched Drug Targets Human SwissProt Accessions, Approved Drug Targets Human SwissProt Accessions, Drug Targets ChEMBL HGNC, ChEMBL HGNC Drug Targets Database

NameDescriptionTypeConstraints
Entry_ReportThis subsection of the ‘Entry information’ section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.stringunique : 1
Protein_NamesThis subsection of the Protein Names on ‘Names and Taxonomy’ section provides a list of names of the human protein, from commonly used to obsolete, to allow unambiguous identification of a protein. This subsection may also include information on the activity of the protein, such as a precise description of the catalytic mechanism of enzymes, or information about individual protein chains or functional domains contained within it, if pertinent.string-
Cross_Reference_HGNC_IdHGNC ID for each protein name.string-
Approved_Gene_SymbolApproved gene symbol for each HGNC cross-referenced protein name.string-
Cross_Reference_HGNC_URLLinks to the HGNC databases cross-referenced in UniProtKB Swiss-Prot > HGNCstring-
Cross_Reference_CHEMBL_IdChEMBL ID for each protein name.string-
Cross_Reference_CHEMBL_URLLinks to the ChEMBL databases cross-referenced in UniProtKB Swiss-Prot > CHeMBLstring-
Entry_ReportProtein_NamesCross_Reference_HGNC_IdApproved_Gene_SymbolCross_Reference_HGNC_URLCross_Reference_CHEMBL_IdCross_Reference_CHEMBL_URL
F7VJQ1Alternative prion protein
HGNC:34447LGALS7Bhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=34447
P04040CatalaseHGNC:1516CAThttp://www.genenames.org/data/hgnc_data.php?hgnc_id=1516
P01350GastrinHGNC:4164GASThttp://www.genenames.org/data/hgnc_data.php?hgnc_id=4164
P04118ColipaseHGNC:2085CLPShttp://www.genenames.org/data/hgnc_data.php?hgnc_id=2085
P29972Aquaporin-1HGNC:633AQP1http://www.genenames.org/data/hgnc_data.php?hgnc_id=633
P41181Aquaporin-2HGNC:634AQP2http://www.genenames.org/data/hgnc_data.php?hgnc_id=634
P51671EotaxinHGNC:10610CCL11http://www.genenames.org/data/hgnc_data.php?hgnc_id=10610
P21333Filamin-AHGNC:3754.FLNAhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=3754
P01258CalcitoninHGNC:1437CALCAhttp://www.genenames.org/data/hgnc_data.php?hgnc_id=1437