Others titles
- Chemical Effect Integrated into Pathways
- Pathways Response to Chemical Exposure
- Pathway Involved and Affected Upon Chemical Exposure
- Chemical Pathway Enriched Associations Gene Ontology
- Chemical Pathway Enriched Associations Kegg Pathway
- Chemical Pathway Enriched Associations Enrichment
- Chemical Pathway Enriched Associations Gene Set Enrichment Analysis (GSEA)
Keywords
- Gene Pathway Enrichment
- Gene Pathway Analyses
- Gene Common Pathways
- Pathways Between Genes
- Genome Annotation
- Gene Enrichment
Chemical Pathway Enriched Associations
This dataset contains the results of pathway enrichment analyses performed for gene groups that are in some way affected by a chemical. The enrichment analysis system permits to find common pathways between genes. Each analysis is based in the list of genes related to the specified chemical, resulting in the output of pathways shared between the genes, creating information used in the biological processes that might be involved in the effect of the chemical over the genes or disease mechanism.
Get The Data
- ResearchNon-Commercial, Share-Alike, Attribution Free Forever
- CommercialCommercial Use, Remix & Adapt, White Label Log in to download
Description
Genome annotation is the description of a gene correlated with a structure, function, products, regulation and other enriching information. “The “unit” of genome annotation is the description of an individual gene and its protein (or RNA) product, and the focal point of each such record is the function assigned to the gene product. The record may also include a brief description of the evidence for this assigned function.” (1) One of the elements that can be annotated to a gene is the pathway(s) of biological processes in which the gene participate or generate.
Gene enrichment is a statistical analysis to identify over-represented (overexpressed) genes from a large pool of genes or proteins that share a common characteristic (i.e. microarray); these over-represented genes could be associated with a disease. Enrichment is statistically comparing descriptions to the genes set to understand the biological processes of those genes. The test determines if the pathway is enriched for the genes calculating the statistical significance of each annotation to find significantly enriched pathways associated with the list of genes. By performing this analysis scientists can determine if the genes share a common characteristic by finding the annotated pathways that are more strongly associated (by annotation significance) to the group of genes. The hypergeometric distribution statistical formula is used to find the significance of the pathways.
1. Koonin EV, Galperin MY. Sequence – Evolution – Function: Computational Approaches in Comparative Genomics. Boston: Kluwer Academic; 2003.
About this Dataset
Data Info
Date Created | 2004-01-20 |
---|---|
Last Modified | 2024-02-28 |
Version | 2023-04-27 |
Update Frequency |
Monthly |
Temporal Coverage |
N/A |
Spatial Coverage |
N/A |
Source | John Snow Labs; Comparative Toxicogenomics Database; |
Source License URL | |
Source License Requirements |
Publicly available and free for research application but citation is required. Permission asked for commercial uses |
Source Citation |
Publicly available and free for research application but citation is required. Permission asked for commercial uses |
Keywords | Gene Pathway Enrichment, Gene Pathway Analyses, Gene Common Pathways, Pathways Between Genes, Genome Annotation, Gene Enrichment |
Other Titles | Chemical Effect Integrated into Pathways, Pathways Response to Chemical Exposure, Pathway Involved and Affected Upon Chemical Exposure, Chemical Pathway Enriched Associations Gene Ontology, Chemical Pathway Enriched Associations Kegg Pathway, Chemical Pathway Enriched Associations Enrichment, Chemical Pathway Enriched Associations Gene Set Enrichment Analysis (GSEA) |
Data Fields
Name | Description | Type | Constraints |
---|---|---|---|
Chemical_Name | Name of the chemical associated with the group of genes in the pathway analysis. | string | required : 1 |
Chemical_ID | Identification number of the chemical by the US National Library of Medicine’s Medical Subject Headings (MeSH). MeSH is a controlled vocabulary of thousands of biomedical terms that serves to standardize the terminology used in published texts that belong to life sciences. Each MeSH term has a unique identifier, which can be from 7 to 8-character length. The MeSH unique identifier was changed to 10-character length after November 2013. | string | required : 1 |
Cas_Registry_Number | Unique numeric identifier designated by CAS for the chemical substance. CAS registry number also serves as a reference to find information on the specific chemical. CAS is a division of the American Chemical Society (ACS); the CAS registry collects information of millions of chemical substances identified since the early 1900’s. | string | - |
Pathway_Name | The official name of the pathway involved in the interaction. A pathway is a series of steps in biological processes at a molecular level that leads to a biological change; a pathway is possible due to the activities of multiple gene products and other elements. Pathways involve molecular interactions and reactions pertaining to metabolism, genetic information processing, environmental information processing, cellular processes, organismal systems, human diseases and drug development. Every pathway has a generic or standardized name that helps on obtaining an overview of the pathway’s nature, others have unique but well-known names, for example, the Krebs cycle. | string | required : 1 |
Pathway_ID | KEGG or REACTOME identifier. Alphanumerical code that identifies the pathway in the lookup of the KEGG or the REACTOME databases. KEGG Pathway Database is a collection of manually drawn pathway maps representing the knowledge on the molecular interaction and reaction networks for metabolism, genetic information processing, environmental information processing, cellular processes, organismal systems, human diseases and drug development. | string | required : 1 |
P_Value | Raw p-value. The p-value indicates how significant is the pathway to the group of genes related to the chemical; the closer to zero, the greater the probability that the pathway is shared by these genes due to reasons other than by chance. The p-value is calculated using hypergeometric distribution method, which compares the pathways shared by the genes with the background distribution of the annotation. | string | required : 1 |
Corrected_P_Value | P-value after applying Bonferroni adjustment. Bonferroni correction is made when a group of variables (in this case a group of genes) is being tested, to give a more accurate significance. | string | required : 1 |
Target_Match_Quantity | Number of genes that interact with the chemical and are annotated to the pathway. | integer | level : Ratiorequired : 1 |
Target_Total_Quantity | Total number of genes that interact with the chemical. | integer | level : Ratiorequired : 1 |
Background_Match_Quantity | Total number of genes that are annotated to the pathway. | integer | level : Ratiorequired : 1 |
Background_Total_Quantity | Total number of human genes. | number | level : Ratiorequired : 1 |
Data Preview
Chemical Name | Chemical ID | Cas Registry Number | Pathway Name | Pathway ID | P Value | Corrected P Value | Target Match Quantity | Target Total Quantity | Background Match Quantity | Background Total Quantity |
10074-G5 | C534883 | Cyclin A:Cdk2-associated events at S phase entry | REACT:R-HSA-69656 | 1.4499999999999997e-05 | 0.00116 | 2 | 4 | 71 | 45326 | |
10074-G5 | C534883 | Cyclin E associated events during G1/S transition | REACT:R-HSA-69202 | 1.49e-05 | 0.00119 | 2 | 4 | 72 | 45326 | |
10074-G5 | C534883 | G1/S Transition | REACT:R-HSA-69206 | 4.09e-05 | 0.0032700000000000003 | 2 | 4 | 119 | 45326 | |
10074-G5 | C534883 | Mitotic G1-G1/S phases | REACT:R-HSA-453279 | 5.91e-05 | 0.00472 | 2 | 4 | 143 | 45326 | |
10074-G5 | C534883 | Pathways in cancer | KEGG:hsa05200 | 2.61e-06 | 0.000209 | 3 | 4 | 395 | 45326 | |
10074-G5 | C534883 | Small cell lung cancer | KEGG:hsa05222 | 2.03e-05 | 0.0016300000000000002 | 2 | 4 | 84 | 45326 | |
10074-G5 | C534883 | S Phase | REACT:R-HSA-69242 | 4.88e-05 | 0.0039 | 2 | 4 | 130 | 45326 | |
10074-G5 | C534883 | Transcriptional misregulation in cancer | KEGG:hsa05202 | 9.26e-05 | 0.007409999999999998 | 2 | 4 | 179 | 45326 | |
10,10-bis(4-pyridinylmethyl)-9(10H)-anthracenone | C112297 | Cholinergic synapse | KEGG:hsa04725 | 3.53e-11 | 2.93e-09 | 4 | 4 | 112 | 45326 | |
10,10-bis(4-pyridinylmethyl)-9(10H)-anthracenone | C112297 | Interaction between L1 and Ankyrins | REACT:R-HSA-445095 | 2.54e-06 | 0.000211 | 2 | 4 | 30 | 45326 |