Schema for GWAS Catalog - NHGRI-EBI Catalog of Published Genome-Wide Association Studies
|
|
Database: hg38 Primary Table: gwasCatalog Row Count: 42,077
Format description: NHGRI's collection of Genome-Wide Association Studies SNPs
field | example | SQL type | info | description |
bin | 591 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
chrom | chr1 | varchar(255) | values | Reference sequence chromosome or scaffold |
chromStart | 845016 | int(10) unsigned | range | Start position in chromosome |
chromEnd | 845017 | int(10) unsigned | range | End position in chromosome |
name | rs141175086 | varchar(255) | values | ID of SNP associated with trait |
pubMedID | 26955885 | int(10) unsigned | range | PubMed ID of publication of the study |
author | Lane JM | varchar(255) | values | First author of publication |
pubDate | 2016-03-09 | varchar(255) | values | Date of publication |
journal | Nat Commun | varchar(255) | values | Journal of publication |
title | Genome-wide association ana... | varchar(1024) | values | Title of publication |
trait | Morning vs. evening chronotype | varchar(255) | values | Disease or trait assessed in study |
initSample | 8,724 European ancestry eve... | longblob | | Initial sample size |
replSample | NA | longblob | | Replication sample size |
region | 1p36.33 | varchar(255) | values | Chromosome band / region of SNP |
genes | LINC01128 | longblob | | Reported Gene(s) |
riskAllele | rs141175086-C | varchar(255) | values | Strongest SNP-Risk Allele |
riskAlFreq | 0.998 | varchar(255) | values | Risk Allele Frequency |
pValue | 4E-8 | varchar(255) | values | p-Value |
pValueDesc | | varchar(255) | values | p-Value Description |
orOrBeta | 2.16 | varchar(255) | values | Odds ratio or beta |
ci95 | [1.34-3.49] | varchar(255) | values | 95% Confidence Interval |
platform | Affymetrix [73355677] (impu... | varchar(255) | values | Platform and [SNPs passing QC] |
cnv | N | enum('Y', 'N') | values | Y if Copy Number Variant |
|
| |
|
|
Sample Rows
|
|
bin | chrom | chromStart | chromEnd | name | pubMedID | author | pubDate | journal | title | trait | initSample | replSample | region | genes | riskAllele | riskAlFreq | pValue | pValueDesc | orOrBeta | ci95 | platform | cnv |
---|
591 | chr1 | 845016 | 845017 | rs141175086 | 26955885 | Lane JM | 2016-03-09 | Nat Commun | Genome-wide association analysis identifies novel loci for chronotype in 100,420 individuals from the UK Biobank. | Morning vs. evening chronotype | 8,724 European ancestry evening chronotype individuals, 26,948 European ancestry morning chronotype individuals | NA | 1p36.33 | LINC01128 | rs141175086-C | 0.998 | 4E-8 | | 2.16 | [1.34-3.49] | Affymetrix [73355677] (imputed) | N |
593 | chr1 | 1070425 | 1070426 | rs3934834 | 19851299 | Johansson A | 2009-10-22 | Obesity (Silver Spring) | Linkage and genome-wide association analysis of obesity-related phenotypes: association of weight with the MGAT1 gene. | Body mass index | 1,079 South Tyrolian individuals, 790 Dutch founder individuals, 2,060 European ancestry individuals | NA | 1p36.33 | NR | rs3934834-G | 0.80 | 6E-7 | (females + males) | 0.11 | [NR] kg increase | Illumina [318237] | N |
593 | chr1 | 1143817 | 1143818 | rs11260603 | 23382691 | Lauc G | 2013-01-31 | PLoS Genet | Loci associated with N-glycosylation of human immunoglobulin G show pleiotropy with autoimmune diseases and haematological cance ... | IgG glycosylation | 2,247 European ancestry individuals | NA | 1p36.33 | NR | rs11260603-C | 0.217058243435692 | 4E-7 | (IGP7) | 0.2608 | [0.16-0.36] unit increase | Illumina [~ 2500000] (imputed) | N |
595 | chr1 | 1312113 | 1312114 | rs12103 | 26192919 | Liu JZ | 2015-07-20 | Nat Genet | Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across pop ... | Ulcerative colitis | 6,968 European ancestry cases, 20,464 European ancestry controls | 10,679 European ancestry cases, 26,715 European ancestry controls, 397 Iranian ancestry cases, 342 Iranian ancestry controls, 1, ... | 1p36.33 | NR | rs12103-A | 0.1834 | 1E-9 | (EA) | 1.1046834 | [1.07-1.14] | Affymetrix, Illumina [~ 9000000] (imputed) | N |
595 | chr1 | 1312113 | 1312114 | rs12103 | 26192919 | Liu JZ | 2015-07-20 | Nat Genet | Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across pop ... | Inflammatory bowel disease | 12,882 European ancestry cases, 21,770 European ancestry controls | 25,273 European ancestry cases, 26,715 European ancestry controls, 548 Iranian ancestry cases, 342 Iranian ancestry control, 1,4 ... | 1p36.33 | NR | rs12103-A | 0.1834 | 3E-11 | (EA) | 1.0906081 | [1.06-1.12] | Affymetrix, Illumina [~ 9000000] (imputed) | N |
595 | chr1 | 1312113 | 1312114 | rs12103 | 26192919 | Liu JZ | 2015-07-20 | Nat Genet | Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across pop ... | Crohn's disease | 5,956 European ancestry cases, 14,927 European ancestry controls | 14,594 European ancestry cases, 26,715 European ancestry controls, 151 Iranian ancestry cases, 342 Iranian ancestry controls, 18 ... | 1p36.33 | NR | rs12103-A | 0.1834 | 6E-7 | (EA) | 1.082054 | [1.05-1.11] | Affymetrix, Illumina [~ 9000000] (imputed) | N |
595 | chr1 | 1312113 | 1312114 | rs12103 | 23128233 | Jostins L | 2012-11-01 | Nature | Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. | Inflammatory bowel disease | 12,924 European ancestry cases, 21,442 European ancestry controls | 25,683 European ancestry cases, 17,015 European ancestry controls | 1p36.33 | TNFRSF18, TNFRSF4 | rs12103-A | 0.182 | 8E-13 | | 1.099 | [1.059-1.139] | Affymetrix, Illumina [1230000] (imputed) | N |
595 | chr1 | 1351674 | 1351675 | rs186507655 | 27197191 | Fehringer G | 2016-04-20 | Cancer Res | Cross-cancer genome-wide analysis of lung, ovary, breast, prostate and colorectal cancer reveals novel pleiotropic associations. | Cancer (pleiotropy) | 5,020 European ancestry lung cancer cases, 3,718 European ancestry lung adenocarcinoma cases, 3,422 European ancestry lung squam ... | NA | 1p36.33 | Intergenic | rs186507655-A | NR | 5E-10 | (subset analysis) | | | Affymetrix, Illumina [9916564] (imputed) | N |
598 | chr1 | 1791591 | 1791592 | rs9660180 | 25673413 | Locke AE | 2015-02-12 | Nature | Genetic studies of body mass index yield new insights for obesity biology. | Body mass index | up to 104,666 European ancestry male individuals, up to 132,115 European ancestry female individuals, 370 African American male ... | up to 48,274 European ancestry male individuals, up to 39,864 European ancestry female individuals, 2,441 African American or Af ... | 1p36.33 | GNB1 | rs9660180-A | 0.499 | 8E-7 | (EA) | 0.017 | [0.01-0.024] kg/m2 increase | Affymetrix, Illumina [2550021] | N |
598 | chr1 | 1791591 | 1791592 | rs9660180 | 25673413 | Locke AE | 2015-02-12 | Nature | Genetic studies of body mass index yield new insights for obesity biology. | Body mass index | up to 104,666 European ancestry male individuals, up to 132,115 European ancestry female individuals, 370 African American male ... | up to 48,274 European ancestry male individuals, up to 39,864 European ancestry female individuals, 2,441 African American or Af ... | 1p36.33 | GNB1 | rs9660180-A | 0.491 | 6E-7 | | 0.016 | [0.0099-0.0229] kg/m2 increase | Affymetrix, Illumina [2550021] | N |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
GWAS Catalog (gwasCatalog) Track Description
|
|
Description
This track displays single nucleotide polymorphisms (SNPs) identified by published
Genome-Wide Association Studies (GWAS), collected in the
NHGRI-EBI GWAS Catalog
published jointly by the National
Human Genome Research Institute (NHGRI) and the European Bioinformatics Institute (EMBL-EBI).
Some abbreviations
are used above.
From http://www.ebi.ac.uk/gwas/docs/about:
The Catalog is a quality controlled, manually curated, literature-derived
collection of all published genome-wide association studies assaying at least
100,000 SNPs and all SNP-trait associations with p-values < 1.0 x
10-5 (Hindorff et al., 2009). For more details about the Catalog
curation process and data extraction procedures, please refer to the
Methods page.
Methods
From http://www.ebi.ac.uk/gwas/docs/methods:
The GWAS Catalog data is extracted from the literature. Extracted information
includes publication information, study cohort information such as cohort size,
country of recruitment and subject ethnicity, and SNP-disease association
information including SNP identifier (i.e. RSID), p-value, gene and risk
allele. Each study is also assigned a trait that best represents the phenotype
under investigation. When multiple traits are analysed in the same study either
multiple entries are created, or individual SNPs are annotated with their
specific traits. Traits are used both to query and visualise the data in the
Catalog's web form and diagram-based query interfaces.
Data extraction and curation for the GWAS Catalog is an expert activity; each
step is performed by scientists supported by a web-based tracking and data
entry system which allows multiple curators to search, annotate, verify and
publish the Catalog data. Papers that qualify for inclusion in the Catalog are
identified through weekly PubMed searches. They then undergo two levels of
curation. First all data, including association information for SNPs, traits
and general information about the study, are extracted by one curator. A second
curator then performs an additional round of curation to double-check the
accuracy and consistency of all the information. Finally, an automated pipeline
performs validation of the extracted data, see the
Quality control and SNP mapping section below for more
details. This information is then used for queries and in the production of the
diagram.
References
Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA.
Potential etiologic and functional implications of genome-wide association loci for human diseases
and traits.
Proc Natl Acad Sci U S A. 2009 Jun 9;106(23):9362-7.
PMID: 19474294; PMC: PMC2687147
| |
|
|
|