Schema for Modern Derived - Modern Human Derived, Denisova Ancestral
  Database: hg19    Primary Table: dhcHumDerDenAncCcdsFrameshiftCodingHighFreq Data last updated: 2012-10-02
Big Bed File Download: /gbdb/hg19/dhcHumDerDenAnc/dhcHumDerDenAncCcdsFrameshiftCodingHighFreq.bb
Item Count: 2
The data is stored in the binary BigBed format.

Format description: Human Derived, Denisova Ancestral variants and functional effect predictions from high-coverage Denisova sequencing project
fieldexampledescription
chromchr17Reference sequence chromosome
chromStart43318777Start position in chromosome
chromEnd43318778End position in chromosome
nameC/-Human allele / Denisova ancestral allele
featureENST00000331495Ensembl Transcript ID or Regulatory Region ID
or ID of TFB profile from JASPAR or TRANSFAC
geneENSG00000184922Ensembl Gene ID
extraENSP=ENSP00000329219; HGNC=FMNL1Extra info: for coding genes, Ensembl Protein ID and/or HGNC;
for Regulatory Motifs, scores & matrix ID
consequenceFRAMESHIFT_CODINGVariant Effect Predictor (VEP) consequence term
cdnaPosition1561Offset in transcript, if applicable
cdsPosition1361Offset in coding sequence (CDS), if applicable
protPosition454Offset in protein sequence, if applicable
aminoAcids-Amino acid change, if applicable
codons-Codon change, if applicable
humanAlCModern human fixed (or major) allele on positive strand
denAl-Denisova (ancestral) allele
chimpAl-Chimpanzee ancestral allele
gorAl-Gorilla ancestral allele
orangAlN/AOrangutan ancestral allele
denZygHOMODenisova zygosity of ancestral allele (homozygous/heterozygous)
dbSNP.dbSNP rs ID, if available
tgpFreq0.961000 Genomes Project frequency of modern human allele
flag.Flag(s): CpG if in CpG island; RM if in repeat masked region;
LowQual if conflicting GATK calls; SysErr if prone to systematic errors
geneStrand.Gene strand: '+' or '-', if applicable; otherwise '.'

Sample Rows
 
chromchromStartchromEndnamefeaturegeneextraconsequencecdnaPositioncdsPositionprotPositionaminoAcidscodonshumanAldenAlchimpAlgorAlorangAldenZygdbSNPtgpFreqflaggeneStrand
chr174331877743318778C/-ENST00000331495ENSG00000184922ENSP=ENSP00000329219; HGNC=FMNL1FRAMESHIFT_CODING15611361454--C---N/AHOMO.0.96..

Modern Derived (dhcHumDerDenAnc) Track Description
 

Description

This track shows mutations in the modern human lineage that rose to fixation or near fixation since the split from the last common ancestor with Denisovans, along with predicted functional effects from Ensembl's Variant Effect Predictor (VEP).

Methods

Methods and analysis are described in detail in Note 19 of supplementary online materials of (Meyer, 2012).

Whole genome Enredo-Pecan-Ortheus (EPO) alignments of human, chimpanzee, gorilla and orangutan were combined with modern human genotypes from the 1000 Genomes Project Phase 1 (1000G) to identify sites that are fixed (>99.0% frequency in 1000G) or high frequency (>90.0% frequency in 1000G) derived in modern humans and ancestral in chimpanzee and at least one other great ape (gorilla or orangutan). In order to avoid paralogous regions, human and chimpanzee sequences were required to appear in only one EPO alignment block. Some "fixed" sites are in dbSNP; these were separated out from fixed sites not in dbSNP, so three categories of frequency are displayed: Fixed, Fixed+dbSNP, and High Frequency.

Various quality filters were applied to Denisova genotypes: minimum 40 PHRED genotype likelihood from the Genome Analysis Toolkit (GATK); minimum 30 RMS map quality score; coverage at least 14X and at most 66X; no sites in positions identified as systematic errors or deemed to be of low quality due to conflicting genotype calls in a second iteration of GATK (Note 6, supplementary online materials of Meyer, 2012).

The derived-in-modern-human sites were intersected with the high-confidence-in-Denisova sites and annotated using VEP to predict effects on protein structure and transcriptional regulation.

Credits

Thanks to the Max Planck Institute for Evolutionary Anthropology for providing the data files used for this track.

References

Meyer M, Kircher M, Gansauge MT, Li H, Racimo F, Mallick S, Schraiber JG, Jay F, Prüfer K, de Filippo C et al. A high-coverage genome sequence from an archaic Denisovan individual. Science. 2012 Oct 12;338(6104):222-6. PMID: 22936568; PMC: PMC3617501; supplementary online materials, Note 19