Schema for UMass ChIP-Seq - U Mass ChIP-Seq
  Database: danRer7    Primary Table: UMassME3    Row Count: 43,002   Data last updated: 2011-04-19
Format description: Wiggle track values to display as y-values (first 6 fields are bed6)
On download server: MariaDB table dump directory
fieldexampleSQL type info description
bin 585smallint(5) unsigned range Indexing field to speed chromosome range queries.
chrom Zv9_NA1varchar(255) values Reference sequence chromosome or scaffold
chromStart 3880int(10) unsigned range Start position in chromosome
chromEnd 4820int(10) unsigned range End position in chromosome
name Zv9_NA1.0varchar(255) values Name of item
span 10int(10) unsigned range each value spans this many bases
count 94int(10) unsigned range number of values in this block
offset 0int(10) unsigned range offset in File to fetch data
file /gbdb/danRer7/wib/UMassME3.wibvarchar(255) values path name to data file, one byte per value
lowerLimit 10double range lowest data value in this block
dataRange 9double range lowerLimit + dataRange = upperLimit
validCount 44int(10) unsigned range number of valid data values in this block
sumData 605double range sum of the data points, for average and stddev calc
sumSquares 8707double range sum of data points squared, for stddev calc

Sample Rows
 
binchromchromStartchromEndnamespancountoffsetfilelowerLimitdataRangevalidCountsumDatasumSquares
585Zv9_NA138804820Zv9_NA1.010940/gbdb/danRer7/wib/UMassME3.wib109446058707
585Zv9_NA11998021520Zv9_NA1.11015494/gbdb/danRer7/wib/UMassME3.wib106414785662
585Zv9_NA100722015510Zv9_NA100.210829248/gbdb/danRer7/wib/UMassME3.wib101142156135391087
585Zv9_NA1001856024820Zv9_NA100.3106261077/gbdb/danRer7/wib/UMassME3.wib101698148424222
585Zv9_NA101590960Zv9_NA101.410371703/gbdb/danRer7/wib/UMassME3.wib104293373949
585Zv9_NA1011296020170Zv9_NA101.5107211740/gbdb/danRer7/wib/UMassME3.wib103192102336
585Zv9_NA1013886045470Zv9_NA101.6106612461/gbdb/danRer7/wib/UMassME3.wib104313534065
585Zv9_NA1034992050710Zv9_NA103.710793122/gbdb/danRer7/wib/UMassME3.wib102272843002
585Zv9_NA1051190011930Zv9_NA105.81033201/gbdb/danRer7/wib/UMassME3.wib100330300
585Zv9_NA107302013190Zv9_NA107.91010173204/gbdb/danRer7/wib/UMassME3.wib109526829332

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

UMass ChIP-Seq (UMassChIP-SeqComposite) Track Description
 

Description and Methods

H3K4me1 and H3K4me3 binding at 24 hours post fertilization in the zebrafish embryo.

Methylation of lysine 4 on histone H3 can serve as a landmark of cis regulatory elements in the genome. In particular, tri-methylation of lysine 4 (H3K4me3) usually marks promoters of active genes, while mono-methylation (H3K4me1) can be more widely distributed and associates with multiple regulatory elements, including enhancers and insulators. Given the increasing importance and value of the zebrafish as a model system, an initial assay of H3K4me1 and H3K4me3 binding in the zebrafish embryo at 24 hours post fertilization was undertaken, a time at which most of the organs systems have begun to develop. These data provide a valuable resource for preliminary identification of cis regulatory features in the zebrafish embryonic genome.

Display Conventions and Configuration

  • Signal - Density coverage of sequence tags for a given sample.
  • Peaks - Genome coordinates of highest tag density within a hotspot; identified using MACS.
  • Hotspots - Regions of enrichment for binding over input alone (p<1x10-5); identified using MACS.

Methods

For each ChIP, 200 wild-type zebrafish embryos staged at 24 hours post fertilization (hpf) were dissociated into single cell suspensions. Cells were cross-linked with a 1% formaldehyde/PBS solution for 10 minutes at room temperature. After adding glycine, the samples were centrifuged and pellets resuspended in lysis buffer. The samples were sonicated using a Microsonicator (Cole and Palmer Instruments) to a size distribution between 200-1,000 bp. Immunoprecipitation was performed using anti-H3K4me1 (ab8895, Abcam) or anti-H3K4me3 antibody (ab8580, Abcam) overnight at 4°C. Both antibody-treated and untreated samples were incubated with protein A agarose beads for one hour at 4°C and subsequently washed. Immunoprecipitated DNAs were purified using a PCR purification kit. DNA, from both antibody and input samples, was polished and A-tailed followed by ligation to Illumina adapters. Adaptor-ligated fragments were gel-purified and amplified by PCR, followed by a final gel purification. Single end 36 or 72 bp reads from adaptor-ligated libraries were obtained by deep sequencing on Illumina Genome Analyzers operated by the UMass Center for AIDS Research Deep Sequencing Core.

Sequence tags were mapped to the zebrafish genome (Zv7) using Bowtie, and only tags with unique genomic coordinates and less than 2 bp of mismatch were included in subsequent analyses. The "Signal" track was generated by extending each uniquely mapped sequence tag to 250 bp in length, followed by coverage calculation for each position in a 250 bp moving window. To minimize file size of these annotation tracks, only genomic positions with at least 10 sequence tags were used to generate wiggle files. Regions of H3K4me1 or H3K4me3 enrichment over input (referred to hereafter as "hotspots") and "peaks" of binding within these enriched regions were determined using the Model-based Analysis of ChIP-Seq algorithm (MACS, version 1.4; (Zhang, et al. 2008) with the default settings and the following modified parameters: -f BED -w --single-wig --gsize 1700000000 --tsize 36 --bw 150).

Verification

Several of the identified peaks have been functionally validated using in vivo reporter assays in zebrafish.

Credits

These data were generated by the Lawson Lab. Aaron Aday generated the ChIP-Seq libraries. Computational analysis was performed by Abirami Lakshmanan and Dr. Julie Zhu.

Contact: Nathan Lawson (Nathan.lawson@um-null-assmed.edu)

References

Aday AW, Zhu LJ, Lakshmanan A, Wang J, Lawson ND. Identification of cis regulatory features in the embryonic zebrafish genome through large-scale profiling of H3K4me1 and H3K4me3 binding sites. Dev Biol. 2011 Sep 15;357(2):450-62. PMID: 21435340; PMC: PMC3273848

Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9(9):R137. PMID: 18798982; PMC: PMC2592715