Schema for Lung Locat - Lung cells 10x method binned by location from Travaglini et al 2020
  Database: hg38    Primary Table: lungTravaglini2020Location10x Data last updated: 2022-05-12
Big Bed File Download: /gbdb/hg38/bbi/lungTravaglini2020/droplet/location.bb
Item Count: 31,492
The data is stored in the binary BigBed format.

Format description: BED6+5 with additional fields for category count and median values, and sample matrix fields
fieldexampledescription
chromchr1Reference sequence chromosome or scaffold
chromStart166069298Start position in chromosome
chromEnd166167001End position in chromosome
nameFAM78BName or ID of item
score0Score from 0-1000, typically derived from total of median value from all categories
strand-+ or - for strand. Use . if not applicable
name2NM_001017961Alternative name for item
expCount4Number of categories
expScores0,0.000884358,0.00068286,0.00218706Comma separated list of category values

Sample Rows
 
chromchromStartchromEndnamescorestrandname2expCountexpScores
chr1166069298166167001FAM78B0-NM_00101796140,0.000884358,0.00068286,0.00218706
chr1166154742166154798MIR9210-NR_03062640,0,0,0
chr1166603915166625236FMO9P0+NR_00292540,0.000209453,0.000390206,0.000257301
chr1166839474166856346POGK0+NM_01754240.029985,0.102399,0.0994049,0.10768
chr1166856511166876264TADA10-NM_05305340.0220604,0.0209221,0.018925,0.0258587
chr1166908186166975540ILDR20-NM_19935140,0.00286253,0.00399961,0.0012865
chr1166975581167022212MAEL0+NM_00128637840,0.000186181,0.000195103,0.000771903
chr1167052835167090377GPA330-NM_00581440.0177768,0.149317,0.0644815,0.127235
chr1167094339167128608DUSP270+ENSG00000198842.1040,0.0118457,0.00517023,0.010292
chr1167175361167195805LINC013630-NR_11081140.000214179,0,0,0.00012865

Lung Locat (lungTravaglini2020Location10x) Track Description
 

Description

This track displays data from A molecular cell atlas of the human lung from single-cell RNA sequencing. Using droplet-based and plate-based single-cell RNA sequencing (scRNA-seq), 58 lung cell type populations were identified: 15 epithelial, 9 endothelial, 9 stromal, and 25 immune. This dataset covers ~75,000 human cells across all lung tissue compartments and circulating blood.

This track collection contains 19 bar chart tracks of RNA expression in the human lung where cells are grouped such as by cell type (Lung Cells, Lung Cells FACS), tissue compartments (Lung Compart, Lung Compart FACS), detailed cell type (Lung Detail, Lung Detail FACS), organ donor (Lung Donor, Lung Donor FACS), halfway detailed cell type (Lung Half Det, Lung Half Det FACS), sample location (Lung Locat, Lung Locat FACS), or organ (Lung Organ, Lung Organ FACS). The default track displayed is Lung Cells.

Display Conventions

The cell types are colored by which class they belong to according to the following table.

Color Cell classification
fibroblast
immune
muscle
secretory
ciliated
epithelial
endothelial

Cells that fall into multiple classes will be colored by blending the colors associated with those classes. The colors will be purest in the Lung Cells subtrack, where the bars represent relatively pure cell types. They can give an overview of the cell composition within other categories in other subtracks as well.

Data Access

The raw bar chart data can be explored interactively with the Table Browser, or the Data Integrator. For automated analysis, the data may be queried from our REST API. Please refer to our mailing list archives for questions, or our Data Access FAQ for more information.

Method

Healthy lung tissue and peripheral blood was surgically removed from 2 male patients (ages 46 and 75) and 1 female patient (age 51) undergoing lobectomy for focal lung tumors. Lung tissue was sampled from the bronchi (proximal), bronchiole (medial), and alveolar (distal) regions. Lung samples were dissociated and enriched with magnetic columns before being sorted into epithelial, endothelial/immune, and stromal cell suspensions. Lung and peripheral blood libraries were prepared using the 10x Genomics 3' v2 kit. In parallel, Smart-Seq2 (SS2) cDNA libraries were prepared using the Nextera XT library kit. Both 10x and SS2 libraries were sequenced on a NovaSeq 6000.

The cell/gene matrix and cell-level metadata was downloaded from the UCSC Cell Browser. The UCSC command line utility matrixClusterColumns, matrixToBarChart, and bedToBigBed were used to transform these into a bar chart format bigBed file that can be visualized. The coloring was done by defining colors for the broad level cell classes and then using another UCSC utility, hcaColorCells, to interpolate the colors across all cell types. The UCSC utilities can be found on our download server.

Data Access

The raw bar chart data can be explored interactively with the Table Browser or the Data Integrator. For automated analysis, the data may be queried from our REST API. Please refer to our mailing list archives for questions, or our Data Access FAQ for more information.

Credit

Thanks to Kyle J. Travaglini, Ahmad N. Nabhan, and to the many authors who worked on producing and publishing this data set. The data were integrated into the UCSC Genome Browser by Jim Kent and Brittney Wick then reviewed by Gerardo Perez. The UCSC work was paid for by the Chan Zuckerberg Initiative.

References

Travaglini KJ, Nabhan AN, Penland L, Sinha R, Gillich A, Sit RV, Chang S, Conley SD, Mori Y, Seita J et al. A molecular cell atlas of the human lung from single-cell RNA sequencing. Nature. 2020 Nov;587(7835):619-625. PMID: 33208946; PMC: PMC7704697