Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Data Access

Broadly Consented Data Downloads

We provide access to broadly consented Fiber-seq data that can be used by researchers for various genomic studies.

Available Datasets

Our broadly consented datasets include the following samples:

  • CHM1: Complete hydatidiform mole cell line
  • CHM13: Complete hydatidiform mole cell line (GRCh38 alignment)
  • CHM13_DSA: CHM13 data aligned against the DSA (T2Tv2.0)
  • GM12878: Lymphoblastoid cell line from Coriell Cell Repository
  • HG002: Human genome reference sample (Ashkenazi Jewish trio son)
  • K562: Chronic myelogenous leukemia cell line

All samples are processed with FIRE (Fiber-seq Inferred Regulatory Elements).

Data Formats

All datasets are provided in standard formats:

  • Fiber-seq CRAM files: Primary data format containing m6A calls and nucleosome positions
  • Processed FIRE outputs: Summary statistics, peaks, and aggregated results

Download Instructions

Browse Available Data

You can explore all available samples using the AWS CLI:

aws s3 ls --no-sign-request --endpoint-url https://s3.kopah.uw.edu s3://stergachis/public/FIRE/broadly-consented/

Download Data

To download specific samples or datasets, use the AWS CLI with the following pattern:

aws s3 sync --no-sign-request --endpoint-url https://s3.kopah.uw.edu s3://stergachis/public/FIRE/broadly-consented/[SAMPLE_NAME]/ ./[LOCAL_DIRECTORY]/

For example, to download CHM13 data:

aws s3 sync --no-sign-request --endpoint-url https://s3.kopah.uw.edu s3://stergachis/public/FIRE/broadly-consented/CHM13/ ./CHM13_data/

Data Use Guidelines

When using our broadly consented data, please:

  • Cite the original studies and data contributors