This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
|
4dn:phase1:working_groups:omics_data_standards:minutes-06-13-2016 [2019/02/14 15:49] rcalandrelli created |
4dn:phase1:working_groups:omics_data_standards:minutes-06-13-2016 [2025/04/22 16:21] (current) |
||
|---|---|---|---|
| Line 5: | Line 5: | ||
| ===== Experiment Procedures, meta-data ===== | ===== Experiment Procedures, meta-data ===== | ||
| - | * | + | * Finalizing the experiment protocol. Derive the Hi-C standard from existing, published in situ Hi-C protocols. Determine what is variable and what should be unified and precise documentation of such variables |
| - | + | * Digestion time for DNAs may vary, depending on the exact type of DNAs (e.g. mitochondrial vs genomic). | |
| - | Finalizing the experiment protocol. Derive the Hi-C standard from existing, published in situ Hi-C protocols\\ Determine what is variable and what should be unified and precise documentation of such variables | + | * in-situ is a fairly standard protocol that different labs are using. Two variations were brought up as being worthwhile improvements: |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | Digestion time for DNAs may vary, depending on the exact type of DNAs (e.g. mitochondrial vs genomic). | + | |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | in-situ is a fairly standard protocol that different labs are using. Two variations were brought up as being worthwhile improvements: | + | |
| * Job Dekker: Dangling sequence removal is an important step that should be added to the standard protocol to improve library quality. | * Job Dekker: Dangling sequence removal is an important step that should be added to the standard protocol to improve library quality. | ||
| * ?: For temperature during overhang filling, 23C is better than 37 C (enzyme dependent?) | * ?: For temperature during overhang filling, 23C is better than 37 C (enzyme dependent?) | ||
| - | * | + | * Other similar procedures that may need a different protocol. May put different categories (like the narrow and broad for ChIP-seq in ENCODE) and may be based on Hi-C protocol before |
| - | + | * Micro-C procedure | |
| - | Other similar procedures that may need a different protocol\\ May put different categories (like the narrow and broad for ChIP-seq in ENCODE) and may be based on Hi-C protocol before | + | * DNase Hi-C (a different protocol from the ones with restriction enzymes) |
| - | + | * How will the protocol be encoded in the meta-data structure? | |
| - | * | + | * Trade-off between tracking information in searchable fields vs. ease of logging. |
| - | + | * Follow ENCODE's example: standard protocols are provided as pdf, variations become meta-data fields. | |
| - | Micro-C procedure | + | * Medata for ENCODE ChIP-seq: |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | DNase Hi-C (a different protocol from the ones with restriction enzymes) | + | |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | How will the protocol be encoded in the meta-data structure? | + | |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | Trade-off between tracking information in searchable fields vs. ease of logging. | + | |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | Follow ENCODE's example: standard protocols are provided as pdf, variations become meta-data fields. | + | |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | Medata for ENCODE ChIP-seq: | + | |
| * Biosample: description of cell, condition, treatment. We may need to expand to describe genome editing. | * Biosample: description of cell, condition, treatment. We may need to expand to describe genome editing. | ||
| * Antibody: We will not need. | * Antibody: We will not need. | ||
| * Protocol: fragmentation method, fragment length. handful others(?) | * Protocol: fragmentation method, fragment length. handful others(?) | ||
| * For Hi-C, we will need to add a field for restriction enzyme (information about crosslinking time, temperature etc, may be in pdf or have separate fields, probably pdf is fine.) | * For Hi-C, we will need to add a field for restriction enzyme (information about crosslinking time, temperature etc, may be in pdf or have separate fields, probably pdf is fine.) | ||
| - | * | + | * Do we need to keep metadata for Hi-C QC prior to sequencing. |
| - | + | ||
| - | Do we need to keep metadata for Hi-C QC prior to sequencing. | + | |
| * e.g. like antibody validation in ENCODE | * e.g. like antibody validation in ENCODE | ||
| * We can't think of examples for Hi-C that need to be reported. | * We can't think of examples for Hi-C that need to be reported. | ||
| * Assume data prep passed basic wetlab QC in the prep stages. (there are natural checkpoints, go/no-go. All data submitted to DCIC is go/go/go. | * Assume data prep passed basic wetlab QC in the prep stages. (there are natural checkpoints, go/no-go. All data submitted to DCIC is go/go/go. | ||
| - | * | + | * computational QC data (e.g. read depth, PCR duplicate proportion, reproducibility, intra/inter) |
| - | + | ||
| - | computational QC data (e.g. read depth, PCR duplicate proportion, reproducibility, intra/inter) | + | |
| * can be determined after data is taken in by DCIC. | * can be determined after data is taken in by DCIC. | ||
| - | * | + | * Cross consortium data sharing/merging with ENCODE |
| - | + | * DCIC: this will naturally be easy since we are starting with their metadata and database model. | |
| - | Cross consortium data sharing/merging with ENCODE | + | |
| - | + | ||
| - | * | + | |
| - | + | ||
| - | DCIC: this will naturally be easy since we are starting with their metadata and database model. | + | |
| ===== Data formats ===== | ===== Data formats ===== | ||