User Tools

Site Tools


4dn:phase1:working_groups:omics_data_standards:minutes-for-2-22

This is an old revision of the document!


Omics Data Standards WG - Minutes for 2-22

Minutes recorded by Erez Lieberman Aiden.

Apologies to all if I did not do justice to everything that was said!

Minutes:

1) Uniform processing of data

Data Serialization

Everyone agrees that there should be a uniform pipeline

It may take a while for a uniform pipeline to emerge

then…BAM files

then…Final Format for Contact Maps, Annotations

2) Separating the various information items and stages

some things are harder to analyze than others

3) Post-standardization: everything goes through uniform pipeline.

4) Pre-standardization:

Option 1; do what encode did: Share processed data alone at center's discretion

Suggestion: Open source code for all feature annotations

How do you code compatibility with various hardware setups?

DCIC: Working with Amazon to have standardized hardware.

5) When to release data to public?

Put data in public when mss available?

Share data with consortium when mss submitted?

Share data with public after QC is available?

What if quality standards change over time?

Version control on quality standards

What should be the nature of the embargo on data?

Several people feel that the data sharing should be immediate and open

But there is a need for acknowledgment

Still, data sharing is in the interest of data sharing

6) We need a session on file formats

4dn/phase1/working_groups/omics_data_standards/minutes-for-2-22.1600883232.txt.gz · Last modified: 2025/04/22 16:21 (external edit)