Agenda:
Discussion an updated ChIA-PET protocol and guidelineUpdated ChIA-PET protocol has been forwarded to WG members in email. QC steps of ChIA-PET
About the number of QCs needed in all the intermediate steps and QC data included in data submission
Antibody coating: coat the beads with nonspecific antibodies to cover all the spaces left. While tagmentation will generate smaller tags while streptavidin pull-down should filter the tags so there won’t be lots of extra tags coming from the longer fragments being further fragmented. Two steps sequencing
The data processed pipeline can generate data for three types of features: binding peaks, interaction clusters and singletons
Signal and noise in PET clusters
Interspecies experiments that include fly and human data: based on the sequence data, the mix or hybrid reflect interchromatin complex ligations. Higher dilution is less the noise. In this particular an 1:1 ratio was prepared. How uniquely maps are human or fly? How many reads can be random ligations? If this is 1:1, then 50% noise rate will be expected from purely random ligations. Random ligation from another species then should be used as quality control metrics in this experiment. However, mixing 1:1 control species with sample may cost too much and currently Dr. Ruan’s lab is exploring 10:1 mixture of human sample with fly controls. Encode-4 data processing pipeline: Data generated> data mapping> pet file was ready for downstream analysis, then PET data can be fed into Juicebox for visualization, be classified into inter-ligation PETs, get through peak-calling process or samtools pile-up analysis. Intra PETs are more likely to have lower coverage and inter PETs are more likely to have higher coverage All the mappable PETs and self-/inter-ligation PETs are unbinned data and can be available to DCIC. The experiment and data analysis protocols can be approved separately and when discussing data analysis protocols it is possible to relate to HiC analysis (file format, mapping algorithm, conversion to Juicer format, etc.) Data Standards group should also release the standard data format soon. Since Dr. Ruan’s group is the primary contributor, DCIC may need to discuss with him to incorporate much of what’s in the workflow. Most of the ChIA-PET pipeline is already using common methods from the other pipelines (mapping, peak calling and others). Conversion between the contact file format from Dr. Ruan’s group and DCIC (the one readable by Juicebox and Cooler) may needs to be done. ChIA-PET data visualization: SNP-based validation of CTCF binding and looping Allele specific analysis is very important for the 4DN. This topic need to bring up soon to discuss. |