Loading…

Spatial distribution of predicted transcription factor binding sites in Drosophila ChIP peaks

In the development of the Drosophila embryo, gene expression is directed by the sequence-specific interactions of a large network of protein transcription factors (TFs) and DNA cis-regulatory binding sites. Once the identity of the typically 8–10bp binding sites for any given TF has been determined...

Full description

Saved in:
Bibliographic Details
Published in:Mechanisms of development 2016-08, Vol.141, p.51-61
Main Authors: Pettie, Kade P., Dresch, Jacqueline M., Drewell, Robert A.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In the development of the Drosophila embryo, gene expression is directed by the sequence-specific interactions of a large network of protein transcription factors (TFs) and DNA cis-regulatory binding sites. Once the identity of the typically 8–10bp binding sites for any given TF has been determined by one of several experimental procedures, the sequences can be represented in a position weight matrix (PWM) and used to predict the location of additional TF binding sites elsewhere in the genome. Often, alignments of large (>200bp) genomic fragments that have been experimentally determined to bind the TF of interest in Chromatin Immunoprecipitation (ChIP) studies are trimmed under the assumption that the majority of the binding sites are located near the center of all the aligned fragments. In this study, ChIP/chip datasets are analyzed using the corresponding PWMs for the well-studied TFs; CAUDAL, HUNCHBACK, KNIRPS and KRUPPEL, to determine the distribution of predicted binding sites. All four TFs are critical regulators of gene expression along the anterio-posterior axis in early Drosophila development. For all four TFs, the ChIP peaks contain multiple binding sites that are broadly distributed across the genomic region represented by the peak, regardless of the prediction stringency criteria used. This result suggests that ChIP peak trimming may exclude functional binding sites from subsequent analyses. •ChIP datasets are analyzed for CAUDAL, HUNCHBACK, KNIRPS and KRUPPEL to determine the distribution of predicted binding sites.•In all four cases the ChIP peaks contain multiple binding sites that are broadly distributed.•Trimming of ChIP peaks may result in the exclusion of functional binding sites from subsequent analyses.
ISSN:0925-4773
1872-6356
DOI:10.1016/j.mod.2016.06.001