F Element Project
Resources & Tools
Faculty Resources
Documentation
Contacts
One of the unusual features of eukaryotic genomes is the discordance between genome size and the complexity of the organism (i.e., the C-value paradox; Eddy, 2012). The smallest chromosome in Drosophila melanogaster is chromosome 4 (also known as the Muller F element), with an estimated size of ~5.2 Mb (Locke and McDermid, 1993). The D. melanogaster F element is generally packaged as heterochromatin: it has a high repeat content, is packaged throughout with HP1a and H3K9me2/3; it exhibits late replication and little or no recombination. However, the banded portion (~1.4 Mb) of this chromosome also contains ~80 protein-coding genes. These F element genes exhibit a range of expression levels similar to genes that reside in euchromatic domains — indicating that F element genes have acquired distinct features that enable them to function in a heterochromatic environment (reviewed in Riddle and Elgin, 2018).
While the F element has maintained a similar size in many other Drosophila species, it is substantially larger in at least four Drosophila species (i.e., D. ananassae, D. bipectinata, D. kikkawai, and D. takahashii). For example, the D. ananassae Muller F element is more than 18.7 Mb in size. This study will examine the factors (e.g., transposon density) that have contributed to the expansion of the F element in these four Drosophila species, and assess the impact of this expansion on gene characteristics (e.g., codon bias, intron size).
GEP students will produce coding region and transcription start site annotations for F element genes in D. ananassae, D. bipectinata, D. kikkawai, and D. takahashii, as well as for genes in a euchromatic reference region derived from the Muller D element. (Euchromatic regions have not expanded in these species.) Comparative analyses using these datasets will provide insights into the evolutionary impacts of changes in chromosome and gene size, and will facilitate the identification of factors that enable genes to function in a heterochromatic environment. We anticipate that this work will move us toward a better understanding of how and why eukaryotic genomes became so large, for mammals, ~1000X larger than that of E. coli.
About
(26 minutes) Slideset
The same material is covered in more detail in the “Annotated Lecture Slides.”
In this lecture for the July 2022 GEP New Member Training, Professor Sarah C. R. Elgin from Washington University in St. Louis describes the research aims for the Drosophila F Element Expansion project. The lecture begins with a brief overview of the C-value paradox, the impact of transposable elements on eukaryotic genomes, and the packaging of DNA into chromatin. The lecture then presents the unusual characteristics of the Muller F Elements in D. melanogaster and in other Drosophila species based on past studies by the Elgin Lab, GEP faculty and students, and other researchers (e.g., the modENCODE project). The presentation concludes with the goals for the comparative analysis of four Drosophila species where the F Elements have undergone different levels of expansion, and a summary of the unusual features (e.g., expansion of coding spans, pseudogene clusters) that have been identified by GEP students as part of their annotations of the D. ananassae F Element.
Project Curriculum
Annotated Lecture Slides
Gene Annotation: Constructing a Defendable Exon/Intron Gene Model
TSS Annotation (Under Development)
Investigation of Motifs
Investigation of Repetitious Elements (Under Development)
- Page Last Updated: September 24, 2025