Creating the DSPR
The Recombinant Inbred Lines
The recombinant inbred lines (RILs) are derived from two synthetic populations. Each synthetic populations is derived from a set of 8 inbred P-free lines representing a worldwide sample. For detailed information about the founder lines used to create the two synthetic populations (A and B), see the table below. The two synthetic populations share one line (AB8: Samarkand ry506; obtained from Trudy Mackay, NC State) in common, thus the two populations are constructed from 15 lines overall. The two synthetic populations were created in Fall 2006. In the first generation, the 8 founder lines were crossed in a round-robin mating scheme to bring all founder alleles together. The populations were subsequently maintained as large, random-mating cohorts for a total of 50 generations, at which point the RILs were created. A set of 750 RILs was created from each synthetic population by 25 generations of full-sib mating. This design creates a panel of RILs in which the genome of each RIL is a fine-scale mosaic of segments from the 8 founder lines.
The Founder Lines
Founder | Vial Code | Stock Center | Stock Number | Stock Name | Details | ||
A1 | 1 | Bloomington | 1 | Canton-S | i18+ | ||
A2 | 3841 | Bloomington | 3841 | BOG1 | i18+ | ||
A3 | 3844 | Bloomington | 3844 | BS1 | i18+ | ||
A4 | 3852 | Bloomington | 3852 | KSA2 | i18+ | ||
A5 | 3875 | Bloomington | 3875 | VAG1 | i18+ | ||
A6 | 3886 | Bloomington | 3886 | wild5B | i18+ | ||
A7 | T.7 | Tucson | 14021-0231.7 | n/a | i18+ | ||
AB8 | Sam | TFC Mackay | n/a | Sam; ry506 | + | ||
B1 | 3839 | Bloomington | 3839 | BER1 | i18+, tetracyline treated | ||
B2 | 3846 | Bloomington | 3846 | CA1 | i18+ | ||
B3 | 3864 | Bloomington | 3864 | QI2 | i9+, tetracyline treated | ||
B4 | 3870 | Bloomington | 3870 | RVC3 | i3+ | ||
B5 | T.0 | Tucson | 14021-0231.0 | n/a | i18+ | ||
B6 | T.1 | Tucson | 14021-0231.1 | n/a | i18+ | ||
B7 | T.4 | Tucson | 14021-0231.4 | n/a | i18+ | ||
A, B | Refers to which synthetic population(s) the line founded. | ||||||
i3, i9, i18 | Refers to the # of generations of inbreeding on arrival from the stock center. | ||||||
+ | Refers to the fact that line was re-initiationed (post-inbreeding) from 4 males and 4 vigin females. Each of these 8 founding flies was genotyped for strain-specific, diagnostic SNP to check stock integrity. | ||||||
tetracyline | Stock harbored Wolbachia, which was cleared with two generations on tetracyline-containing food. |
Whole Genome Re-sequencing of Founder Lines
Using Illumina paired-end 54bp sequencing, we generated 50X genome sequence coverage for each founder, aligning the millions of short reads to the D. melanogaster reference sequence to create an alignment of the 15 founders.
Uncovering the mosaic structure of the RILS
We used 96-plex RAD (restriction-site associated DNA markers) to discover and genotype ~10,000 SNP markers in the panel of RILs. This set of linked SNP markers allows us to uncover the mosaic founder structure of each RIL using a Hidden Markov Model (HMM) we have developed. The HMM uses this set of markers to determine the probability that each segment is derived from each of the 8 founder lines. While a single biallelic SNP can only distinguish between two allelic classes, a set of linked SNPs can distinguish between all 8 founders. This fact can be visualized in the example to the right where the pattern of markers (black and gray representing different alleles) allows identification of the founder ancestry. As SNP density increases, the level of genetic information also increases. The high SNP density used in the DSPR allows us to distinguish among the 8 founder lines with a high degree of confidence. The fact that we can infer the founder ancestry of each RIL chromosomal segment and have the full genome sequences of the founders, means we can essentially determine the full genome sequence of all the RILs with near perfect certainty.