abstract |
The disclosure provides methods and systems for designing and synthesizing probes to capture a representative sample of genomic variants of a target genome from a sample. The methods include providing a multiple sequence alignment (MSA), designing a plurality of representative subsequences, and optionally synthesizing a nucleic acid probe. The designing step can comprise designating a plurality of intervals in the MSA, shifting start positions for each MSA subset, clustering the aligned subsequences within each adjusted subset, and determining a representative sequence for each reduced MSA subset. The disclosure also encompasses methods of isolating a plurality of nucleic acid variants of a targeted genomic subregion from a sample using the disclosed probe design, as well as the probe compositions themselves. |