megasat: automated inference of microsatellite genotypes from sequence data

Luyao Zhan, Ian G. Paterson, Bonnie A. Fraser, Beth Watson, Ian R. Bradbury, Praveen Nadukkalam Ravindran, David Reznick, Robert G. Beiko, Paul Bentzen

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

68 Citas (Scopus)

Resumen

megasat is software that enables genotyping of microsatellite loci using next-generation sequencing data. Microsatellites are amplified in large multiplexes, and then sequenced in pooled amplicons. megasat reads sequence files and automatically scores microsatellite genotypes. It uses fuzzy matches to allow for sequencing errors and applies decision rules to account for amplification artefacts, including nontarget amplification products, replication slippage during PCR (amplification stutter) and differential amplification of alleles. An important feature of megasat is the generation of histograms of the length–frequency distributions of amplification products for each locus and each individual. These histograms, analogous to electropherograms traditionally used to score microsatellite genotypes, enable rapid evaluation and editing of automatically scored genotypes. megasat is written in Perl, runs on Windows, Mac OS X and Linux systems, and includes a simple graphical user interface. We demonstrate megasat using data from guppy, Poecilia reticulata. We genotype 1024 guppies at 43 microsatellites per run on an Illumina MiSeq sequencer. We evaluated the accuracy of automatically called genotypes using two methods, based on pedigree and repeat genotyping data, and obtained estimates of mean genotyping error rates of 0.021 and 0.012. In both estimates, three loci accounted for a disproportionate fraction of genotyping errors; conversely, 26 loci were scored with 0–1 detected error (error rate ≤0.007). Our results show that with appropriate selection of loci, automated genotyping of microsatellite loci can be achieved with very high throughput, low genotyping error and very low genotyping costs.

Idioma originalEnglish
Páginas (desde-hasta)247-256
Número de páginas10
PublicaciónMolecular Ecology Resources
Volumen17
N.º2
DOI
EstadoPublished - mar. 1 2017

Nota bibliográfica

Funding Information:
This research benefitted from a Canadian Natural Sciences and Engineering Research Council (NSERC) Strategic Grant to PB and RGB, an NSERC Discovery Grant to PB and National Science Foundation (NSF) support to DR. The sequence data reported in this study were obtained using a DNA sequencer acquired using a generous bequest from Elizabeth Ann Nielsen.

Publisher Copyright:
© 2016 John Wiley & Sons Ltd

ASJC Scopus Subject Areas

  • Biotechnology
  • Ecology, Evolution, Behavior and Systematics
  • Genetics

Huella

Profundice en los temas de investigación de 'megasat: automated inference of microsatellite genotypes from sequence data'. En conjunto forman una huella única.

Citar esto