PMERGE: Computational filtering of paralogous sequences from RAD-seq data

Praveen Nadukkalam Ravindran, Paul Bentzen, Ian R. Bradbury, Robert G. Beiko

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

11 Citas (Scopus)

Resumen

Restriction-site associated DNA sequencing (RAD-seq) can identify and score thousands of genetic markers from a group of samples for population-genetics studies. One challenge of de novo RAD-seq analysis is to distinguish paralogous sequence variants (PSVs) from true single-nucleotide polymorphisms (SNPs) associated with orthologous loci. In the absence of a reference genome, it is difficult to differentiate true SNPs from PSVs, and their impact on downstream analysis remains unclear. Here, we introduce a network-based approach, PMERGE that connects fragments based on their DNA sequence similarity to identify probable PSVs. Applying our method to de novo RAD-seq data from 150 Atlantic salmon (Salmo salar) samples collected from 15 locations across the Southern Newfoundland coast allowed the identification of 87% of total PSVs identified through alignment to the Atlantic salmon genome. Removal of these paralogs altered the inferred population structure, highlighting the potential impact of filtering in RAD-seq analysis. PMERGE is also applied to a green crab (Carcinus maenas) data set consisting of 242 samples from 11 different locations and was successfully able to identify and remove the majority of paralogous loci (62%). The PMERGE software can be run as part of the widely used Stacks analysis package.

Idioma originalEnglish
Páginas (desde-hasta)7002-7013
Número de páginas12
PublicaciónEcology and Evolution
Volumen8
N.º14
DOI
EstadoPublished - jul. 2018

Nota bibliográfica

Funding Information:
Canada Foundation for Innovation; Compute Canada; Canada Research Chairs; Canadian Natural Sciences and Engineering Research Council (NSERC)

Funding Information:
Computational infrastructure used to carry out the analyses were supported by the Canada Foundation for Innovation and Compute Canada. RGB acknowledges the support of the Canada Research Chairs program. This research benefitted from a Canadian Natural Sciences and Engineering Research Council (NSERC) Strategic Grant to PB and RGB.

Publisher Copyright:
© 2018 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.

ASJC Scopus Subject Areas

  • Ecology, Evolution, Behavior and Systematics
  • Ecology
  • Nature and Landscape Conservation

PubMed: MeSH publication types

  • Journal Article

Huella

Profundice en los temas de investigación de 'PMERGE: Computational filtering of paralogous sequences from RAD-seq data'. En conjunto forman una huella única.

Citar esto

Nadukkalam Ravindran, P., Bentzen, P., Bradbury, I. R., & Beiko, R. G. (2018). PMERGE: Computational filtering of paralogous sequences from RAD-seq data. Ecology and Evolution, 8(14), 7002-7013. https://doi.org/10.1002/ece3.4219