Evaluation of genomic island predictors using a comparative genomics approach

Morgan G.I. Langille, William W.L. Hsiao, Fiona S.L. Brinkman

Résultat de recherche: Articleexamen par les pairs

226 Citations (Scopus)

Résumé

Background: Genomic islands (GIs) are clusters of genes in prokaryotic genomes of probable horizontal origin. GIs are disproportionately associated with microbial adaptations of medical or environmental interest. Recently, multiple programs for automated detection of GIs have been developed that utilize sequence composition characteristics, such as G+C ratio and dinucleotide bias. To robustly evaluate the accuracy of such methods, we propose that a dataset of GIs be constructed using criteria that are independent of sequence composition-based analysis approaches. Results: We developed a comparative genomics approach (IslandPick) that identifies both very probable islands and non-island regions. The approach involves 1) flexible, automated selection of comparative genomes for each query genome, using a distance function that picks appropriate genomes for identification of GIs, 2) identification of regions unique to the query genome, compared with the chosen genomes (positive dataset) and 3) identification of regions conserved across all genomes (negative dataset). Using our constructed datasets, we investigated the accuracy of several sequence composition-based GI prediction tools. Conclusion: Our results indicate that AlienHunter hasthe highest recall, but the lowest measured precision, while SIGI-HMM is the most precise method. SIGI-HMM and IslandPath/DIMOB have comparable overall highest accuracy. Our comparative genomics approach, IslandPick, was the most accurate, compared with a curated list of GIs, indicating that we have constructed suitable datasets. This represents the first evaluation, using diverse and, independent datasets that were not artificially constructed, of the accuracy of several sequence composition-based GI predictors. The caveats associated with this analysis and proposals for optimal island prediction are discussed.

Langue d'origineEnglish
Numéro d'article329
JournalBMC Bioinformatics
Volume9
DOI
Statut de publicationPublished - août 5 2008
Publié à l'externeOui

Note bibliographique

Funding Information:
We gratefully acknowledge the SFU/UBC Bioinformatics Training Program funded by the Canadian Institutes of Health Research (CIHR) and Michael Smith Foundation for Health Research (MSFHR) for providing initial funding. In addition, MGIL and WWLH are both recipients of MSFHR scholarships, and WWLH was supported by a CIHR scholarship. FSLB is the recipient of MSFHR Scholar and Senior Scholar awards, and a CIHR New Investigator award. Support was also provided by the Genome Canada Pathogenomics Projects (FPMI and PI2), the Cystic Fibrosis Foundation, as well as by IBM.

ASJC Scopus Subject Areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Applied Mathematics

PubMed: MeSH publication types

  • Comparative Study
  • Evaluation Study
  • Journal Article

Empreinte numérique

Plonger dans les sujets de recherche 'Evaluation of genomic island predictors using a comparative genomics approach'. Ensemble, ils forment une empreinte numérique unique.

Citer