Power for detecting genetic divergence: Differences between statistical methods and marker loci

Nils Ryman; Stefan Palm; Carl André; Gary R. Carvalho; Thomas G. Dahlgren; Per Erik Jorde; Linda Laikre; Lena C. Larsson; Anna Palmé; Daniel E. Ruzzante

doi:10.1111/j.1365-294X.2006.02839.x

Power for detecting genetic divergence: Differences between statistical methods and marker loci

Nils Ryman, Stefan Palm, Carl André, Gary R. Carvalho, Thomas G. Dahlgren, Per Erik Jorde, Linda Laikre, Lena C. Larsson, Anna Palmé, Daniel E. Ruzzante

Medicine

Résultat de recherche: Article › examen par les pairs

219 Citations (Scopus)

Résumé

Information on statistical power is critical when planning investigations and evaluating empirical data, but actual power estimates are rarely presented in population genetic studies. We used computer simulations to assess and evaluate power when testing for genetic differentiation at multiple loci through combining test statistics or P values obtained by four different statistical approaches, viz. Pearson's chi-square, the log-likelihood ratio G-test, Fisher's exact test, and an FST-based permutation test. Factors considered in the comparisons include the number of samples, their size, and the number and type of genetic marker loci. It is shown that power for detecting divergence may be substantial for frequently used sample sizes and sets of markers, also at quite low levels of differentiation. The choice of statistical method may be critical, though. For multi-allelic loci such as microsatellites, combining exact P values using Fisher's method is robust and generally provides a high resolving power. In contrast, for few-allele loci (e.g. allozymes and single nucleotide polymorphisms) and when making pairwise sample comparisons, this approach may yield a remarkably low power. In such situations chi-square typically represents a better alternative. The G-test without Williams's correction frequently tends to provide an unduly high proportion of false significances, and results from this test should be interpreted with great care. Our results are not confined to population genetic analyses but applicable to contingency testing in general.

Langue d'origine	English
Pages (de-à)	2031-2045
Nombre de pages	15
Journal	Molecular Ecology
Volume	15
Numéro de publication	8
DOI	https://doi.org/10.1111/j.1365-294X.2006.02839.x
Statut de publication	Published - juill. 2006

ASJC Scopus Subject Areas

Ecology, Evolution, Behavior and Systematics
Genetics

PubMed: MeSH publication types

Journal Article
Research Support, Non-U.S. Gov't

Accès au document

10.1111/j.1365-294X.2006.02839.x

Autres fichiers et liens

Citer

@article{214ec9a5ae4e4b238d0f55081923c7af,

title = "Power for detecting genetic divergence: Differences between statistical methods and marker loci",

abstract = "Information on statistical power is critical when planning investigations and evaluating empirical data, but actual power estimates are rarely presented in population genetic studies. We used computer simulations to assess and evaluate power when testing for genetic differentiation at multiple loci through combining test statistics or P values obtained by four different statistical approaches, viz. Pearson's chi-square, the log-likelihood ratio G-test, Fisher's exact test, and an FST-based permutation test. Factors considered in the comparisons include the number of samples, their size, and the number and type of genetic marker loci. It is shown that power for detecting divergence may be substantial for frequently used sample sizes and sets of markers, also at quite low levels of differentiation. The choice of statistical method may be critical, though. For multi-allelic loci such as microsatellites, combining exact P values using Fisher's method is robust and generally provides a high resolving power. In contrast, for few-allele loci (e.g. allozymes and single nucleotide polymorphisms) and when making pairwise sample comparisons, this approach may yield a remarkably low power. In such situations chi-square typically represents a better alternative. The G-test without Williams's correction frequently tends to provide an unduly high proportion of false significances, and results from this test should be interpreted with great care. Our results are not confined to population genetic analyses but applicable to contingency testing in general.",

author = "Nils Ryman and Stefan Palm and Carl Andr{\'e} and Carvalho, {Gary R.} and Dahlgren, {Thomas G.} and Jorde, {Per Erik} and Linda Laikre and Larsson, {Lena C.} and Anna Palm{\'e} and Ruzzante, {Daniel E.}",

year = "2006",

month = jul,

doi = "10.1111/j.1365-294X.2006.02839.x",

language = "English",

volume = "15",

pages = "2031--2045",

journal = "Molecular Ecology",

issn = "0962-1083",

publisher = "Wiley-Blackwell",

number = "8",

}

TY - JOUR

T1 - Power for detecting genetic divergence

T2 - Differences between statistical methods and marker loci

AU - Ryman, Nils

AU - Palm, Stefan

AU - André, Carl

AU - Carvalho, Gary R.

AU - Dahlgren, Thomas G.

AU - Jorde, Per Erik

AU - Laikre, Linda

AU - Larsson, Lena C.

AU - Palmé, Anna

AU - Ruzzante, Daniel E.

PY - 2006/7

Y1 - 2006/7

N2 - Information on statistical power is critical when planning investigations and evaluating empirical data, but actual power estimates are rarely presented in population genetic studies. We used computer simulations to assess and evaluate power when testing for genetic differentiation at multiple loci through combining test statistics or P values obtained by four different statistical approaches, viz. Pearson's chi-square, the log-likelihood ratio G-test, Fisher's exact test, and an FST-based permutation test. Factors considered in the comparisons include the number of samples, their size, and the number and type of genetic marker loci. It is shown that power for detecting divergence may be substantial for frequently used sample sizes and sets of markers, also at quite low levels of differentiation. The choice of statistical method may be critical, though. For multi-allelic loci such as microsatellites, combining exact P values using Fisher's method is robust and generally provides a high resolving power. In contrast, for few-allele loci (e.g. allozymes and single nucleotide polymorphisms) and when making pairwise sample comparisons, this approach may yield a remarkably low power. In such situations chi-square typically represents a better alternative. The G-test without Williams's correction frequently tends to provide an unduly high proportion of false significances, and results from this test should be interpreted with great care. Our results are not confined to population genetic analyses but applicable to contingency testing in general.

AB - Information on statistical power is critical when planning investigations and evaluating empirical data, but actual power estimates are rarely presented in population genetic studies. We used computer simulations to assess and evaluate power when testing for genetic differentiation at multiple loci through combining test statistics or P values obtained by four different statistical approaches, viz. Pearson's chi-square, the log-likelihood ratio G-test, Fisher's exact test, and an FST-based permutation test. Factors considered in the comparisons include the number of samples, their size, and the number and type of genetic marker loci. It is shown that power for detecting divergence may be substantial for frequently used sample sizes and sets of markers, also at quite low levels of differentiation. The choice of statistical method may be critical, though. For multi-allelic loci such as microsatellites, combining exact P values using Fisher's method is robust and generally provides a high resolving power. In contrast, for few-allele loci (e.g. allozymes and single nucleotide polymorphisms) and when making pairwise sample comparisons, this approach may yield a remarkably low power. In such situations chi-square typically represents a better alternative. The G-test without Williams's correction frequently tends to provide an unduly high proportion of false significances, and results from this test should be interpreted with great care. Our results are not confined to population genetic analyses but applicable to contingency testing in general.

UR - http://www.scopus.com/inward/record.url?scp=33745055761&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33745055761&partnerID=8YFLogxK

U2 - 10.1111/j.1365-294X.2006.02839.x

DO - 10.1111/j.1365-294X.2006.02839.x

M3 - Article

C2 - 16780422

AN - SCOPUS:33745055761

SN - 0962-1083

VL - 15

SP - 2031

EP - 2045

JO - Molecular Ecology

JF - Molecular Ecology

IS - 8

ER -

Power for detecting genetic divergence: Differences between statistical methods and marker loci

Résumé

ASJC Scopus Subject Areas

PubMed: MeSH publication types

Accès au document

Autres fichiers et liens

Empreinte numérique

Citer