TY - JOUR
T1 - Power for detecting genetic divergence
T2 - Differences between statistical methods and marker loci
AU - Ryman, Nils
AU - Palm, Stefan
AU - André, Carl
AU - Carvalho, Gary R.
AU - Dahlgren, Thomas G.
AU - Jorde, Per Erik
AU - Laikre, Linda
AU - Larsson, Lena C.
AU - Palmé, Anna
AU - Ruzzante, Daniel E.
PY - 2006/7
Y1 - 2006/7
N2 - Information on statistical power is critical when planning investigations and evaluating empirical data, but actual power estimates are rarely presented in population genetic studies. We used computer simulations to assess and evaluate power when testing for genetic differentiation at multiple loci through combining test statistics or P values obtained by four different statistical approaches, viz. Pearson's chi-square, the log-likelihood ratio G-test, Fisher's exact test, and an FST-based permutation test. Factors considered in the comparisons include the number of samples, their size, and the number and type of genetic marker loci. It is shown that power for detecting divergence may be substantial for frequently used sample sizes and sets of markers, also at quite low levels of differentiation. The choice of statistical method may be critical, though. For multi-allelic loci such as microsatellites, combining exact P values using Fisher's method is robust and generally provides a high resolving power. In contrast, for few-allele loci (e.g. allozymes and single nucleotide polymorphisms) and when making pairwise sample comparisons, this approach may yield a remarkably low power. In such situations chi-square typically represents a better alternative. The G-test without Williams's correction frequently tends to provide an unduly high proportion of false significances, and results from this test should be interpreted with great care. Our results are not confined to population genetic analyses but applicable to contingency testing in general.
AB - Information on statistical power is critical when planning investigations and evaluating empirical data, but actual power estimates are rarely presented in population genetic studies. We used computer simulations to assess and evaluate power when testing for genetic differentiation at multiple loci through combining test statistics or P values obtained by four different statistical approaches, viz. Pearson's chi-square, the log-likelihood ratio G-test, Fisher's exact test, and an FST-based permutation test. Factors considered in the comparisons include the number of samples, their size, and the number and type of genetic marker loci. It is shown that power for detecting divergence may be substantial for frequently used sample sizes and sets of markers, also at quite low levels of differentiation. The choice of statistical method may be critical, though. For multi-allelic loci such as microsatellites, combining exact P values using Fisher's method is robust and generally provides a high resolving power. In contrast, for few-allele loci (e.g. allozymes and single nucleotide polymorphisms) and when making pairwise sample comparisons, this approach may yield a remarkably low power. In such situations chi-square typically represents a better alternative. The G-test without Williams's correction frequently tends to provide an unduly high proportion of false significances, and results from this test should be interpreted with great care. Our results are not confined to population genetic analyses but applicable to contingency testing in general.
UR - http://www.scopus.com/inward/record.url?scp=33745055761&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745055761&partnerID=8YFLogxK
U2 - 10.1111/j.1365-294X.2006.02839.x
DO - 10.1111/j.1365-294X.2006.02839.x
M3 - Article
C2 - 16780422
AN - SCOPUS:33745055761
SN - 0962-1083
VL - 15
SP - 2031
EP - 2045
JO - Molecular Ecology
JF - Molecular Ecology
IS - 8
ER -