TY - JOUR
T1 - Detecting the signatures of adaptive evolution in protein-coding genes
AU - Bielawski, Joseph P.
PY - 2013
Y1 - 2013
N2 - The field of molecular evolution, which includes genome evolution, is devoted to finding variation within and between groups of organisms and explaining the processes responsible for generating this variation. Many DNA changes are believed to have little to no functional effect, and a neutral process will best explain their evolution. Thus, a central task is to discover which changes had positive fitness consequences and were subject to Darwinian natural selection during the course of evolution. Due the size and complexity of modern molecular datasets, the field has come to rely extensively on statistical modeling techniques to meet this analytical challenge. For DNA sequences that encode proteins, one of the most powerful approaches is to employ a statistical model of codon evolution. This unit provides a general introduction to the practice of modeling codon evolution using the statistical framework of maximum likelihood. Four real-data analysis activities are used to illustrate the principles of parameter estimation, robustness, hypothesis testing, and site classification. Each activity includes an explicit analytical protocol based on programs provided by the Phylogenetic Analysis by Maximum Likelihood (PAML) package.
AB - The field of molecular evolution, which includes genome evolution, is devoted to finding variation within and between groups of organisms and explaining the processes responsible for generating this variation. Many DNA changes are believed to have little to no functional effect, and a neutral process will best explain their evolution. Thus, a central task is to discover which changes had positive fitness consequences and were subject to Darwinian natural selection during the course of evolution. Due the size and complexity of modern molecular datasets, the field has come to rely extensively on statistical modeling techniques to meet this analytical challenge. For DNA sequences that encode proteins, one of the most powerful approaches is to employ a statistical model of codon evolution. This unit provides a general introduction to the practice of modeling codon evolution using the statistical framework of maximum likelihood. Four real-data analysis activities are used to illustrate the principles of parameter estimation, robustness, hypothesis testing, and site classification. Each activity includes an explicit analytical protocol based on programs provided by the Phylogenetic Analysis by Maximum Likelihood (PAML) package.
UR - http://www.scopus.com/inward/record.url?scp=84878693606&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84878693606&partnerID=8YFLogxK
U2 - 10.1002/0471142727.mb1901s101
DO - 10.1002/0471142727.mb1901s101
M3 - Article
C2 - 23288462
AN - SCOPUS:84878693606
SN - 1934-3639
SP - 19.1
JO - Current Protocols in Molecular Biology
JF - Current Protocols in Molecular Biology
IS - SUPPL.101
ER -