On the Statistical Interpretation of Site-Specific Variables in Phylogeny-Based Substitution Models.

Rodrigue, N. (2013). "On the Statistical Interpretation of Site-Specific Variables in Phylogeny-Based Substitution Models.", Genetics, 193(2), pp. 557-564. doi : 10.1534/genetics.112.145722  Access to full text

Abstract

Phylogeny-based modeling of heterogeneity across the positions of multiple-sequence-alignments has generally been approached from two main perspectives. The first treats site-specificities as random variables drawn from a statistical law, and the likelihood function takes the form of an integral over this law. The second assigns distinct variables to each position, and, in a maximum-likelihood context, adjusts these variables, along with global parameters, to optimize a joint likelihood function. Here, it is emphasized that while the first approach directly enjoys the statistical guaranties of traditional likelihood theory, the latter does not, and should be approached with particular caution when the site-specific variables are high-dimensional. Using a phylogeny-based mutation-selection framework, it is shown that the difference in interpretation of site-specific variables explains the incongruities in recent studies regarding distributions of selection coefficients.

Date modified: