×

A semialgebraic description of the general Markov model on phylogenetic trees. (English) Zbl 1300.60085

Summary: Many of the stochastic models used in inference of phylogenetic trees from biological sequence data have polynomial parameterization maps. The image of such a map–the collection of joint distributions for all parameter choices–forms the model space. Since the parameterization is polynomial, the Zariski closure of the model space is an algebraic variety which is typically much larger than the model space but amenable to study with algebraic methods. Of practical interest, however, is not the full variety but the subset formed by the model space. Here we develop complete semialgebraic descriptions of the model space arising from the \(k\)-state general Markov model on a tree, with slightly restricted parameters. Our approach depends upon both recently formulated analogues of Cayley’s hyperdeterminant and on the construction of certain quadratic forms from the joint distribution whose positive (semi)definiteness encodes information about parameter values.

MSC:

60J20 Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.)
92D15 Problems related to evolution
92D20 Protein sequences, DNA sequences
62P10 Applications of statistics to biology and medical sciences; meta analysis
14P10 Semialgebraic sets and related spaces
PDFBibTeX XMLCite
Full Text: DOI arXiv