×

Inferring the demographic history from DNA sequences: an importance sampling approach based on non-homogeneous processes. (English) Zbl 1366.92086

Summary: In [“A new method for estimating the demographic history from DNA sequences: an importance sampling approach”, Front. Genet. 6, 13 p. (2015; doi:10.3389/fgene.2015.00259)], the first author et al. introduced an importance sampling (IS) approach for estimating the demographic history of a sample of DNA sequences, the skywis plot. More precisely, we proposed a new nonparametric estimate of a population size that changes over time. We showed on simulated data that the skywis plot can work well in typical situations where the effective population size does not undergo very steep changes. In this paper, we introduce an iterative procedure which extends the previous method and gives good estimates under such rapid variations. In the iterative calibrated skywis plot we approximate the effective population size by a piecewise constant function, whose values are re-estimated at each step. These piecewise constant functions are used to generate the waiting times of non homogeneous Poisson processes related to a coalescent process with mutation under a variable population size model. Moreover, the present IS procedure is based on a modified version of the M. Stephens and P. Donnelly [J. R. Stat. Soc., Ser. B, Stat. Methodol. 62, No. 4, 605–655 (2000; Zbl 0962.62107)] proposal distribution. Finally, we apply the iterative calibrated skywis plot method to a simulated data set from a rapidly expanding exponential model, and we show that the method based on this new IS strategy correctly reconstructs the demographic history.

MSC:

92D15 Problems related to evolution
92D10 Genetics and epigenetics
62P10 Applications of statistics to biology and medical sciences; meta analysis
60G55 Point processes (e.g., Poisson, Cox, Hawkes processes)

Citations:

Zbl 0962.62107

Software:

Matlab; fastsimcoal
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Ait Kaci Azzou, S.; Larribe, F.; Froda, S., A new method for estimating the demographic history from dna sequences: an importance sampling approach, Front. Genet., 6, 259 (2015) · Zbl 1366.92086
[2] Donnelly, P.; Tavaré, S., Coalescents and genealogical structure under neutrality, Annu. Rev. Genet., 29, 401-421 (1995)
[3] Drummond, A. J.; Nicholls, G. K.; Rodrigo, A. G.; Solomon, W., Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data, Genetics, 161, 1307-1320 (2002)
[4] Drummond, A.; Rambaut, A.; Shapiro, B.; Pybus, O., Bayesian coalescent inference of past population dynamics from molecular sequences, Mol. Biol. Evol., 22, 1185-1192 (2005)
[5] Durbin, R.; Li, H., Inference of human population history from individual whole-genome sequences, Nature, 475, 493-496 (2011)
[6] Excoffier, L.; Foll, M., fastsimcoal: a continuous-time coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios, Bioinformatics, 27, 1332-1334 (2011)
[7] Fu, Y., A phylogenetic estimator of effective population size or mutation rate, Genetics, 136, 685-692 (1994)
[8] Griffiths, R. C.; Tavaré, S., Ancestral inference in population genetics, Statist. Sci., 9, 307-319 (1994) · Zbl 0955.62644
[9] Gutierrez, S.; Yvon, M.; Pirolles, E.; Garzo, E.; Fereres, A.; Michalakis, Y.; Blanc, S., Circulating virus load determines the size of bottlenecks in viral populations progressing within a host, PLoS Pathog., 8, 11 (2012)
[10] Hein, J.; Schierup, M.; Wiuf, C., Gene Genealogies, Variation and Evolution:A Primer in Coalescent Theory (2005), Oxford University Press: Oxford University Press USA · Zbl 1113.92048
[11] Heled, J.; Drummond, A., Bayesian inference of population size history from multiple loci, BMC Evol. Biol., 8-15 (2008)
[12] Jukes, T. H.; Cantor, C. R., Evolution of protein molecules, (Munro, H. N., Mammalian Protein Metabolism (1969), Academic Press: Academic Press New York), 21-123
[13] Kingman, J., On the genealogy of large populations, J. Appl. Probab. (1982) · Zbl 0516.92011
[14] Kuhner, M. K.; Yamato, J.; Felsenstein, J., Maximum likelihood estimation of population growth rates based on the coalescent, Genetics, 149, 429-434 (1998)
[17] Minin, V.; Bloomquist, E.; Suchard, M., Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics, Mol. Biol. Evol., 25, 1459-1471 (2008)
[18] Nordborg, M., (Balding, D. J.; Bishop, M. J.; Cannings, C., Coalescent Theory. In Handbook of Statistical Genetics (2007), John Wiley and Sons, Inc.: John Wiley and Sons, Inc. Chichester, U.K.) · Zbl 1125.62117
[19] Opgen-Rhein, R.; Fahrmeir, L.; Strimmer, K., Inference of demographic history from genealogical trees using reversible jump Markov chain Monte Carlo, BMC Evol. Biol., 6, 5 (2005)
[21] Pybus, O.; Rambaut, A.; Harvey, P. H., An integrated framework for the inference of viral population history from reconstructed genealogies genetics, Genetics, 155, 3, 1429-1437 (2000)
[22] Rodrigo, A. G.; Shaper, E. G.; Delwart, E. L.; Iversen, A. K.; Gallo, M. V.; Brojatsch, J., Coalescent estimates of HIV-1 generation time in vivo, Proc. Natl. Acad. Sci. USA, 96, 5, 2187-2191 (1999)
[23] Ross, S., Stochastic Processes (1995), Wiley: Wiley New York, NY
[24] Ross, S., Simulation (2013), Academic Press · Zbl 1255.65001
[25] Slatkin, M.; Hudson, R. R., Pairwise comparisons of mitochondrial dna sequences in stable and exponentially growing populations, Genetics, 129, 2, 555-562 (1991)
[26] Stephens, M., Inference under the coalescent, (Handbook of Statistical Genetics (2001)), 213-238
[27] Stephens, M.; Donnelly, P., Inference in molecular population genetics, J. Roy. Statist. Soc., 62, 605-655 (2000) · Zbl 0962.62107
[28] Strimmer, K.; Pybus, O. G., Exploring the demographic history of dna sequences using the generalized skyline plot, Mol. Biol. Evol., 18, 12, 2298-2305 (2001)
[29] Tavaré, S.; Zeitouni, O., (Picard, J., Stochastic Processes. Stochastic Processes, LNM, vol. 1837 (2004), Springer-Verlag: Springer-Verlag Berlin, Heidelberg)
[30] Wakeley, J., Coalescent Theory: An Introduction (2008), Roberts and Company Publishers · Zbl 1366.92001
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.