Learning hidden Markov models with unknown number of states. (English) Zbl 07491730

Summary: This paper proposes a novel method for learning hidden Markov models (HMMs) with an unknown number of states based on a valuable feature set. The feature set is built using the hitting times of HMMs. Based on the feature set, we obtain a parameter estimation for HMMs by SVD and the clustering algorithm. The advantages of the proposed method are that it can accurately and automatically identify the number of hidden states, it is robust to misspecified emission distributions, it is less sensitive to initialization, and is asymptotically consistent. Numerical experiments show that the proposed method performs better than other methods when the observed time series is long enough.


82-XX Statistical mechanics, structure of matter
Full Text: DOI


[1] Baum, L. E.; Petrie, T., Statistical inference for probabilistic functions of finite state Markov chains, Ann. Math. Stat., 37, 6, 1554-1563 (1966) · Zbl 0144.40902
[2] Rabiner, L. R., A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE, 77, 257-286 (1989)
[3] Lambert, M. F.; Whiting, J. P.; Metcalfe, A. V., A non-parametric hidden Markov model for climate state identification, Hydrol. Earth Syst. Sci. Discuss., 7, 652-667 (2003)
[4] Lefevre, F., Non-parametric probability estimation for HMM-based automatic speech recognition, Comput. Speech Lang., 17, 113-136 (2003)
[5] L. Shang, K.-P. Chan, Nonparametric discriminant HMM and application to facial expression recognition, in: Computer Vision and Pattern Recognition, 2009. CVPR. IEEE Conference on, 2009, pp. 2090-2096.
[6] Kang, M.; Ahn, J.; Lee, K., Opinion mining using ensemble text hidden markov models for text classification, Expert Syst. Appl., 94 (2017)
[7] Aghdam, M., Context-aware recommender systems using hierarchical hidden Markov models, Physica A, 518, 89-98 (2019)
[8] Yau, C.; Papaspiliopoulos, O.; Roberts, G. O.; Holmes, C., BayesIan non-parametric hidden Markov models with applications in genomics, J. R. Stat. Soc. Ser. B Stat. Methodol., 73, 37-57 (2011) · Zbl 1411.62247
[9] Fuh, C. D., Asymptotic operating characteristics of an optimal change point detection in hidden markov models, Ann. Statist., 32, 5, 2305-2339 (2005) · Zbl 1073.60047
[10] Douc, R.; Moulines, E., Asymptotic properties of the maximum likelihood estimation in misspecified hidden markov models (2011)
[11] Andersson, S.; Rydén, T.; Johansson, R., Linear optimal prediction and innovations representations of hidden Markov models, Stochastic Process. Appl., 108, 1, 131-149 (2003) · Zbl 1075.62626
[12] Rousseeuw, K.; Poisson Caillault, E.; Lefebvre, A.; Hamad, D., Hybrid hidden markov model for marine environment monitoring, IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., 8, 1, 204-213 (2017)
[13] Zheng, K.; Li, Y.; Xu, W., Regime switching model estimation: spectral clustering hidden Markov model, Ann. Oper. Res. (2019)
[14] Nystrup, P.; Lindström, E.; Madsen, H., Learning hidden markov models with persistent states by penalizing jumps - sciencedirect, Expert Syst. Appl., 150 (2020)
[15] Zheng, J.; Huang, J.; Tong, C., The order estimation for hidden Markov models, Physica A, 2019, 527, Article 121462 pp. (2019)
[16] Teh, Y. W.; Jordan, M. I.; Beal, M. J.; Blei, D. M., Hierarchical Dirichlet processes, J. Amer. Statist. Assoc., 101, 1566-1581 (2006) · Zbl 1171.62349
[17] Blei, D. M.; Jordan, M. I., Variational inference for Dirichlet process mixtures, Bayesian Anal., 1, 121-144 (2006) · Zbl 1331.62259
[18] Gassiat, E.; Keribin, C., The likelihood ratio test for the number of components in a mixture with Markov regime, ESAIM Probab. Stat., 4, 25-52 (2000) · Zbl 0982.62016
[19] R. Jin, C. Ding, F. Kang, A probabilistic approach for optimizing spectral clustering, in: Advances in Neural Information Processing Systems 18 Neural Information Processing Systems, NIPS 2005, December (2005) 5-8, Vancouver, British Columbia, Canada, 2005, DBLP.
[20] Hardy, M. R., A regime-switching model of long-term stock returns, N. Am. Actuar. J., 5, 2, 41-53 (2001) · Zbl 1083.62530
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.