×

The combinatorial structure of beta negative binomial processes. (English) Zbl 1358.60069

This paper treats a characterization for the combinatorial structure of conditionally i.i.d sequences of negative binomial processes \(X_j\) \(\sim\) \(\mathrm{NBP}(r, B_0)\) with a common beta process base measure \(B_0\) on \(( \Omega, {\mathcal A})\), where \(r > 0\) is a parameter. In a Bayesian non-parametric applicatory framework, such processes have served as models for latent multisets of features underlying certain data. Analogously, random subsets arise from conditionally i.i.d. sequences of Bernoulli processes with a common beta process base measure. In this case, the combinatorial structure is described by the Indian buffet process (IBP). The principal feature of the proof consists in the description of the key Markov kernels needed to use a negative binomial Indian buffet process (NB-IBP) representation in a Markov chain Monte Carlo algorithm targeting a posterior distribution. For other related works, see, e.g., [T. Broderick et al., “Combinatorial clustering and the beta-negative binomial process”, IEEE Trans. Pattern Anal. Mach. Intell. 37, No. 2, 290–306, (2014; doi:10.1109/TPAMI.2014.2318721); N. L. Hjort, Ann. Stat. 18, No. 3, 1259–1294 (1990; Zbl 0711.62033); G. Gregoire, Stochastic Processes Appl. 16, 179–188 (1984; Zbl 0522.60048)].

MSC:

60G99 Stochastic processes
60G09 Exchangeability for stochastic processes
60G57 Random measures
60G55 Point processes (e.g., Poisson, Cox, Hawkes processes)
60J10 Markov chains (discrete-time Markov processes on discrete state spaces)
60J22 Computational methods in Markov chains
65C05 Monte Carlo methods
65C40 Numerical analysis or methods applied to Markov chains
PDFBibTeX XMLCite
Full Text: DOI arXiv Euclid

References:

[1] Barndorff-Nielsen, O. and Yeo, G.F. (1969). Negative binomial processes. J. Appl. Probab. 6 633-647. · Zbl 0186.50801 · doi:10.2307/3212108
[2] Broderick, T., Jordan, M.I. and Pitman, J. (2013). Cluster and feature modeling from combinatorial stochastic processes. Statist. Sci. 28 289-312. · Zbl 1331.62124 · doi:10.1214/13-STS434
[3] Broderick, T., Mackey, L., Paisley, J. and Jordan, M.I. (2014). Combinatorial clustering and the beta-negative binomial process. IEEE Trans. Pattern Anal. Mach. Intell. 37 290-306. Special issue on Bayesian nonparametrics.
[4] Broderick, T., Pitman, J. and Jordan, M.I. (2013). Feature allocations, probability functions, and paintboxes. Bayesian Anal. 8 801-836. · Zbl 1329.62278 · doi:10.1214/13-BA823
[5] Canny, J. (2004). Gap: A factor model for discrete data. In Proceedings of the 27 th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval , Sheffield , United Kingdom .
[6] Devroye, L. (1992). Random variate generation for the digamma and trigamma distributions. J. Stat. Comput. Simul. 43 197-216. · doi:10.1080/00949659208811438
[7] Ghahramani, Z., Griffiths, T.L. and Sollich, P. (2007). Bayesian nonparametric latent feature models. In Bayesian Statistics 8. Oxford Sci. Publ. 201-226. Oxford: Oxford Univ. Press. · Zbl 1252.62004
[8] Gopalan, P., Ruiz, F.J.R., Ranganath, R. and Blei, D.M. (2014). Bayesian nonparametric Poisson factorization for recommendation systems. In Proceedings of the 17 th International Conference on Artificial Intelligence and Statistics , Reykjavik , Iceland .
[9] Grégoire, G. (1984). Negative binomial distributions for point processes. Stochastic Process. Appl. 16 179-188. · Zbl 0522.60048 · doi:10.1016/0304-4149(84)90018-8
[10] Griffiths, T.L. and Ghahramani, Z. (2006). Infinite latent feature models and the Indian buffet process. In Advances in Neural Information Processing Systems 19, Vancouver , Canada .
[11] Hjort, N.L. (1990). Nonparametric Bayes estimators based on beta processes in models for life history data. Ann. Statist. 18 1259-1294. · Zbl 0711.62033 · doi:10.1214/aos/1176347749
[12] Kallenberg, O. (2002). Foundations of Modern Probability , 2nd ed. Probability and Its Applications ( New York ). New York: Springer. · Zbl 0996.60001
[13] Kim, Y. (1999). Nonparametric Bayesian estimators for counting processes. Ann. Statist. 27 562-588. · Zbl 0980.62078 · doi:10.1214/aos/1018031207
[14] Kingman, J.F.C. (1967). Completely random measures. Pacific J. Math. 21 59-78. · Zbl 0155.23503 · doi:10.2140/pjm.1967.21.59
[15] Kozubowski, T.J. and Podgórski, K. (2009). Distributional properties of the negative binomial Lévy process. Probab. Math. Statist. 29 43-71. · Zbl 1170.60021
[16] Lo, A.Y. (1982). Bayesian nonparametric statistical inference for Poisson point processes. Z. Wahrsch. Verw. Gebiete 59 55-66. · Zbl 0482.62078 · doi:10.1007/BF00575525
[17] Meeds, E., Ghahramani, Z., Neal, R.M. and Roweis, S.T. (2007). Modeling dyadic data with binary latent factors. In Advances in Neural Information Processing Systems 20, Vancouver , Canada .
[18] Neal, R.M. (2003). Slice sampling. Ann. Statist. 31 705-767. With discussions and a rejoinder by the author. · Zbl 1051.65007 · doi:10.1214/aos/1056562461
[19] Paisley, J., Zaas, A., Woods, C.W., Ginsburg, G.S. and Carin, L. (2010). A stick-breaking construction of the beta process. In Proceedings of the 27 th International Conference on Machine Learning , Haifa , Israel .
[20] Roy, D.M. (2014). The continuum-of-urns scheme, generalized beta and Indian buffet processes, and hierarchies thereof. Preprint. Available at . arXiv:1501.0020
[21] Sibuya, M. (1979). Generalized hypergeometric, digamma and trigamma distributions. Ann. Inst. Statist. Math. 31 373-390. · Zbl 0448.62008 · doi:10.1007/BF02480295
[22] Sudderth, E.B., Torralba, A., Freeman, W.T. and Willsky, A.S. (2005). Describing visual scenes using transformed Dirichlet processes. In Advances in Neural Information Processing Systems 18, Vancouver , Canada .
[23] Teh, Y.W., Görür, D. and Ghahramani, Z. (2007). Stick-breaking construction for the Indian buffet process. In Proceedings of the 11 th International Conference on Artificial Intelligence and Statistics , San Juan , Puerto Rico .
[24] Thibaux, R. and Jordan, M.I. (2007). Hierarchical beta processes and the Indian buffet process. In Proceedings of the 11 th International Conference on Artificial Intelligence and Statistics , San Juan , Puerto Rico .
[25] Thibaux, R.J. (2008). Nonparametric Bayesian models for machine learning. Ph.D. thesis, EECS Department, Univ. California, Berkeley.
[26] Titsias, M. (2007). The infinite gamma-Poisson feature model. In Advances in Neural Information Processing Systems 20.
[27] Wolpert, R.L. and Ickstadt, K. (1998). Poisson/gamma random field models for spatial statistics. Biometrika 85 251-267. · Zbl 0951.62082 · doi:10.1093/biomet/85.2.251
[28] Zhou, M., Hannah, L., Dunson, D. and Carin, L. (2012). Beta-negative binomial process and Poisson factor analysis. In Proceedings of the 29 th International Conference on Machine Learning , Edinburgh , United Kingdom .
[29] Zhou, M., Madrid, O. and Scott, J.G. (2014). Priors for random count matrices derived from a family of negative binomial processes. Preprint. Available at . arXiv:1404.3331v2
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.