A new approach for determining the prior probabilities in the classification problem by Bayesian method. (English) Zbl 1414.62261

Summary: In this article, we suggest a new algorithm to identify the prior probabilities for classification problem by Bayesian method. The prior probabilities are determined by combining the information of populations in training set and the new observations through fuzzy clustering method (FCM) instead of using uniform distribution or the ratio of sample or Laplace method as the existing ones. We next combine the determined prior probabilities and the estimated likelihood functions to classify the new object. In practice, calculations are performed by Matlab procedures. The proposed algorithm is tested by the three numerical examples including bench mark and real data sets. The results show that the new approach is reasonable and gives more efficient than existing ones.


62H30 Classification and discrimination; cluster analysis (statistical aspects)
68T10 Pattern recognition, speech recognition


Full Text: DOI


[1] Bora DJ, Gupta AK (2014) Impact of exponent parameter value for the partition matrix on the performance of fuzzy C means Algorithm. arXiv:1406.4007 (arXiv preprint)
[2] Cannon, RL; Dave, JV; Bezdek, JC, Efficient implementation of the fuzzy c-means clustering algorithms, IEEE Trans Pattern Anal Mach Intell, 2, 248-255, (1986) · Zbl 0602.68084
[3] Fadili, MJ; etal., On the number of clusters and the fuzziness index for unsupervised FCA application to BOLD fMRI time series, Med Image Anal, 5, 55-67, (2001)
[4] Ghosh, AK; Chaudhuri, P.; Sengupta, D., Classification using Kernel density estimates, Technometrics, 48, 120-132, (2006)
[5] Hall, LO; etal., A comparison of neural network and fuzzy clustering techniques in segmenting magnetic resonance images of the brain, IEEE Trans Neural Netw, 3, 672-682, (1992)
[6] Inman, HF; Bradley, EL, The overlapping coefficient as a measure of agreement between probability distributions and point estimation of the overlap of two normal densities, Commun Stat Theory Methods, 18, 3851-3874, (1989) · Zbl 0696.62131
[7] Mardia KV, Kent JT, Bibby JM (1979) Multivariate analysis. Academic Press, Cambridge · Zbl 0432.62029
[8] Martinez WL, Martinez AR (2007) Computational statistics handbook with MATLAB. CRC Press, Boca Raton · Zbl 0986.62104
[9] McLachlan GJ, Basford KE (1988) Mixture models: inference and applications to clustering. Statistics: textbooks and monographs. Dekker, New York
[10] Miller, G.; etal., Bayesian prior probability distributions for internal dosimetry, Radiat Prot Dosim, 94, 347-352, (2001)
[11] Pal, NR; Bezdek, JC, On cluster validity for the fuzzy c-means model, IEEE Trans Fuzzy Syst, 3, 370-379, (1995)
[12] Pham-Gia, T.; Turkkan, N.; Vovan, T., Statistical discrimination analysis using the maximum function, Commun Stat Simul Comput, 37, 320-336, (2008) · Zbl 1132.62049
[13] Scott DW (1992) Multivariate density estimation: theory, practice, and visualization. Wiley · Zbl 0850.62006
[14] Silverman BW (1986) Density estimation for statistics and data analysis, vol 26. CRC Press, Boca Raton · Zbl 0617.62042
[15] Vo, T.; Pham-Gia, T., Clustering probability distributions, J Appl Stat, 37, 1891-1910, (2010)
[16] Webb AR (2003) Statistical pattern recognition. Wiley, New York · Zbl 1102.68639
[17] Yu, J.; Cheng, Q.; Huang, H., Analysis of the weighting exponent in the FCM, IEEE Trans Syst Man Cybern Part B Cybern, 34, 634-639, (2004)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.