×

A population-size model for protein spot detection in proteomic studies. (English) Zbl 1342.62170

Summary: In proteomic studies, a population of proteins are often examined on a gel using a technique called two-dimensional gel eletrophoresis. The technique separates the protein population into individual protein spots on a two-dimensional gel by isoelectric charge and molecular weight. The resulting gel images are then processed by a software system for spot detection and subsequent analysis. The performance of a spot-detection program is evaluated by the total number of spots that are detected. A popular spot-detection program uses the “master-slave” approach, where all spots on “slave images” are subsets of the spots on the “master image”. We argue that this approach potentially misses a large proportion of proteins and propose a model that quantifies the lack of performance. We provide nonparametric estimators for the protein population size and the expected number of proteins to be detected if a “fusion-gel” approach was used. Using the data from a rat liver proteome study, we estimate that more than half of the protein population is missed by the master-slave approach.

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Anderson, N. and N. Anderson (1998). Proteome and proteomics: new technologies, new concepts, and new words. Electrophoresis19, 1853-1861.
[2] Berth, M., F. M. Moser, M. Kolbe, and J. Bernhardt (2007). The state of the art in the analysis of two-dimensional gel electrophoresis images. Applied Microbiology and Biotechnology76, 1223-1243.
[3] Chao, A. (2001). An overview of closed capture-recapture models. Journal of Agricultural, Biological, and Environmental Statistics6, 138-155.
[4] Gascuel, O. and G. Caraux (1992). Bounds on expectations of order statistics via extremal dependences. Statistics & Probability Letters15, 143-148. · Zbl 0753.62030
[5] GE Healthcare (2007). DeCyder 2-D differential analysis software, user manual.http://www.gelifesciences.com.
[6] Görg, A., W. Weiss, and M. J. Dunn (2004). Current two-dimensional electrophoresis technology for proteomics. Proteomics4, 3665-3685.
[7] Lai, T. L. and H. Robbins (1976). Maximally dependent random variables. Proceedings of the National Academy of Sciences73, 286-288. · Zbl 0352.60013
[8] Luhn, S., M. Berth, M. Hecker, and J. Bernhardt (2003). Using standard positions and image fusion to create proteome maps from collections of two-dimensional gel electrophoresis images. Proteomics3, 1117-1127.
[9] Mao, C. X. (2007). Estimating population sizes for capture-recapture sampling with binomial mixtures. Computational Statistics and Data Analysis51, 5211-5219. · Zbl 1445.62280
[10] ——- (2008). Computing an NPMLE for a mixing distribution in two closed heterogeneous population size models. Biometrical Journal50, 983 - 992. · Zbl 1442.62524
[11] Mao, C. X., R. K. Colwell, and J. Chang (2005). Estimating the species accumulation curve using mixtures. Biometrics61, 433-441. · Zbl 1077.62125
[12] O’Farrell, P. H. (1975). High resolution two-dimensional electrophoresis of proteins. Journal of Biological Chemistry250, 4007-4021.
[13] Teicher, H. (1961). Identifiability of mixtures. The Annals of Mathematical Statistics32, 244-248. · Zbl 0146.39302
[14] Ünlü, M., M. Morgan, and J. Minden (1997). Difference gel electrophoresis: a single gel method for detecting changes in protein extracts. Electrophoresis18, 2071-2077.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.