Madigan, David; Raghavan, Nandini; Dumouchel, William; Nason, Martha; Posse, Christian; Ridgeway, Greg Likelihood-based data squashing: A modeling approach to instance construction. (English) Zbl 0996.68564 Data Mining and Knowledge Discovery 6, No. 2, 173-190 (2002). Summary: Squashing is a lossy data compression technique that preserves statistical information. Specifically, squashing compresses a massive dataset to a much smaller one so that outputs from statistical analyses carried out on the smaller (squashed) dataset reproduce outputs from the same statistical analyses carried out on the original dataset. Likelihood-based Data Squashing (LDS) differs from a previously published squashing algorithm insofar as it uses a statistical model to squash the data. The results show that LDS provides excellent squashing performance even when the target statistical analysis departs from the model used to squash the data. Cited in 9 Documents MSC: 68U99 Computing methodologies and applications 68P05 Data structures Keywords:likelihood-based data squashing; data compression technique Software:R PDF BibTeX XML Cite \textit{D. Madigan} et al., Data Min. Knowl. Discov. 6, No. 2, 173--190 (2002; Zbl 0996.68564) Full Text: DOI OpenURL