Large datasets in biomedicine: a discussion of salient analytic issues.
Advances in high-throughput and mass-storage technologies have led to an information explosion in both biology and medicine, presenting novel challenges for analysis and modeling. With regards to multivariate analysis techniques such as clustering, classification, and regression, large datasets present unique and often misunderstood challenges. The authors' goal is to provide a discussion of the salient problems encountered in the analysis of large datasets as they relate to modeling and inference [...]
Author(s): Sinha, Anshu, Hripcsak, George, Markatou, Marianthi
DOI: 10.1197/jamia.M2780