On the statistical analysis of the GS-NS0 cell proteome: Imputation, clustering and variability testing

We have undertaken two-dimensional gel electrophoresis proteomic profiling on a series of cell lines with different recombinant antibody production rates. Due to the nature of gel-based experiments not all protein spots are detected across all samples in an experiment, and hence datasets are invaria...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad, Norhaiza, Zhang, Jian, Brown, Phillip J., James, David C., Birch, John R., Racher, Andrew J., Smales, C. Mark
Format: Article
Published: Elsevier BV 2006
Online Access:http://eprints.utm.my/id/eprint/9046/
http://dx.doi.org/10.1016/j.bbapap.2006.05.002
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We have undertaken two-dimensional gel electrophoresis proteomic profiling on a series of cell lines with different recombinant antibody production rates. Due to the nature of gel-based experiments not all protein spots are detected across all samples in an experiment, and hence datasets are invariably incomplete. New approaches are therefore required for the analysis of such graduated datasets. We approached this problem in two ways. Firstly, we applied a missing value imputation technique to calculate missing data points. Secondly, we combined a singular value decomposition based hierarchical clustering with the expression variability test to identify protein spots whose expression correlates with increased antibody production. The results have shown that while imputation of missing data was a useful method to improve the statistical analysis of such data sets, this was of limited use in differentiating between the samples investigated, and highlighted a small number of candidate proteins for further investigation.