Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

We present a method for Bayesian model-based hierarchical coclustering of gene expression data and use it to study the temporal transcription responses of an Anopheles gambiae cell line upon challenge with multiple microbial elicitors. The method fits statistical regression models to the gene expression time series for each experiment and performs coclustering on the genes by optimizing a joint probability model, characterizing gene coregulation between multiple experiments. We compute the model using a two-stage Expectation-Maximization-type algorithm, first fixing the cross-experiment covariance structure and using efficient Bayesian hierarchical clustering to obtain a locally optimal clustering of the gene expression profiles and then, conditional on that clustering, carrying out Bayesian inference on the cross-experiment covariance using Markov chain Monte Carlo simulation to obtain an expectation. For the problem of model choice, we use a cross-validatory approach to decide between individual experiment modeling and varying levels of coclustering. Our method successfully generates tightly coregulated clusters of genes that are implicated in related processes and therefore can be used for analysis of global transcript responses to various stimuli and prediction of gene functions.

Original publication

DOI

10.1073/pnas.0408393102

Type

Journal article

Journal

Proceedings of the National Academy of Sciences of the United States of America

Publication Date

15/11/2005

Volume

102

Pages

16939 - 16944

Addresses

Department of Mathematics, Imperial College London, Huxley Building, 180 Queens Gate, London SW7 2AZ, United Kingdom. n.heard@imperial.ac.uk

Keywords

Cell Line, Animals, Anopheles gambiae, Zymosan, Cluster Analysis, Bayes Theorem, Gene Expression Profiling, Immunity, Gene Expression, Algorithms, Models, Genetic