Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Single-cell RNA-seq data allows insight into normal cellular function and various disease states through molecular characterization of gene expression on the single cell level. Dimensionality reduction of such high-dimensional data sets is essential for visualization and analysis, but single-cell RNA-seq data are challenging for classical dimensionality-reduction methods because of the prevalence of dropout events, which lead to zero-inflated data. Here, we develop a dimensionality-reduction method, (Z)ero (I)nflated (F)actor (A)nalysis (ZIFA), which explicitly models the dropout characteristics, and show that it improves modeling accuracy on simulated and biological data sets.

Original publication

DOI

10.1186/s13059-015-0805-z

Type

Journal article

Journal

Genome biology

Publication Date

02/11/2015

Volume

16

Addresses

Department of Statistics, University of Oxford, 1 South Parks Road, OX1 3TG, Oxford, UK. emma.pierson@st-annes.ox.ac.uk.

Keywords

Models, Statistical, Gene Expression Profiling, Sequence Analysis, RNA, Principal Component Analysis, Software, Single-Cell Analysis