SINCERA: A Computational Pipeline for Single Cell RNA-seq Profiling Analysis ================================= DESCRIPTION ================================= Author: Minzhe Guo (minzhe.guo@cchmc.org) Version: a04142015 License: GNU General Public License v3 . This is the R source code of SINCERA analytic pipeline for analysis of single-cell RNA-seq data from heterogeneous cell populations. To use this script, you will need the R statistical computing environment (version 3.1.3 or later) and several packages freely available through Bioconductor and CRAN. SINCERA is in the ALPHA stage of development. Core features have been implemented. We are improving the documentation, completing features, and refining the interfaces. SINCERA is under heavy active development. Updates of SINCERA will be distributed primarily through the SINCERA website at: http://research.cchmc.org/pbge/sincera.html. We are in the processing to submit the package to Bioconductor (http://www.bioconductor.org) as well. ================================= FILES ================================= 1) E16.5.Rd: R data object file containing Single-cell RNA-seq expression data from mouse lung at E16.5; and gene-celltype association data downloaded from EBI Expression Atlas. exprs.m: the FPKM values of 36188 Ensembl genes in 148 cells cells: the information of 148 cells, the cluster membership of cells used in the manuscript is encoded in the column "CLUSTER" genes: the information of 36188 Ensembl genes celltype.gene.association: processed cell type and gene association data downloaded from EBI Expression Atlas (http://www.ebi.ac.uk/gxa/) celltype.genome.count: the hits of each cell type in the genome of 36188 Ensembl genes data4cm: data for demonstrating the consensus-maximization-based refinement of regulatory target prediction for Nkx2-1 in epithelial cells 2) sincera: R functions to implement the pipeline. 3) demo.R: R functions to utilize Sincera for analysing E16.5 data. ================================= INSTALLATION AND RUNNING THE DEMO ================================= 1) Download and unzip the SINCERA package. 2) Open R GUI (Instructions for downloading and installation of the latest version of R computing environment can be found at http://cran.rstudio.com/). 3) Change the directory of R GUI to the directory of SINCERA. 4) Run the demo by sourcing demo.R in R GUI. ================================= DEPENDENCIES ================================= The following R and Bioconductor packages are needed by SINCERA: * Biobase * ROCR , * RobustRankAggreg * G1DBN * igraph , SINCERA will try to resolve the dependencies automatically. If the dependencies cannot be resolved, please try the following scripts for installation or refer to the website of each package for more information. if (!require(Biobase)) { source("http://bioconductor.org/biocLite.R") biocLite(c("Biobase")) library(Biobase) } if (!require(ROCR)) { install.packages("ROCR") library(ROCR) } if(!require(RobustRankAggreg)) { install.packages("RobustRankAggreg") library(RobustRankAggreg) } if(!require(G1DBN)) { install.packages("G1DBN") library(G1DBN) } if(!require(igraph)) { install.packages("igraph") library(igraph) } ================================= CHANGE LOG ================================= [03/01/2015] SINCERA a03012015 Core functions are ready. [04/14/2015] SINCERA a04142015 Refined interfaces. We expect to release the next version of SINCERA with refined functionality and documentation in May, 2015.