Skip to main content
Figure 1 | Genome Medicine

Figure 1

From: Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels

Figure 1

Growth of publicly available RNA-seq data and analysis workflow. (a) Over the past years the number of available public RNA-seq samples has increased exponentially (exponential fit r2 > 0.991). (b) General overview of the steps taken to process, quality control and integrate all samples. LCL, lymphoblastoid cell line; PCA, principal component analysis. (c) Overview of the diversity of 4,978 samples used for expression clustering. Three samples having read lengths >140 (365, 452, 151 bases) are omitted from the read length plot.

Back to article page