- Open Access
Development of double-positive thymocytes at single-cell resolution
Genome Medicine volume 13, Article number: 49 (2021)
T cells generated from thymopoiesis are essential for the immune system, and recent single-cell studies have contributed to our understanding of the development of thymocytes at the genetic and epigenetic levels. However, the development of double-positive (DP) T cells, which comprise the majority of thymocytes, has not been well investigated.
We applied single-cell sequencing to mouse thymocytes and analyzed the transcriptome data using Seurat. By applying unsupervised clustering, we defined thymocyte subtypes and validated DP cell subtypes by flow cytometry. We classified the cell cycle phases of each cell according to expression of cell cycle phase-specific genes. For immune synapse detection, we used immunofluorescent staining and ImageStream-based flow cytometry. We studied and integrated human thymocyte data to verify the conservation of our findings and also performed cross-species comparisons to examine species-specific gene regulation.
We classified blast, rearrangement, and selection subtypes of DP thymocytes and used the surface markers CD2 and Ly6d to identify these subtypes by flow cytometry. Based on this new classification, we found that the proliferation of blast DP cells is quite different from that of double-positive cells and other cell types, which tend to exit the cell cycle after a single round. At the DP cell selection stage, we observed that CD8-associated immune synapses formed between thymocytes, indicating that CD8sp selection occurred among thymocytes themselves. Moreover, cross-species comparison revealed species-specific transcription factors (TFs) that contribute to the transcriptional differences of thymocytes from humans and mice.
Our study classified DP thymocyte subtypes of different developmental stages and provided new insight into the development of DP thymocytes at single-cell resolution, furthering our knowledge of the fundamental immunological process of thymopoiesis.
The thymus gland is a specialized organ that is highly conserved in vertebrates . In the thymus, most CD4/CD8 double-negative (DN) precursors develop into the αβ-T cell lineage and then go through CD4/CD8 double-positive (DP) and CD4/CD8 single-positive (SP) stages to achieve maturation . While developing into DP cells, thymocytes undergo rapid and extensive proliferation, which generates a burst of DP cells and supplies a large pool of T cell clones [3, 4]. When DP cells become quiescent from expansion, rearrangement of the TCRα chain is initiated to form mature TCRs. Then DP thymocytes are subjected to positive and negative selection, which depend on TCR interaction with the peptide/major histocompatibility complex (MHC) complex (pMHC). In general, the TCRα chain is continuously rearranged until a MHC-restricted TCR αβ heterodimer is achieved [5, 6]. Although DP cells are strongly sensitive to TCR stimulation [7, 8], most DP cells appear to be neglected with no appropriate pMHC signals in their lifespan and undergo programmed cell death. Only DP thymocytes bearing TCRs that interact with self-peptide/MHC complexes of low affinity will acquire MHC restriction and differentiate into CD4/CD8 SP cells. A nonself-reactive TCR repertoire is ultimately shaped through negative selection. Thymocytes that survive the thymopoiesis process become naïve T cells and migrate out of the thymus.
Traditional studies have mainly focused on bulk-level analyses and have largely depended on known markers of relevant developmental stages. In recent years, single-cell sequencing technology has offered researchers the opportunity to explore scientific issues from a much wider and deeper perspective. For example, Kernfeld et al. and Zeng et al. reported a comprehensive depiction of the development of the fetal thymus in mice and humans, respectively [9, 10]. Lavaert et al. studied the single-cell transcriptional dynamics of human postnatal thymus seeding progenitors , and Zhou et al. revealed the regulatory gene expression dynamics leading to lineage commitment in DN thymocytes . These studies have helped us to understand the development of early thymocytes from a variety of aspects. However, the DP cells that account for approximately 80% of the proportion in the mature thymus were either not present among [9, 10] or selectively excluded from [11, 12] the target cell types used in these studies. Recently, Park et al. presented a comprehensive landscape of human thymocyte development . In addition, Le et al. described the lineage specification trajectories and commitment spectrum of human thymocytes . Nevertheless, the development of DP cells in mice has not been well characterized to date.
In this study, we performed single-cell sequencing of mouse thymocytes and reconstructed the entire developmental trajectory in detail. We classified the main cell types of thymocytes and identified three subtypes in the DP stage. We then developed a flow cytometry gating strategy to partition these DP subtypes at the protein level. Based on unsupervised classification and pseudotime analysis, we found that DP thymocytes undergo an unique mechanism of cell division that is different from that of DN cells and other cell types. During the selection stages, the activity of MHC-I molecule-associated antigen presentation was significantly upregulated, suggesting a process of antigen presentation and recognition between thymocytes. To confirm this hypothesis, we examined immune synapses between thymocytes. Moreover, we carried out cross-species comparisons and identified species-specific transcription factors (TFs) that contribute to the transcriptional differences between thymocytes from humans and mice. Together, our study provides new insight into the development of DP thymocytes at single-cell resolution, and these findings help us to better understand this fundamental immunological process.
The objective of this study was to use scRNA-seq to capture the transcriptomes of αβ-T cells in the thymus to describe important events during αβ-T cell development. Flow cytometry was employed to confirm observations from sequencing datasets to validate expression levels at baseline. Immunofluorescence and imaging flow cytometry were used to assess the existence of CD8-specific selection between thymocytes. Details on the sample collection and processing are described below.
C57BL/6 mice were ordered from Beijing Vital River Laboratory Animal Technology and maintained under specific pathogen-free conditions until the experiments were performed. All mouse experiments in this study were reviewed and approved by the Institutional Animal Care and Use Committee of the University of Science and Technology of China.
Thymus tissues were harvested from mice aged 6–8 weeks and gently ground in 1 mL of RPMI-1640. Thymocytes in a single-cell suspension were counted after being passed through a 40-μm nylon mesh filter.
For surface marker staining, cells were labeled with fluorescent antibodies at 4 °C for 30 min and washed twice with 1× phosphate-buffered saline (PBS, Sangon, China). For intracellular marker staining, the cells were fixed with 1% paraformaldehyde (PFA, Sangon) at 4 °C for 10 min, washed with 1× perm/wash buffer (BD Bioscience) once, and incubated in 1× perm/wash buffer for 30 min at 4 °C. Next, the cells were labeled with fluorescent antibodies at 4 °C for 30 min and washed twice with 1× perm/wash buffer. After the final wash, the cells were analyzed or sorted using an SH800S cell sorter (Sony).
Imaging synapses between thymocytes
To maintain the natural attachment between adjacent thymocytes, we gently cut the thymus with surgical scissors in an Eppendorf tube; 1 mL of 1× PBS was added to resuspend the thymocytes, and the thymic debris was allowed to settle. The suspension was pipetted and passed through a 40-μm nylon mesh filter. After counting the cells, approximately 1 × 107 thymocytes were labeled with fluorescent anti-mouse antibodies at 4 °C for 30 min in approximately 100 μL 1× PBS. Then, 30 μL of 4% PFA was directly added and gently mixed, and the cells were fixed at 4 °C for 10 min. The cells were briefly centrifuged at 100×g for 1 min to avoid the formation of a tight cell pellet and resuspended in 200 μL of 1× perm/wash buffer. After staining with 488-labeled phalloidin for 20 min at room temperature, the cells were centrifuged at 100×g for 1 min and resuspended in 60 μL of 1× PBS. For ImageStream experiments, cells were directly examined using an Amnis ImageStream Mk II Imaging Flow Cytometer (Luminex). For immunofluorescence experiments, cells were diluted to a proper concentration, seeded on poly-l-lysine-coated slides and observed by confocal microscopy (ZEISS LMS 880).
Cell proliferation staining
DPbla/re cells were sorted using the surface markers CD45+CD4+CD8+CD2low, and the total thymocyte population and DPbla/re cells were separately labeled with UltraGreen (AAT Bioquest) cell dye. After culturing in serum-free UltraCulture medium (Lonza) for 24 h, proliferative cells were identified by fluorescence intensity analysis.
Anti-mouse CD4 (BV421, PE, APC, and Percp-Cy5.5), CD8 (FITC, APC, and PE-Cy7), Ly6d (FITC), CD2 (PE and APC), CD69 (PE), Ki67 (BV421), CD3e (BV421), and H2 (PE) antibodies were purchased from BioLegend. Anti-mouse RORγt (PE) antibodies were obtained from BD Bioscience. All fluorescent antibodies were used according to the user manuals.
Single cells were captured in droplet emulsions using a GemCode Single-Cell Instrument (10X Genomics), and scRNA-seq libraries were constructed with the 10X Genomics protocol using a GemCode Single-Cell 3′ Gel Bead and Library V2 Kit. The libraries were sequenced using a HiSeq X-10 Sequencing System (Illumina). scATAC-seq experiments were performed as previously described .
Sequences were aligned using Cell Ranger version 1.3.1 from 10X Genomics with default parameters. The GRCm38.p5 assembly was used as the reference genome, and ribosomal RNA, mitochondrial RNA (Mt-RNA), and pseudogenes were removed. Cells with fewer than 500 detected genes and more than 4500 detected genes for which total mitochondrial gene expression exceeded 40% were removed. Genes that were expressed in fewer than three cells were also removed. Quantile normalization was performed using qnorm in R.
Identification of cell types and subtypes
Standard procedures for filtering, variable gene selection, dimensionality reduction, clustering, and identifying marker genes were performed using the Seurat package version 2.3.0.
FindVariableGenes (x.low.cutoff = 0.025, x.high.cutoff = 3, y.cutoff = 0.5).
FindClusters (reduction.type = “pca”, dims.use = 1:20, resolution=2, print.output=0).
RunTSNE (dims.use = 1:20, do.fast = TRUE).
FindAllMarkers (only.pos = TRUE, min.pct = 0.5, thresh.use=0.5).
Merging Tabula Muris data
We downloaded raw thymocyte single-cell RNA-seq data from Tabula Muris . Sequencing data were aligned using Cell Ranger version 1.3.1 from 10X Genomics with default parameters. The genome build used was GRCm38.p5 without pseudogenes, ribosomal genes, and mitochondrial genes. Then, we used Seurat (V 3.1.4) to integrate two datasets for analysis with the following key parameters.
FilterCells (nFeature_RNA > 1000, nFeature_RNA < 5000 and percent.mt < 10).
FindVariableFeatures (nfeatures = 2000).
FindIntegrationAnchors (dims = 1:40), IntegrateData (dims.use = 1:20).
RunPCA (npcs = 40), FindClusters (resolution = 1).
The robustness of clustering was tested by Seurat analysis under 25 different conditions with combinations of five resolution values (Res = 0.8, 0.9, 1, 1.1, 1.2) and five values for the number of neighbors in the initial graph (k = 15, 20, 25, 30, 35). We then calculated the consistency of clustering for each cell pair by their co-occurrence count across the 25 parameter settings.
Developmental trajectory and cell order analysis
After cell filtering, data were prepared for visualization and population balance analysis (PBA)  by constructing a k-nearest neighbor (kNN) graph, in which cells correspond to graph nodes and edges connect cells to their nearest neighbors. The kNN graphs were visualized using a force-directed layout with a custom interactive software interface called SPRING and default parameters. The cell order was predicted by PBA , which calculates a scalar “potential” for each cell that is analogous to a distance, or pseudotime, from an undifferentiated source and a vector of fate probabilities that indicates the distance to fate branch points. For a self-renewing system, the sum of all cells satisfies the constraint ∑i Ri = 0. We assigned negative values to R for the ten cells with the highest expression of CD4sps and CD8sps marker genes. We assigned an SP value to all remaining cells, and the value was chosen to enforce the steady-state condition ∑i Ri = 0. The cell order was scaled according to the stage of cell development.
Identifying dynamically varying genes
For each gene, a sliding window (n = 20 cells) across the cell ordering was used to identify windows with the maximum and minimum average expression, as previously described . Transition points between stages were defined using the frequency of gene inflection points and patterns of PBA-predicted fate probabilities. However, owing to the continuous nature of transcriptional states, the locations of these transitions should be considered approximate. The inflection point density is the number of genes that turn on or off at a given point on the trajectory. For each gene, inflection points were identified at the points with maximally increasing or decreasing expression, as follows. First, the trajectory of each dynamically varying gene was smoothed using Gaussian smoothing with a width σ = 1% of the total trajectory. The gene expression derivative for gene k, denoted dk, was then computed. Inflection points were identified as the points with the maximum or minimum derivatives for each gene. To exclude maxima or minima resulting from relatively small fluctuations in gene expression, only appreciably large extrema were retained for further analysis. Specifically, a point with a derivative for gene k, max (dk), was kept only if
We chose the threshold Q = 3 for this study and then plotted the density of these inflection points over the cell ordering axis. Regions with large-scale changes in gene expression have a high density of inflection points, whereas relatively stable states are characterized by low density .
Enrichment of TFs using SCENIC
Transcription factors from scRNA-seq data were identified using SCENIC  with the default parameters. An enrichment score less than the threshold was defined as no enrichment (0), and nCellsAssigned TFs less than 10% and greater than 90% were removed. The average of each subgroup was obtained based on the previous clustering results, and TFs with large differences (> 0.4) between subgroups were identified.
Mapping the relationship between TFs and global changes in genes
Time-specific TFs and the possible regulatory relationship between TF and genes were obtained from the SCENIC results. Next, TF genes with high correlations were screened out (correlation > 0.4) based on the correlation between TF and gene expression. These TF genes were assigned to different stages according to the TF and gene expression patterns. Circos was then used to map genes regulated by TFs.
We used the general mapping, alignment, peak calling, and motif searching procedures to process the scATAC-seq data from APEC  and ATAC-pipe . We also implemented a Python script in ATAC-pipe to trim adapters in the raw data (in paired-end FASTQ files for each single-cell sample). APEC utilizes BOWTIE2 to map trimmed sequencing data to the corresponding genome index and PICARD for sorting, duplicate removal, and fragment length counting of the aligned data. The pipeline calls peaks from the merged file of all cells using MACS2 and ranks and filters out low-quality peaks based on the false discovery rate (Q-value). The genomic locations of the peaks were annotated by HOMER, and motifs were searched using FIMO. APEC calculates the number of fragments and the percent of reads mapped to the transcriptional start site (TSS) region (±2000 bp) in each cell and filters out high-quality cells. Finally, we obtained 1933 cells with 600 accessons  (peak group with a similar pattern). TF enrichment in each cell was obtained for downstream analysis.
scRNA-seq and scATAC-seq integration analysis
We merged the genes from scRNA-seq data into gene groups corresponding to accessions based on the genes corresponding to the peak in the accessions above. We then integrated the scRNA-seq and scATAC-seq (in the form of cell × accession) data according to the accessions. The mutual nearest neighbor was used to find the anchor and weight of the two data points on the nearest 100 canonical correlation spaces (60 canonical correlation spaces were used when analyzing the integrated data). The t-SNE and PBA cell order from scRNA-seq data were assigned to each cell for the scATAC-seq data according to the anchor and weight. Finally, the motif enrichment information from scATAC-seq was assigned to each scRNA-seq cell based on the average of the 10 cells of the PBA order. The cell type information from scRNA-seq was assigned to scATAC-seq.
Cell cycle analysis
We defined genes from the cell cycle Gene Ontology (GO) category in the Mouse Genome Informatics (MGI) database as genes associated with the cell cycle. The average of all cell cycle-related genes was taken as the cell cycle score for each cell. Genes with periodic expression correlating with the cell cycle  were used to generate a cell cycle phase score for each cell, after which a phase score was calculated for each phase (G1/S, S, G2/M, M, and M/G1) by averaging expression traces for the genes specific to that phase. Finally, the z score of each phase in each cell was calculated, and cells with a z score greater than 1 (one standard deviation) were classified into each phase. If the z score of a cell in each phase was less than 1, the cell was not considered to be in the cell cycle. The status was divided into different phases (G1/S, S, G2/M, M, M/G1, and not cell cycle) for each cell. Read counts were normalized using qnorm, and numpy.polyfit (deg = 10) was used to fit the data.
Removing cell cycle-related genes and reclustering analysis
All genes related to the cell cycle (genes from the cell cycle GO term in the MGI) were removed from the data, and then reclustering was performed as previously described (resolution = 1.9). The overlap ratio of the four subpopulations (DPblas and the original subpopulations) after the removal of cell cycle-related genes was determined.
Reclustering of DPblas was also performed in accordance with the previous parameters (resolution = 1).
Cell cycle phase analysis of the Tabula Muris thymus, erythroid and neuron data sets
The cell cycle phase analysis method used was the same as previously described, and the default parameters in Seurat and PBA were applied to calculate the cell cluster and cell order of the Tabula Muris thymus (Seurat resolution = 2) and neuron (Seurat resolution = 0.2) datasets. We used GSE109774  for the Tabula Muris thymus dataset. We used GSE89754  for the erythroid dataset and GSE93593  for the neuron dataset. The erythroid PBA cell order was based on the results of the original paper.
Cell cycle phase analysis of the human thymus data sets
The cell cycle phase analysis method employed was the same as previously described, and in Seurat (V 3.1.4), integrated methods (min.cells = 3,min.features = 500, PC = 40, resolution = 1.2) and PBA were applied to calculate the cell cluster and cell order of the 24-year-old human thymus E-MTAB-8581  dataset.
Cell cycle phase analysis of the early T cell data sets
The cell cycle phase analysis method was the same as previously described. Seurat (V 3.1.4), integrated methods (min.cells = 3,min.features = 500, PC = 40, resolution = 1.2) and PBA were used to calculate the cell cluster and cell order of the early T cell dataset GSE130812 .
Human mouse cross-species comparison analysis
We used Seurat (V 3.1.4) to integrate the mouse and human single-cell thymocyte datasets. The parameters used for the analysis were as follows:
FilterCells (nFeature_RNA > 1000, nFeature_RNA < 5000 and percent.mt < 10).
FindVariableFeatures (nfeatures = 2000).
FindIntegrationAnchors (dims = 1:20), IntegrateData (dims.use = 1:20).
RunPCA (npcs = 20), FindClusters (resolution = 1).
Genes that were differentially expressed between the species were identified with two steps: (1) genes that were expressed in one species but not the other (expressed: nCells> 20, not expressed: nCells < 3); (2) for each stage, marker genes (fold change > 0.2, background expression < 1) for human and mouse data were combined, and if the list was larger than 400, only the top 400 significant genes were included. Then, differentially expressed genes across species were identified (fold change > 0.25) for all developmental stages. Genes in either (1) or (2) were regarded as differentially expressed genes between humans and mice.
Cells were divided into 100 bins along the developmental pseudotime trajectory, and smoothed gene expression values were calculated with the numpy.polyfit (n = 10) option.
A TF list was obtained from The Human Transcription Factors database , while the transcriptomes for human samples were obtained from different datasets (see “Availability of data and materials”). Genes regulated by each TF were based on gene set enrichment analysis (GSEA) . The significance of whether a TF regulates cross-species differential genes was calculated by Fisher’s exact test.
ChIP-seq data for Gata1 were obtained from Cistrome Data Browser  with the accession numbers GSM912907, GSM867158, and GSM867159. RNA-seq data for the human and mouse thymus were obtained from ENCODE  under accession numbers GSM1220578, GSM1220591, GSM1220592, GSM1220593, GSM1220599, GSM1220601, GSM1010944, GSE78390, GSM970852, GSE93469, and GSE90183. All data were normalized by sequencing depth, and differential analysis was performed with a t-test. Genes with a fold change > 1 and p value < 0.01 were regarded as significantly differentially expressed.
The detailed statistical methods and parameters used in this study are described above. The original analysis codes and scripts can be accessed at https://github.com/QuKunLab/T-cell-development .
Single-cell transcriptome profile depicts αβ-T cell development in the thymus
We profiled lymphocytes in the mouse thymus with a droplet-based single-cell RNA sequencing (scRNA-seq) platform and obtained a total of 2004 cells. Of these, 1986 single cells with an average of 1784 detected genes per cell passed the quality control (see “Methods”) and were used for further analysis (Fig. 1a, Additional file 1: Figure S1A, B). Unsupervised analysis was performed with Seurat , and the cells were clustered into 15 subgroups (Fig. 1b). We divided the cells into 7 developmental stages of thymocytes according to the expression of marker genes: DN progenitors (Il2ra+), immature SP thymocytes (ISPs, Cd4− Cd8+ Mki67+), DP blasts (DPblas; Cd4+ Cd8+ Mki67+), DP thymocytes undergoing rearrangement (DPres; Cd4+ Cd8+ Rag1high), DP cells under selection (DPsels; Cd4+ Cd8+ Itm2a+), and CD4/CD8 SP thymocytes (CD4sps and CD8sps). Cells in different stages were aggregated in a two-dimensional t-distributed stochastic neighbor embedding (t-SNE) plot that outlined the developmental trajectory of αβ-T cells (Fig. 1b).
To verify the developmental stages of the thymocytes we observed, we constructed a pseudotime trajectory using an unsupervised SPRING algorithm and PBA , and the predicted timeline was consistent with the clustering results (Additional file 1: Figure S1C, Fig. 1c). Briefly, CD25 (Il2ra) represents the most DN subset, and β-selection commits most DN cells to an αβ-T cell fate. When progressing to the ISP stage, Cd8 expression is turned on, and the thymocytes proliferate rapidly into the DPbla stage with the onset of Cd4 expression, upregulating cell cycle-associated genes such as Ki-67 (Mki67). With the help of Rag1 and Rag2 genes, TCRα loci are next rearranged in DPre cells to acquire a mature TCR signal for positive selection in the DPsel stage. Finally, CD4sp and CD8sp cells that survive negative selection will migrate out of the thymus via chemotaxis signals (Fig. 1d, e). Notably, this classification is quite similar to the thymocyte subpopulations in the human thymus .
In addition to 13 thymocyte clusters, a group of antigen-presenting cells (APCs) and a group of recently identified nonconventional lymphocytes (NCLs)  were defined (Fig. 1d, Additional file 1: Figure S1D). Consistently, many stage-specific markers of thymocyte development were highly expressed in the coordinated clusters (Fig. 1e). To confirm our single-cell partition results, we reanalyzed the thymocyte profile from the Tabula Muris dataset  and detected 1364 thymocytes; then, we integrated them with our data and obtained 3320 cells passing quality control. As expected, the clustering results were nearly identical (Additional file 1: Figure S2A, B). Moreover, thymocytes from both datasets were distributed proportionally across clusters with the same markers (Additional file 1: Figure S2C, D). We also compared our single-cell clusters with bulk-sorted thymocyte subpopulations  and observed good correlation between the corresponding subtypes of thymocytes (Additional file 1: Figure S2E). To evaluate whether the clustering of the thymocytes was stable, we performed Seurat clustering on the integrated data under 25 conditions with 5 resolutions (Res = 0.8, 0.9, 1.0, 1.1, 1.2) and 5 nearest neighbor values (K = 15, 20, 25, 30, 35) (Additional file 1: Figure S3A) and calculated the consistency of the cluster assignment for each cell. We found that pairs of cells that clustered together in the original analyses consistently clustered together across parameter settings (Additional file 1: Figure S3B), which suggests that the clustering results are quite robust.
Single-cell ATAC-seq revealed new transcription factors involved in thymocyte development
The development of thymocytes is driven by the interplay of multiple TFs. However, a comprehensive depiction of TF regulatory dynamics during thymocyte development is lacking. As we constructed the pseudotime development trajectory from the single-cell snapshot of thymocytes (Fig. 1c), we next investigated the dynamics of gene expression during thymocyte development based on the pseudocoordinate of each cell (Fig. 2a). We reconstructed the dynamics of enriched TFs using single-cell regulatory network interference and clustering (SCENIC) . In addition to the essential TFs previously reported, such as Myc, Rorc, Gata3, Zbtb7b and Runx3 , novel TFs, such as Srf, were found to be candidate TFs that regulate thymocyte maturation (Fig. 2b). Egr3 is reported to be essential in the proliferation of thymocytes at the transition of DN to DP . Interestingly, we found that Egr3 might also function at the DPsel and SP stages (Fig. 2b).
To validate enrichment of TFs in each stage, we used a single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) of thymocytes (Additional file 1: Figure S4A) and anchored the single cells from chromatin accessibility profiles onto those from scRNA-seq (Fig. 2c, Additional file 1: Figure S4B). We measured TF motif enrichment at each developmental stage according to the chromatin-open status of thymocytes and observed TF motif enrichment patterns similar to those of the SCENIC predictions (Additional file 1: Figure S4C). For example, the Srf and Erg3 motifs were highly enriched in DPsels and SP subtypes (Fig. 2d), suggesting that these TFs function in the development of thymocytes. We also mapped scATAC-seq data onto the integrated data and found similar TF enrichment patterns (Additional file 1: Figure S4D-F). Next, we constructed a regulatory network of enriched TFs for each developmental stage according to the coexpression pattern between TFs and marker genes considering the binding motifs in promoter regions (Additional file 1: Figure S5). A Circos plot showed that these stage-specific TFs control expression of marker genes at their stages and helped thymocytes to transition to the next stage (Fig. 2e), illustrating the programmed development progress of thymocytes under the regulation of TFs.
Ly6d and CD2 can serve as new markers to partition DP subtypes
DP cells account for approximately 80% of thymocytes and undergo essential processes of proliferation, rearrangement and selection. However, the subclassification of DP cells remains unclear due to a lack of specific markers for distinguishing thymocytes undergoing different developmental stages. In this study, we classified DP cells into three subtypes at the transcriptome level according to the RNA expression pattern and relative biofunction of marker genes: DPbla, DPre, and DPsel. To identify specific markers, we explored differentially expressed cell surface marker genes across thymocyte subtypes (Fig. 3a) and observed that Ly6d was highly expressed in thymocytes but dramatically decreased in DPsels and SP cells. In contrast, expression of Cd2 was relatively low before the DPsel stage but was extensively elevated in DPsels and SP cells (Fig. 3a, b). When integrating with Tabula Muris data, we observed very similar expression patterns of Ly6d and CD2 across thymocyte subtypes (Additional file 1: Figure S6). These results are consistent with previously reported transcriptional dynamics at the bulk level . To verify whether surface marker expression is consistent with RNA expression, we performed flow cytometry analysis for Ly6d. As expected, Ly6d expression increased from the DN2 stage and remained high until the SP stage (Fig. 3c).
Ly6d is reported to identify the branch point of B cell and T cell development , but its behavior in T cell development has not been described. CD2 is essential for thymic selection . Thus, Ly6d and CD2 are potential markers to distinguish the developmental stages of DP thymocytes, and we performed flow cytometry analysis to assess whether Ly6d and CD2 may serve as new markers to partition DP subtypes. Together with Ki67 and CD3, we successfully identified ISPs and three DP subtypes: ISPs, CD4−CD8+Ly6d+CD3−; DPblas, CD4+CD8+CD2lowKi67+; DPres, CD4+CD8+Ly6dhiCD2lowKi67−; and DPsels, CD4+CD8+CD2hi (Fig. 3d). To verify this gating strategy, we also examined several known stage-specific markers, such as CD24, Rorγt, and CD69. CD24 was highly expressed on early thymocytes  and downregulated from DPblas and DPres. Rorγt, a DP marker , was differentially expressed on DP subtypes and downregulated on DPsels. CD69, a marker for DP cell selection , was upregulated on DPsels. These protein expression dynamics confirmed that the DP subtypes we defined were at different developmental stages (Fig. 3e).
DPblas differentiate into DPres during the cell cycle
When committed to the αβ-lineage, thymocytes undergo rapid proliferation and transition to DP cells. Consistently, Mki67 was highly expressed on ISPs and DPblas (Fig. 1e). To investigate the proliferation process in detail, we quantified the average expression level of all cell cycle-associated genes during the developmental process and observed strong proliferation of ISPs and DPblas (Fig. 4a). Interestingly, we noticed that the expression patterns of cell cycle-associated genes in ISPs and DPbla1–4 were quite different, indicating different proliferation statuses. Therefore, we classified the cell cycle phases (G1/S, S, G2/M, M or M/G1) of all thymocytes based on phase-specific gene expression  (see “Methods”). We found that 40% of ISPs were in M/G1 phase, indicating strong continuous cell cycle proliferation. However, when cells entered the DPbla stage, the M/G1 phase was rarely observed (Fig. 4b), indicating that most DPblas exited the cell cycle after proliferation. Consistent with this, the four cell cycle phases appeared to be distributed in chronological order along the developmental trajectory of DPblas, whereby most DPbla1s were in G1/S phase, DPbla2s were in S and G2/M phases, and DPbla3s and DPbla4s mainly stayed in G2/M and M phases (Fig. 4b). The expression dynamics of cell cycle phase-specific genes also coordinated with the chronological order of the cell cycle phases in DPblas (Fig. 4c).
The observed correlation of cell cycle phases and predicted DPbla stages might be caused by the following: (1) the single-cell clustering and pseudotime prediction analysis were dominated by cell cycle genes or (2) DPbla cells differentiated into DPres during the cell cycle and tended to exit the cell cycle after division. Thus, we excluded all cell cycle-associated genes and reanalyzed the cell clustering and pseudotime trajectory with the same parameters (Additional file 1: Figure S7) and found that the chronological order of cells with or without cell cycle-associated genes correlated significantly (Pearson r = 0.995, p value < 10− 24, Fig. 4d). The distributions of cell identities at the four DPbla stages were also very similar (Fig. 4e). These results suggest that DPbla cells differentiate into DPres during the cell cycle and exit the cell cycle after division.
To test this hypothesis, we analyzed publicly available single-cell RNA profiles of mouse and human thymocytes (Additional file 1: Figure S8, 9) [12, 13, 16] in the same manner. Similar to our data, we observed a conserved chronological distribution of cell cycle phases in mouse and human DPblas (Fig. 4f). However, during the proliferation of DN thymocytes, the cell cycle phases seemed not to be sequentially distributed (Fig. 4g, Additional file 1: Figure S10), suggesting different proliferation mechanisms in DN and DP thymocytes. We also assessed proliferation during the development of other cell types, such as erythroid cells  and neurons  (Additional file 1: Figure S11), though the cell cycle phases were randomly distributed during the proliferation stages in both cell types, suggesting that the chronological distribution of the phases in development may be DPbla specific (Fig. 4f, g).
To validate the tendency of DPblas to exit the cell cycle, we sorted DPblas/DPres, cultured them in vitro, and quantified the proportion of proliferative cells among DPblas/DPres by UltraGreen staining. For DPblas, only 14% of proliferating cells divided more than once compared to 43% in total thymocytes (Fig. 4h), confirming that DPblas tended to exit the cell cycle after division. In addition, we investigated differentially expressed genes between DPbla subgroups and found that genes involved in T cell recombination and selection were enriched at the end of the DPbla stage (Additional file 1: Figure S12). This result suggests that the cell cycle process during the DPbla stage may help in preparing for the initiation of TCRα chain recombination and the positive selection progress, which occurs in DPres.
Thymocytes serve as APCs for CD8-associated thymic selection
Following the DPbla stage, thymocytes in the DPre stage enter a relatively quiescent phase  with little alteration in gene expression (Fig. 2a) to achieve TCRα chain rearrangement. After TCRα chain recombination and TCR maturation are complete, thymocytes are subjected to thymic selection in the DPsel stage. We identified three DPre and two DPsel subtypes. To verify differences among these subtypes, we analyzed differentially expressed genes between the DPre and DPsel subtypes (Additional file 1: Figure S13A, B) and found that these subtypes appear to be in different developmental stages. For example, mitochondrial cytochrome c oxidase-associated genes were enriched in DPre1, suggesting high metabolic activity. In contrast, genes involved in SRP-dependent cotranslational protein targeting the membrane were enriched in DPre2, which might result from the TCR translocation process. Enrichment of the GO term “T cell differentiation” in the DPre3 cluster might indicate maturation of the TCR signal at the late rearrangement stage (Figure S13C). In terms of DPsel subtypes, DPsel1 cells were highly enriched with the GO term “T cell costimulation”, suggesting that these cells undergo an active selection process. The “cytoplasmic translation” GO term was associated with DPsel2 cells, indicating that these cells are preparing for differentiation into the next stage (Figure S13D).
In addition, we observed strong enrichment of the MHC-I antigen presentation process in DPsels, CD4sps, and CD8sps (Fig. 5a). For example, expression of MHC-I, such as H2-D1, H2-K1, and B2 m, which are essential for antigen presentation, was significantly increased in DPsels and SP cells compared to DPre cells (Fig. 5b). Immunoproteasome subunits such as Psme2 and antigen loading-associated genes such as Tap1 and Tapbp were also upregulated in these stages (Fig. 5b). Furthermore, increased MHC-I antigen presentation activity at selection stages was observed in the integrated mouse and human thymocyte data (Additional file 1: Figures S14, S15). We then performed flow cytometry analysis and verified upregulation of surface MHC-I molecules in the DPsels and SP cells (Fig. 5c). These findings suggest an active MHC-I antigen presentation process in DPsels and SP cells.
CD8-associated positive and negative selection of thymocytes depends on the interaction of CD8, TCR, and the peptide-MHC-I (pMHCI) complex. However, the source of pMHCI has not been well studied. Classically, thymic epithelial cells (TECs) and APCs are considered major contributors. Nevertheless, it has been reported that thymocytes still exhibit normal CD8sp differentiation when pMHCI is expressed only on hematopoietic cells . Moreover, soluble pMHCI can effectively rescue CD8sp differentiation in an MHC-I-deficient fetal thymic organ culture system [37, 38]. These data suggest that CD8sp selection may respond to a wide source of pMHCI, which may include pMHCI on thymocytes themselves. In addition, expression of thymoproteasome subunits, which promote CD8 T cell differentiation [39, 40], was detected in thymocytes (Additional file 1: Figure S16); therefore, these cells are capable of providing a pMHCI signal for CD8-associated thymic selection.
We observed high expression of Cd2 by DPsels and SPs, which is critical for the formation of immunological synapses (ISs) [32, 41]. Thus, when DPsels and SPs interact intensively with proper pMHC on APCs during the selection process, ISs can be observed between them. To investigate whether selection occurs between thymocytes, we performed immunofluorescence experiments to detect ISs formed between thymocytes and adjacent APCs. We found that DP and CD4sp cells formed ISs with CD4-CD8-APCs in the thymus, indicating active CD4-associated selection between them (Fig. 5d). In addition to CD4-specific ISs, we detected CD8-associated ISs between thymocytes (Fig. 5d). When we further investigated adjacent thymocytes by imaging flow cytometry (Fig. 5e), we also observed clear CD8-associated ISs between adjacent DP and SP cells with CD4 evenly distributed on the surface (Fig. 5f). This finding suggests active CD8-associated selection between thymocytes. These results demonstrate that thymocytes can serve as APCs for CD8-associated thymic selection.
Species-specific TFs regulate cross-species transcriptional differences during thymocyte development in humans and mice
To investigate cross-species differences in thymocyte development, we integrated the single-cell transcriptome profiles of adult human and mouse thymocytes. Consistent with a previous report , the clusters of human and mouse thymocytes were mixed well (Fig. 6a, Additional file 1: Figure S17A), indicating that the development of thymocytes is conserved between these species. Nonetheless, cross-species transcriptional differences at the single-cell level occur during thymocyte development [13, 14]. Accordingly, we compared the single-cell transcriptomes of human and mouse thymocytes across all clusters and found many differentially expressed homologous genes, including previously reported genes such as AEBP1, BAALC, and DTX1  (Fig. 6b, c).
To investigate the regulation of transcriptional differences between humans and mice, we focused on differentially expressed TFs (Figure S17B). To this end, we overlapped their regulated genes annotated by MSigDB  with the differentially expressed genes between humans and mice and examined the enrichment to determine whether they might drive cross-species transcriptional differences (Fig. 6d). In humans, genes regulated by differential TFs, such as CREB3L4, HES2, ZNF146, ZNF423, and ZNF711, were significantly enriched among the genes differentially expressed from mice, indicating that these TFs might contribute to cross-species transcriptional differences in humans (Additional file 1: Figure S17B, C). In mice, the candidate driver TFs detected were Ets2 and Gata1 (Fig. 6d, e, Figure S17D). Ets2 belongs to the ETS-domain transcription factor family, which is critical for many bioprocesses, including development ; Gata1 is known to be essential for erythroid and megakaryocytic differentiation . To verify whether Gata1 is capable of regulating these differentially expressed genes between humans and mice, we analyzed ChIP-seq data for Gata1 and found that it binds to genomic regions around these differentially expressed genes, such as Skap1, Ap3s1, and P2ry10 (Fig. 6f, g, Additional file 1: Figure S17E, F). Moreover, Skap1 was also among the most differentially expressed genes in the human and mouse thymus (Fig. 6h). Thus, Gata1 might play a role in driving cross-species transcriptional differences between humans and mice.
The proper development of T cells is essential for effective responses against invading pathogens and for tolerance against self-antigens in the adaptive immune system; it is therefore a fundamental subject of investigation. In this study, we surveyed the mouse thymocyte development process at a single-cell resolution and classified subtypes of DP thymocytes that were consistent with those of human thymocytes . Unlike previous classifications of DP cells, for which gating strategies were mainly based on expression of a few well-studied marker genes, we classified cell subtypes via unsupervised analysis of high-throughput single-cell transcriptomes and then confirmed them by flow cytometry, rendering our classification a better representation of the inherent heterogeneity between cell subgroups. In addition to DP cells, DN and SP cell clusters were detected in our data. We classified SP cells into CD4sp and CD8sp clusters. A small proportion of CD4sp cells appeared to express CD8 mRNAs, which made CD4sp and CD8sp cells seemed not to be pure. However, we observed a very robust classification of these two clusters across different analysis parameters, suggesting that the CD4sp and CD8sp clusters we identified are indeed transcriptionally different. The moderate expression of CD8 mRNA in CD4sp cells may be caused by the delayed degradation of CD8 mRNAs. Overall, the relatively low cell number of our dataset limited the power in terms of separating SP clusters, and more experiments, such as cell sorting and single-cell sequencing, are needed to fully determine the cell purity of the CD4sp and CD8sp clusters.
During the transition from DN to DP, thymocytes undergo 6–8 divisions  to generate a large pool of DP cells. Consistent with this, we observed a notable proportion of M/G1 thymocytes in the DN and ISP stages, indicating continuous cell cycles. Thus, DN thymocytes are self-renewable and support the persistence and differentiation of thymocytes without any input from bone marrow progenitors for a relatively long time. When entering the DPbla stage, thymocytes invoke a different proliferation pathway and differentiate into DPres during the cell cycle, which appears not to be self-renewable. This may be caused by upregulation of RORγt in DP cells, which suppresses the proliferation of thymocytes . Typically, DP cells comprise the majority (60 to 80%) of thymocytes in the first 4 to 5 decades of life. Afterwards, as a consequence of thymic involution, expansion at the early DP stage seems to be disrupted, and DP cells decline dramatically and irreversibly (to less than 15%) . Decreased thymocyte levels result in a deficit in naïve T cell output to the peripheral T cell pool, strongly contributing to the immune insufficiency of older individuals . Hence, proliferation at the early DP stage is crucial to maintain the homeostasis of thymic T cell development. Our study revealed a unique proliferation behavior of DPblas that lacks self-renewal ability. The unique division mechanism of DP thymocytes may lead to a decline in the DP population during thymic involution.
Thymic selection is essential for T cells to establish central tolerance, which depends on the TCR affinity with the self-peptide-MHC complex. Unlike CD4 T cells restricted to MHC-II, which is expressed on only TECs and APCs, CD8 T cells recognize peptides presented on MHC-I, which is widely expressed on a variety of cells, including thymocytes themselves. Our data suggest that upregulated activity of MHC-I antigen presentation in DPsels and SP thymocytes may contribute to CD8sp T cell selection between thymocytes, as also indicated by previously reported data [36,37,38]. This phenomenon may also be related to the thymic preference for CD4 T cells over CD8 T cells, which is usually thought to be a consequence of the default CD4 T cell pathway  or homeostatic mechanisms [49, 50]. Our study results suggest another possibility of easier access to pMHCI in CD8sp selection, accelerating fate decisions for CD8sp cells. Consistent with this, CD8 T cells usually become reconstituted faster than CD4 T cells after hematopoietic stem cell transplantation .
In summary, this study classified the main thymocyte cell types and identified three subtypes of DP cells at different developmental stages. We revealed that Ly6d and CD2 can be used as surface markers to partition these subtypes at the protein level. Based on the classification of DP thymocytes, we found that DPblas differentiate into DPres during the cell cycle, which is specific to DP thymocyte development compared to DN and other cell types. We also observed that at the thymic selection stage, thymocytes can serve as APCs in CD8-associated selection. Based on cross-species comparison, we found that species-specific TFs contribute to the transcriptional differences between thymocytes from humans and mice. Together, this study provides new insight into the development of DP thymocytes at a single-cell resolution and helps us to better understand this fundamental immunological process.
Availability of data and materials
The raw sequence data reported in this paper have been deposited in the Gene Expression Omnibus (GEO) under accession numbers GSE166715, which is also publicly accessible from https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE166715 . The processed data are provided on figshare (the thymus single-cell data generated from this study: https://figshare.com/s/1f910888df4afa4ec53b , the integrated thymus single-cell data: https://figshare.com/s/db3f03095c74c16cf38d ). The original analysis codes and scripts can be accessed at https://github.com/QuKunLab/T-cell-development .
The Tabula Muris thymus dataset was downloaded from GSE109774: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109774 .
The erythroid dataset was downloaded from GSE89754: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE89754 .
The neuron dataset was downloaded from GSE93593: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE93593 .
The 24-year-old human thymus dataset was downloaded from E-MTAB-8581: https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8581/ .
A TF list was obtained from The Human Transcription Factors database: http://humantfs.ccbr.utoronto.ca/ .
Genes regulated by each TF were based on GSEA: http://www.gsea-msigdb.org/gsea/msigdb/collections.jsp#C3 .
ChIP-seq data for Gata1 were obtained from Cistrome Data Browser: http://cistrome.org/db/#/ .
RNA-seq data for the human and mouse thymus were obtained from ENCODE under accession numbers GSM1220578, GSM1220591, GSM1220592, GSM1220593, GSM1220599, GSM1220601 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE18927 ), GSM1010944 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE16256 ), GSE78390 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE78390 , GSM970852 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE49847 ), GSE93469 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE93469 ) and GSE90183 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE90183 ).
Code related to analyses is available from GitHub (APEC: https://github.com/QuKunLab/APEC , Seurat: https://github.com/satijalab/seurat ).
Immature single positive
DP thymocytes undergoing rearrangement
DP thymocytes undergoing selection
Antigen presentation cell
Thymic epithelial cell
T cell receptor
Boehm T, Swann JB. Origin and evolution of adaptive immunity. Annu Rev Anim Biosci. 2014;2:259–83.
Cheng MH, Anderson MS. Monogenic autoimmunity. Annu Rev Immunol. 2012;30:393–427.
Ciofani M, Zuniga-Pflucker JC. The thymus as an inductive site for T lymphopoiesis. Annu Rev Cell Dev Biol. 2007;23:463–93.
Hernandez-Munain C, Sleckman BP, Krangel MS. A developmental switch from TCR delta enhancer to TCR alpha enhancer function during thymocyte maturation. Immunity. 1999;10:723–33.
Borgulya P, Kishi H, Uematsu Y, von Boehmer H. Exclusion and inclusion of alpha and beta T cell receptor alleles. Cell. 1992;69:529–37.
Brandle D, Muller C, Rulicke T, Hengartner H, Pircher H. Engagement of the T-cell receptor during positive selection in the thymus down-regulates RAG-1 expression. Proc Natl Acad Sci U S A. 1992;89:9529–33.
Davey GM, Schober SL, Endrizzi BT, Dutcher AK, Jameson SC, Hogquist KA. Preselection thymocytes are more sensitive to T cell receptor stimulation than mature T cells. J Exp Med. 1998;188:1867–74.
Lucas B, Stefanova I, Yasutomo K, Dautigny N, Germain RN. Divergent changes in the sensitivity of maturing T cells to structurally related ligands underlies formation of a useful T cell repertoire. Immunity. 1999;10:367–76.
Kernfeld EM, Genga RMJ, Neherin K, Magaletta ME, Xu P, Maehr R. A single-cell transcriptomic atlas of thymus organogenesis resolves cell types and developmental maturation. Immunity. 2018;48:1258–70. e1256
Zeng Y, Liu C, Gong Y, Bai Z, Hou S, He J, Bian Z, Li Z, Ni Y, Yan J, et al. Single-cell RNA sequencing resolves spatiotemporal development of pre-thymic lymphoid progenitors and thymus organogenesis in human embryos. Immunity. 2019;51:930–48. e936
Lavaert M, Liang KL, Vandamme N, Park JE, Roels J, Kowalczyk MS, Li B, Ashenberg O, Tabaka M, Dionne D, et al. Integrated scRNA-Seq identifies human postnatal thymus seeding progenitors and regulatory dynamics of differentiating immature thymocytes. Immunity. 2020;52:1088–104. e1086
Zhou W, Yui MA, Williams BA, Yun J, Wold BJ, Cai L, Rothenberg EV. Single-cell analysis reveals regulatory gene expression dynamics leading to lineage commitment in early T cell development. Cell Syst. 2019;9:321–37. e329
Park JE, Botting RA, Dominguez Conde C, Popescu DM, Lavaert M, Kunz DJ, Goh I, Stephenson E, Ragazzini R, Tuck E, et al. A cell atlas of human thymic development defines T cell repertoire formation. Science. 2020;367(6480):eaay3224.
Le J, Park JE, Ha VL, Luong A, Branciamore S, Rodin AS, Gogoshin G, Li F, Loh YE, Camacho V, et al. Single-cell RNA-Seq mapping of human thymopoiesis reveals lineage specification trajectories and a commitment spectrum in T cell development. Immunity. 2020;52:1105–18. e1109
Li B, Li Y, Li K, Zhu L, Yu Q, Cai P, Fang J, Zhang W, Du P, Jiang C, et al. APEC: an accesson-based method for single-cell chromatin accessibility analysis. Genome Biol. 2020;21:116.
Tabula Muris C, Overall c, Logistical c, Organ c, processing, Library p, sequencing, Computational data a, Cell type a, Writing g, et al. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Nature. 2018;562:367–72.
Tusi BK, Wolock SL, Weinreb C, Hwang Y, Hidalgo D, Zilionis R, Waisman A, Huh JR, Klein AM, Socolovsky M. Population snapshots predict early haematopoietic and erythroid hierarchies. Nature. 2018;555:54.
Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, Tirosh I, Bialas AR, Kamitaki N, Martersteck EM, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161:1202–14.
Aibar S, Gonzalez-Blas CB, Moerman T, Huynh-Thu VA, Imrichova H, Hulselmans G, Rambow F, Marine JC, Geurts P, Aerts J, et al. SCENIC: single-cell regulatory network inference and clustering. Nat Methods. 2017;14:1083–6.
Zuo Z, Jin Y, Zhang W, Lu Y, Li B, Qu K. ATAC-pipe: general analysis of genome-wide chromatin accessibility. Brief Bioinform. 2019;20:1934–43.
Close JL, Yao Z, Levi BP, Miller JA, Bakken TE, Menon V, Ting JT, Wall A, Krostag AR, Thomsen ER, et al. Single-cell profiling of an in vitro model of human interneuron development reveals temporal dynamics of cell type production and maturation. Neuron. 2017;96:949.
Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, Chen X, Taipale J, Hughes TR, Weirauch MT. The human transcription factors. Cell. 2018;175:598–9.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545–50.
Mei S, Qin Q, Wu Q, Sun H, Zheng R, Zang C, Zhu M, Wu J, Shi X, Taing L, et al. Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse. Nucleic Acids Res. 2017;45:D658–62.
Kazachenka A, Bertozzi TM, Sjoberg-Herrera MK, Walker N, Gardner J, Gunning R, Pahita E, Adams S, Adams D, Ferguson-Smith AC. Identification, characterization, and heritability of murine metastable epialleles: implications for non-genetic inheritance. Cell. 2018;175:1717.
Young L, Kun L, Lianbang Z, Bin L, Dandan Z, Pengfei C, Chen J, Pengcheng D, Jun L, Kun Q. Development of double-positive thymocytes at single-cell resolution. GitHub. https://github.com/QuKunLab/T-cell-development (2020).
Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol. 2018;36(5):411–20.
Mingueneau M, Kreslavsky T, Gray D, Heng T, Cruse R, Ericson J, Bendall S, Spitzer MH, Nolan GP, Kobayashi K, et al. The transcriptional landscape of alphabeta T cell differentiation. Nat Immunol. 2013;14:619–32.
Carpenter AC, Bosselut R. Decision checkpoints in the thymus. Nat Immunol. 2010;11:666–73.
Xi H, Schwartz R, Engel I, Murre C, Kersh GJ. Interplay between RORgammat, Egr3, and E proteins controls proliferation in response to pre-TCR signals. Immunity. 2006;24:813–26.
Inlay MA, Bhattacharya D, Sahoo D, Serwold T, Seita J, Karsunky H, Plevritis SK, Dill DL, Weissman IL. Ly6d marks the earliest stage of B-cell specification and identifies the branchpoint between B-cell and T-cell development. Genes Dev. 2009;23:2376–81.
Sasada T, Reinherz EL. A critical role for CD2 in both thymic selection events and mature T cell function. J Immunol. 2001;166:2394–403.
Porritt HE, Rumfelt LL, Tabrizifard S, Schmitt TM, Carlos J, Zuniga-Pflucker JC, Petrie HT. Heterogeneity among DN1 prothymocytes reveals multiple progenitors with different capacities to generate T cell and non-T cell lineages. Immunity. 2004;20:735–45.
He YW, Beers C, Deftos ML, Ojala EW, Forbush KA, Bevan MJ. Down-regulation of the orphan nuclear receptor ROR gamma t is essential for T lymphocyte maturation. J Immunol. 2000;164:5668–74.
Yamashita I, Nagata T, Tada T, Nakayama T. Cd69 cell-surface expression identifies developing thymocytes which audition for T-cell antigen receptor-mediated positive selection. Int Immunol. 1993;5:1139–50.
Ju JM, Jung MH, Nam G, Kim W, Oh S, Kim HD, Kim JY, Chang J, Lee SH, Park GS, et al. Escape from thymic deletion and anti-leukemic effects of T cells specific for hematopoietic cell-restricted antigen. Nat Commun. 2018;9:225.
Mintern JD, Maurice MM, Ploegh HL, Schott E. Thymic selection and peripheral activation of CD8 T cells by the same class I MHC/peptide complex. J Immunol. 2004;172:699–708.
Hogquist KA, Jameson SC, Heath WR, Howard JL, Bevan MJ, Carbone FR. T cell receptor antagonist peptides induce positive selection. Cell. 1994;76:17–27.
Murata S, Sasaki K, Kishimoto T, Niwa S, Hayashi H, Takahama Y, Tanaka K. Regulation of CD8+ T cell development by thymus-specific proteasomes. Science. 2007;316:1349–53.
Murata S, Takahama Y, Kasahara M, Tanaka K. The immunoproteasome and thymoproteasome: functions, evolution and human disease. Nat Immunol. 2018;19:923–31.
Espagnolle N, Depoil D, Zaru R, Demeur C, Champagne E, Guiraud M, Valitutti S. CD2 and TCR synergize for the activation of phospholipase Cgamma1/calcium pathway at the immunological synapse. Int Immunol. 2007;19:239–48.
Liberzon A, Birger C, Thorvaldsdottir H, Ghandi M, Mesirov JP, Tamayo P. The molecular signatures database (MSigDB) hallmark gene set collection. Cell Syst. 2015;1:417–25.
Sharrocks AD. The ETS-domain transcription factor family. Nat Rev Mol Cell Biol. 2001;2:827–37.
Katsumura KR, Bresnick EH, Group GFM. The GATA factor revolution in hematology. Blood. 2017;129:2092–102.
Taghon T, Yui MA, Pant R, Diamond RA, Rothenberg EV. Developmental and molecular characterization of emerging beta- and gammadelta-selected pre-T cells in the adult mouse thymus. Immunity. 2006;24:53–64.
Thome JJ, Grinshpun B, Kumar BV, Kubota M, Ohmura Y, Lerner H, Sempowski GD, Shen Y, Farber DL. Longterm maintenance of human naive T cells through in situ homeostasis in lymphoid tissue sites. Sci Immunol. 2016;1(6):eaah6506.
Aspinall R, Andrew D. Thymic involution in aging. J Clin Immunol. 2000;20:250–6.
Suzuki H, Punt JA, Granger LG, Singer A. Asymmetric signaling requirements for thymocyte commitment to the CD4+ versus CD8+ T cell lineages: a new perspective on thymic commitment and selection. Immunity. 1995;2:413–25.
Crompton T, Lees RK, Pircher H, Macdonald HR. Precommitment of Cd4+Cd8+ thymocytes to either Cd4 or Cd8 lineages. Proc Natl Acad Sci U S A. 1993;90:8982–6.
Corbella P, Moskophidis D, Spanopoulou E, Mamalaki C, Tolaini M, Itano A, Lans D, Baltimore D, Robey E, Kioussis D. Functional commitment to helper T cell lineage precedes positive selection and is independent of T cell receptor MHC specificity. Immunity. 1994;1:269–76.
Alho AC, Kim HT, Chammas MJ, Reynolds CG, Matos TR, Forcade E, Whangbo J, Nikiforow S, Cutler CS, Koreth J, et al. Unbalanced recovery of regulatory and effector T cells after allogeneic stem cell transplantation contributes to chronic GVHD. Blood. 2016;127:646–57.
Young L, Kun L, Lianbang Z, Bin L, Dandan Z, Pengfei C, Chen J, Pengcheng D, Jun L, Kun Q. Development of double-positive thymocytes at single-cell resolution. Datasets GSE166715. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE166715 (2020).
Young L, Kun L, Lianbang Z, Bin L, Dandan Z, Pengfei C, Chen J, Pengcheng D, Jun L, Kun Q. Development of double-positive thymocytes at single-cell resolution. Thymus single-cell data generated from this study. figshare. https://figshare.com/s/db3f03095c74c16cf38d (2020).
Young L, Kun L, Lianbang Z, Bin L, Dandan Z, Pengfei C, Chen J, Pengcheng D, Jun L, Kun Q. Development of double-positive thymocytes at single-cell resolution. Integrated thymus data . figshare. https://figshare.com/s/1f910888df4afa4ec53b (2020).
Tabula Muris C, Overall c, Logistical c, Organ c, processing, Library p, sequencing, Computational data a, Cell type a, Writing g, et al. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Datasets GSE109774. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109774 (2018).
Tusi BK, Wolock SL, Weinreb C, Hwang Y, Hidalgo D, Zilionis R, Waisman A, Huh JR, Klein AM, Socolovsky M. Fundamental limits on dynamic inference from single-cell snapshots. Datasets GSE89754. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE89754 (2018).
Close JL, Yao Z, Levi BP, Miller JA, Bakken TE, Menon V, Ting JT, Wall A, Krostag AR, Thomsen ER, et al. Single-cell profiling of an in vitro model of human interneuron development reveals temporal dynamics of cell type production and maturation. Datasets GSE93593. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE93593 (2017).
Park JE, Botting RA, Dominguez Conde C, Popescu DM, Lavaert M, Kunz DJ, Goh I, Stephenson E, Ragazzini R, Tuck E, et al. A cell atlas of human thymic development defines T cell repertoire formation. Datasets E-MTAB-8581. EMBL-EBI. https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-8581/ (2020).
Lambert SA, Jolma A, Campitelli LF, Das PK, Yin Y, Albu M, Chen X, Taipale J, Hughes TR, Weirauch MT. The human transcription factors. The Human Transcription Factors database. http://humantfs.ccbr.utoronto.ca/ (2018).
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Molecular Signatures Database. http://www.gsea-msigdb.org/gsea/msigdb/collections.jsp#C3 (2005).
Mei S, Qin Q, Wu Q, Sun H, Zheng R, Zang C, Zhu M, Wu J, Shi X, Taing L, et al. Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse. Cistrome Data Browser. http://cistrome.org/db/#/ (2017).
JA S: University of Washington Human Reference Epigenome Mapping Project. Datasets GSE18927. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE18927 (2010).
Lister R, Pelizzola M, Dowen R, Hawkins R, Hon G, Tonti-Filippini J, Nery J, Lee L, Ye Z, Ngo Q, et al. UCSD Human Reference Epigenome Mapping Project. Datasets GSE16256. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE16256 (2009).
Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Datasets GSE78390. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE78390 (2016).
Feng Y, Yong C, Alessandra B, Jeff V, Weisheng W, Tyrone R, Consortium ME. A comparative encyclopedia of DNA elements in the mouse genome. Datasets GSE49847. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE49847 (2013).
Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. Datasets GSE93469. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE93469 (2017).
Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Datasets GSE90183. Gene Expression Omnibus. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE90183 (2016).
Li B, Li Y, Li K, Zhu L, Yu Q, Cai P, Fang J, Zhang W, Du P, Jiang C, et al. APEC: an accesson-based method for single-cell chromatin accessibility analysis. GitHub. https://github.com/QuKunLab/APEC (2020).
Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. GitHub. https://github.com/satijalab/seurat (2018).
We thank the USTC supercomputing center and the School of Life Science Bioinformatics Center for providing supercomputing resources for this project. We thank the CAS interdisciplinary innovation team for helpful discussion.
This work was supported by the National Key R&D Program of China (2017YFA0102900 and 2020YFA0112200 to K.Q.), the National Natural Science Foundation of China grants (91940306, 81788101, 31970858, 31771428 and 91640113 to K.Q. and 81871479 to J.L.), and supported by the Fundamental Research Funds for the Central Universities (WK2070000158, YD2070002019, WK9110000141 to K.Q., WK2070000129, WK9100000001 and WK2070000114 to Y.L.). It was also supported by Anhui Provincial Natural Science Foundation grant (1908085QH326 to Y.L. and BJ2070000097 to B.L.).
Ethics approval and consent to participate
All mouse experiments in this study were reviewed and approved by the Institutional Animal Care and Use Committee of the University of Science and Technology of China (approval number: USTCACUC1701027).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Figure S1.
Data processing workflow and quality control of thymocytes. Figure S2. Integrated analysis with data from Tabula Muris. Figure S3. Robustness of thymocyte clustering. Figure S4. Single-cell ATAC-seq on mouse thymocytes. Figure S5. The transcription regulation network of each stage during thymocyte development. Figure S6. Ly6d and CD2 serve as new markers to gate DP subtypes in integrated data. Figure S7. Single-cell transcriptome clustering and pseudo-time trajectory of thymocytes without cell cycle-related genes. Figure S8. Single-cell transcriptome map of thymocytes from Tabula Muris dataset. Figure S9. Single-cell transcriptome map of thymocytes from Human data. FigureS10. Single-cell transcriptome map of Early T Cell Development. Figure S11. Single-cell transcriptome map of neuronal development. Figure S12. GO terms enriched in subgroups of DPbla stage. Figure S13. The difference in subgroups of DP stage. Figure S14. MHC-I antigen presentation occurred between thymocytes in integrated data. Figure S15. The expression of MHC-I associated genes in thymocytes. Figure S16. The expression of thymoproteasome subunits associated genes in thymocytes. Figure S17. Transcriptional regulation differences between human and mouse thymocyte development.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Li, Y., Li, K., Zhu, L. et al. Development of double-positive thymocytes at single-cell resolution. Genome Med 13, 49 (2021). https://doi.org/10.1186/s13073-021-00861-7
- Single-cell sequencing
- Cell cycle
- Thymic selection