Genes and pathways underlying regional and cell type changes in Alzheimer's disease
© Miller et al.; licensee BioMed Central Ltd. 2013
Received: 2 April 2013
Accepted: 25 May 2013
Published: 25 May 2013
Skip to main content
© Miller et al.; licensee BioMed Central Ltd. 2013
Received: 2 April 2013
Accepted: 25 May 2013
Published: 25 May 2013
Transcriptional studies suggest Alzheimer's disease (AD) involves dysfunction of many cellular pathways, including synaptic transmission, cytoskeletal dynamics, energetics, and apoptosis. Despite known progression of AD pathologies, it is unclear how such striking regional vulnerability occurs, or which genes play causative roles in disease progression.
To address these issues, we performed a large-scale transcriptional analysis in the CA1 and relatively less vulnerable CA3 brain regions of individuals with advanced AD and nondemented controls. In our study, we assessed differential gene expression across region and disease status, compared our results to previous studies of similar design, and performed an unbiased co-expression analysis using weighted gene co-expression network analysis (WGCNA). Several disease genes were identified and validated using qRT-PCR.
We find disease signatures consistent with several previous microarray studies, then extend these results to show a relationship between disease status and brain region. Specifically, genes showing decreased expression with AD progression tend to show enrichment in CA3 (and vice versa), suggesting transcription levels may reflect a region's vulnerability to disease. Additionally, we find several candidate vulnerability (ABCA1, MT1H, PDK4, RHOBTB3) and protection (FAM13A1, LINGO2, UNC13C) genes based on expression patterns. Finally, we use a systems-biology approach based on WGCNA to uncover disease-relevant expression patterns for major cell types, including pathways consistent with a key role for early microglial activation in AD.
These results paint a picture of AD as a multifaceted disease involving slight transcriptional changes in many genes between regions, coupled with a systemic immune response, gliosis, and neurodegeneration. Despite this complexity, we find that a consistent picture of gene expression in AD is emerging.
Alzheimer's disease (AD) is the most common form of dementia, affecting nearly half of the population over the age of 85 years . AD has no cure and although <10% of cases can be linked to genetic mutations in PSEN1, PSEN2, or APP, the majority of AD cases have no known genetic cause, and the underlying genetic modifiers are highly complex and remain elusive . While neurofibrillary tangles (NFTs) and amyloid deposition are pathological hallmarks of AD, transcriptional studies suggest that dysfunction of cellular pathways such as energy metabolism [3–5], synaptic transmission [3–6], and myelin-axon interactions  may precede the neuropathological indicators [7, 8]. Other pathways implicated in AD include inflammation [3, 4, 9], cytoskeletal dynamics [9, 10], signal transduction [3, 4, 9, 11], protein misfolding [3, 12], transcription factors [3, 9], and cell proliferation [3, 9]. Furthermore, these transcriptional changes do not occur throughout the brain in a uniform manner; AD follows a well-characterized progression, with pathology beginning in brain areas involved in learning, memory, perception, and emotion, such as the entorhinal cortex, amygdala, and hippocampus, then spreading throughout the cortex [7, 13]. This regional vulnerability is strikingly apparent in the hippocampus, where CA1 pyramidal neurons are invariably affected earlier and more severely than their neighboring CA3 counterparts.
While many of these transcriptional changes are likely due to dysfunctional cellular pathways, changes in the cellular composition of affected brain regions are also likely to impact gene expression levels . In addition to widespread pyramidal cell loss and diffuse atrophy of affected brain regions , the role of glial cells in AD pathophysiology is becoming more apparent. Microglia, the resident immune cells in the central nervous system, have been shown to cluster around amyloid plaques , increasing in number in the early stages of AD . Reactive astrocytes show similar response to disease pathology, whereas astrocytes not associated with pathology tend to degenerate . Oligodendrocyte dysfunction has also been suggested as an early event in AD progression . Although a few groups have used methods such as laser capture microdissection [19, 20] and microaspiration  to enrich their samples for transcripts expressed in pyramidal neurons, the extent to which cellular composition impacts gene expression remains unclear.
To address these issues and to complement these forward genetic analyses, we have performed a large-scale transcriptional analysis in brain of individuals with advanced AD and non-demented controls, focusing specifically on the CA1 field of the hippocampus and the relatively less affected adjacent region, CA3. For comparisons between brain regions and across disease status, we find consistency between our results and several previous studies; however, with the addition of CA3 samples in AD we are also able to provide novel insights into AD pathophysiology. In CA1 we find that genes related to synaptic transmission and cell-cell signaling tend to show decreased expression in AD, whereas genes related to cell death and cell proliferation tend to show increased expression. Interestingly, many of the changes occurring in CA1 also occur in CA3, although to a lesser extent. Furthermore, genes showing decreased expression with AD progression are likely to also show an initial enrichment in CA3, whereas genes showing increased expression with AD progression are likely to also show an initial enrichment in CA1, indicating that transcription levels in a region may reflect that region's vulnerability to disease. Based on this rubric, we identify ABCA1, MT1H, PDK4, and RHOBTB3 as putative vulnerability genes and FAM13A1, LINGO2, and UNC13C as putative protection genes. To account for the changes in cellular composition that occur in AD, we developed and apply a linear model, finding that the most differentially expressed genes are likely involved in dysfunctional cellular pathways rather than due to cell loss or gliosis. Along the same lines, we use weighted gene co-expression network analysis (WGCNA) to find modules of highly co-expressed genes enriched with markers for major cell types, each of which shows a distinct expression pattern that provides insight into aging and AD. Of particular note is a microglia-associated module that shows increased expression in controls with early signs of NFT pathology, lending support to the idea that microglial activation may be one of the earliest events in AD progression. Together, these findings suggest that large-scale regional vulnerabilities in AD are likely due to the combination of many small differences in gene expression patterns between brain regions, affecting multiple cell types.
Summary of subject information
11 M/5 F
9 M/8 F
81.7 ± 6.9
77.3 ± 9.1
10.8 ± 6.8
11.2 ± 6.3
0.58 ± 0.51
2.59 ± 0.51
1.50 ± 0.52
5.33 ± 0.62
Subjects from the Alzheimer's Disease Center fell into two categories. First, control subjects were participants in brain aging studies at the Oregon Aging/Alzheimer's Disease Center. Subjects received annual neurological and neuropsychological evaluation, with clinical dementia rating assigned by an experienced clinician. Controls had normal cognitive and functional examinations. Second, the AD subjects were diagnosed by a clinical team consensus conference, met National Institute for Neurological and Communicative Disorders and Stroke-Alzheimer's Disease and Related Disorder Association diagnostic criteria for clinical AD, had a clinical dementia rating of greater than 1.0, and neuropathologic confirmation at autopsy (after informed consent). Tissue use conformed to institutional review board-approved protocols. Subjects from the Spinal Fluid Resource Center met comparable criteria. Ordinal scales were used to assess NFT burden (Braak stage of 0 to 6)  and amyloid plaque burden (0 to 3), where higher scores indicate greater pathology.
Hippocampal CA1 and CA3 subfields were isolated using the following method. First, frozen tissue was cut into 60 μm sections, with the first section from each sample stained with cresyl violet. Labeled sections were then photographed and enlarged, and these images were used as reference during dissections. Using a scalpel CA1 and CA3 subfields were dissected from sectioned but unlabeled tissue on dry ice and immediately placed into elution buffer for RNA extraction. Total RNA from each sample was isolated using the RNeasy Micro Kit with DNase I treatment (QIAGEN, Valencia, CA, USA), then tested for quality on the Agilent 2100 Bioanalyzer using RNA 6000 Nano Chips (Agilent Technologies, Palo Alto, CA, USA). For each of the 71 samples passing RNA quality control standards, 360 ng total RNA were sent to the Southern California Genotyping Consortium (Los Angeles, CA, USA) for analysis on the Illumina HumanHT-12 v3 Expression BeadChips (Illumina, San Diego, CA, USA). Samples were randomly assigned to BeadChips in order to minimize the impact of any batch effects on differential expression by region or disease status.
Unprocessed expression data for all 71 samples have been deposited in NCBI's Gene Expression Omnibus (GEO)  and are accessible through GEO Series accession number GSE29378. Illumina HumanHT-12 v3 Expression BeadChips measure the expression of over 25,000 annotated genes using 48,803 probes. Initial expression values were computed from probe intensities using the program GenomeStudio (Illumina). From these data, six samples with low inter-array correlation were removed as outliers (as described in ). The data were then quantile normalized. Two final outlier arrays were removed as above, for a total of 63 samples (32 control, 31 AD) remaining in the analysis. This outlier removal procedure is completely unbiased, since it ignores phenotypic traits.
After preprocessing and outlier removal, the following categories of probes were omitted from the analysis: (i) probes called as present (P < 0.1) in three or fewer samples; (ii) probes not assigned gene symbol annotations; and (iii) duplicate probes for a single gene, but only if these probes had a Pearson's correlation value of R > 0.8 (using the function collapseRows ). When removing duplicate probes for a gene, the probe with the highest average expression level was retained. This final filtering step left a total of 23,696 probes in our analysis corresponding to 17,128 genes. The resulting expression matrix is also available from the same location.
We measured differential expression with respect to region, disease, and Braak stage, often using only a subset of the total data. Unless otherwise specified, an uncorrected P-value cutoff of <0.05 (using a Student's t-test) combined with a fold change (FC) >1.2 was used to determine differential expression (after correcting for multiple comparisons, very few genes showed significant differential expression). When it came to validating findings across data sets, we kept track of the directionality of gene expression (for example, genes that are over-expressed in diseased individuals in one data set should also be over-expressed in the diseased individuals of another data set). For region-enrichment comparisons, paired t-tests were used, since CA1 and CA3 were obtained from each subject.
To characterize lists of differentially expressed genes based on gene ontology annotation, we used Enrichment Analysis Systematic Explorer (EASE) , as previously described [3, 5]. EASE assigns identified genes to Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and other experimentally derived gene categories, and then tests for significant overrepresentation of identified genes within each category using a modified Fisher's exact test. In order to compare our differential expression results with similarly designed previous studies, we first sorted and ranked all genes in our analysis with respect to region in control only, as well as with respect to disease status in CA1 alone. We sorted and ranked the variables using the Z scores. Since a monotonically increasing function relates Z scores to P-values, this is equivalent to sorting by P-values. For each previous study, we then noted where the reported differentially expressed genes were located in our sorted list, and assessed the resulting significance using a Z score to measure divergence from a random distribution. Specifically, we quantify consistency using 'mean gene rank', which is the mean ranked differential expression of a subset of genes, scaled by the number of total genes and offset by 0.5 to set chance = 0.
We also determined putative vulnerability and protection genes with AD. Vulnerability genes are defined as genes showing significantly higher expression in CA1 than CA3 (FC >1.2) and increasing with AD to a significantly greater degree in CA1 compared with CA3 (FC in CA1 >1.2 and FC in CA1 > FC in CA3). Protection genes were defined as genes showing significantly higher expression in CA3 than CA1 (FC >1.2) and also increasing to a greater degree (FC in CA3 >1.2, FC in CA1 <1.2) or decreasing to a lesser degree (FC in CA3 <1.4 × FC in CA1) in CA3 compared with CA1. Both vulnerability and protection genes also must have a Bayes ANOVA significance of P < 0.05 as assessed using the function bayesAnova (parameters: conf = 12, bayes = 1, winSize = 11) , and all of the FC criteria must hold when defining groups based on both the mean and the median expression for each group.
To ensure that our results for region and disease status were not solely a product of neurodegeneration and gliosis, we used a multivariate linear model to regress individual gene expression levels against region, disease status, and marker genes for four major cell types: neurons (SYT1), astrocytes (AQP4), oligodendrocytes (MOG), and microglia (TYROBP), respectively. These particular marker genes met the following three criteria: 1) they had multiple publications linking them to their matched cell type; 2) they showed significant experimental confirmation in two previous microarray studies; and 3) they showed high connectivity with their matched cell type in two previous WGCNA studies in brain [14, 25]. We also note that the model is fairly robust to choice of marker genes for cell type.
We created a network from normalized expression data by following the standard procedure of WGCNA . Briefly, we calculated pair-wise Pearson correlations between each gene pair, and then transformed this matrix into a signed adjacency matrix using a power function. The components of this matrix (connection strengths) were then used to calculate 'topological overlap' (TO), a robust and biologically meaningful measurement of gene similarity based on two genes' co-expression relationships with all other genes in the network. Genes were hierarchically clustered using '1 - TO' as the distance measure, and initial module assignments were determined by using a dynamic tree-cutting algorithm . For computational reasons, initial module formation was performed only on the approximately 15,000 genes with the highest overall connectivity, as previously described . We calculated Pearson correlations between each gene and each module eigengene - referred to as a gene's module membership - along with the corresponding P-values [14, 28]. The module eigengene is commonly used as a representative value for a module, and is defined as the first principal component of a module, and is the component that explains the maximum possible variability for all genes in a module. For the final module characterizations, each gene was (re)assigned to the module for which it had the highest module membership. Thus, genes were each assigned to exactly one module, including genes that were omitted from the initial module formation.
Modules were characterized using the following strategy: first, modules were annotated using EASE (as described above); second, modules were further annotated by measuring their overlap with modules from previous WGCNA studies of human and mouse brain [14, 25]; third, cell type annotations were confirmed by measuring the overlap between our modules and experimentally derived lists of cell type-specific genes using the function userListEnrichment ; fourth, modules were annotated for region and disease specificity by measuring their overlap with lists of differentially expressed genes from the six studies discussed in the text [3, 4, 20, 29–31]; and finally, module eigengenes were associated with all phenotypic traits available in this study (region, disease, age, and so on) in order to gain insight into the role each module might play in AD pathophysiology. To test for significant overlap between gene lists from our study and those from previous lists, the hypergeometric distribution (Fisher's exact test) was used. Modules were graphically depicted using VisANT , as previously described . Network depictions show the 250 strongest reciprocal within-module gene-gene interactions (connections) as measured by TO. A gene was considered a 'hub' if it had at least 15 depicted connections.
RNA for quantitative RT-PCR (qRT-PCR) validations of eight disease- and region-specific genes was collected as for the arrays. Although RNA was collected from the same samples as in the microarray analysis, it was collected from different sections. Total RNA was collected from larger pieces of hippocampus and frontal cortex of five select individuals for qRT-PCR validations of microglial genes. For these samples, the RNeasy Mini Kit with DNase I treatment (QIAGEN) was used for RNA isolation. A list of primer pairs used for qRT-PCR validation is provided (Table S7 in Additional file 6). In total, 13 genes were assessed using qRT-PCR. For qRT-PCR validations of PDPR, results from two separate primer pairs were averaged.
Probes for RNA in situ hybridization analysis were designed using distal forward and reverse primer pairs from two proximal qRT-PCR validation regions to yield a probe of approximately 500 bp that was cloned into the pCR4-TOPO vector (Invitrogen). To produce digitonin-labeled probes, plasmids were first linearized with NotI (New England Biolabs, Ipswich, MA, USA), then transcribed using the DIG RNA Labeling kit (Roche, Indianapolis, IN, USA) according to the manufacturer's protocols. Formalin-fixed paraffin-embedded tissue sections of control and AD case individuals cut to 16 μm thickness were obtained from the UCLA Alzheimer's Disease Research Center. Hybridization was performed according to  with modifications from  using 600 μg RNA per section.
To address the issue of regional vulnerability with disease progression, while also taking into account the complexity of AD, we performed a large genome-wide comparison of CA1 and CA3 gene expression in the brain of individuals with advanced AD and non-demented controls using Illumina Human HT-12 microarrays. The purpose of this study design was several-fold: first, to identify genes that show an association with vulnerable regions in AD progression; second, to quantify the relationship between region and disease using gene expression; third, to bring together the results of several previous studies of disparate design coming to apparently inconsistent results; fourth, to determine how the composition of cell types in hippocampus changes with AD progression; fifth, to identify genes marking early, presymptomatic signs of AD progression; and finally, to provide a gene expression resource for interested scientists. The data discussed in this publication have been deposited in NCBI's GEO  and are accessible through GEO Series accession number GSE29378.
To minimize the possibility of group bias, brain samples from individuals with moderate to severe AD (disease group; N = 17) were matched for gender, age, and post mortem interval (PMI) with individuals showing little to no cognitive deficits (control group; N = 16), as closely as possible (Table 1; Additional file 1). Furthermore, samples were randomly assigned to microarrays to limit batch effects. Simple clustering of the arrays reveals no significant confounding factors: samples cluster by individual, but not by batch, brain bank, location on the array, PMI, gender, or age (Figure S1 in Additional file 6). With the exception of heat shock proteins, no GO categories showed significant enrichment for genes differentially expressed with batch, brain bank, location on the array, or PMI, further suggesting that our results are properly controlled for possible confounding factors.
Annotation for lists of differentially expressed genes
Down with AD (in CA1)
Potassium channel activity
Up with AD (in CA1)
Response to stress
Hs_TGF beta signaling pathway
Enriched in CA3 (in control)
Cell growth and/or maintenance
Cytoskeletal protein binding
Enriched in CA1 (in control)
Signal transducer activity
Response to external stimulus
We next identified genes enriched in either CA1 or CA3 ('region-enriched' genes) in controls. Since both regions were collected from identical tissue sections, removing a major source of variability, we identified more differentially expressed genes than in the disease-related analysis. We find that the list of genes enriched in CA3 shows overrepresentation for genes involved in synaptic transmission, cytoskeletal protein binding, and cholesterol biosynthesis (Table 2; Additional file 2). In the case of CA1-enriched genes, we find over-representation of genes related to signal transduction, immune response, and cell motility (Table 2; Additional file 2). Interestingly, we also find enrichment in metallothioneins, a group of heavy metal binding proteins that have previously been implicated in aging and AD . When we perform the identical region-enrichment analysis in the AD group, we see similar results as with controls; however, fewer genes meet significance (Figure S2B in Additional file 6). This attenuation of region-enriched genes with disease is consistent with previous results in ischemia , and is not due to increased variance in the AD samples, as the standard deviations of the genes differentially expressed in controls are no different than in AD.
Top genes differentially expressed by disease and region
Genes regionally DE in control only
Genes changing with AD in CA1 only
Down with AD
Up with AD
Finally, we confirmed the direction and FC of eight of these highly disease-altered or region-enriched genes by qRT-PCR (Materials and methods; genes in bold in Table 3), thus validating a cross-section of our microarray results by an independent method.
We next extended these analyses to all genes, including those with much more marginal differential expression, in a total of six studies: three assessing changes with AD progression in CA1 (Figure 1e) [3, 4, 20] and three finding CA1- and CA3-enriched genes in control hippocampus (Figure 1f) [29–31]. We ranked all of our genes from the most CA1-enriched to the most CA3-enriched (or the ones most decreasing with AD to the ones most increasing), and then compared lists of differentially expressed genes from previous studies to our ranked lists (Materials and methods). For 9 of the 12 comparisons, we find the distribution of genes significantly shifted in the expected direction of overlap, and in the three other comparisons the direction of change was still correct, but did not reach significance (Figure 1e,f). In other words, genes presented as CA3-enriched in earlier studies are significantly more likely to have higher expression in CA3 than in CA1 in this study, and likewise for the other phenotypes. Thus, despite the many differences in experimental designs between studies, this in silico validation indicates that there is significant and previously unappreciated concordance between functional genomic studies related to AD. These analyses highlight for the first time many common genes and pathways in AD pathogenesis, showing a degree of convergence that has not been well appreciated previously.
To find genes that may play a role in the relative vulnerability of CA1 or protection of CA3, we considered the relative difference in fold change with disease between these brain regions. Our nomenclature of 'protection' and 'vulnerability' genes should be interpreted with a grain of salt, since carefully designed validation studies are needed to show a causal relationship implied by the terminology. Such a strategy has previously been successfully applied in the discovery of potential disease-related genes in AD  and novel neuroprotective genes in frontotemporal dementia . More specifically, we would expect vulnerability genes to have higher expression levels in CA1 than CA3 and also to increase expression to a greater extent in disease, whereas protective genes should show the opposite pattern. Overall, we found four candidates for putative vulnerability genes (ABCA1, MT1H, PDK4, RHOBTB3; Figure 2c) and three candidates for putative protection genes (FAM13A1, LINGO2, UNC13C; Figure 2d) meeting these criteria (Materials and methods). Two of our four vulnerability genes have been previously associated with AD: MT1H is a member of the family of zinc-regulating metallothionein proteins discussed earlier, while ABCA1 is a major cholesterol regulator that can influence amyloid plaque aggregation and clearance (reviewed in ). Furthermore, increasing expression of ABCA1 with increasing severity of AD has been measured both functionally and neuropathologically . Although none of the neuroprotective genes have known roles in AD, two have been associated with neuroprotection or plasticity in other contexts: variants of LINGO2 have been associated with risk and age of onset in Parkinson's disease , while UNC13C is a candidate gene for critical period neuronal plasticity in visual cortex .
Finally, to validate expression of UNC13C, we performed in situ hybridization on tissue from three additional human hippocampi showing no, moderate, and high pathology according to Braak and Braak staging (Figure S5 in Additional file 6). Consistent with both microarray probes for this gene, expression of UNC13C shows increased expression in CA3 relative to CA1 in AD tissue compared with control. These results highlight the importance of including regions of different levels of vulnerability in transcriptional studies to allow for more comprehensive disease gene assessments.
One potential variable that we wished to explore was the role of cell type differences underlying differential expression changes. For example, with neurodegeneration there will be lost neurons, increases in glial cells, and a likely infiltration of inflammatory cells. To address this issue, we created a linear model measuring differential expression with region and with disease, which also takes into account four major cell types in the brain using linear regression (Materials and methods). We chose genes used extensively in the literature as markers, and that have also been labeled as hub genes in previous transcriptional studies of human brain [14, 25] (although we note that choice marker gene makes very little difference in the results). As a caveat, we point out that this linear model ignores within-subject relationships and resulting P-values should only be interpreted as descriptive as opposed to inferential measures.
After accounting for cell type, we found that approximately 60% of differentially expressed genes are still significant (Figure S6 in Additional file 6), and that most of the same GO categories from Table 2 still show significant enrichment, albeit to a lesser extent. This result suggests that, with relatively equal contributions, differentially expressed genes in our analysis mark two distinct phenomena: first, there are differences in cell composition between regions and disease states - a result that we will discuss extensively in the context of WGCNA below - and second, many genes show significant changes in expression even after accounting for changes in cell composition. This second category likely represents the subset of differentially expressed genes marking dysfunctional cellular pathways, which we hypothesize encompasses the most significant gene expression changes, and includes all the genes from Table 3. These results suggest that standard microarray analyses of heterogeneous tissue can accurately pinpoint genes related to dysfunctional intracellular pathways for the most highly differentially expressed genes, but that more sophisticated analyses are required to address cell type composition for the majority of such genes.
Summary of module characterization and trait association
Up with AD
Down with AD
Down with AD duration, down with AD, enriched CA3
Glutatmatergic synapse, neuron
Down with AD, down with age, enriched in CA3
Astrocyte (and other glia), cell death?
Up with AD
Up with AD
Enriched in CA1
Enriched in CA1, up with NFTs in CT, up with AD
Down with AD, enriched in CA1
Down with AD, enriched CA3
Up with AD, down with PMI, batch, Brain Bank
Many mixed categories
Choroid plexus, extracellular signaling
Up with age
Up with AD
Down with age
Up with AD duration, up with AD
To further examine the association between microglia and early tau pathology, we determined which genes showed the most significant increase in expression between Braak stages of 1 and 2 using a t-test, this time including CA1 and CA3 samples together to increase statistical power. Overall, we found 490 significant genes, including many in the light green 'microglial' module and >60 from the 'defense response' GO category (P < 10-18). To validate our results we performed qRT-PCR, adding two new controls to our analysis (Additional file 1). Of the five additional genes tested, three were validated (Figure 4). We then repeated the analysis on frontal cortex from the same individuals, and found that four of these genes validated (Figure 4). Since NFTs have not yet formed in CA3 or frontal cortex by Braak stage 2 and are only isolated in CA1 , this result suggests that microglial activation spreads throughout the brain before NFT pathology, and may therefore be one of the earliest indicators of AD progression.
We have performed a large genome-wide analysis of gene expression in the human hippocampus in the context of AD progression. To address the issue of selective regional vulnerability - that is, why neurons die more readily and earlier in certain areas - we performed microarray-based gene expression analysis on RNA both from CA1 and the nearby, relatively less affected CA3. Using this novel study design, we find that CA3 has a less abnormal expression pattern at baseline than CA1, consistent with the observed pathological gradient in susceptibility. We also find candidate protection and vulnerability markers for AD, some of which have already been implicated in the disease (ABCA1 and MT1H). We perform an in silico validation of previous gene expression studies, identifying significant, previously unrecognized convergence of gene expression abnormalities in AD. Finally, we use WGCNA to find co-expression modules (which turn out to be enriched with markers for major cell types) and measure their expression in the context of aging and AD progression (Table 4). The identification of disease-associated modules permits extending the results from analysis of single genes independently, to the identification of dysregulated pathways. Of note, results from one module suggest that microglial genes show increased expression in controls of Braak stage 2.
To the best of our knowledge, this study represents the first transcriptional snapshot of CA3 in AD human brain, although multiple transcriptional studies of dementia have compared regions of differing degrees of vulnerability in order to gain insight into disease. For example, comparisons between cortex and cerebellum in mice with and without a tau mutation led to the discovery of puromycin-sensitive aminopeptidase (PSA) as a potential tau protease acting as a neuroprotective factor in frontotemporal dementia . In a microarray study of AD, Hata and colleagues  found that calcineurin Aß showed significantly enriched gene expression levels in hippocampus relative to parietal cortex in AD, but not in control. Using in situ hybridization and RT-PCR analysis, they then confirmed that this gene might play a critical role in the pathophysiological mechanisms of AD. Another study of AD in the human brain compared gene expression levels across six brain regions affected by AD at different stages of progression . They found decreased expression of MAPT, CDK5, and various tubulin proteins across multiple AD-affected regions (including CA1), possibly indicating a cellular attempt to inhibit NFT formation. These studies highlight the importance of including regions with differing levels of vulnerability in the analysis of diseases in which there is a specific stereotyped progression.
There are several advantages of using CA3 as the comparison region for CA1, rather than a more distant, unaffected, region. First, CA1 and CA3 are structurally similar: they each have four layers, are directly connected via the Schaffer collateral, and are located in the hippocampus. Because of these similarities, it is more likely that differential changes with disease are due to disease pathology, rather than due to changes in local environment. Second, since both CA1 and CA3 are distinctly laminated, it is relatively easy to dissect these regions in a consistent manner between samples. Thus, we were able to collect reliable data using microscope-aided dissection. Third, the proximity of CA1 and CA3 allows us to collect samples from the same slide, thus eliminating one level of technical bias. Finally, many previous studies have compared CA1 and CA3 in control tissue, providing a valuable test of the validity of our results.
As with most microarray studies of disease, we first determined AD-related genes by finding the most differentially expressed genes between control and disease. Using this method we have found several genes, including SERPINA3  and S100A6 , which have been previously associated with AD (Table 3). We can also find disease genes by filtering our list of differentially expressed genes using data from previous studies (for example, Figure S4E in Additional file 6). One advantage of our methodology is that we can find protection and vulnerability genes by taking into account expression levels in tissues that are differentially affected by disease progression. In this way we find ABCA1  and MT1H , which have also been previously associated with AD (Figure 2), along with several novel disease genes. Using WGCNA we can find additional disease genes in the form of hub genes for modules correlated with AD-related traits. Previous studies have shown that hubs are more likely than other genes to be functionally relevant; for example, in the case of oncogenic signaling networks in glioblastoma, nearly all hubs of a cancer-related module were found to be molecular targets for treatment . In this case we find that RGS4, a gene involved in calcium signaling that has been found to show decreased expression in several studies of AD , was the top hub gene in the light yellow module (Figure S8I in Additional file 6), which also shows decreased expression with AD (Table 4). Together, these results demonstrate that a multifaceted systems biology analysis of expression data increases a study's effectiveness in finding disease-related genes.
By a number of measures, we show remarkable consistency (that was previously unrecognized) between published studies of gene expression in AD: first, genes showing increased expression with AD in CA1 are enriched for synaptic transmission and cell-cell signaling, while those decreasing with AD are enriched for cell death and proliferation genes (Table 2); second, we find that most previously published lists of genes differentially expressed by hippocampal region or disease state are consistent with our results, even if they do not on the surface appear to be in agreement with each other (Figure 1; Figures S3 and S4 in Additional file 6); and finally, we find modules of co-expressed genes that are highly overlapping with previously published modules corresponding to basic cell types and cellular components (Table 4). Such a high level of between-study conformity, particularly regarding differential expression of individual genes, stems from our large sample size and robust statistical methods, adding confidence that our results represent real biological effects.
Our results regarding the changing expression patterns of cell type-specific modules suggest that both neurons and glia are affected by AD progression. Specifically, we found that neuron-associated modules showed decreased expression with AD, astrocyte-associated modules showed increased expression with AD, the oligodendrocyte-associated module showed increased expression with age, and a microglia-associated module showed increased expression with Braak stage in controls (Figure 3). While AD is usually thought of as a neurodegenerative disorder, there is mounting evidence that changes in glial cells occur with AD progression as well. Since oligodendrocytes produce the brain's supply of cholesterol and since progression of neurodegeneration in AD follows the reverse pattern of developmental myelination , oligodendrocyte dysfunction has been suggested as an early event in AD progression , and has been clearly linked to aging . Hundreds of publications have linked astrocytes and microglia to AD progression, generally in the context of inflammation (for example, see ), although the complex issue of whether these effects are protective or pathological is still open to debate (reviewed in ). Increases in inflammatory markers have been seen in many transcriptional studies of AD [3, 4, 9, 53, 54], often occurring early in the disease progression. Finally, both microglia  and reactive astrocytes  have been found to surround amyloid plaques, suggesting that glial dysfunction, along with neurodegeneration, is something that occurs throughout disease progression.
Microglia are extremely sensitive to disease pathologies, and as such could act as diagnostic markers of disease onset or progression . Furthermore, it is widely accepted that microglia often are found near amyloid deposits  and that microglia-mediated inflammation contributes to the progression of AD . What association microglia and neuroinflammatory markers have with NFT pathology is less clear. Whereas microglial cell activation has been linked to NFT burden in some cases [57, 58], this association has not received nearly the same attention as that of microglia and amyloid plaques [15, 59]. Overall, it is clear that microglia activation occurs in the AD brain, but its timing and role in AD progression has been difficult to pin down.
Our finding that microglial markers show increased expression in controls in Braak stage 2 (Figures 3 to 5) lends support to the idea that an increase in inflammatory processes may be one of the earliest events in AD progression. In a longitudinal analysis of blood from approximately 900 subjects, higher protein levels of three inflammatory markers (interleukin 6, α1-antichymotrypsin, and C reactive protein) were associated with an increased risk of dementia in general and of AD specifically . A separate study found that a panel of 18 signaling proteins involved in immune response could accurately predict the transition of mild cognitive impairment to AD when measured in blood plasma . While these are not the same genes that we found differentially expressed with Braak stage, these studies highlight the possibility of using blood biomarkers as a preclinical predictor of AD progression. Immune response genes have also been linked to blood lipid levels , another possible indicator of AD progression. Positron emission tomography (PET) is another non-invasive strategy that has the potential of preclinically predicting AD progression. One group found that approximately 40% of the patients they imaged with mild cognitive impairment showed increased microglial activation . Interestingly, the only place where they found significant microglial activation in amyloid-positive versus amyloid-negative mild cognitive impairment patients was frontal cortex, which is consistent with our qRT-PCR validations (Figure 4b). Thus, several studies suggest that some measure of inflammatory markers could be combined with a longitudinal study design to create a relatively accurate predictor of AD onset.
Our results further demonstrate that these same microglial markers show increased expression in or near neurons bearing NFTs (Table S6 in Additional file 6), suggesting that microglia may react to both major AD pathologies, not only amyloid plaques. The major question that remains is whether the upregulation of microglia reflects immune activation, or some other function, such as synaptic pruning or homeostasis , and whether this process is protective or dysfunctional. Since microglia can cross the blood brain barrier  and since they may be involved in amyloid plaque degradation , we surmise that they serve a protective role. At least two studies of transgenic mice with APP and PSEN1 mutations support this hypothesis. The first found that injection of transgenic mice with macrophage colony-stimulating factor, a protein that stimulates the production of bone marrow-derived microglia, prevents cognitive decline when injected presymptomatically and stabilizes the cognitive decline when injected after the appearance of amyloid pathology . The second study found that activated microglia colocalize with newly formed amyloid plaques within 1 to 2 days, at which point these plaques no longer increase in size, suggesting that microglia may stabilize their growth . Similar results were found in human: in individuals with possible AD, not only were there more microglia and amyloid plaques relative to controls, but amyloid plaques were also never found without an adjacent microglia . Thus, despite the relative lack of success of anti-inflammatory trials to date , our results suggest that approaches to AD treatment involving the mobilization of anti-inflammatory processes may have the potential to be both noninvasive and effective.
Despite a century of study, the number of AD diagnoses continues to increase, suggesting that new strategies for studying AD need to be developed and that previous results need to be confirmed in order to better understand this complex disease. Tollervey and colleagues , for example, used splice junction microarrays to find changes in alternative splicing in temporal cortex, both with age and with neurodegenerative disease, allowing them to distinguish disease-specific changes, which mostly affect neurons, from common changes, which affect both neurons and oligodendrocytes. We have taken a complementary approach by confirming previous transcriptional studies of AD on many levels, but go beyond these studies in a number of ways. We find candidate genes for neuroprotection and vulnerability in the AD hippocampus, as well as a robust relationship between disease- and region-specific gene expression changes. We identify co-expression modules corresponding to major cell types, which show expression patterns consistent with known disease-related changes, and suggest that a more detailed look into the role of microglia in preclinical AD is warranted. Together, these results paint a picture of AD as a multifaceted disease involving slight transcriptional changes in many genes between regions, coupled with a systemic immune response, gliosis, and neurodegeneration. Despite this complexity, we find that a consistent picture of gene expression in AD is emerging.
Enrichment Analysis Systematic Explorer
Gene Expression Omnibus
post mortem interval
weighted gene co-expression network analysis.
We thank Ezra Rosen, Brent Bill, Jennifer Lowe, and Neelroop Parikshak for reading the manuscript for clarity, and Eric Wexler, Chandran Vijayendran, Patricia Kramer, and Jeffrey Kaye for valuable conversations. Tissue and clinical characterization for this experiment was graciously provided by the Oregon Alzheimer's Disease Center and the Oregon Brain Aging Study (OBAS): Office of Research and Development, Clinical Sciences Research and Development Service, funded by NIH NIA P30 AG008017 and the Department of Veterans Affairs, as well as by the Human Brain and Spinal Fluid Resource Center, VA West Los Angeles Healthcare Center, 11301 Wilshire Blvd, Los Angeles, CA 90073, which is sponsored by NINDS/NIMH, National Multiple Sclerosis Society, and the Department of Veterans Affairs. This work was supported by National Research Service Award F31 AG031649 from the National Institute on Aging (NIA) (to JAM); NIH/National Institute of Mental Health Merit Award R37 MH 60233-09S1 (to DHG); NIH/NIA Award R01 AG26938-05 (to DHG); the Oregon Alzheimer's Disease Center grant P30AG008017 (to RLW); and Consortium for Frontotemporal Dementia Research Award 108400 (to DHG and JMG).
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.