Skip to main content

Red blood cell indices and anaemia as causative factors for cognitive function deficits and for Alzheimer’s disease



Studies have shown that low haemoglobin and anaemia are associated with poor cognition, and anaemia is known to be associated with Alzheimer’s disease (AD), but the mechanism of this risk is unknown. Here, we first seek to confirm the association between cognition and anaemia and secondly, in order to further understand the mechanism of this association, to estimate the direction of causation using Mendelian randomisation.


Two independent cohorts were used in this analysis: AddNeuroMed, a longitudinal study of 738 subjects including AD and age-matched controls with blood cell measures, cognitive assessments and gene expression data from blood; and UK Biobank, a study of 502,649 healthy participants, aged 40–69 years with cognitive test measures and blood cell indices at baseline. General linear models were calculated using cognitive function as the outcome with correction for age, sex and education. In UK Biobank, SNPs with known blood cell measure associations were analysed with Mendelian randomisation to estimate direction of causality. In AddNeuroMed, gene expression data was used in pathway enrichment analysis to identify associations reflecting biological function.


Both sample sets evidence a reproducible association between cognitive performance and mean corpuscular haemoglobin (MCH), a measure of average mass of haemoglobin per red blood cell. Furthermore, in the AddNeuroMed cohort, where longitudinal samples were available, we showed a greater decline in red blood cell indices for AD patients when compared to controls (p values between 0.05 and 10−6). In the UK Biobank cohort, we found lower haemoglobin in participants with reduced cognitive function. There was a significant association for MCH and red blood cell distribution width (RDW, a measure of cell volume variability) compared to four cognitive function tests including reaction time and reasoning (p < 0.0001). Using Mendelian randomisation, we then showed a significant effect of MCH on the verbal–numeric and numeric traits, implying that anaemia has causative effect on cognitive performance.


Lower haemoglobin levels in blood are associated to poor cognitive function and AD. We have used UK Biobank SNP data to determine the relationship between cognitive testing and haemoglobin measures and suggest that haemoglobin level and therefore anaemia does have a primary causal impact on cognitive performance.


Dementia, a syndrome increasingly common in our ageing societies, is widely recognised as one of the world’s largest unmet medical needs. Significant progress has been made in identifying the determinative genes of familial diseases that cause dementia, such as early-onset Alzheimer’s disease (AD) or fronto-temporal dementia [1, 2]. For the commonest form of dementia, late-onset AD, genome-wide association studies have identified genes that alter the risk of suffering from the condition [3]. The identification of these genetic factors has driven much of our understanding with respect to the mechanisms of neurodegenerative disease. However, although modifiable environmental factors have also been identified (reviewed in [4]), the role of environmental influences such as cardiovascular risk, depression and social isolation in the disease process is less certain. Most significantly, factors associated with diseases, such as depression and social isolation, could plausibly be consequences, or even prodromal symptoms [5], rather than causes of dementia. It has been suggested that metabolic dysfunction plays a mechanistic role in disease [6] and could be a consequence of the genetically driven molecular pathological process rather than its cause [7, 8]. Clearly, this makes a difference when considering potential interventions to identify or prevent AD.

Another potentially modifiable risk factor for poor cognition in late life is anaemia. Systematic reviews suggest that anaemia is a risk factor for both dementia and for cognitive impairment [9, 10]. In addition to these, Faux et al. [11] found lower haemoglobin and differences in blood measures for mean cell haemoglobin, packed cell volume and higher erythrocyte sedimentation rates in people with AD, while Ferrer et al. [12] found that levels of neuronal haemoglobin are reduced in AD. In the Rush Memory and Aging Project, both high and low levels of haemoglobin were associated with AD and faster cognitive decline [13]. In participants at post-mortem analyses, lower haemoglobin levels were associated with macroscopic infarcts but not other pathologies of neurodegeneration [14]. Although it is reasonably clear that there is a relationship between indices of red blood cell phenotypes and cognition, the directionality and therefore causality of the observation is unknown, just as it is for other environmental factors.

Determining whether potentially modifiable factors associated with dementia are drivers of disease process and hence targets for therapy is of critical importance. A powerful approach to determining such causality is the use of Mendelian randomisation (MR). One of the limitations of MR, however, is the availability of genetic loci strongly associating with the phenotype under consideration. Here, we have utilised growing understanding of the genetic determinants of red blood cell characteristics to explore the role of haemoglobin and anaemia as a causal factor of cognitive phenotypes, including dementia, while integrating this growing understanding with modern MR methods able to combine multiple genetic loci.

We use a range of analyses to draw inferences about the relationship of red blood cell indices, and therefore anaemia, to both cognitive function and AD. Using both UK Biobank and AddNeuroMed data, we confirm the relationship between AD and anaemia. Then, MR methods suggest that altered red blood cell indices are causally associated with reduced cognitive function and finally, we provide transcriptomic evidence for molecular pathways that might underpin this mechanism.


Clinical measures and blood indices

UK Biobank

The UK Biobank study is comprised of 502,649 healthy participants, aged 40–69 years with comprehensive phenotypic measures including cognitive testing and blood cell indices (Additional file 1: Table S1), with measures described in detail online [15]. Briefly, blood cell indices were calculated for participants using a haemotology analyser which generated complete count data, including red blood cell count (RBC) and haemoglobin concentration (HGB). Other parameters were calculated from these same measures, e.g. mean corpuscular haemoglobin (MCH). All indices used in this analysis were taken from the recruitment/baseline visit. Anaemia classification was based on NICE guidelines, specifically males with HGB below 13 g/100 mL and women with HGB below 12 g/100 mL.

Results from tests conducted at baseline were used to measure cognitive function. Full assessment methods are described by Lyall et al. [16] but a brief description of cognitive function test and value treatment follows:

Verbal–numeric reasoning (fluid intelligence)

13 logic-based questions asked within a 2 minute time limit. Total number of correct responses was used for analysis (UKB Field Identifier (FID) 20016).

Numeric memory

Participants were asked to remember a two-digit number after a brief pause. The number of digits was then increased and longest number of digits recalled was used for analysis (FID: 4282).

Reaction time

Time taken for participants to match two identical symbols and press button. Mean reaction time (ms) of eight trials was used for analysis after log transformation (FID: 20023).

Visual memory

Pair matching test based on memory of card location. Number of pairs mismatched for the six pair test was used for analysis after log transformation (FID: 399).

Prospective memory

An instruction was given at the beginning of the assessment, which the participant needs to remember in order to select the correct shape at the end of the interview. A binary success or fail measure of the first attempt was used for further analysis (FID: 20018).


AddNeuroMed was a multi-national longitudinal study of AD in Europe described elsewhere [17, 18]. It included both AD and age-matched control subjects with blood cell measures, neuropsychological assessments and gene expression data [19]. NINCDS-ADRDA criteria and Diagnostic and Statistical Manual of Mental Disorders (DSM-IV) were used to classify AD patients. Blood cell count measurements were generated at King’s College Hospital according to clinical standards for 285 of these subjects. For a subset of samples (n = 71), all these variables were available for two or more visits. Blood measure rate of change was calculated as the slope of a linear model using individual age at visit (years) with blood measure as the dependent variable. Namely, blood measure = β0 + β1 age + ε (where β1 is the slope used, β0 intercept and ε noise).

Statistical analysis

UK Biobank

To test for associations between each cognitive function test and blood measure, we used a general linear model (GLM) per blood measure in which participants were filtered by age (> 60 years) to give a better comparison to AD patients. Cognitive function test was used as the outcome variable, and blood measure as the main exposure in each case. All p values were adjusted for multiple testing using Benjamini and Hochberg correction. A representative residual value for blood count was generated based on a linear model using device and acquisition route as covariants (FID: 30000-30284). This allowed correction for effects of blood collection method without impacting the cognitive function model. Demographic variables were also added as further covariates to correct for age, education, sex (FID: 31) and assessment centre (FID: 54) as described by Nevado–Holgado et al. [20]. Education level impacts on multiple outcome measures [21, 22], here, we included education within our model to adjust for socio-economic factors represented by schooling in different areas. However, we accept that education and cognition are correlated as people with stronger cognitive ability tend to stay in education longer and we have included education as a covariate assuming that as a generic adjustment of residual confounders, it will lead to conservative estimate of cognitive function. The same approach was used to test for the association between AD status and blood traits, with a GLM per blood measure including the same covariates as before. However, the population consisted of all participants older than 60 with a diagnosis of AD, plus a control participant (i.e. without AD) per case matched by age and gender. A representative residual value for blood count was generated based on a linear model using device and acquisition route as covariants (FID: 30000-30284). This allowed correction for effects of blood collection method without impacting the cognitive function model.


To test for differences in the case and control sample sets, different statistical tests were applied depending on the number of available samples. An unpaired t test was used to assess for significant differences between the mean rates of change, while Kolmogorov–Smirnov test was used to discern a difference between the distributions of rates of change. p values were adjusted for false discovery rate in both instances. These simpler methods were required to capture differences in the case of a small sample set while, where sample size was large enough (for MMSE-tested patients), a GLM was applied instead with corrections for sex and age.

Mendelian randomisation

The main genetic data analysis was based on the first released data batch of 152,736 participants from UK Biobank. Samples were filtered by ethnicity (FID: 22006, only keeping those with white genetic background); genetic sex (FID: 22001, removing those where stated gender did not match with real X–Y chromosome); related participants (FID: 22012, removing one from each common pair) and experimental checks (FID: 22050 and 22010) to leave 116,478 samples. A secondary replication analysis was performed on the interim set of genetic data (UK Biobank Release 2) which contained 335,423 participants. The dataset was processed following the method outlined by Bycroft et al. [23].

The SNPs for MR were selected based on two GWAS studies of blood traits with secondary validations as a filter [24, 25]. The SNP list was then filtered using the PhenomeScanner [26] tool to remove all SNPs with a known AD relationship, including SNPs located in the APOE/TOMM40 locus, to reduce the potential of pleiotropy errors. Remaining SNPs, with an info score > 0.9, were extracted from the imputed dataset. Subsets of SNPs specific to the blood measure were prepared to allow testing of instrument choice for pleiotropy. As blood measures are derived from common values, we selected three independent traits to study based on their association with the outcome variables: MCH; red blood cell distribution width (RDW) and reticylocyte count (RET). Association analysis was performed in SNPtest [27] for imputed data.

One-sample MR was implemented using the “Mendelian Randomisation” package from R [28] which incorporates three methods with different assumptions. The median weighted method or two-stage least squares estimation uses a median of the individual causal estimate per SNP, which is calculated from the ratio estimates of outcome’s regression coefficient divided by exposure [29]. The inverse-variance weighted (IVW) method uses the same ratio estimates but incorporates inverse-variance weights into the final summary estimate [30]. The Egger method is sensitive to SNP pleiotropy and allows the estimation of underlying bias by allowing a non-zero estimate for the intercept of the calculated ratio of beta values [31]. Comparing estimates from all of the methods shows the robustness of the overall analysis. Two-sample MR was performed with the “MRBase” R package [32] using the same instrument set.

Gene expression analysis and pathway enrichment

RNA was extracted from blood samples and assayed on Illumina Human HT-12 Expression Beadchips, full details are described by Lunnon et al. [19]. While a subset of these samples was used for this analysis based on data completion, the full raw dataset is available as GEO DataSets with accession numbers GSE63060 and GSE63061. Two approaches were used for array expression analysis, LIMMA models were used for fold change calculations and the SAMr correlation method was used to generate permutated statistics for the patient based approach. Finally, the Kolmogorov–Smirnov test was used to evaluate KEGG pathways for significant enrichment. This pathway approach is described by Nevado–Holgado et al. [33] which, similar to GSEA, takes significance values from each individual gene and compares the overall distribution of expression rather than a simple binomial approach.


Haemoglobin content has a significant association with cognitive function tests

Using the UK Biobank dataset, five cognitive function tests were compared to the complete blood cell indices set (Table 1). There was a significant association for red blood cell distribution width (RDW) and mean corpuscular haemoglobin (MCH) with outcomes on four cognitive tests including reaction time and verbal–numeric reasoning (Fig. 1a). Although reaction time was associated with white cell count and neutrophil number, associations with red cell indices were considerably more extensive.

Table 1 Associations between blood traits and cognitive function tests as revealed by linear modelling
Fig. 1
figure 1

Cognitive tests have a significant effect on red blood cell measures. a There is a significant association between red blood cell measures and the reaction time, reasoning, numeric and prospective cognitive function tests. b Increased MCH and related indices have a positive effect on verbal–numeric reasoning, prospective and numeric memory (red squares). Reaction time is increased as haemoglobin decreases due to inverse nature of the reaction time test (blue squares). See Abbreviations for blood indices’ acronyms

Performance on the reasoning test was positively correlated with red blood cell haemoglobin (Fig. 1b). Haemoglobin concentration (HGB), MCH and mean corpuscular haemoglobin concentration (MCHC) were higher in participants with higher reasoning scores (beta = 0.04, 0.04, 0.05 and p value = 2.26 × 10−7, 1.92 × 10−28, 7.33 × 10−12 respectively). The same correlation trend is seen in the numeric and prospective memory tests. Reaction time was inversely associated with HGB, MCH and MCHC measures (beta = − 0.009, − 0.003, − 0.002 and p value = 6.67 × 10−46, 7.94 × 10−8, 8.45 × 10−8 respectively); reflecting the same direction of change as with other cognition measures as increased reaction time is reflective of relatively worse cognition. We found that RDW was inversely correlated with four tests of cognitive function (beta between − 0.053 and − 0.008, p value from 1.71 × 10−14 to 0.003).

Interestingly, the reticulocyte (RET) measures, although highly variable, show the largest significant beta scores (beta between − 1.34 and − 1.310 with p values from 0.025 to 9.4 × 10−5). As these sets of measures are used clinically to diagnose iron-deficient anaemia, we estimated the proportion of participants with anaemia according to NICE guidelines and repeated the analysis. Participants with anaemia, so defined, had a significant reduction in performance on cognitive tests for three measures (prospective, numeric and reasoning) and increased reaction time score (p < 0.0005, Additional file 2: Figure S1).

Mean corpuscular haemoglobin and red blood cell distribution width have a causative relationship with verbal–numeric reasoning

Using UK Biobank to estimate a direction of effect, we applied a single-sample MR model where the cognitive test was the outcome variable, the blood measure the mediating exposure variable, and SNPs known to be related to the blood measure were used as instruments (Fig. 2a). In all cases, we used three alternative MR methods to discount the possibility of pleiotropy among SNPs (Table 2) as well as plots to assess SNP beta scores (Fig. 2ce). This approach identified a significant effect on the numeric and reasoning traits from the MCH measure (Fig. 2b). The effect between MCH and reasoning traits was replicated in an analysis using in the interim release of the full UK Biobank genetic data where we were able to reproduce the same direction of effect (Additional file 1: Table S2). In addition, two-sample MR was used to analyse the association in an alternative sample set (Additional file 2: Figure S2). The UK Biobank cognitive reasoning was used as the outcome, and MCH beta scores from the MRBase library were introduced as the new exposure to duplicate the significant results shown in our main one-sample results (p values < 0.05 for all three MR methods).

Fig. 2
figure 2

MCH has a significant effect on the reasoning cognition in multiple MR analysis approaches. a Mendelian randomisation model used for analysis. b p values are significant (> 0.005) in multiple MR methods for the MCH measure (exposure) in the reasoning and numeric traits. Significance in more than one test method is important to rule out pleiotropy among instruments. c MCH instrument (SNP) causal estimates for the reasoning (outcome) show symmetry about 0 indicating a robust analysis (without pleiotropy). d MCH instrument causal estimates for the numeric trait. e Instrument causal estimates for the reasoning trait compared to RDW

Table 2 Associations from MCH and RDW to cognitive tests as revealed by MR

RDW also showed significant effects in several of the MR tests for the reasoning and numeric traits (Table 2). Beta scores were negative suggesting an inverse relationship whereby RDW decreases as the cognition improves (Fig. 2e). Given the relationship between haemoglobin measures and cognitive tests, red blood cell indices were selected based on GLM results (Table 1), and their unique derivation source, to fit independent testing assumptions. MCH and RDW were the best candidates based on results from analyses with cognitive tests and imply that both haemoglobin levels and red blood cells themselves have a potentially causative effect on cognition (Table 2). RET was included as it is an independent measure with strong beta scores but was not significant (Additional file 1: Table S3).

Changes in red blood indices are also associated with Alzheimer’s disease

UK Biobank participants gave consent for linkage to medical records and using Hospital Episodes Statistics data a subset of participants with a recorded clinical diagnosis of AD or other dementia was identified using ICD10 codes. This subset was then age and gender matched to a control group (n = 1170). Using this sub-cohort anaemia was found to be significantly increased in people with AD (beta = 0.26, p value = 2.3 × 10−6) and the RBC and HGB indices were all decreased in the AD participant set (beta = − 0.66 and − 0.18 respectively, adjusted p values < 0.05; Additional file 1: Table S4).

Replication of the red blood cell association in an independent cohort

We then turned to the AddNeuroMed cohort to replicate these findings using complementary analyses. We determined rate of change measures per participant to incorporate multiple visit data when the participant made at least three visits between age of patient at visit (years) and each cell count measure (Additional file 2: Figure S3). These rate of change values were not correlated to the mean statistic (rho = − 0.031, Fig. 3a) suggesting that they provide additional information over and above the mean. We found a significant difference between AD case and normal cognition control subjects in five red blood cell rate of change measures (p value < 0.05, Table 3). A decline in rate of change was shown in the AD cases compared to control patients, with Fig. 3b, c showing the difference in distributions between RBC (p value = 2.21 × 10− 4) and mean corpuscular volume (MCV, p value = 1.95 × 10−3). The test was repeated using the MMSE scores per patient as an assessment of cognition. Using the highest and lowest scores (± 20%) and despite the low sample numbers (n = 53) and therefore lack of power, a significant difference remained (adjusted p value < 0.005) in three red blood cell rate of change measures between low and high MMSE (Fig. 3d). Finally, using MMSE as a continuous measure in a linear model, significant association was shown between MMSE score and four red blood cell measures including MCH (Table 3).

Fig. 3
figure 3

Rate of change in red blood cells emphasises differences in AD case–control samples. a Rate of change per patient is not correlated with mean per patient. b The distribution of RBC is significantly decreased in AD compared to controls. c The distribution of MCV, a haemoglobin measure, is significantly decreased in AD patients. d RBC rate of change is significantly different for high and low MMSE scores

Table 3 Significant Differences for red blood cell measures in an independent sample set

Pathway enrichment analysis indicates changes in MCH may have an impact on haematological gene expression

As the AddNeuroMed cohort also contained whole blood whole genome transcript data, we were able to use this dataset to explore, using several approaches, the gene expression patterns and hence KEGG pathways, linked to both blood traits and to AD. Initially, we used all subjects with both expression and rate of change in MCH data in a fold change analysis to look for significantly associated genes (37 patients), finding an enrichment for the glycosylphosphatidylinositol (GPI) anchor biosynthesis pathway (p value = 0.0107) in those with greatest rate of change in MCH. Defects in this pathway cause paroxysmal nocturnal haemoglobinuria, a genetic disorder whereby the immune system destroys red blood cells. We then focussed in on the AD group with complete data as above (n = 22) to look for correlation between rate of decline in blood indices and gene expression. Using this filtered approach, we detected an enrichment for haematopoietic cell linage pathway correlating with MCH rate of decline (p value = 0.0088, Additional file 1: Table S5). In both cases, we found weaker p values at the initial analysis stage, which is to be expected given the sample size.


Recently, increasing attention is being paid, with considerable justification, to environmental factors that might influence the development of dementia. As pharmacological strategies for prevention have not yet yielded success and as the number of people with dementia continues to rise then modifying environmental factors to reduce incidence of dementia is an increasingly attractive prospect. Supportive evidence for such an approach has come from multiple lines of evidence that despite increasing prevalence, the incidence of dementia might be falling; an observation that might be due to improved modification of cardiovascular risk factors. However, other non-genetic risk factors derived from observational study cannot be assumed to be causative and because of this, modification may not prove to be successful in reducing further the incidence of dementia. It becomes therefore, of paramount importance to determine causality, including through the use of MR techniques. However, previously, this approach has offered relatively little support to the hypothesis that modification of environmental risk factors such as LDL cholesterol, glycemic traits, diabetes, body mass index or education would reduce incidence of dementia [34]. In fact, counter intuitively, Ostergaard et al. [35] find higher systolic blood pressure to be associated with decreased risk of dementia, suggesting either that blood pressure has opposite effects on risk of dementia and of cardiovascular disease or that another factor associated with hypertension, most obviously anti-hypertensive medication, has a protective effect. There is therefore an evidence gap at present between observational studies proposing risk factors for modification and robust proof of concept for such modification including causality. Without this evidence, the only approach is to perform an interventional study of environmental modification, a challenge given the difficulties and costs of such public health measures. Evidence from approaches such as MR for causality would add considerably to the justification for such interventional studies.

We present evidence here for a primary causative association between indices indicative of relatively poor red cell function and cognitive function and, using MR with genetic loci previously found to have robust relationship with red cell phenotypes, findings that strongly suggest that lower haemoglobin has a causal impact on cognitive performance. Moreover, secondary analyses are in line with previous findings showing an association between anaemia and meeting operationalised criteria is a risk factor for dementia as well as lower cognition. Specifically, in UK Biobank data, we find lower MCH and RDW to be associated with relatively lower verbal–numeric reasoning and numeric memory and that measures indicative of anaemia, or a clinical diagnosis of anaemia, are associated with decreased cognitive function. This result replicates findings in a larger healthy population (n > 37,000) compared to previous studies [11, 14]. In complementary analyses in AddNeuroMed, a cohort study of dementia, we similarly find that red blood cell indices including red cell count, PCV and HGB are associated with AD and with decline in cognitive function measures. Using genetic loci strongly associated with these blood traits, we find associations with poorer cognitive function strongly suggesting a causative relationship with cognitive performance and by implication with dementia. Finally, pathway analysis of gene expression in blood in the AddNeuroMed cohort finds genes known to be linked with anaemia and the pathway of haematopoietic cell linage to be associated with changes in red cell indices adding further to the weight of evidence suggesting that these observations are indicative of true biological association.

The RBC indices we observe to be most strongly associated with cognitive outcomes are MCH and RDW, measures commonly associated with iron deficiency anaemia [36] indicating a possible deficit in haem synthesis or iron metabolism as an underlying trait. A possible relationship between neurodegeneration and iron has been investigated in other MR studies. Pichler et al. [37] used MR with three SNP instruments to find that increased iron reduces the risk of Parkinson’s disease and implying that there may well be a causal association in other similar diseases. However, Lupton et al. [38] used genetic determinants of the serum iron measures transferrin and ferritin in a reanalysis of large-scale GWAS data but found no association with AD. One possible explanation for this apparent discrepancy is the use of MCH in the present study, reportedly a more reliable measure of haemoglobin not influenced by sample storage conditions or cell counter methods [36]. Another potential explanation is the difference in instrument choice available from comprehensive GWAS studies of the blood indices [39]. By approaching the problem from the opposite direction using known genetic blood traits, we were able to detect a significant link not seen using AD genetics. The complexities of relationship between iron and AD have been shown using other experimental methods. For example, iron metabolism is disrupted in cortical neurons and the beta-amyloid protein precursor has ferroxidase activity in mouse models [40]. Telling et al. [41] have described a correlation between iron biochemistry and amyloid beta. These results show the relationship at the molecular level and may indicate a potential mechanism for iron within AD. The relevance of blood indices to the iron deposition has been shown in other UK Biobank based studies. Miller et al. [42] showed a correlation between the blood indices and T2* image derived phenotypes from the brain scans (which reflects iron deposition). In addition, a recent GWAS study showed significant associations between the T2* subcortical regions and genes related to iron transport such as TF, HFE and SLC25A37 [43].

We recognise that there are limitations to this study. The five cognitive tests were generally in agreement; however, there was some discrepancy in the visual memory task. The task itself involved matching of pairs and although the mismatched score was used to improve reliability of the testing measure there are still weaknesses in this data set. Other studies have shown the measure has a low reliability score of 0.15 [16] and potential weaknesses of test method may be impacting on our own analysis results. The main inference for the MR analysis is use of cognitive performance as a proxy representative for AD. An alternative would have been to use the AD phenotype as the mediating exposure, but the low number of AD patients recorded in UK Biobank seriously limits the statistical sensitivity of this approach. Additionally, this only had borderline significance in other studies [44].

Pleiotropy of instruments is a common limitation of MR approaches. We used a number of tests to check for pleiotropy effects on the results including Egger methods and confirmational plots.

Using the rate of change statistic from the blood measures, we were able to determine a difference between AD patients and controls. This is not a standard approach, possibly due to limited data available for multiple visits; however, it was very informative. We found differences that were reproduced in a larger set that were not detected otherwise. Using the same dataset but taking a mean statistic per patient, rather than time decline, we detected a difference in white blood cell measure for basophils [45]. Given the known effects of AD on blood measures, it seems likely that both blood types are affected. Nonetheless, both methods warrant replication in a larger, independent dataset. We have also presented some interesting pathway enrichment results yielding pathways which warrant replication in an independent sample set with the goal of identifying related genes.


We have presented here further evidence for the association between red blood cell measures normally indicative of anaemia and measures of both poor cognitive performance and of dementia. Using a robust MR approach, we are able to determine that this relationship is one of causality and not consequence suggesting that reversing these changes might slow or prevent the onset of dementia. These findings require replication in other datasets but already derive from one very large and one very detailed cohort study. If they are replicated then the implications are considerable. As our findings apply to people with decreased cognitive function within the normal range as well as to people with established dementia then the implication is that the causal relationship between decreased red cell function and anaemia are an early, preclinical influence on disease that continues through to the dementia syndrome. It follows that measures to reduce or reverse poor red cell function might be both preventative and therapeutic at least in part. If this was proven in interventional studies then such screening measures, already in widespread use in the population, might be used to identify people for these and indeed for other secondary prevention interventions as they become available.



Alzheimer’s disease


Basophill count


Basophill percentage


Eosinophill count


Eosinophill percentage


Haematocrit percentage


Haemoglobin concentration


Immature reticulocyte fraction


Lymphocyte count


Lymphocyte percentage


Mean corpuscular haemoglobin


Mean corpuscular haemoglobin concentration


Mean corpuscular volume


Monocyte count


Monocyte percentage


Mean platelet volume


Mendelian randomisation


Mean reticulocyte volume


Mean sphered cell volume


Neutrophill count


Neutrophill percentage


Nucleated red blood cell count


Nucleated red blood cell percentage


Platelet crit


Packed cell volume


Platelet distribution width


Platelet count


Red blood cell count


Red blood cell distribution width


Reticulocyte count


Reticulocyte percentage


White blood cell count


  1. Cacace R, Sleegers K, Van Broeckhoven C. Molecular genetics of early-onset Alzheimer’s disease revisited. Alzheimers Dement. 2016;12:733–48.

    Article  PubMed  Google Scholar 

  2. Ferrari R, Wang Y, Vandrovcova J, Guelfi S, Witeolar A, Karch CM, et al. Genetic architecture of sporadic frontotemporal dementia and overlap with Alzheimer’s and Parkinson’s diseases. J Neurol Neurosurg Psychiatry. 2017;88:152–64.

    Article  PubMed  Google Scholar 

  3. Lambert J-C, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013;45:1452–8.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  4. Livingston G, Sommerlad A, Orgeta V, Costafreda SG, Huntley J, Ames D, et al. Dementia prevention, intervention, and care. Lancet. 2017 [cited 2017 Aug 7];0. Available from:

  5. Mirza SS, Wolters FJ, Swanson SA, Koudstaal PJ, Hofman A, Tiemeier H, et al. 10-year trajectories of depressive symptoms and risk of dementia: a population-based study. Lancet Psychiatry. 2016;3:628–35.

    Article  PubMed  Google Scholar 

  6. Ribe EM, Lovestone S. Insulin signalling in Alzheimer’s disease and diabetes: from epidemiology to molecular links. J Intern Med. 2016;280:430–42.

    Article  PubMed  CAS  Google Scholar 

  7. Marciniak E, Leboucher A, Caron E, Ahmed T, Tailleux A, Dumont J, et al. Tau deletion promotes brain insulin resistance. J Exp Med. 2017;

  8. Ruiz HH, Chi T, Shin AC, Lindtner C, Hsieh W, Ehrlich M, et al. Increased susceptibility to metabolic dysregulation in a mouse model of Alzheimer’s disease is associated with impaired hypothalamic insulin signaling and elevated BCAA levels. Alzheimers Dement J Alzheimers Assoc. 2016;12:851–61.

    Article  Google Scholar 

  9. Andro M, Le Squere P, Estivin S, Gentric A. Anaemia and cognitive performances in the elderly: a systematic review. Eur J Neurol. 2013;20:1234–40.

    Article  PubMed  CAS  Google Scholar 

  10. Peters R, Burch L, Warner J, Beckett N, Poulter R, Bulpitt C. Haemoglobin, anaemia, dementia and cognitive decline in the elderly, a systematic review. BMC Geriatr. 2008;8:18.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  11. Faux NG, Rembach A, Wiley J, Ellis KA, Ames D, Fowler CJ, et al. An anemia of Alzheimer’s disease. Mol Psychiatry. 2014;19:1227–34.

    Article  PubMed  CAS  Google Scholar 

  12. Ferrer I, Gómez A, Carmona M, Huesa G, Porta S, Riera-Codina M, et al. Neuronal hemoglobin is reduced in Alzheimer’s disease, argyrophilic grain disease, Parkinson’s disease, and dementia with Lewy bodies. J Alzheimers Dis. 2011;23:537–50.

    Article  PubMed  CAS  Google Scholar 

  13. Shah RC, Buchman AS, Wilson RS, Leurgans SE, Bennett DA. Hemoglobin level in older persons and incident Alzheimer disease prospective cohort analysis. Neurology. 2011;77:219–26.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  14. Shah RC, Schneider JA, Leurgans S, Bennett DA. Association of lower hemoglobin level and neuropathology in community-dwelling older persons. J Alzheimers Dis. 2012;32:579–86.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  15. UK Biobank Data Catalogue [Internet]. Available from:

  16. Lyall DM, Cullen B, Allerhand M, Smith DJ, Mackay D, Evans J, et al. Cognitive test scores in UK Biobank: data reduction in 480,416 participants and longitudinal stability in 20,346 participants. PLoS One. 2016;11:e0154222.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  17. Lovestone S, Francis P, Kloszewska I, Mecocci P, Simmons A, Soininen H, et al. AddNeuroMed–the European collaboration for the discovery of novel biomarkers for Alzheimer’s disease. Ann N Y Acad Sci. 2009;1180:36–46.

    Article  PubMed  CAS  Google Scholar 

  18. Simmons A, Westman E, Muehlboeck S, Mecocci P, Vellas B, Tsolaki M, et al. MRI measures of Alzheimer’s disease and the AddNeuroMed study. Ann N Y Acad Sci. 2009;1180:47–55.

    Article  PubMed  Google Scholar 

  19. Lunnon K, Sattlecker M, Furney SJ, Coppola G, Simmons A, Proitsi P, et al. A blood gene expression marker of early Alzheimer’s disease. J Alzheimers Dis. 2013;33:737–53.

    Article  PubMed  CAS  Google Scholar 

  20. Nevado-Holgado AJ, Kim C-H, Winchester L, Gallacher J, Lovestone S. Commonly prescribed drugs associate with cognitive function: a cross-sectional study in UK biobank. BMJ Open. 2016;6:e012177.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Deary IJ, Johnson W. Intelligence and education: causal perceptions drive analytic processes and therefore conclusions. Int J Epidemiol. 2010;39:1362–9.

    Article  PubMed  Google Scholar 

  22. Sharp ES, The GM. Relationship between education and dementia. An Updated Systematic Review Alzheimer Dis Assoc Disord. 2011;25:289–304.

    Article  PubMed  Google Scholar 

  23. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. Genome-wide genetic data on ~500,000 UK Biobank participants. bioRxiv. 2017;166298.

  24. Chami N, Chen M-H, Slater AJ, Eicher JD, Evangelou E, Tajuddin SM, et al. Exome genotyping identifies pleiotropic variants associated with red blood cell traits. Am J Hum Genet. 2016;99:8–21.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  25. van der Harst P, Zhang W, Leach IM, Rendon A, Verweij N, Sehmi J, et al. Seventy-five genetic loci influencing the human red blood cell. Nature. 2012;492:369–75.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  26. Staley JR, Blackshaw J, Kamat MA, Ellis S, Surendran P, Sun BB, et al. PhenoScanner: a database of human genotype–phenotype associations. Bioinformatics. 2016;32:3207–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  27. Marchini J, Howie B, Myers S, McVean G, Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007;39:906–13.

    Article  PubMed  CAS  Google Scholar 

  28. Yavorska OO, Burgess S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int J Epidemiol. 2017;46:1734–9.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Angrist JD, Imbens GW. Two-stage least squares estimation of average causal effects in models with variable treatment intensity. J Am Stat Assoc. 1995;90:431–42.

    Article  Google Scholar 

  30. Rosenblad A. Introduction to meta-analysis by Michael Borenstein, Larry V. Hedges, Julian P.T. Higgins, Hannah R. Rothstein. Int Stat Rev. 2009;77:478–9.

    Article  Google Scholar 

  31. Bowden J, Smith GD, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44:512–25.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Hemani G, Zheng J, Wade KH, Laurin C, Elsworth B, Burgess S, et al. MR-Base: a platform for systematic causal inference across the phenome using billions of genetic associations bioRxiv. 2016;078972.

  33. Nevado-Holgado AJ, Lovestone S. Determining the molecular pathways underlying the protective effect of non-steroidal anti-inflammatory drugs for Alzheimer’s disease: a bioinformatics approach. Comput Struct Biotechnol J. 2017;15:1–7.

    Article  PubMed  CAS  Google Scholar 

  34. Benn M, Nordestgaard BG, Frikke-Schmidt R, Tybjærg-Hansen A. Low LDL cholesterol, PCSK9 and HMGCR genetic variation, and risk of Alzheimer’s disease and Parkinson’s disease: Mendelian randomisation study. BMJ. 2017;357:j1648.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Østergaard SD, Mukherjee S, Sharp SJ, Proitsi P, Lotta LA, Day F, et al. Associations between potentially modifiable risk factors and Alzheimer disease: a Mendelian randomization study. PLoS Med. 2015;12:e1001841.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  36. Goddard AF, James MW, McIntyre AS, Scott BB. Guidelines for the management of iron deficiency anaemia. Gut. 2011;60:1309–16.

    Article  PubMed  CAS  Google Scholar 

  37. Pichler I, M FDG, Gögele M, Lill CM, Bertram L, Do CB, et al. Serum iron levels and the risk of Parkinson disease: a Mendelian randomization study. PLoS Med. 2013;10:e1001462.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  38. Lupton MK, Benyamin B, Proitsi P, Nyholt DR, Ferreira MA, Montgomery GW, et al. No genetic overlap between circulating Iron levels and Alzheimer’s disease. J Alzheimers Dis. 2017;59:85–99.

    Article  PubMed  CAS  Google Scholar 

  39. Andrews NC. Genes determining blood cell traits. Nat Genet. 2009;41:1161–2.

    Article  PubMed  CAS  Google Scholar 

  40. Duce JA, Tsatsanis A, Cater MA, James SA, Robb E, Wikhe K, et al. Iron-export ferroxidase activity of β-amyloid precursor protein is inhibited by zinc in Alzheimer’s disease. Cell. 2010;142:857–67.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Telling ND, Everett J, Collingwood JF, Dobson J, van der Laan G, Gallagher JJ, et al. Iron biochemistry is correlated with amyloid plaque morphology in an established mouse model of Alzheimer’s disease. Cell Chem Biol. 2017;24:1205–1215.e3.

    Article  PubMed  CAS  Google Scholar 

  42. Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat Neurosci. 2016;19:1523–36.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  43. Elliott L, Sharp K, Alfaro-Almagro F, Douaud G, Miller K, Marchini J, et al. The genetic basis of human Brain Struct Funct: 1,262 genome-wide associations found from 3,144 GWAS of multimodal brain imaging phenotypes from 9,707 UK Biobank participants. bioRxiv. 2017;178806.

  44. Astle WJ, Elding H, Jiang T, Allen D, Ruklisa D, Mann AL, et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell. 2016;167:1415–29. e19

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  45. Lunnon K, Ibrahim Z, Proitsi P, Lourdusamy A, Newhouse S, Sattlecker M, et al. Mitochondrial dysfunction and immune activation are detectable in early Alzheimer’s disease blood. J Alzheimers Dis. 2012;30:685–710.

    Article  PubMed  CAS  Google Scholar 

Download references


The authors would like to thank Chi-Hun Kim for his assistance with data acquisition and discussions and Danielle Newby for helpful discussion and comment on the manuscript. This study was conducted using the UK Biobank resource, and the authors would like to thank the UK Biobank participants.


We received funding for this analysis through the Medical Research Council Dementias Platform UK (MR/L015382/1) and the European Union’s Horizon 2020 research and innovation programme 2014–2020 under Grant Agreement No. 634143 to the MedBioInformatics programme.

Availability of data and materials

The data used in this study are available via a direct application to UK Biobank ( Data is open access, after a registration process and approval from the access committee to confirm reasonable request. AddNeuroMed data is available for download from GSE63063.

Author information

Authors and Affiliations



AJN-H, LW and SL contributed to study design. LW conducted data processing and statistical analyses. LW, SL, JP and AJN-H contributed to the writing of the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Laura M. Winchester.

Ethics declarations

Ethics approval and consent to participate

This research has been conducted using the UK Biobank Resource under Application Number 15181. UK Biobank has received ethics approval from the Research Ethics Committee (ref. 11/NW/0382). Full details for the AddNeuroMed study are available in the primary publications [17, 18]. The study conformed to the Declaration of Helsinki.

Consent for publication

Not applicable

Competing interests

The authors declare they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. Sample set description. Table S2. MR replication in release 2 of the UK Biobank data. Table S3. MR results for RET trait. Table S4. General linear model results show an association between AD patients and red blood indices in UK Biobank. Table S5. Significantly enriched KEGG pathways for MCH decline correlated genes. (XLSX 18 kb)

Additional file 2:

Figure S1. Anaemia has a significant effect on four cognitive test measures. Figure S2. Two sample MR results replicating the direction of effect for MCH on the verbal–numeric reasoning outcome. Figure S3. Rate of change as a measure of red blood cell count. (PDF 371 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Winchester, L.M., Powell, J., Lovestone, S. et al. Red blood cell indices and anaemia as causative factors for cognitive function deficits and for Alzheimer’s disease. Genome Med 10, 51 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: