Skip to main content

Prediction models for dementia and neuropathology in the oldest old: the Vantaa 85+ cohort study



We developed multifactorial models for predicting incident dementia and brain pathology in the oldest old using the Vantaa 85+ cohort.


We included participants without dementia at baseline and at least 2 years of follow-up (N = 245) for dementia prediction or with autopsy data (N = 163) for pathology. A supervised machine learning method was used for model development, considering sociodemographic, cognitive, clinical, vascular, and lifestyle factors, as well as APOE genotype. Neuropathological assessments included β-amyloid, neurofibrillary tangles and neuritic plaques, cerebral amyloid angiopathy (CAA), macro- and microscopic infarcts, α-synuclein pathology, hippocampal sclerosis, and TDP-43.


Prediction model performance was evaluated using AUC for 10 × 10-fold cross-validation. Overall AUCs were 0.73 for dementia, 0.64–0.68 for Alzheimer’s disease (AD)- or amyloid-related pathologies, 0.72 for macroinfarcts, and 0.61 for microinfarcts. Predictors for dementia were different from those in previous reports of younger populations; for example, age, sex, and vascular and lifestyle factors were not predictive. Predictors for dementia versus pathology were also different, because cognition and education predicted dementia but not AD- or amyloid-related pathologies. APOE genotype was most consistently present across all models. APOE alleles had a different impact: ε4 did not predict dementia, but it did predict all AD- or amyloid-related pathologies; ε2 predicted dementia, but it was protective against amyloid and neuropathological AD; and ε3ε3 was protective against dementia, neurofibrillary tangles, and CAA. Very few other factors were predictive of pathology.


Differences between predictors for dementia in younger old versus oldest old populations, as well as for dementia versus pathology, should be considered more carefully in future studies.


The oldest old constitute the largest and fastest growing population with dementia [1], but they are less often the focus of dementia prevention studies. Cohort studies with participants aged 85+ years [2,3,4,5,6,7] have investigated individual risk factors in association with dementia, but the predictive value of more complex multifactorial risk profiles in the oldest old is still unclear. Several dementia risk scores have been developed in younger populations, but they tend to perform poorly for predicting dementia in the oldest old age groups [8, 9]. The association of vascular and lifestyle-related factors with dementia risk, for example, has been shown to vary with age [10], and risk profiles predictive of subsequent dementia can differ between midlife and older age [9].

While most multifactorial prediction models or risk scores have focused on dementia, less is known about longitudinal prediction of neuropathology in people without dementia. In the oldest old, multiple dementia-related pathologies are common [11], but the association with a dementia diagnosis may be less straightforward than in younger age groups [10]. In this context, it becomes particularly important to investigate potential differences between predictors for dementia and for specific types of neuropathologies.

The main aims of the present study based on the Vantaa 85+ cohort are to develop multifactorial models for (1) predicting incident dementia in the oldest old, considering sociodemographic, cognitive, clinical, lifestyle, and apolipoprotein E (APOE) genotype data; and (2) predicting dementia-related neuropathologies at death in the oldest old, including Alzheimer’s disease (AD)-related pathology (amyloid plaques and neurofibrillary tangles), cerebral amyloid angiopathy (CAA), cerebral macro- and microinfarcts, and Lewy body pathology (α-synuclein).


Study population

The Vantaa 85+ study has been described in detail previously [4, 12]. In brief, the study focused on residents in the City of Vantaa in southern Finland who were at least 85 years old in 1991. Of the 601 people invited to participate, 11 refused, 1 could not be reached, and 1 died, leaving 588 (98%) participants who gave informed consent to participate in the study. Additionally, 35 people died before the baseline clinical examination, which was done for 553 participants. At baseline, 214 participants were diagnosed with dementia, and 339 did not have dementia. Clinical reexaminations were conducted in 1994, 1996, 1999, and 2001. At the time of death, additionally 101 participants had been diagnosed with dementia. Postmortem examination was conducted for 288 participants who attended the baseline clinical examination and 16 who had died before baseline. The Vantaa 85+ study was approved by the ethics committee of the Health Centre of the City of Vantaa. Written consent for the autopsies was given by the nearest relatives of the deceased.

To reduce the effects of mortality, of the 339 without dementia at baseline, 94 participants who died within the first 2 years of follow-up were excluded from the dementia prediction model. This eliminated significant time-to-death differences between individuals who died with and without dementia. Of the 245 remaining participants without baseline dementia and who were included in the model development (Fig. 1), 97 subsequently developed dementia.

Fig. 1
figure 1

Study design flowchart

The study population used for neuropathological prediction model development included 163 participants who attended the baseline examination, did not have dementia at baseline, and had available autopsy data. Participants with baseline dementia were excluded to enable comparison with the dementia prediction model.

Assessment of factors included in prediction models

Factors included in prediction models were assessed at the baseline clinical evaluation, when participants were examined by a physician and interviewed on their health, health-related behavior, and medication by a trained nurse. Medical history was additionally verified using primary health care records. Sociodemographic factors included age, sex, years of formal education, and social class [13]. Cognition was assessed with the Mini Mental State Examination (MMSE) [14] and the Short Portable Mental Status Questionnaire (SPMSQ; recorded number of errors) [15]. Participants self-reported subjective memory decline on a scale of no/a little/yes. Functional abilities were evaluated using activities of daily living (ADL) and the Instrumental Activities of Daily Living Scale (IADL) [16, 17]. Competence in daily activities was also assessed in the interview with a single question with six answer options ranging from “independent” (1 point) to “needs help in all activities” (6 points). The Zung Self-Rating Depression Scale was administered to assess depressive symptoms [18]. Comorbidities included in prediction models were diabetes, cardiovascular conditions (angina pectoris, heart infarction, atrial fibrillation, heart failure, arteriosclerosis obliterans, or hypertension) and cerebrovascular conditions (stroke or transient ischemic attack). Other vascular and lifestyle-related factors were systolic and diastolic blood pressure, body mass index (BMI), alcohol use, and smoking.

Total cholesterol as well as high-density lipoprotein (HDL) and low-density lipoprotein (LDL) cholesterol were quantified from baseline blood samples using enzymatic methods [4]. APOE genotyping was done with a combination of DNA minisequencing [19] and DNA amplification through PCR followed by restriction enzyme digestion with HhaI [20].

Dementia diagnosis

Dementia was diagnosed according to the revised criteria of the Diagnostic and Statistical Manual of Mental Disorders, Third Edition [21]. AD and vascular dementia were diagnosed using the National Institute of Neurological and Communicative Disorders–Alzheimer’s Disease and Related Disorders Association [22] and National Institute of Neurological Disorders and Stroke–Association Internationale pour la Recherche en l’Enseignement en Neurosciences [23] criteria. Diagnosis was based on a broad range of information, including interviews, health examinations, cognitive and functional assessments, and health and social work records (e.g., information on home services or other social care services provided to participants based on diminished functional or cognitive capacity). Diagnoses were made by consensus of two neurologists.

Incident dementia cases were identified from medical and social work records, as well as from the information collected at the study follow-up visits using examinations and interviews with participants and their relatives or caregivers [24]. Although clinicians were not blinded to cognitive/functional assessments during these visits (e.g., MMSE, SPMSQ, ADL, IADL), diagnoses relied primarily on overall clinical judgment based on all available information.

Neuropathological assessment

Neuropathological assessments have been described in detail previously [12, 24,25,26,27,28]. In brief, brains obtained at autopsy were fixed in phosphate-buffered 4% formaldehyde for at least 2 weeks and examined independently of clinical data. For AD-related pathology, the Consortium to Establish a Registry for Alzheimer’s disease (CERAD) protocol was followed [29]. Methenamine silver staining was used for β-amyloid [30], and the modified Bielschowsky method was used for neurofibrillary tangles and neuritic plaques [31]. As described previously, β-amyloid load was determined as the average fraction of cortical area covered by methenamine silver-stained plaques in four neocortical samples [12]. The average number of neurofibrillary tangles was also determined in the four samples [25]. The CERAD scores and Braak stages were defined as originally described [29, 32]. CAA was analyzed in six brain regions (frontal, parietal, temporal and occipital lobes, hippocampus, and cerebellum) based on Congo red staining and confirmed using IHC against β-amyloid peptide [26]. Macroscopic infarcts (cavitary lesions or solid cerebral infarcts visible to the naked eye) were identified from sliced cerebral hemispheres, brainstem, and cerebellum. All lesions were histologically ascertained to be infarcts. Cortical microinfarcts were analyzed in the H&E-stained tissue sections in the same six brain regions as CAA [26]. They were focal lesions smaller than 2 mm invisible to the naked eye with neuronal loss, glial cell and macrophage reaction, and/or cystic tissue necrosis. Sections of substantia nigra stained with the H&E method and sections of substantia nigra and hippocampus stained with antibodies against α-synuclein were used to screen for Lewy body-related pathology [27].

Hippocampal sclerosis (HS) and TDP-43 (transactive response binding protein 43) immunopositivity in the granular cell layer of the hippocampus were assessed as previously described [28]. In summary, HS and hemispheric symmetry/asymmetry were determined on H&E staining by estimating the severity of neuronal loss. The density of neurons was assessed semiquantitatively by three observers. For TDP-43 immunostaining, right-sided hippocampus tissue blocks were cut into 4-μm-thick sections and stained with the Lab Vision immunostainer (Thermo Fisher Scientific, Waltham, MA, USA), and polymer-based kits were used for detection.

Disease State Index

The Disease State Index (DSI) is a supervised machine learning method designed for predicting disease outcomes and differential diagnostics as a clinical decision-making tool. A detailed description has been published previously [33]. Compared with traditional methods for developing dementia risk scores, DSI is able to deal with larger amounts of heterogeneous data, to handle missing data, and to use unprocessed data without prespecified cutoffs for predictors. Conceptually related factors are structured into groups, such as combining all cognitive tests. This is useful for filtering noise and preventing strongly correlated factors from being multiplied. DSI thus provides detailed information about predictive performance on multiple levels simultaneously: the independent performance of each factor, the combined performance of a group of similar factors, and the overall performance of the entire model. DSI has accuracy comparable to that of methods such as logistic regression, support vector machines, and Bayes inference [33] and has previously been used for predicting dementia [34], progression of mild cognitive impairment [33, 35, 36], and differential diagnosis of neurodegenerative diseases [37].

DSI builds a model from the distributions of data using a population with known outcomes. For tested individuals, DSI gives index values ranging from 0 to 1, describing similarity of the data to the distributions in the model. A value close to 0 indicates that the data are similar to controls (no subsequent dementia or pathology), while a value close to 1 shows similarity to cases (subsequent dementia or pathology). The data used for the predictions can be dichotomous, continuous, or categorical.

First, a fitness function is calculated for each factor. Fitness function f(x) is the share of false-negative errors divided by the sum of false-negative and false-positive errors, using measurement value x as a threshold for classification. It goes through the distribution using each point as a classification threshold to evaluate the shares of false-negatives and false-positives, assigning 0 to values unique to controls and 1 to values unique to cases.

To complement fitness, a relevance value is calculated for each measure. The relevance value ranges from 0 to 1 and indicates the ability, based on the data, to differentiate between cases and controls in general. Relevance is defined as the sum of sensitivity and specificity minus 1, also known as the Youden index. Two data distributions that are completely overlapping will receive a relevance of 0, while two distributions with no overlap will get a relevance of 1.

Conceptually related factors are structured into groups to combine the effect of possibly correlating factors to a single predictor. Individual factors are combined into a group DSI value through a weighted average based on their relevance values. This process is then repeated recursively for all groups to obtain a total DSI value. Any missing values are ignored as part of the model, and the total score is calculated only from the available values.

Data analysis

We built DSI models for predicting dementia and the different neuropathologies. AUCs with 95% CIs for a 10 × 10-fold cross-validation were calculated to evaluate model performance. The dataset was divided into ten random subgroups, where nine were combined to form the training group and one acted as the test group. This process was completed for each subgroup, and the cross-validation itself was repeated ten times. Thus, we show mean AUCs and 95% CIs resulting from the 10 × 10 cross-validation process. Factor selection was conducted before the model building; that is, only factors that were significantly different (p < 0.05) between the groups with and without the outcome of interest were included in the final models. The initial list of factor groups and individual factors included sociodemographics (age, sex, education, social class), cognition (MMSE total score, MMSE orientation, MMSE word list - sum of registration and recall tasks, MMSE calculation, MMSE other tasks, and SPMSQ), functioning (sum of ADL and IADL, competence in daily activities question, and subjective memory decline), APOE genotype (binary variables: ε2 carrier versus noncarrier, ε4 carrier versus noncarrier, genotype ε3ε3 versus others; and a categorical variable: all genotypes [ε2ε2, ε2ε3, ε2ε4, ε3ε3, ε3ε4, or ε4ε4]), comorbidities (cardiovascular, cerebrovascular, and diabetes), cholesterol (total, LDL, and HDL), blood pressure (systolic and diastolic), lifestyle (BMI, alcohol use, and smoking), and depressive symptoms (Zung Self-Rating Depression Scale).

The following neuropathological outcomes were dichotomized as present versus absent: β-amyloid load (average fraction of cortical area covered by methenamine silver-stained plaques > 0), tangle count (average number of neurofibrillary tangles > 0), CAA (average percentage of blood vessels with CAA > 0), cerebral macroinfarcts (total number > 0), microinfarcts (number > 0), α-synuclein (brainstem, limbic, or diffuse neocortical pathology present versus absent), HS (severe marked/total loss versus no/minor loss of pyramidal neurons in the CA1 and subiculum), and TDP-43 (immunopositivity in the granular cell layer present versus absent). Neuropathological AD was defined on the basis of National Institute on Aging–Alzheimer’s Association criteria [38] using the combination of Braak and CERAD scores, and it was dichotomized as present (intermediate or high likelihood of AD) versus absent (low likelihood of AD).

To investigate relationships between neuropathological variables and how they predict dementia, we conducted principal component analysis (PCA) in 159 participants without dementia at baseline and who had complete neuropathological data. The pca function in MATLAB R2015b (MathWorks, Natick, MA, USA) was used. The pathology variables were centered but not weighted by variance. The principal component (PC) scores of each participant were used as predictors for dementia, and predictive performance for each PC was assessed using AUC values.


Predicting dementia

Characteristics for the dementia prediction population are shown in Table 1. The population consisted of 245 participants without dementia at baseline who were alive for at least 2 more years. Mean follow-up was 5.6 years, and 97 (40%) of the participants developed dementia before death. Education, performance in MMSE, SPMSQ, and competence in daily activities were significantly lower in people with subsequent dementia. Differences were also detected for APOE genotype. Other baseline characteristics were not significantly different between groups (Table 1) and were excluded from the prediction model.

Table 1 Baseline characteristics of the dementia prediction population

Cross-validation results for the DSI model for predicting dementia development are shown in Table 2. AUC was 0.73 for the entire model. According to AUC values for the groups of predictors, cognition including SPMSQ and both MMSE total score and its four subcategories (orientation, calculation, word list, and other tasks), were the most important predictors of dementia, followed by functioning (competence in daily activities), sociodemographics (education), and APOE status. APOEε2 carrier status predicted dementia development before death, while the ε3ε3 genotype was protective against dementia development, although AUCs were relatively low. The impact of APOEε2 and other major predictors was similar in further analyses considering clinical diagnosis of AD and vascular dementia separately (results not shown).

Table 2 Dementia prediction model

Predicting pathology

For predicting pathology, we included the 163 participants with no dementia at baseline and available autopsy data. This population had a mean age of 88.7 years, a follow-up time of 4.1 years, and 4.3 years of education. Thirty-one (19%) of these participants were male, 33 (21%) of them were APOEε4 carriers, and 26 (17%) were ε2 carriers. Fifty-nine (36%) had dementia at death.

Cross-validation results (AUCs) of the DSI pathology prediction models are shown in Table 3. Sensitivities and specificities are shown in Additional file 1: Table S1. The total AUCs for AD- or amyloid-related pathologies were 0.66 for amyloid load, 0.64 for tangle count, 0.68 for neuropathological AD, and 0.66 for CAA. APOE genotype had the highest AUCs for all these pathologies, but there were differences in the impact of different alleles. APOEε4 carrier status was predictive for all four pathology outcomes, while APOEε2 carrier status was protective against β-amyloid load and neuropathological AD. The ε3ε3 genotype was protective against tangle count and CAA, but it was not related to amyloid load or neuropathological AD.

Table 3 Neuropathology prediction models

Very few other factors had predictive value (Table 3): poorer competence in daily activities for β-amyloid load, higher total and LDL cholesterol, and subjective memory decline for tangle count; lower social class and subjective memory decline for neuropathological AD; and absence of cardiovascular comorbidity and male sex for CAA.

The model for cerebral macroinfarcts had the best predictive performance with total AUC of 0.72 (Table 3). The predictors in descending order of AUC values were history of cerebrovascular conditions, poorer MMSE score (total and word list learning and recall tasks), higher BMI, and poorer competence in daily activities. We also modeled the two most common subtypes of cerebral macroinfarcts: cortical and white matter. Cortical macroinfarcts (AUC of 0.71) were predicted by cerebrovascular comorbidity and APOE genotype. APOEε4 carriers were more likely to develop cortical macroinfarcts, while genotype ε3ε3 was protective. White matter macroinfarcts (AUC of 0.76) were predicted by cholesterol, cerebrovascular comorbidity, and APOEε2 carrier status. The AUC for the cerebral microinfarcts model was 0.61, with education as the only predictor.

HS (AUC of 0.78) was predicted by cognition (MMSE total score, word list learning and recall, and other tasks). Current smokers were also more likely to have HS (Table 3). TDP-43 was only predicted by less pronounced depressive symptoms (AUC of 0.69). There were no significant predictors found for α-synuclein.

Overall, APOE genotype was the predictor that emerged most consistently across all models. The impact of APOE on dementia versus pathology is summarized in Fig. 2.

Fig. 2
figure 2

Impact of apolipoprotein E (APOE) genotype on dementia versus neuropathology. Red indicates alleles that predicted dementia/pathology. Blue indicates alleles that were protective against dementia/pathology. White indicates alleles with no significant impact

Associations between pathology and dementia at death in participants without dementia at baseline are shown in Additional file 1: Table S2, and results of the PCA analysis are provided in Additional file 1: Table S3. The first three components of PCA explained 56% of the variance in autopsy findings and reflected three mostly independent pathological processes: AD-/amyloid-related pathology, including β-amyloid, neurofibrillary tangles, and CAA (PC1); vascular-type pathology, including primarily macroscopic infarcts (PC2); and Lewy body-type pathology, including α-synuclein (PC3). PC1 was most predictive of dementia (AUC of 0.71), followed by PC2 (AUC of 0.60). The other PCs did not predict dementia.


Predicting dementia in the oldest old

The DSI model performance for predicting dementia onset before death, on average 6 years later, in people aged 85+ years was close to the 10-year DSI dementia prediction model in a younger-old population [34] and was in the upper range of reported performance for previous dementia risk scores in younger populations [9]. Similarly to other dementia risk scores [9, 34], cognition was the main predictor, followed by functioning and education levels. However, there were several important differences compared with younger populations. Age, sex, and vascular and lifestyle factors were not predictive of dementia in the present study, although they are usually important predictors in midlife. The age range for 85+ populations is inherently smaller than for younger cohorts, potentially limiting the predictive value of age. Individuals who survive to the age of 85 years without dementia are also a selected group. While mechanisms are not fully clear, associations of vascular and lifestyle factors with dementia have been reported to differ in midlife versus late life [10].

APOE genotype was related to incident dementia, but the pattern was different from that in younger populations, where the ε4 allele increases dementia risk, while the ε2 allele seems protective ( In the present study, APOEε4 carrier status was not important for dementia prediction, in line with previous findings in the oldest old [39, 40]. The ε3ε3 genotype was protective, while the ε2 allele was predictive of subsequent dementia. Compared with younger populations, a lower proportion of ε4 carriers and a higher proportion of ε2 carriers have been reported in the oldest old [10, 40], including the Vantaa 85+ cohort [41]. Three previous population-based studies with shorter follow-up than Vantaa 85+ reported no protective effect of the ε2 allele against incident dementia after the age of 85 years [40, 42, 43]. Additionally, the ε2 allele increased the risk of incident vascular dementia in one study [42]. Previous reports on lower risk of dementia among the oldest old APOEε2 carriers have come from cross-sectional studies of dementia prevalence at death [44], and this may not necessarily apply longitudinally to dementia incidence after the age of 85 years.

Predicting dementia versus predicting neuropathology

The APOE genotype consistently predicted AD- or amyloid-related pathologies at death on average 6 years later, but with a different pattern than for incident dementia. The ε4 allele predicted all these pathologies. ε3ε3 genotype was protective against neurofibrillary tangles and CAA. The ε2 allele was protective against β-amyloid load and neuropathological AD. This pattern is closer to findings derived from younger-old populations, where the ε4 allele increases the risk and ε2 allele decreases the risk of subsequent AD-related pathology [45]. A conflicting finding was reported in the 90+ Study, where ε2 carriers had increased CERAD scores in cross-sectional analyses at death [44]. However, further analyses showed lower cortical β-amyloid percentage areas in ε2 carriers [46].

While APOE was not related to cerebral macroinfarcts in general in the present study, the ε4 allele predicted cortical macroinfarcts, and the ε2 allele predicted white matter macroinfarcts. A meta-analysis of studies in younger populations has linked both the ε4 and ε2 alleles to increasing burden in magnetic resonance imaging markers of cerebrovascular disease, including white matter hyperintensities [47]. However, longitudinal associations of APOE genotype with subsequent cerebrovascular lesions in the oldest old are still unclear. In the Vantaa 85+ study, while white matter infarcts alone were not significantly related to dementia diagnosis at death, they may suggest a potential explanation for the predictive effect of APOEε2 on incident dementia.

Very few other factors besides APOE predicted neuropathology. Vascular and lifestyle factors did not predict β-amyloid load or neuropathological AD. It is still debated whether vascular and lifestyle risk factors for dementia are actually related to amyloid pathology and whether such relationships may be age-dependent. Our finding that higher LDL and total cholesterol predicted tangle count needs to be verified in other 85+ cohorts.

Cognitive performance was not predictive of AD- or amyloid-related pathologies, although it predicted dementia, cerebral macroinfarcts, and HS. Of the included sociodemographic factors, only lower social class predicted neuropathological AD.

Predictors for HS and TDP-43 pathology are still unclear. While current smoking was related to HS and less pronounced depressive symptoms were related to TDP-43, the number of participants with these pathologies was very small in this study, and these findings require verification in other cohorts.

Overall, predictive performance of the models (AUC, sensitivity, specificity) was not very high. While study-specific limitations may have contributed to this, it is also possible that neither incident dementia nor specific neuropathologies can be predicted with very high accuracy in the oldest old using predictors commonly emphasized in younger-old populations. This is also suggested by the failure of previous dementia risk scores when extrapolated from younger-old to oldest-old populations. Different approaches may be needed that better account for the heterogeneity and multipathology often existing within the 85+ age group.

Strengths and limitations of the present study

The main strength of the present study is the prospective population-based design with a high autopsy rate over 10 years, the inclusion of participants aged > 85 years, and the multicomponent longitudinal prediction models for both dementia and specific neuropathologies. However, the developed prediction models are applicable only to a highly selected group of individuals who survive to the age of 85 years without developing dementia. External validation in other oldest-old cohorts will also be needed. The Vantaa 85+ population may differ from populations that are currently 85+ years old (e.g., for relatively low education). Health-related measures prior to the age of 85 years were not available. Sample size may have limited statistical power, especially for pathological outcomes with smaller numbers of participants.

Participants with autopsy were more likely to have incident dementia and lower MMSE at baseline than those without autopsy, which may have affected the pathology models. Participants with dementia at baseline were excluded from pathology models, but owing to their old age, some pathology may have been present at baseline. Quantitative, systematic methods were used for neuropathological assessments, but findings based on traditional silver-staining methods may be somewhat different from IHC methods for AD pathology. Other pathologies, such as aging-related tau astrogliopathy, could not be included owing to lack of data.


This is the first study combining longer-term dementia and neuropathology multicomponent prediction models among the oldest old. The dementia risk profile in this age group was very different from risk profiles previously described at younger ages. Predictors of dementia did not necessarily predict pathology. APOE genotype was the most consistent predictor across all models, but with different impact for different alleles.

The predictive models in the present study were developed for early identification of individuals with elevated risk of subsequent dementia. Longitudinal prediction models in the oldest old are more complex than in younger-old populations, and multifactorial risk profiles including both genetic and nongenetic factors need to be further investigated.



Alzheimer’s disease


Activities of daily living


Apolipoprotein E


Body mass index


Cerebral amyloid angiopathy


Consortium to Establish a Registry for Alzheimer’s Disease


Disease State Index


High-density lipoprotein


Hippocampal sclerosis


Instrumental Activities of Daily Living Scale


Low-density lipoprotein


Mini Mental State Examination


Principal component analysis


Short Portable Mental Status Questionnaire


Transactive response binding protein 43


  1. Prince M, Bryce R, Albanese E, Wimo A, Ribeiro W, Ferri CP. The global prevalence of dementia: a systematic review and metaanalysis. Alzheimers Dement. 2013;9(1):63–75.

    Article  PubMed  Google Scholar 

  2. Corrada MM, Brookmeyer R, Berlau D, Paganini-Hill A, Kawas CH. Prevalence of dementia after age 90: results from the 90+ study. Neurology. 2008;71(5):337–43.

    Article  CAS  PubMed  Google Scholar 

  3. Heeren TJ, Lagaay AM, Hijmans W, Rooymans HG. Prevalence of dementia in the ‘oldest old’ of a Dutch community. J Am Geriatr Soc. 1991;39(8):755–9.

    Article  CAS  PubMed  Google Scholar 

  4. Rastas S, Pirttilä T, Mattila K, et al. Vascular risk factors and dementia in the general population aged >85 years: prospective population-based study. Neurobiol Aging. 2010;31(1):1–7.

    Article  PubMed  Google Scholar 

  5. von Strauss E, Viitanen M, De Ronchi D, Winblad B, Fratiglioni L. Aging and the occurrence of dementia: findings from a population-based cohort with a large sample of nonagenarians. Arch Neurol. 1999;56(5):587–92.

    Article  Google Scholar 

  6. Tschanz JT, Treiber K, Norton MC, et al. A population study of Alzheimer’s disease: findings from the Cache County Study on Memory, Health, and Aging. Care Manag J. 2005;6(2):107–14.

    Article  PubMed  Google Scholar 

  7. Ebly EM, Parhad IM, Hogan DB, Fung TS. Prevalence and types of dementia in the very old: results from the Canadian Study of Health and Aging. Neurology. 1994;44(9):1593–600.

    Article  CAS  PubMed  Google Scholar 

  8. SJB V, MPJ VB, OJG S, et al. Modifiable risk factors for prevention of dementia in midlife, late life and the oldest-old: validation of the LIBRA index. J Alzheimers Dis. 2017;58(2):537–47.

    Article  Google Scholar 

  9. EYH T, Harrison SL, Errington L, et al. Current developments in dementia risk prediction modelling: an updated systematic review. PLoS One. 2015;10(9):e0136181.

    Article  Google Scholar 

  10. Gardner RC, Valcour V, Yaffe K. Dementia in the oldest old: a multi-factorial and growing public health issue. Alzheimers Res Ther. 2013;5(4):27.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Kawas CH, Kim RC, Sonnen JA, Bullain SS, Trieu T, Corrada MM. Multiple pathologies are common and related to dementia in the oldest-old: the 90+ Study. Neurology. 2015;85(6):535–42.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Polvikoski T, Sulkava R, Haltia M, et al. Apolipoprotein E, dementia, and cortical deposition of beta-amyloid protein. N Engl J Med. 1995;333(19):1242–7.

    Article  CAS  PubMed  Google Scholar 

  13. Rauhala U. The social stratification of Finnish society [in Finnish]. Doctoral thesis. Porvoo: Werner Söderström Osakeyhtiö; 1966.

    Google Scholar 

  14. Folstein MF, Folstein SE, McHugh PR. “Mini Mental State”: a practical method for grading the cognitive state of patients for clinician. J Psychiatr Res. 1975;12:189–98.

    Article  CAS  PubMed  Google Scholar 

  15. Pfeiffer E. A short portable mental status questionnaire for the assessment of organic brain deficit in elderly patients. J Am Geriatr Soc. 1975;23:433–41.

    Article  CAS  PubMed  Google Scholar 

  16. Katz S, Ford AB, Moscowitch RW, et al. Studies of illness in the aged. JAMA. 1963;185:914–9.

    Article  CAS  PubMed  Google Scholar 

  17. Lawton MP, Brody EM. Assessment of older people: self maintaining and instrumental activities of daily living. Gerontologist. 1969;9:179–86.

    Article  CAS  PubMed  Google Scholar 

  18. Zung WW, Richards CB, Short MJ. Self-rating depression scale in an outpatient clinic. Arch Gen Psychiatry. 1965;13:508–15.

    Article  CAS  PubMed  Google Scholar 

  19. Syvänen AC, Sajantila A, Lukka M. Identification of individuals by analysis of biallelic DNA markers, using PCR and solid-phase minisequencing. Am J Hum Genet. 1993;52:46–59.

    PubMed  PubMed Central  Google Scholar 

  20. Hixson JE, Vernier DTJ. Restriction isotyping of human apolipoprotein E by gene amplification and cleavage with HhaI. J Lipid Res. 1990;31:545–8.

    CAS  PubMed  Google Scholar 

  21. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 3rd ed. Washington, DC: American Psychiatric Association; 1987.

    Google Scholar 

  22. McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan EM. Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology. 1984;34(7):939–44.

    Article  CAS  PubMed  Google Scholar 

  23. Román GC, Tatemichi TK, Erkinjuntti T, et al. Vascular dementia: diagnostic criteria for research studies: report of the NINDS-AIREN International Workshop. Neurology. 1993;43(2):250–60.

    Article  PubMed  Google Scholar 

  24. Ahtiluoto S, Polvikoski T, Peltonen M, et al. Diabetes, Alzheimer disease, and vascular dementia: a population-based neuropathologic study. Neurology. 2010;75(13):1195–202.

    Article  CAS  PubMed  Google Scholar 

  25. Myllykangas L, Polvikoski T, Sulkava R, Verkkoniemi A, Crook R, Tienari PJ, Pusa AK, Niinistö L, O’Brien P, Kontula K, Hardy J, Haltia M, Pérez-Tur J. Genetic association of α2-macroglobulin with Alzheimer’s disease in a Finnish elderly population. Ann Neurol. 1999;46(3):382–90.

    Article  CAS  PubMed  Google Scholar 

  26. Tanskanen M, Mäkelä M, Myllykangas L, Rastas S, Sulkava R, Paetau A. Intracerebral hemorrhage in the oldest old: a population-based study (Vantaa 85+). Front Neurol. 2012;3:103.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Oinas M, Polvikoski T, Sulkava R, et al. Neuropathologic findings of dementia with Lewy bodies (DLB) in a population-based Vantaa 85+ study. J Alzheimers Dis. 2009;18(3):677–89.

    Article  PubMed  Google Scholar 

  28. Kero M, Raunio A, Polvikoski T, Tienari PJ, Paetau A, Myllykangas L. Hippocampal sclerosis in the oldest old: a Finnish population-based study. J Alzheimers Dis. 2018;63(1):263–72.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Mirra SS, Heyman A, McKeel D, et al. The Consortium to Establish a Registry for Alzheimer’s Disease (CERAD): II. Standardization of the neuropathologic assessment of Alzheimer’s disease. Neurology. 1991;41:479–86.

    Article  CAS  PubMed  Google Scholar 

  30. Yamaguchi H, Haga C, Hirai S, Nakazato Y, Kosaka K. Distinctive, rapid, and easy labeling of diffuse plaques in the Alzheimer brains by a new methenamine silver stain. Acta Neuropathol. 1990;79:569–72.

    Article  CAS  PubMed  Google Scholar 

  31. Yamamoto T, Hirano A. A comparative study of modified Bielschowsky, Bodian and thioflavin S stains on Alzheimer’s neurofibrillary tangles. Neuropathol Appl Neurobiol. 1986;12:3–9.

    Article  CAS  PubMed  Google Scholar 

  32. Braak H, Braak E. Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol (Berl). 1991;82:239–59.

    Article  CAS  Google Scholar 

  33. Mattila J, Koikkalainen J, Virkki A, et al. A disease state fingerprint for evaluation of Alzheimer’s disease. J Alzheimers Dis. 2011;27(1):163–76.

    Article  PubMed  Google Scholar 

  34. Pekkala T, Hall A, Lötjönen J, et al. Development of a late-life dementia prediction index with supervised machine learning in the population-based CAIDE study. J Alzheimers Dis. 2017;55(3):1055–67.

    Article  PubMed  Google Scholar 

  35. Hall A, Mattila J, Koikkalainen J, et al. Predicting progression from cognitive impairment to Alzheimer’s disease with the Disease State Index. Curr Alzheimer Res. 2015;12(1):69–79.

    Article  CAS  PubMed  Google Scholar 

  36. Hall A, Muñoz-Ruiz M, Mattila J, et al. Generalizability of the Disease State Index prediction model for identifying patients progressing from mild cognitive impairment to Alzheimer’s disease. J Alzheimers Dis. 2015;44(1):79–92.

    Article  PubMed  Google Scholar 

  37. Koikkalainen J, Rhodius-Meester H, Tolonen A, et al. Differential diagnosis of neurodegenerative diseases using structural MRI data. Neuroimage Clin. 2016;11:435–49.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Hyman BT, Phelps CH, Beach TG, et al. National Institute on Aging-Alzheimer’s Association guidelines for the neuropathologic assessment of Alzheimer’s disease. Alzheimers Dement. 2012;8(1):1–13.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Juva K, Verkkoniemi A, Viramo P, et al. APOEε4 does not predict mortality, cognitive decline, or dementia in the oldest old. Neurology. 2000;54(2):412–5.

    Article  CAS  PubMed  Google Scholar 

  40. Corrada MM, Paganini-Hill A, Berlau DJ, Kawas CH. Apolipoprotein E genotype, dementia, and mortality in the oldest old: the 90+ Study. Alzheimers Dement. 2013;9(1):12–8.

    Article  PubMed  Google Scholar 

  41. Sulkava R, Kainulainen K, Verkkoniemi A, et al. APOE alleles in Alzheimer’s disease and vascular dementia in a population aged 85+. Neurobiol Aging. 1996;17(3):373–6.

    Article  CAS  PubMed  Google Scholar 

  42. Skoog I, Hesse C, Aevarsson O, et al. A population study of apoE genotype at the age of 85: relation to dementia, cerebrovascular disease, and mortality. J Neurol Neurosurg Psychiatry. 1998;64(1):37–43.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Qiu C, Kivipelto M, Agüero-Torres H, Winblad B, Fratiglioni L. Risk and protective effects of the APOE gene towards Alzheimer’s disease in the Kungsholmen project: variation by age and sex. J Neurol Neurosurg Psychiatry. 2004;75(6):828–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Berlau DJ, Corrada MM, Head E, Kawas CH. APOEε2 is associated with intact cognition but increased Alzheimer pathology in the oldest old. Neurology. 2009;72(9):829–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Nicoll JA, Savva GM, Stewart J, Matthews FE, Brayne C, Ince P. Medical Research Council Cognitive Function and Ageing Study. Association between APOE genotype, neuropathology and dementia in the older population of England and Wales. Neuropathol Appl Neurobiol. 2011;37(3):285–94.

    Article  CAS  PubMed  Google Scholar 

  46. Berlau DJ, Corrada MM, Robinson JL, et al. Neocortical β-amyloid area is associated with dementia and APOE in the oldest-old. Alzheimers Dement. 2013;9(6):699–705.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Schilling S, DeStefano AL, Sachdev PS, et al. APOE genotype and MRI markers of cerebrovascular disease: systematic review and meta-analysis. Neurology. 2013;81(3):292–300.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


Not applicable.


This study was funded by the European Union 7th Framework Program for research, technological development, and demonstration VPH-DARE@IT (601055); MIND-AD Academy of Finland 291803 and Swedish Research Council 529-2014-7503 (EU Joint Programme - Neurodegenerative Disease Research, JPND); strategic funding for UEF-BRAIN from the University of Eastern Finland; VTR funding from Kuopio University Hospital; the Academy of Finland (287490, 294061, 278457, 319318); Center for Innovative Medicine (CIMED) at Karolinska Institutet Sweden; Stiftelsen Stockholms sjukhem Sweden; the Knut and Alice Wallenberg Foundation (Sweden); Konung Gustaf V:s och Drottning Victorias Frimurarstiftelse Sweden; Alzheimerfonden Sweden; Swedish Research Council 2017-06105; and the Stockholm County Council (ALF 20150589, 20170304). The study was supported by UEF Bioinformatics computing infrastructure and HUS ERVA fund. The funding sources had no involvement in study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the article for publication.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available, owing to ethics rules and legislation in Finland. For more information, please contact LM (

Author information

Authors and Affiliations



All authors took part in drafting or revising the manuscript for content. The study concept and design and the analysis and interpretation of data were done by AH, TPe, and AS. JL and JM provided the DSI software used for analysis. TPo, LM, MKe, MM, MO, AP, and MT took part in the acquisition of data. AH, TPe, MvG, and AS contributed to the statistical analysis. AH, JL, and AS took part in study supervision and coordination. MKi, HS, and AS obtained funding for the study. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Alina Solomon.

Ethics declarations

Ethics approval and consent to participate

The Vantaa 85+ study was approved by the ethics committee of the Health Centre of the City of Vantaa, and all patients provided informed consent. Written consent for the autopsies was given by the nearest relatives of the deceased.

Consent for publication

Not applicable.

Competing interests

JM and JL are shareholders and cofounders of Combinostics Ltd. Combinostics Ltd. owns the following intellectual property rights related to the paper: A method for inferring the state of a system (US7,840,510 B2, PCT/FI2007/050277) (JL) and State inference in a heterogeneous system (PCT/FI2010/050545. FI20125177) (JL and JM).

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Table S1. Sensitivity and specificity of neuropathology prediction models. Table S2. Neuropathology characteristics by dementia status at death for participants without dementia at baseline. Table S3. The first three components of PCA for autopsy findings and prediction of dementia at death for participants with complete autopsy data and no dementia at baseline. (PDF 40 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hall, A., Pekkala, T., Polvikoski, T. et al. Prediction models for dementia and neuropathology in the oldest old: the Vantaa 85+ cohort study. Alz Res Therapy 11, 11 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: