Skip to main content

Enhanced diagnostic accuracy for neurocognitive disorders: a revised cut-off approach for the Montreal Cognitive Assessment



The Montreal Cognitive Assessment (MoCA) has good sensitivity for mild cognitive impairment, but specificity is low when the original cut-off (25/26) is used. We aim to revise the cut-off on the German MoCA for its use in clinical routine.


Data were analyzed from 496 Memory Clinic outpatients (447 individuals with a neurocognitive disorder; 49 with cognitive normal findings) and from 283 normal controls. Cut-offs were identified based on (a) Youden’s index and (b) the 10th percentile of the control group.


A cut-off of 23/24 on the MoCA had better correct classification rates than the MMSE and the original MoCA cut-off. Compared to the original MoCA cut-off, the cut-off of 23/24 points had higher specificity (92% vs 63%), but lower sensitivity (65% vs 86%). Introducing two separate cut-offs increased diagnostic accuracies with 92% specificity (23/24 points) and 91% sensitivity (26/27 points). Scores between these two cut-offs require further examinations.


Using two separate cut-offs for the MoCA combined with scores in an indecisive area enhances the accuracy of cognitive screening.


A steep increase in the prevalence of dementia is expected [1], associated with social, economic, and societal challenges. Early detection of dementia is crucial for an implementation of therapeutic strategies in the earliest disease stages [2], and reliable cognitive screening tools play an important role in this process of identifying individuals with cognitive impairment [3]. In the context of clinical research, accurate cognitive assessment tools are needed for an adequate selection of participants, since erroneous inclusion or exclusion of individuals may bias study findings [4, 5].

The Montreal Cognitive Assessment (MoCA) [6] has gained popularity for cognitive screening. It correlates well with extensive neuropsychological test batteries [7, 8] and covers most of the cognitive domains outlined in the Diagnostic and Statistical Manual, 5th Edition (DSM-5) [9]. However, while the initially proposed cut-off (25/26 points) [6] has shown good sensitivity for mild cognitive impairment (MCI) (i.e., ≥ 83%) [6, 10, 11], this cut-off was found to have a specificity of 66% or less in various different studies, implying a potentially unacceptably high number of false-positive classifications [7, 10,11,12,13]. Consequently, new cut-offs have been proposed for various patient populations and languages (see [13] for an overview). However, the psychometric properties of any screening test are not fixed characteristics, but depend on the clinical context [14], limiting the transferability of these cut-offs to other settings. Moreover, most previous authors defined “optimal cut-offs,” which aim at finding the best balance between sensitivity and specificity, as opposed to conventional cut-offs that are based on clinical standards (e.g., test performance 1–2 SD below the normative mean [9, 15]). Optimal cut-offs are likely to be sample-dependent and specific to the individual study [15, 16] and should therefore be validated in independent samples. Furthermore, in most validation studies, a rather homogenous patient sample was recruited (e.g., only patients with probable Alzheimer’s disease (AD) according to McKhann criteria [17], exclusion of patients with medical comorbidities), which does not reflect the clinical reality, where patient populations are typically heterogeneous, and medical comorbidities are frequent. In addition, excluding patients who are difficult to diagnose induces several forms of bias and may lead to an overestimation of diagnostic accuracy [18, 19]. Heterogeneous samples reflect the clinical reality more accurately as healthcare professionals face the challenge to identify truly impaired patients from a pool of individuals with a suspected neurocognitive disorder (NCD), irrespective of the underlying cause.

In the present study, we aim to address these limitations and therefore estimate the diagnostic accuracy of the original MoCA cut-off in a sample of consecutively referred Memory Clinic outpatients (MC sample). The objective of the study was to differentiate normal findings (NF; i.e., neurocognitive results were within normal limits) from patients with mild and major NCD (labeled Mild+Major NCD in the following) [9]. Since the MoCA was developed to identify individuals with MCI, subgroup analyses are performed for patients diagnosed with mild NCD (labeled Mild NCD in the following). Given the high rate of false-positive classifications that is associated with the original MoCA cut-off, we aimed at finding a new cut-off with higher specificity. In this context, we introduce an approach to determine a conventional cut-off solely based on a sample of cognitively healthy normal controls (NCs), which we then validated in the MC sample. We then compare this conventional cut-off to an optimal cut-off approach.

In sub-analyses, we investigate the differences in diagnostic accuracy in relation to demographic adjustments by comparing the original MoCA score with recently established demographically corrected MoCA z-scores [7] (an Excel file for the calculation of the z-score is provided in the Supplementary material). Finally, during our analyses, we noticed that information is lost when a continuous variable like the MoCA is dichotomized [20, 21], and a traditional binary cut-off is used. We therefore propose a revised approach to evaluate cognitive performance on the MoCA using two separate cut-offs in combination with an indecisive area between these scores.



We retrospectively assessed data from 1307 consecutive outpatients of the Memory Clinic, University Department of Geriatric Medicine FELIX PLATTER, Basel, Switzerland, undergoing neuropsychological assessment between March 6, 2017, and October 12, 2018. Data from patients meeting the following inclusion criteria were considered for the analysis: (a) age ≥ 65 years, (b) education ≥ 7 years, (c) fluency in the German language, and (d) availability of a neuropsychological assessment including the Mini Mental Status Examination (MMSE) [22] and the German version of the MoCA. Exclusion criteria were (a) severe sensory or motor impairment interfering with cognitive testing, (b) repeated testing with the MoCA due to follow-up examinations, and (c) documented refusal of the use of personal health-related data for research purposes. An overview of the clinical diagnoses is provided in Supplementary Table 1. The demographic inclusion criteria were selected to match the NC group from a previous normative study on the German version of the MoCA (see [7] for details).

The NC group was recruited from an existing Registry of Individuals Interested to Participate in Research established by the Memory Clinic in 2013 with approval from the local ethics committee (no. EKBB 280/12). From this registry, potential participants with the required demographic characteristics (age, education, sex) were identified and invited to participate in a normative study on the German verison of the MoCA [7]. During the recruitment process, a stratification of sex (female and male) and age (groups: 65–69, 70–74, 75–79, and > 79 years) was applied. Inclusion criteria were (a) age ≥ 65 years, (b) education ≥ 7 years, (c) fluent German speaking, and (d) provided written informed consent. Subjects who met one of the following criteria were excluded: (a) cognitive impairment (i.e., MMSE < 27/30 and/or Consortium to Establish a Registry for Alzheimer’s Disease-Neuropsychological Assessment Battery (CERAD-NAB) < 85.89 [23], any diagnosis of cognitive impairment), (b) diagnosis and/or symptoms of depression (i.e., Geriatric Depression Scale (GDS) > 5/15 [24]), (c) severe sensory or motor impairment interfering with cognitive testing, (d) serious somatic disease, (e) any disease or events affecting the central nervous system, (f) cerebrovascular disease, (g) current medication with psychoactive drugs except for benzodiazepines, and (h) participation in a cognitive study within the last 3 months (to avoid practice effects).


Patients were assessed in the following order: (a) detailed patient and medical history, (b) neuropsychological screening including the MMSE and the clock drawing test, (c) German version of the MoCA, (d) assessment of symptoms of depression (15-item GDS or Beck Depression Inventory) [25], and (e) extensive neuropsychological examination. Neuropsychological assessments were performed by board-certified neuropsychologists and by psychologists with a master’s degree in psychology. Neuropsychological test results were interpreted based on demographically corrected (i.e., age, sex, and education) z-scores and were used to inform diagnostic deliberations. The patients were medically examined by a neurologist or a geriatrician. Imaging (i.e., structural magnetic resonance imaging, computed tomography, and/or positron emission tomography with 18F-fluorodeoxyglucose) was performed, and in some patients, cerebrospinal fluid was collected to assess for protein deposition. Diagnostic consensus was reached in weekly interdisciplinary meetings by neuropsychologists, neurologists, neuroradiologists, nuclear medicine specialists, geriatricians, psychiatrists, and a neuropathologist. MoCA results were not considered in the diagnostic process.

The detailed procedures for the NC group are described elsewhere [7]. Briefly, study eligibility in the NC group was assessed by the German versions of the MMSE and the 15-item GDS questionnaire. All individuals were then assessed with the German version of the MoCA, followed by the German version of the CERAD-NAB. Subjects meeting any exclusion criteria were omitted from the main analysis.

The study protocol (no. EKNZ 2018-00737) was approved by the regional research ethics board (Ethikkommission Nordwest- und Zentralschweiz [EKNZ]) on May 22, 2018. The study was conducted in respect of the most recent version of the Declaration of Helsinki and was registered on (NCT03581643). The need for informed consent was waived by the EKNZ.

Statistical analyses

Demographical characteristics and test scores were compared pairwise using the non-parametric Wilcoxon rank sum test for between-group comparisons. Differences in sex were analyzed using the chi-squared test. All statistical analyses were performed using R, version 3.5.0 (R Foundation, Vienna, Austria) and RStudio Desktop (RStudio, Boston, MA, USA). Data are presented as mean (SD), and the education-corrected MoCA score (+ 1 point for < 12 years of education) was used, unless stated otherwise. There were no missing data in any of the analyses.

Different cut-off approaches for the MoCA

Diagnostic accuracies were calculated in the MC sample (i.e., Mild+Major NCD vs. NF, and Mild NCD vs. NF), for cut-offs on (a) the MoCA score and (b) the MoCA z-score. The different approaches to calculate the cut-offs are outlined in the following sections: “Original MoCA cut-off,” “Balanced cut-off,” “Youden’s index,” and “10th percentile in NCs.” For the MMSE, cut-offs were calculated in the MC sample (again Mild+Major NCD vs. NF, and Mild NCD vs. NF), using Youden’s index (see the section “Youden’s index”).

Original MoCA cut-off

The original MoCA cut-off (25/26 points) proposed by Nasreddine et al. [6] was applied to the MC sample, and diagnostic accuracies were calculated for (a) Mild+Major NCD vs. NF and (b) Mild NCD vs. NF.

Balanced cut-off

A balanced cut-off was calculated by choosing the scores where values of sensitivity and specificity are as equal as possible. Again, this approach was applied to the MC sample and diagnostic accuracies were derived for the MoCA score and the MoCA z-score for (a) Mild+Major NCD vs. NF and (b) Mild NCD vs. NF.

Youden’s index

Using the Optimal Cutpoints Package in R [26], Youden’s index (sensitivity + specificity − 1) [27] was applied to define the optimal cut-offs in the MC sample (Mild+Major NCD vs. NF and Mild NCD vs. NF) for the MoCA score, the MoCA z-score, and the MMSE. Youden’s index is calculated for every potential cut-point on the MoCA/MMSE; the cut-off where Youden’s index reaches its maximum value (i.e., the highest possible Youden’s index would be sensitivity = 100 + specificity = 100 – 1 = 199) is selected as the optimal cut-off. This approach considers false-positive classifications as undesirable as false-negative ones and aims at finding the cut-off with the overall highest sensitivity and specificity [27].

10th percentile in NCs

For this approach, MoCA cut-offs were derived solely based on the NC group. The cut-off was selected by pre-defining a desired specificity of approximately 90% in the NC group (i.e., maximum 10% false-positive classifications in a group of cognitively healthy individuals). This was achieved by choosing the MoCA score and the MoCA z-score that split the NC sample at the 10th percentile. In this approach, normality is defined as a reference range based on the distribution of scores in cognitively healthy individuals, and scores below the 10th percentile were considered pathological. The resulting cut-offs were then validated in the MC sample to differentiate Mild+Major NCD vs. NF and Mild NCD vs. NF.

Determination of the overall diagnostic accuracy

The discriminative power of the MoCA score, the MoCA z-score, and the MMSE score was estimated in terms of area under the curve (AUC) in the MC sample. Receiver operating characteristic (ROC) curves were calculated using the pROC package in R [28]. The AUCs of the MoCA score vs. MoCA z-score, the MoCA score vs. MMSE score, and the MoCA z-score vs. MMSE score were compared with a bootstrap two-sided significance test for correlated ROC curves. The correct classification rates of the newly derived MoCA cut-offs were compared to the original MoCA cut-off and the optimal cut-offs on the MMSE using McNemar’s test. Results were corrected for multiple comparisons according to Bonferroni-Holm.

Definition of two cut-offs and an indecisive area for the MoCA

During our analyses, we realized that the presented approaches to calculate cut-offs were not satisfying, since either specificity, sensitivity, or both were low. Therefore we considered to introduce a multiple cut-off approach. First, we created a plot to visualize the relationship between MoCA scores and rates of sensitivity and specificity to better understand which cut-offs should be selected. We aimed at finding cut-offs with approximately 90% sensitivity and 90% specificity. This plot is thereafter referred to as “Two cut-offs and an indecisive area.” For this purpose, cumulative frequencies were calculated separately for Mild NCD and for NC for each MoCA score. Thus, the proportion of individuals who performed equally or below a given score was determined for each score and expressed in percent of the whole sample. The cumulative frequency for a given score in Mild NCD corresponds to the sensitivity. Specificity is represented by the complementary sum (1 − cumulative frequency) in NC.


Descriptive analysis

Four hundred and forty-seven patients (Mild+Major NCD), 49 normal findings (NF), and 283 normal controls (NC) were included in the final analysis. Demographic characteristics are displayed in Table 1. There were no differences between NC and NF. Compared to NF, the patients (i.e., Mild+Major NCD, Mild NCD) were older (P value < .001), had fewer years of formal education (P value < .001), and had lower test scores (MMSE: P value < .001; MoCA: P value < .001, MoCA z-score: P value < .001). There were no sex differences between the groups.

Table 1 Demographic characteristics

Diagnostic accuracies

ROC curves for the MC sample are displayed in Supplementary Fig. 1. The AUC of the MoCA scores appears larger than that of the MMSE. However, with the application of the Bonferroni-Holm procedure, the AUC neither differed significantly between MoCA and MMSE scores (MoCA [AUC = 0.94] vs. MMSE [AUC = 0.84]: P value = .051; MoCA z-score [AUC = 0.94] vs. MMSE: P value = .074) nor between the uncorrected MoCA and the MoCA z-score (P value = 1.0).

Cut-offs and the corresponding diagnostic properties for the MoCA score, the MoCA z-score, and the MMSE are provided in Table 2. A MoCA score of 23/24 points was the optimal cut-off according to the 10th percentile method as well as according to Youden’s index in all patient groups. This cut-off had better correct classification rates than the original MoCA cut-off (25/26 points; P value < .001) and the MMSE score (P value < .001) in both patient samples. Specificity for the cut-off of 23/24 points was high with 92%, and it had good sensitivity for Mild+Major NCD (84%). However, sensitivity was low for Mild NCD (65%). The original MoCA cut-off (25/26 points) had high sensitivity for Mild+Major NCD (94%) and for Mild NCD (86%), but poor specificity (63%). For Mild NCD, an intermediate (i.e., balanced) cut-off (24/25 points) had neither good sensitivity (74%) nor good specificity (74%). We, therefore, aimed at obviating this trade-off between sensitivity and specificity by defining two separate cut-offs. This is illustrated by the example of Mild NCD vs. NC in the “Two separate cut-offs and an indecisive area” section.

Table 2 Cut-offs and diagnostic accuracy for the MoCA score, the MoCA z-score, and the MMSE

Two separate cut-offs and an indecisive area

In Fig. 1, sensitivity based on Mild NCD is plotted against specificity based on NC. Specificity increases with lower scores, while sensitivity increases with higher scores. At 23/24 points, specificity is 88%, indicating that only 12% of the NC scored ≤ 23 points. At 26/27 points, sensitivity is 91%, so only 9% of patients with Mild NCD achieved scores > 26 points. Consequently, pathological and normal cognition may be defined using two separate cut-offs. Analogous to the concept of z-scores, a distribution of scores is assumed, and extreme values are considered improbable for a specific population. Among those who are cognitively healthy, values below a given cut-off (i.e., 23 points) are rare, suggesting that an individual scoring ≤ 23 points is probably not healthy. This statement was accurate in 88% of the NC group (= specificity). Values above a given cut-off (i.e., > 26 points) are uncommon in Mild NCD. Therefore, an individual who attains > 26 points on the MoCA probably does not suffer from an NCD. This statement was accurate in 91% of Mild NCD patients (= sensitivity). Scores between these two cut-offs (24, 25, and 26 points) constitute an indecisive area. This indecisive area may be greater or smaller, depending on the desired accuracies (i.e., for sensitivity of 95% and specificity of 95%, the indecisive area would encompass MoCA scores from 23 to 26 points). In Fig. 2, we illustrate these two cut-offs together with an indecisive area to provide clinicians a means to determine which cut-offs are most appropriate with their patients. The corresponding positive predictive values (PPV) and negative predictive values (NPV) are plotted in Supplementary Fig. 2.

Fig. 1
figure 1

Two separate cut-offs and an indecisive area. The percentage of patients with Mild NCD who were correctly classified as patients (sensitivity, red line) and the percentage of normal controls that were correctly classified as normal controls (specificity, blue line) are illustrated. Two cut-offs are highlighted by the dashed lines: one cut-off for not-healthy results (23/24; with 88% specificity) and one cut-off for not-pathological results (26/27; with 91% sensitivity). Scores between these two cut-offs constitute an indecisive area (in orange), where information from further examinations is required. The + 1 adjustment for individuals with education < 12 years proposed by Nasreddine et al. was applied to calculate the MoCA score [6]

Fig. 2
figure 2

Decision tree for clinical evaluations of the MoCA score and the MoCA z-score. This decision tree may be used to determine which cut-offs for the MoCA score and the MoCA z-score are most appropriate. The cut-offs proposed here were based on the classification of normal controls (NC) vs. mild neurocognitive disorder (Mild NCD)


The German MoCA showed good AUC, sensitivity, and specificity for the classification of patients with mild and major NCD versus cognitively healthy normal findings when applied in a heterogeneous group of individuals referred to a university-affiliated Memory Clinic. In the present study, a MoCA score of 23/24 points was established as the optimal cut-off across different patient groups based on two methods. This finding is in line with a recent meta-analysis including seven validation studies on the MoCA [13]. The new MoCA cut-off had an improved correct classification rate compared to both, the original MoCA cut-off and the MMSE. Further, differences in diagnostic accuracy depending on the severity of cognitive impairment (Mild vs. Major NCD) were revealed. While a cut-off of 23/24 points had high sensitivity for all patients (Mild+Major NCD [84%]), sensitivity was low for Mild NCD (65%). When applying a higher cut-off (e.g., the originally proposed 25/26 points), sensitivity for Mild NCD increased to 91%; however, specificity to detect NF was low (59%). If both measures are balanced, neither of them is sufficiently high. Indeed, most screening tools for MCI lack either sensitivity or specificity [29].

In the current situation, it is difficult to favor one above the other. Specificity should be high to avoid falsely classifying healthy individuals as cognitively impaired and potentially provoke distress and insecurities in the affected individuals and their families. This is especially important in the context of AD and other neurodegenerative diseases, which cannot be reversed at this point of time. However, one may also argue in favor of prioritizing sensitivity, since some of the underlying etiologies of mild NCD (e.g., sleep apnea) may be easily treated and potentially even reversed. Moreover, in the context of mild NCD—regardless of the underlying etiology―various options to compensate for the presence of mild NCD exists and early diagnosis of cognitive decline may improve an individuals and his/her relatives’ quality of life. Nevertheless, ideally sensitivity and specificity should be balanced. Based on the findings of this study, we therefore propose a revised method to evaluate cognitive performance, taking the MoCA as an example. This builds up on previous efforts of other authors having developed demographic adjustments for the MoCA as a reaction of the too stringent nature of the original MoCA cut-off (e.g., [30, 31], and [7] for an overview of previously published normative studies). Instead of applying a single cut-off, two separate cut-offs may be used. One cut-off for results that are unlikely within the normal range and one cut-off for scores that are rarely seen in patients. MoCA scores > 26 points may be considered as not pathological with very high accuracy, while scores ≤ 23 points are very likely not healthy. Between these scores, we have defined an indecisive area. When an individual scores between 24 and 26 points (i.e., within the indecisive area), the clinician should start a more comprehensive neuropsychological assessment (e.g., an in-depth assessment according to DSM-5) [9] or reassess the individual with the MoCA in approximately 6 to 12 months.

Choice of normative samples and patient characteristics

It has been argued that a restrictive cognitively healthy normative group may not be entirely comparable to the population, which is typically screened with the MoCA. This may artificially boost specificity of a test and lead to an overestimation in diagnostic accuracy [16, 19]. We addressed this issue by analyzing two groups of cognitively healthy individuals: one that was purposely recruited for a previous normative study (NC) and one that was formed by consecutively referred patients with a cognitive normal finding (NF). In our study, there were no differences between the NC and the NF group, neither in demographic characteristics nor in cognitive performance. Furthermore, the optimal MoCA cut-offs were identical in these two groups. This suggests that the healthy controls in our study are representative for individuals with cognitive normal findings in the clinical routine. While this is reassuring, longitudinal data from individuals, who remained healthy for several years, should be analyzed in future studies.

Influence of demographic adjustments on diagnostic accuracy

In our study, individuals with mild and major NCD were older and had less years of formal education when compared to the NF group. We deliberately chose not to match the groups on demographic characteristics to reflect the true nature of this clinical routine sample. Moreover, while some authors (including our group) have suggested that correcting for demographical effects may increase diagnostic accuracy when evaluating cognitive performance [7, 13], others have questioned the utility of demographical adjustments [32], since age and education are per se risk factors of cognitive decline. In the current study, we addressed this subject by comparing demographically adjusted MoCA z-scores (accounting for age, education, and sex) with the MoCA raw total score (without any demographic corrections). Conversely, we found no difference between demographically corrected and uncorrected MoCA scores in the overall diagnostic accuracy measured by the AUC. However, a difference emerged in the balance of sensitivity and specificity. When considering the effects of age, education, and sex (z-scores), the MoCA gained specificity, while the uncorrected MoCA score showed increased sensitivity. The education-corrected MoCA score was located in between, with higher sensitivity but lower specificity compared to the MoCA z-score, and lower sensitivity but higher specificity compared to the uncorrected MoCA score. This result is in line with previous findings from a simulation [33]. Whether to rely on a demographically adjusted score or on an uncorrected raw score may depend on the setting. For instance, when the MoCA is applied to identify cognitively healthy participants in clinical research, high sensitivity might be more important to avoid the inclusion of patients with false-negative test results. In contrast, if the aim is to include cognitively impaired patients in a clinical trial, high specificity should be favored over sensitivity to avoid including healthy individuals with false-positive results. Indeed, the erroneous inclusion of cognitively healthy individuals as patients may mask possible treatment effects in clinical trials [4]. When a general practitioner should decide whether to refer a patient to a specialized Memory Clinic based on cognitive screening, false-positive results should be minimized to reduce healthcare costs and discomfort for the individual. On the other hand, false-negative results may deprive a patient of the early implementation of therapeutic strategies. In this situation, we suggest relying on our new system with two separate cut-offs and an indecisive area.


Sensitivity, specificity, and the AUC give an indication of the quality of the test under observation by classifying the test performance with respect to a reference standard (i.e., an individual will be classified as a patient on the MoCA as well as according to a complete Memory Clinic diagnostic workup). However, these measures do not inform about the probability whether a tested individual has a specific disease [15, 34]. Predictive values―which are influenced by prevalence rates―reflect this information. In the current study, the MoCA had very high PPV across all patient groups and most MoCA scores (see Supplementary Fig. 2). However, the PPV will be lower in a setting with low prevalence of a specific disease (e.g., when screening for cognitive impairment at the general practitioner’s office). Likewise, in most MoCA studies reporting PPV and NPV, the prevalence of MCI was greater than in the general population [10]. Ideally, the diagnostic accuracy of a test should be evaluated in the same setting where it is clinically applied [35]. We did not have access to any data from first step screening processes (i.e., from a general practitioner’s office). Thus, our findings inform about how well the MoCA classifies individuals as healthy or cognitively impaired compared to a more extensive, multi-dimensional, diagnostic process, as performed in our Memory Clinic (described in the “Procedures” section). Additionally, we can provide the probability for a Memory Clinic patient to be affected by a mild or major NCD, when the MoCA performance is below the cut-off (PPV), as well as the probability that the patient is cognitively healthy, when the performance lies above the cut-off (NPV) [34]. This should be kept in mind, when applying our findings to other settings than a Memory Clinic. We refer to the excellent recent publication by Trevethan [34] for a better understanding on the informative value of sensitivity, specificity, PPV, and NPV.

There may be a selection bias in the NC and MC groups. The NC group was recruited from a Registry of Individuals Interested to Participate in Research, which may exhibit a greater motivation to perform well in the neuropsychological assessment. The MC group consisted of patients referred to the Memory Clinic from external medical professionals (e.g., general practitioners, hospitals) and may differ from individuals not seeking advice from a medical professional.


In the present study, the diagnostic properties of the German MoCA were evaluated in an outpatient sample referred to a university-affiliated Memory Clinic. The originally proposed MoCA cut-off (25/26 points) had good sensitivity for mild and major NCD, but specificity was poor. As an alternative, a cut-off of 23/24 points on the MoCA improved specificity. However, the sensitivity to detect mild NCD was low using this cut-off. Thus, both cut-offs lead to a trade-off in either sensitivity (23/24 points) or specificity (25/26 points). In this context, we propose a new method to guide clinical decision-making by relying on two separate cut-offs combined with an indecisive area. Adding an indecisive area will increase both, sensitivity and specificity. Moreover, the presence of an indecisive area highlights the difficulties related to the early detection of cognitive impairment and mirrors the clinical reality quite accurately.

Availability of data and materials

The datasets analyzed during the study are available from the corresponding author on reasonable request.



Alzheimer’s disease


Area under the curve


Consortium to Establish a Registry for Alzheimer’s Disease-Neuropsychological Assessment Battery


Diagnostic and Statistical Manual, 5th Edition


Ethikkommission Nordwest- und Zentralschweiz (i.e., regional research ethics board)


Geriatric Depression Scale


Mild cognitive impairment


Mini Mental Status Examination


Montreal Cognitive Assessment


Normal controls


Neurocognitive disorder


Normal findings


Negative predictive values


Positive predictive values


Receiver operating characteristic


  1. Prince M, Comas-Herrera A, Knapp M, Guerchet M, Karagiannidou M. World Alzheimer Report 2016. Improving healthcare for people living with dementia: coverage, quality and costs now and in the future. 2016 [Available from:

  2. Yaffe K. Modifiable risk factors and prevention of dementia: what is the latest evidence? JAMA Intern Med. 2018;178(2):281–2.

    Article  PubMed  Google Scholar 

  3. Ehrensperger MM, Taylor KI, Berres M, Foldi NS, Dellenbach M, Bopp I, et al. BrainCheck - a very brief tool to detect incipient cognitive decline: optimized case-finding combining patient- and informant-based data. Alzheimers Res Ther. 2014;6(9):69.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Edmonds EC, Ard MC, Edland SD, Galasko DR, Salmon DP, Bondi MW. Unmasking the benefits of donepezil via psychometrically precise identification of mild cognitive impairment: a secondary analysis of the ADCS vitamin E and donepezil in MCI study. Alzheimers Dement (N Y). 2018;4:11–8.

    Google Scholar 

  5. Edmonds EC, Delano-Wood L, Jak AJ, Galasko DR, Salmon DP, Bondi MW, et al. “Missed” mild cognitive impairment: high false-negative error rate based on conventional diagnostic criteria. J Alzheimers Dis. 2016;52(2):685–91.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Nasreddine ZS, Phillips NA, Bedirian V, Charbonneau S, Whitehead V, Collin I, et al. The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc. 2005;53(4):695–9.

    Article  PubMed  Google Scholar 

  7. Thomann AE, Goettel N, Monsch RJ, Berres M, Jahn T, Steiner LA, et al. The Montreal Cognitive Assessment: normative data from a German-speaking cohort and comparison with international normative samples. J Alzheimers Dis. 2018;64(2):643–55.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Lam B, Middleton LE, Masellis M, Stuss DT, Harry RD, Kiss A, et al. Criterion and convergent validity of the Montreal cognitive assessment with screening and standardized neuropsychological testing. J Am Geriatr Soc. 2013;61(12):2181–5.

    Article  PubMed  Google Scholar 

  9. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 5th ed. Arlington: American Psychiatric Association; 2013.

  10. Ozer S, Young J, Champ C, Burke M. A systematic review of the diagnostic test accuracy of brief cognitive tests to detect amnestic mild cognitive impairment. Int J Geriatr Psychiatry. 2016;31(11):1139–50.

    Article  PubMed  Google Scholar 

  11. Smith T, Gildeh N, Holmes C. The Montreal Cognitive Assessment: validity and utility in a memory clinic setting. Can J Psychiatr. 2007;52(5):329–32.

    Article  Google Scholar 

  12. Davis DH, Creavin ST, Yip JL, Noel-Storr AH, Brayne C, Cullum S. Montreal Cognitive Assessment for the diagnosis of Alzheimer’s disease and other dementias. Cochrane Database Syst Rev. 2015;10:CD010775.

    Google Scholar 

  13. Carson N, Leach L, Murphy KJ. A re-examination of Montreal Cognitive Assessment (MoCA) cutoff scores. Int J Geriatr Psychiatry. 2018;33(2):379–88.

    Article  PubMed  Google Scholar 

  14. Florkowski CM. Sensitivity, specificity, receiver-operating characteristic (ROC) curves and likelihood ratios: communicating the performance of diagnostic tests. Clin Biochem Rev. 2008;29(Suppl 1):S83–7.

    PubMed  PubMed Central  Google Scholar 

  15. Weissberger GH, Strong JV, Stefanidis KB, Summers MJ, Bondi MW, Stricker NH. Diagnostic accuracy of memory measures in Alzheimer’s dementia and mild cognitive impairment: a systematic review and meta-analysis. Neuropsychol Rev. 2017;27(4):354–88.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Martin PK, Schroeder RW, Baade LE. A tale of two norms: the impact of normative sample selection criteria on standardized scores in older adults. Clin Neuropsychol. 2017;31(6–7):1204–18.

    Article  PubMed  Google Scholar 

  17. McKhann GM, Knopman DS, Chertkow H, Hyman BT, Jack CR Jr, Kawas CH, et al. The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer's Dementia. 2011;7(3):263–9.

    Article  PubMed  Google Scholar 

  18. Lijmer JG, Mol BW, Heisterkamp S, Bonsel GJ, Prins MH, van der Meulen JH, et al. Empirical evidence of design-related bias in studies of diagnostic tests. JAMA. 1999;282(11):1061–6.

    Article  CAS  PubMed  Google Scholar 

  19. Noel-Storr AH, McCleery JM, Richard E, Ritchie CW, Flicker L, Cullum SJ, et al. Reporting standards for studies of diagnostic test accuracy in dementia: the STARDdem Initiative. Neurology. 2014;83(4):364–73.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Brown MD, Reeves MJ. Evidence-based emergency medicine/skills for evidence-based emergency care. Interval likelihood ratios: another advantage for the evidence-based diagnostician. Ann Emerg Med. 2003;42(2):292–7.

    Article  PubMed  Google Scholar 

  21. Vermeersch P, Vercammen M, Holvoet A, Vande Broek I, Delforge M, Bossuyt X. Use of interval-specific likelihood ratios improves clinical interpretation of serum FLC results for the diagnosis of malignant plasma cell disorders. Clin Chim Acta. 2009;410(1–2):54–8.

    Article  CAS  PubMed  Google Scholar 

  22. Folstein MF, Folstein SE, McHugh PR. “Minimental state”. A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12:189–98.

    Article  CAS  PubMed  Google Scholar 

  23. Ehrensperger MM, Berres M, Taylor KI, Monsch AU. Early detection of Alzheimer’s disease with a total score of the German CERAD. J Int Neuropsychol Soc. 2010;16:910–20.

    Article  PubMed  Google Scholar 

  24. Yesavage JA, Sheikh JI. Geriatric depression scale (GDS). Clin Gerontol. 1986;5:165–73.

    Article  Google Scholar 

  25. Beck AT, Ward CH, Mendelson M, Mock J, Erbaugh J. An inventory for measuring depression. Arch Gen Psychiatry. 1961;4:561–71.

    Article  CAS  PubMed  Google Scholar 

  26. López-Ratón M, Rodríguez-Álvarez MX, Cadarso-Suárez C, Gude-Sampedro F. Optimal cutpoints: an R package for selecting optimal cutpoints in diagnostic tests. 2014. 2014;61(8):36.

  27. Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–5.

    Article  CAS  PubMed  Google Scholar 

  28. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Summers MJ, Bondi MW. Editorial introduction to the special issue on neuropsychological assessment in aging. Neuropsychol Rev. 2017;27(4):303–4.

    Article  PubMed  Google Scholar 

  30. Rossetti HC, Lacritz LH, Cullum CM, Weiner MF. Normative data for the Montreal Cognitive Assessment (MoCA) in a population-based sample. Neurology. 2011;77(13):1272–5.

    Article  PubMed  Google Scholar 

  31. Rossetti HC, Lacritz LH, Hynan LS, Cullum CM, Van Wright A, Weiner MF. Montreal Cognitive Assessment performance among community-dwelling African Americans. Arch Clin Neuropsychol. 2017;32(2):238–44.

    PubMed  Google Scholar 

  32. Strauss E, Sherman EMS, Spreen O. A compendium of neuropsychological tests. Administration, norms, and commentary. Oxford: Oxford University Press; 2006.

    Google Scholar 

  33. O'Connell ME, Tuokko H. Age corrections and dementia classification accuracy. Arch Clin Neuropsychol. 2010;25(2):126–38.

    Article  PubMed  Google Scholar 

  34. Trevethan R. Sensitivity, specificity, and predictive values: foundations, pliabilities, and pitfalls in research and practice. Front Public Health. 2017;5:307.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Habibzadeh F, Habibzadeh P, Yadollahie M. On determining the most appropriate test cut-off value: the case of tests with continuous results. Biochem Med (Zagreb). 2016;26(3):297–307.

    Article  Google Scholar 

Download references


We gratefully acknowledge the help of the staff of the Memory Clinic, University Department of Geriatric Medicine FELIX PLATTER, Basel, Switzerland, notably Michael M. Ehrensperger, PhD, and Marc Sollberger, MD, for providing advice during study conduction, and Ursi Kunze, MSc, for the support in database management. We thank Christian Brauchli, PhD, for the methodological input and commenting on the manuscript.


Financial support for this research was provided from internal sources of the Memory Clinic, University Department of Geriatric Medicine FELIX PLATTER, Basel, Switzerland, and of the Department of Anesthesia, University Hospital Basel, Switzerland.

Author information

Authors and Affiliations



AET and MB conducted the statistical analyses. All authors were involved in the data interpretation and the writing of the manuscript. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Andreas U. Monsch.

Ethics declarations

Ethics approval and consent to participate

The study protocol (no. EKNZ 2018-00737) was approved by the regional research ethics board (Ethikkommission Nordwest- und Zentralschweiz [EKNZ]) on May 22, 2018. The study was conducted in respect of the most recent version of the Declaration of Helsinki and was registered on (NCT03581643). The need for informed consent was waived by the EKNZ.

Consent for publication

Not applicable.

Competing interests

There are no conflicts of interest. Dr. Monsch reports personal fees from AbbVie Pahrma, from Vifor Pharma, from Merz Pharma, and from Roche Pharma outside the submitted work. Dr. Thomann reports personal fees from Vifor Pharma outside the submitted work. Dr. Thomann is a full-time employee of F. Hoffmann-La Roche Ltd., starting after the main time of her contribution to the submitted work.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Supplementary Table 1.

Diagnoses in the patient sample. Supplementary Figure 1. ROC curves. Supplementary Figure 2. Positive and negative predictive values. Excel file for automatic calculation of demographically corrected z-score for the German version of the MocA. MoCA z-score calculation

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Thomann, A.E., Berres, M., Goettel, N. et al. Enhanced diagnostic accuracy for neurocognitive disorders: a revised cut-off approach for the Montreal Cognitive Assessment. Alz Res Therapy 12, 39 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: