Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study

Liu, Haihong; Zhang, Xiaolei; Liu, Haining; Chong, Sheau Tsuey

doi:10.3389/ijph.2023.1605322

ORIGINAL ARTICLE

Int. J. Public Health, 19 January 2023

Volume 68 - 2023 | https://doi.org/10.3389/ijph.2023.1605322

Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study

Haihong Liu ^1,2

Xiaolei Zhang ^3,4

Haining Liu ^2,5,6^*

Sheau Tsuey Chong ^1,7^*

1. Centre for Research in Psychology and Human Well-being, Faculty of Social Sciences and Humanities, Universiti Kebangsaan Malaysia, Bangi, Malaysia
2. Department of Psychology, Chengde Medical University, Chengde, China
3. Department of Biomedical Engineering, Chengde Medical University, Chengde, China
4. Faculty of Engineering, Universiti Putra Malaysia, Serdang, Malaysia
5. Hebei Key Laboratory of Nerve Injury and Repair, Chengde Medical University, Chengde, China
6. Hebei International Research Center of Medical Engineering, Chengde Medical University, Chengde, China
7. Counselling Psychology Programme, Secretariat of Postgraduate Studies, Faculty of Social Sciences and Humanities, Universiti Kebangsaan Malaysia, Bangi, Malaysia

Article metrics

Citations

5,5k

Views

2,8k

Downloads

Abstract

Objective: To explore the predictive value of machine learning in cognitive impairment, and identify important factors for cognitive impairment.

Methods: A total of 2,326 middle-aged and elderly people completed questionnaire, and physical examination evaluation at baseline, Year 2, and Year 4 follow-ups. A random forest machine learning (ML) model was used to predict the cognitive impairment at Year 2 and Year 4 longitudinally. Based on Year 4 cross-sectional data, the same method was applied to establish a prediction model and verify its longitudinal prediction accuracy for cognitive impairment. Meanwhile, the ability of random forest and traditional logistic regression model to longitudinally predict 2-year and 4-year cognitive impairment was compared.

Results: Random forest models showed high accuracy for all outcomes at Year 2, Year 4, and cross-sectional Year 4 [AUC = 0.81, 0.79, 0.80] compared with logistic regression [AUC = 0.61, 0.62, 0.70]. Baseline physical examination (e.g., BMI, Blood pressure), biomarkers (e.g., cholesterol), functioning (e.g., functional limitations), demography (e.g., age), and emotional status (e.g., depression) characteristics were identified as the top ten important predictors of cognitive impairment.

Conclusion: ML algorithms could enhance the prediction of cognitive impairment among the middle-aged and older Chinese for 4 years and identify essential risk markers.

Introduction

With the current rapidly aging global population, the burden of dementia in low-income countries is expected to increase dramatically in the coming decades [1]. Currently, 47.47 million people worldwide have been diagnosed with dementia but by 2050, this number is expected to double [2]. To identify the population with the highest risk of dementia, focusing on the early stages of the pathological process is a viable strategy for prevention. Cognitive impairment is characterized by decreased memory, attention and language, and deterioration in other cognitive functions, including mild cognitive impairment and dementia [3]. At present, neuropsychological assessment is an important method for screening and diagnosing cognitive impairment. For example, neuropsychological examinations such as Mini-mental State Examination (MMSE) and Montreal Cognitive Assessment (MoCA) are useful evaluation tests for cognitive function [5, 6]. Considering the shortage of community professionals and time, telephone interview for cognitive status (TICS) with fewer items has demonstrated its value as an effective screening tool for cognitive impairment in the community compared with other methods such as MMSE [4]. This method complements commonly used cognitive function evaluation tools, by identifying potential risk factors for cognitive impairment. Early screening of individual cognitive impairment is crucial to preventing cognitive decline, and progression to dementia through immediate effective treatment and management strategies.

At present, a fundamental strategy to prevent or minimize cognitive decline is through early detection of the risk factors for cognitive impairment, which benefits preventive intervention [5]. Therefore, the prediction of cognitive impairment plays an important role in mitigating and preventing cognitive impairment. However, predicting cognitive impairment is a challenging process. Few empirical studies have compared the predictors of cognitive impairment, and these methods mainly used meta-analysis methods and traditional regression methods. For example, a meta-analysis reported in the Lancet in 2020, revealed that approximately 35% of dementia was contributed by nine factors, including previous education, high blood pressure, middle-aged obesity, and late-life depression [5]. Researchers have used traditional regression statistical methods to identify the common predictors of cognitive impairment outcomes including demographic characteristics and general health information. However, these studies did not point out the importance ranking of influencing factors.

Notably, in these studies, the most commonly used analysis method was regression-based inferential statistics. Yet, the predictors produced by these surveys were insensitive [6]. Several issues associated with conventional statistics limit their robust prediction of complex neurodegenerative processes. Traditional regression-related approaches can only accommodate a restricted number of predictors and cannot process the complex multi-class characteristic variables [7]. In addition, these methods are based on linear assumptions and may not be able to effectively manage more complex patterns including non-linear and higher dimensional [8].

Emerging computing methods using machine learning can optimize the prediction of cognitive impairment, to overcome the shortcomings of traditional methods. Machine learning(ML) has been used for clinical classification and prediction based on extracted high-dimensional features from data [9]. The random forest is a typical ML technique with high predictive performance and robustness as regards to its accuracy and ease of implementation [10, 11]. This method has a high level of predictive ability. It creates multiple decision trees by implementing random sampling in the same data set, combining them and finally predicting the target variable [12, 13]. Importantly, random forests also have excellent predictive ability to discover the correlation between explanatory variables and diseases, while preventing over-fitting when multiple explanatory variables are applied to the model [14, 15]. Compared with other methods, random forest demonstrated the highest accuracy compared with other methods in predicting cognitive impairment [16, 17].

In recent years, machine learning, has been increasingly used in research to predict cognitive impairment. These studies mainly have two characteristics. First, most of these studies use expensive sample data sources, such as MRI, PET and other medical imaging methods [18–20]. Second, predictor variables in some studies that predict cognitive impairment or other diseases, are only self-reported variables [16, 21, 22].

Cognitive impairment is an age-related condition caused by Alzheimer’s disease, vascular dementia, mixed dementia or other related types of dementias with no cure [23, 24]. Therefore, ML technology has high potential value in evaluation of risk factors of cognitive impairment. Conventional regression-based approaches have been used to effectively identify key risk factors, including demographic (e.g., education, venerable age, gender) [25], physical condition (e.g., body mass index (BMI), hearing difficulty) [26, 27], lifestyle activities (e.g., smoking, drinking and instrumental activities of daily living (IADL) [28], poor psychological wellbeing (e.g., depression) [29–31], and chronic diseases (e.g., chronic lower back pain) [32]. However, few studies have examined the effect of these independent risk factors on cognitive impairment among older Chinese using a prospective design.

Given the potentially severe consequences of cognitive impairment, an improvement in the predictions for middle-age and older people is important. To solve this problem and fill gaps in the previous literature, this study aims to examine the predictive power of the ML model for cognitive impairment using China Health and Retirement Longitudinal Survey database. In addition, the study compared the prediction accuracy of random forest in ML with traditional inferential statistical method (logistic regression). First, a large representative elderly sample, which consisted of middle-aged and elderly people, participated in a 3-year survey. Second, random forest and logistic regression were used to longitudinally predict cognitive impairment and identify ten most important predictors, including biological factors and psychological at 2-year and 4-year follow-ups from 44 baseline predictors. These baseline predictors included demographic (e.g., age, education level), health status and functioning (e.g., physical functions, biomarkers), emotional status (e.g., depression), lifestyle and behavior (e.g., smoking, drinking, sleeping habits). Finally, a 4-year follow-up cross-sectional data was applied to construct a model to verify the accuracy of the longitudinal prediction model for previous 2 years.

It was assumed that the random forest model of ML could predict long-term cognitive impairment. In addition, compared with the logistic regression model, the random forest model could improve the prediction of long-term cognitive impairment outcomes. ML model could also screen out the risk factors with the most significant impact on cognitive impairment. Overall, the study results provide insights into the practicability of this innovative computational method, which has potential diagnostic value in cognitive decline. More importantly, the model provided a ranking of predictors of cognitive impairment which is invaluable for identifying the risk various factors (simple and easily available variables) of cognitive impairment in daily life, for effective prevention and intervention to promote healthy aging [23].

Methods

Dataset and Participants

The data for this study were obtained from the CHARLS from 2011 to 2015. This longitudinal survey covered 450 villages or communities in 150 counties/districts, of which 52.67% comprised rural areas, and 47.33% was urban areas [33]. The CHARLS survey aims to build high-quality public databases of individuals and families of middle-aged and older persons aged 45 and above across the country. The national baseline survey was conducted in 2011, whereas the second and third data surveys were carried out in 2013and 2015, respectively. A total of 19,817 respondents were involved in the 2011 baseline survey. In 2013, a total of 18,605 respondents participated in the assessment of cognitive ability. In 2015, a total of 21,095 respondents participated in all the same surveys as the baseline survey. First, the 2011, 2013, and 2015 data were merged according to the principle of ID and household ID matching, and a total of 4,043 participants participated in the survey from 2011 to 2015. Then, missed follow-up data and interviewees answered by others, and variables missing >20% of patient data were excluded from the analysis. Finally, respondents who participated in all three surveys and without the characteristics in the aforementioned exclusion criteria were included in the analysis. The final sample size was 2,326. (The data preprocessing process is shown in Supplementary Figure S1). Specifically, basic information module, health status and functioning module, physical examination and blood-based biomarkers data from 2011 to 2015, and cognitive module data from 2013 were used in this study. The study protocol of the CHARLS was approved by the Peking University Biomedical Ethics Committee, which conformed to the standards set by the latest revision of the Declaration of Helsinki (IRB00001052-11015) (http://charls.ccer.edu.cn/charls/, https://opendata.pku.edu.cn/dataverse/CHARLS).

Patient and Public Involvement

In this study, we used data from open database CHARLS, which is a nationally longitudinal survey. Therefore, no direct patient was involved and contacted.

Cognitive Function

To measure the cognitive status of the research population, several measurements of the telephone interview of cognitive status (TICS-10) in the CHARLS data were used [34, 35]. These included date, week, and season among others (orientation and attention), which scored with 5 points; 100 minus 7 calculation series scored with 5 points; the recall, delayed word recall and episodic memory for 10 words was scored with 20 points; and the drawing of two repeated five-sided graphs (visual spatial abilities) was scored with 1 point. The total cognitive function score was 31 points. In general, a higher score indicated better cognitive function of middle-aged and elderly people.

Based on the previous studies, the results of cognitive classification and baseline demographics (e.g., age, education level, marital status, type of residence), health status and functioning (ADL, IADL, functional limitations, life expectancy, eyes, hearing, oral cavity, pain, physical examination, and blood indicators), emotional status (depression CES-D), lifestyle and behavior (sleeping, physical activity, social interaction, smoking, and drinking) were related. Therefore, 44 baseline features in the model were selected to reflect the previously determined predictors in the dataset.

Demographics

Demographic variables included gender, age, education (no formal education illiterate, does not finish primary school but capable of reading or writing, sishu, elementary school, middle school, vocational school, two/three-year college/associate degree, four-year college/bachelor’s degree, Post-graduate, Master’s degree), household registration (agricultural household registration, non-agricultural household registration and unified residence household registration), marital status (being married and not being married) among others. Six options were present in the questionnaire regarding marital status, namely “married and living with spouse,” “married but not living with a spouse for a temporary period due to work and other reasons,” “separated,” “divorced,” “widowed,” and “never married.” The first two types of marital status were defined as “being married,” while the remaining two types were defined as “not being married.”

Health Status and Functioning

In this section, to assess health status and functioning, predictive variables mainly related to health and function were used, including eyesight (close) and eyesight (distant objects), hearing problem, tooth loss, chronic disease (participants self-reported whether they had a chronic disease diagnosed by a doctor), pain (are you often troubled with body pains?), physical functions (height, weight, BMI, and respiratory function, blood pressure), blood indicators, activities of daily living (ADL) and instrumental activities of daily living (IADL) [36]. ADL includes dressing, bathing, eating, getting into and out of bed, using the toilet, controlling urination and defecation and IADL includes doing household chores, preparing hot meals, shopping, money management, taking medicine). Are there any difficulties in these daily routines? There are 4 options for all questions: 1. No difficulty, 2. Difficulty but achievable, 3. Difficulty and need help, 4. Unable to complete. A higher score indicates lower quality of acting.

Functional limitations, included “running 1 km,” “walking 100 m,” “sedentary standing up,” “climbing stairs,” and “picking up coins” and nine questions regarding the difficulty of nine basic activities, with a total score range within 0–27 points. A higher score indicated deteriorating body function Following that life expectancy was assessed through the possibility of an individual’s assumption about living until the expected age. Accordingly, the questions comprised a rating of 1–5, which indicated from “nearly impossible” to “very certain.”

Venous blood samples (biomarkers) comprised: high-sensitivity C-reactive protein (hsCRP), glycosylated hemoglobin (HbA1c), total cholesterol, high density lipoprotein (HDL) cholesterol, low-density lipoprotein (LDL) cholesterol, triglycerides, glucose, blood urea nitrogen (BUN), creatinine, uric acid, and cystatin C [37, 38].

Emotional Status

The CHARLS database employed the Center for Epidemiologic Studies Depression Scale-10 (CESD-10) to investigate the depression risk among middle-aged and elderly people, with depression scores ranging from 0 to 30 points. Subsequently, higher score indicated higher susceptibility to depression. A score of 10 points or higher indicates high risk of depression [39].

Lifestyle and Behavior

This study identified lifestyle and behavior as the main factors affecting cognitive function. The other factors identified included sleeping habits (night sleep time and nap time), eating habits (number of meals a day), smoking, drinking, physical activity (amount of exercise per person), energy expenditure (total number of high and low activities of each person * weight), and social interaction (entertainment activities, service activities, other activities, and whether to participate in social activities).

Outcome Variables

During the 2013 and 2015 follow-ups, the outcome variables of the survey were consistent with the cognitive function test in 2011 variables such as date, week, and season (orientation and attention), and scored 5 points; 100 minus 7 calculation series, scored 5 points, recall and delayed recall word recall episodic memory 10 words, scored 20 points, and drawing two repeated five-sided graphs (visual spatial abilities) scored 1 point. The total cognitive score was 31 points. When the total score of participants exceeded 1 standard deviation lower than the standard of the corresponding age group, the participants were classified as cognitively impaired. Meanwhile, other participants were defined as having normal cognition [34, 40].

Performance Evaluation and Data Analyses

All data were analyzed using SPSS version 26.0 version and Python version 3.8. The random forest and logistic regression model were employed to predict the cognitive function of middle-aged and elderly people in 2013 and 2015. Subsequently, 2,326 data sets were divided into a training set (70%, N = 1,628) and test set (30%, N = 698). Some missing values were imputed using the method of nearest neighbor imputation. A flow chart of the random forest data analysis process is shown in Supplementary Figure S1. The parameters set of the random model were as follows: maximum depth of the forest = 6, and maximum number of leaves = 90(to alleviate over-fitting). The learning rate was set to 0.001, and the training evaluation index was 100 iterations of AUC training. When the number of iterations exceeded 5 times, no further increase in the AUC value was observed, and the training was stopped to prevent over-fitting. To validate the model, the 10-fold cross-validation method was used.

In the logistic regression model, the input data were standardized in order to speed up the gradient descent to find the optimal solution. The regularization parameter was selected as “L2,” the number of cross-validation was set to 10 folds, the loss function was optimized by the second derivative matrix of the loss function, the regularization coefficient was set to 20 equal parts from −2 to 2, and the error range of the iteration termination criterion was 0.01.

Descriptive Analysis

Results were expressed as the mean (± standard deviation) of continuous variables or the percentage of subjects in categorical variables. The AUC value in the Receiver Operator Characteristic curve (ROC) was the area under the ROC curve, which reflected the performance of the model. The AUC >0.9 was considered very good, 0.8–0.90 was considered good, 0.7–0.8 was regarded as fair, and <0.7 was regarded as poor [41].

Results

Sample Characteristics

The baseline characteristics of this study in 2011 are presented in Table 1. In the first year (baseline, 2011), there were 2,326 participants, including 1,318 people aged 45–59 (middle-aged) and 1,008 people aged 60 and above (elderly people). The average score of cognitive function was 12.94 (±5.95). The specific score for each baseline or variable was as follows: demographic (5), health status (24), functioning (3), emotional status (2), and lifestyle and behavior (10).

TABLE 1

Variable	M(SD) or N(%)
Age	58.64 (8.69)
45–59 years old(N = 1,318)	52.37(4.41)
60+ years old(N = 1,008)	66.83(5.54)
Gender
Male	1,083 (46.60%)
Female	1,243 (53.4%)
Education
No formal education illiterate	664 (28.50%)
Did not finish primary school but capable of reading or writing	532 (22.90%)
Elementary school	569 (24.50%)
Middle school	381 (16.40%)
High school and above	180 (7.70%)
Hukou
Agricultural	2011 (86.50%)
Non-agricultural	307 (13.20%)
Unified residence	8 (0.30%)
Marital status
Married	2083 (89.60%)
Not married	243 (10.40%)
Health status and functioning
Physical examination
BMI	23.23 (3.88)
Central obesity
Yes	520 (22.40%)
No	1806 (77.60%)
Breath (vital capacity)	257.24 (109.81)
Systolic	128.97 (21.32)
Diastolic	75.40 (12.33)
Pulse	72.01 (10.26)
Vision
Eyesight (close)	3.85 (0.87)
Eyesight (distance)	3.80 (0.94)
Hearing	3.62 (0.89)
Tooth loss
Yes	188 (8.10%)
No	2,138 (91.90%)
Chronic diseases
Yes	1,633 (70.20%)
No	693 (29.80%)
Pain
Yes	854 (36.70)
No	1,472 (63.30%)
Biomarkers
HsCRP	2.59 (6.58)
HbA1c	5.25 (0.73)
Total cholesterol	192.47 (37.43)
HDL cholesterol	52.12 (15.10)
LDL cholesterol	114.36 (34.16)
Triglycerides	125.54 (90.27)
Glucose	106.90 (32.12)
Blood urea nitrogen (BUN)	15.85 (4.73)
Uric acid	4.51 (1.22)
Cystatin C	1.02 (0.24)
Creatinine	0.78 (0.18)
Life expectancy	2.96 (1.03)
Functioning
Functional limitations	10.90 (4.13)
ADL	4.57 (3.20)
IADL	5.59 (1.72)
Emotional status
Depression (CESD-10)	8.86 (6.42)
High depression risk
Yes	821 (35.30%)
No	1,505 (64.70%)
Lifestyle and behavior
Night sleep (Hour)	6.28 (1.938)
Nap time (Minute)	31.27 (44.50)
Eating habits	3.14 (0.40)
Smoking
Yes	876 (37.70%)
No	1,450 (62.30%)
Drinking
Yes	764 (32.80%)
No	1,562 (67.20%)
Physical activity	2,589.33 (4578.68)
Energy consumption	145495.67 (260370.71)
Social interaction
Entertainment	0.55 (0.72)
Service activities	0.10 (0.33)
Other activities	0.02 (0.5)
Cognitive function
2011Cognitive total score	12.94 (5.95)

Baseline sample characteristics (China, 2011).

Model Performance

Table 2 shows the comparison of AUC between random forest and logistic regression. The results show that random forest performs better than logistic regression. Table 3 illustrates the model performance indicators of random forest models. The results for the 2nd and 4th year follow-up model indicators show that the AUC of the 3 ML models (model a¹, model a², model b) were 0.81, 0.79, and 0.80, respectively (see Supplementary Figures S2–S4). The models fit was good and fair.

TABLE 2

		Mean AUC	95%CI	Model fit
Random models	Model a¹	0.81	0.79–0.83	Good
	Model a²	0.79	0.76–0.82	Fair
	Model b	0.80	0.77–0.83	Good
Logistic regression	Model a¹	0.61	0.58–0.64	Poor
	Model a²	0.62	0.59–0.65	Poor
	Model b	0.70	0.67–0.73	Fair

AUC of random forest and logistic regression (China, 2022).

TABLE 3

	Model a¹	Model a²	Model b
Precision	0.69	0.62	0.84
Recall	0.83	0.79	0.79
F-score	0.75	0.69	0.72
Accuracy	0.82	0.79	0.84

Model performance (China, 2022).

Note: 1) Model a¹refers to the model indicators of the predictive model constructed by the independent variables in 2011 to predict the cognitive impairment in 2013.

2) Model a² refers to the model indicators of the predictive model constructed by the independent variables in 2011 to predict the cognitive impairment in 2015.

3) Model b refers to the model indicators of the prediction model constructed by the cross-sectional data in 2015 and the independent variables to predict the cognitive impairment in 2015.

Predictor Variables Importance

Table 4 presents the predictor variable ranking results by importance of 10 feature selection methods that were performed on the dataset. The cognitive classification prediction model for 2013 ranked the variables from the most important to the lest important as follows: “education,” “triglycerides,” “age,” “BMI,” “LDL cholesterol,” “uric acid,” “functional limitations,” “pulse,” “HDL cholesterol,” and “life expectancy.” For 2015, the prediction of cognitive classification model’s variable ranking was in the following order: “BMI,” “depression,” “pulse,” “systolic,” “education,” “total cholesterol,” “blood urea nitrogen (BUN),” “HDL,” “cholesterol,” “Uric acid,” and “HsCRP.” The important factors of the forecast model for the verification of horizontal data were organized in the following order: “BMI,” “pulse,” “depression,” “breath,” “creatinine,” “age,” “night sleep time,” “education,” “triglycerides,” and “total cholesterol.”

TABLE 4

	Model a¹	Importance	Model a²		Model b	Importance
1	Education	0.0702	BMI	0.0809	BMI	0.0883
2	Triglycerides	0.0694	Depression	0.0583	Pulse	0.0523
3	Age	0.0673	Pulse	0.0508	Depression	0.0521
4	BMI	0.0651	Systolic	0.0503	Breath	0.0481
5	LDL cholesterol	0.0563	Education	0.0462	Creatinine	0.0462
6	Uric acid	0.0561	Total cholesterol	0.0439	Age	0.0407
7	Functional limitations	0.0525	Blood urea nitrogen (BUN)	0.0405	Night time	0.0399
8	Pulse	0.0454	HDL cholesterol	0.0398	Education	0.0396
9	HDL cholesterol	0.0444	Uric acid	0.0384	Triglycerides	0.0392
10	Life expectancy	0.0407	HsCRP	0.0362	Total cholesterol	0.0382

Importance of variables of predictive models and validated predictive models (China,2022).

Note: Model a¹, Model a² and model b have the same meaning as the above table.

The importance of variables in this study is based on the contribution of features in each decision tree. The average contribution of all decision trees is the importance of the feature. The importance of the features in the decision tree depends on the change of the Gini coefficient of the nodes of the decision tree. The top ten features were screened and their relative importance was re-stated.

Discussion

Advantages of Machine Learning Methods

With the advances in the electronic age, the application of machine learning to clinical disease diagnosis, differential diagnosis, and disease prediction is increasing [42, 43]. This study provides a new method of machine learning to predict cognitive impairment in middle-aged and elderly people. The method of mutual verification of the combination of longitudinal and cross-sectional data was adopted to improve the effectiveness of the model. Good results were achieved, with the best model producing an AUC of 0.81 [41] compared with logistic regression [22, 42].

Indeed, logistic regression was previously considered a standard method for binary classification. Compared with machine learning methods, logistic regression is limited by assumptions of normality and linear relationships and may not evaluate the non-linear and complex relationships between physiological and social data. As a non-parametric technique, random forest in machine learning can overcome the shortcomings of under-fitting in traditional regression methods, and at the same time prevent over-fitting [11, 44]. It is thus considered a more flexible method for assessing complex interactions between variables. This study also corroborates previous findings that random forest is more suitable for the application of high-dimensional variables and large-scale data [45]. At present, it has become an alternative standard classification method to logistic regression [12]. Couronné et al. used 243 real data sets to conduct systematic large-scale comparative study, which showed that the average prediction performance of random forest is better than that of logistic regression [46]. In view of the fact that there are many variable dimensions involved in this study, the sample size was large. Predictive models for cognitive impairment in middle-aged and elderly people are lacking. Therefore, the present study established a predictive model of cognitive impairment in middle-aged and elderly people using the random forest method. The results indicated that the use of sociodemographic characteristics, health status and functioning, and emotional state could accurately predict cognitive impairment.

Impact of Demographic Variables on Cognitive Impairment

From the results, the demographic variables of age and education level are important predictors, which is similar to previous research results [22, 47]. Previous studies showed that the prevalence of dementia was higher among the people aged 65 years old or older [47]. Education level could reduce the risk of cognitive impairment and dementia [48, 49]. Compared with people with no education, less education was associated with a lower risk of cognitive impairment. Notably, education contributes to cognitive reserve [50, 51].

Impact of Health Status on Cognitive Impairment

Blood predictors including non-invasive markers were used in the current research to predict cognitive impairment. Non-invasive markers help predict patients with normal cognitive status or cognitive impairment, which may also contribute to better preventive measures [52, 53]. The results of this study indicated that Biomarkers in health status are important risk factors for predicting cognitive impairment. This finding is consistent with previous studies. We also found that uric acid, HsCRP, creatinine, LDL cholesterol, HDL cholesterol, total cholesterol, triglycerides, and BUN in venous blood samples were important predictors of the cognitive impairment. This finding is consistent with previous studies showing that low uric acid is a risk factor for cognitive impairment. When at an appropriate level, uric acid could reduce the occurrence and development of cognitive impairment [54]. Decreased creatinine concentration may indicate the occurrence of cognitive decline [55].

The results of epidemiological studies indicated the presence of a preliminary correlation between inflammation and cognitive impairment [56]. Noble, Manly [57] reported that the elderly with higher C-reactive protein levels are at a higher risk of memory impairment. Therefore, evidence supports the important role of this biomarker as a vascular risk factor for cognitive decline [58]. Furthermore, a 31-year longitudinal study concluded that the HsCRP changes during the middle age could reflect the underlying process of aging-related cognitive decline [59]. Longitudinal studies also found that higher baseline Triglycerides and LDL-C concentrations were associated with a higher rate of cognitive decline, but the effect of Triglycerides was not significant [60, 61]. Overall, our findings were slightly different from those of previous studies. Notably, HDL cholesterol and triglycerides showed an important predictive effect on cognitive impairment. The latest meta-analysis experiments have concluded that triglycerides potentially affect cognition [62–64].

For the indicators of physical examination, the findings of this study were in line with the previous findings that physiology is correlated with cognitive function was consistent. A useful indicator of physical health status is BMI, which screens for human weight. Furthermore, BMI ranked higher in terms of the important features for both regardless of it being the longitudinal prediction model and horizontal verification model. A longitudinal study conducted in South Korea found that obesity or weight loss in the later stages of life did not affect the risk of cognitive impairment [65]. Another study found that obesity at the middle age was an important predictor of the development of cognitive impairment in the later life phase [66]. In the latest research, it was proposed that being underweight was possibly an important risk factor for cognitive impairment among the elderly in China [26]. The results of a machine learning study also show that BMI ranked among the top 10 predictors of mild cognitive impairment (MCI) [21, 67]. Therefore, interventions for cognitive function among the elderly should target weight management.

The importance of blood pressure was identified. Similarly, previous studies highlighted a positive association between elevated diastolic or systolic blood pressure and the risk of cognitive impairment [68]. This condition played a crucial role in the guidance of routine clinical practice. To illustrate, effective control of blood pressure could reduce the risk of cognitive impairment, which was in line with previous research results [69]. Furthermore, in the present study systolic pressure was an important predictor of cognitive impairment, and has been established to cause cerebrovascular diseases and subsequently reduce cognitive ability [70, 71]. Pulse is also an important predictor. Previous studies also that showed the combination of higher pulse speed and age contributed to a gradual decrease in cognitive ability [72].

In a meta-analysis, a negative correlation was found between pulse wave velocity and cognition, particularly executive function, memory, and overall cognition. However, this association was independent of demographic, clinical, and evaluation characteristics [73]. In all horizontally verified prediction models, the feature of the breath (vital capacity) function was ranked fourth. Previous studies have also verified the relationship between lung function and cognitive function. A recent systematic review reported that although some research has shown a correlation between lung function and cognition, the result of the current study indicated several limitations and used a single measurement method [74]. Similarly, the present study, used only one type of expiratory volume as an indicator of lung function. Thus, further study is required to unravel the complex relationship among these factors.

Functional limitations have been proved as important predictors of cognitive impairment in previous studies. Multiple horizontal and longitudinal studies have concluded that poorer dysfunction is associated with more severe cognitive impairment [75, 76]. In recent years, functional limitations have been added to the cognitive impairment screening test [77, 78]. This proves that the negative effect of functional limitation in cognitive dysfunction is adequate.

Effect of Depression on Cognitive Impairment

The present study demonstrated that depression is an important predictor of cognitive impairment. In the predictive model, depression was ranked first in importance. Furthermore, the negative effect harm of depression has been proven in multiple studies, including accelerating individual cognitive impairment [29–31, 79, 80]. Depressive symptoms reflect the individual’s emotional state, and the 2-year predictive model demonstrated that life expectancy is also an important factor in the predictive model. Our results showed that life expectancy is one of the important factors affecting cognitive impairment. In fact, life expectancy refers to the possibility for middle-aged and elderly people to envision living until the expected age is reached, a higher life expectancy reflects a positive emotional state. Previous research found that positive emotions are more likely to protect against cognitive decline, whereas negative emotions are associated with a higher risk of mild cognitive impairment and dementia [81]. Therefore, relevant literature has suggested that dementia interventions based on established positive psychology principles could help elderly people cope with their diseases [82, 83]. The concept of improving cognitive function through psychosocial interventions is now also gaining acceptance [84].

Influence of Lifestyle and Behavior on Cognitive Function

In this study, night time sleep, as in lifestyle and behavior factor, played an important role in the horizontal data prediction model. Previous studies demonstrated that poor sleep quality was associated with poor cognitive function [85, 86]. As a solution, behavioral intervention to improve sleep could effectively improve the cognitive ability of patients with cognitive impairment [86], proving that night time sleep is important for the cognitive function of middle-aged and elderly people.

Conclusion

In conclusion, this study has proved the accuracy of ML using vertical and cross-sectional data. Importantly, this study added the measure of non-invasive marker blood. This survey has proved the utility of ML in long-term predictions of persistent cognitive impairment and identifications risk markers, supporting testing of new ML algorithms to predict disease progression. Furthermore, the availability of these variables suggests their potential use in screening for future cognitive impairment among the elderly in the community. The risk factors discovered could also be used by clinical staff to treat cognitive impairment and develop intervention programs. More importantly, the predictors of cognitive impairment found in this study through predictive models can be subjects of public education to reduce their effect on cognitive impairment in daily life. This is of great significance for preventing the deterioration of the quality of life of middle-aged and elderly people and promoting the healthy aging of society.

Limitation

There are some limitations in this study. First, this study only used the method of random forest in comparison with logistic regression. There is a lack of comparison with other methods, such as the support vector machine method. Although the advantages of random forest have been demonstrated in other studies, further comparison with other methods is needed. Second, although our model screened the important risk factors for cognitive impairment, the specific relationship between these risk factors and cognitive impairment, whether being positive or negative, remained unclear. Therefore, further research is required to elucidate the value of these variables in cognitive impairment. Third, the measure of cognitive impairment in this study was self-reported by division respondents rather than derived from clinical judgment. While this is a good way for the community to implement measurements, there is a risk of pseudocognitive impairment. Therefore, future research should incorporate clinical reporting to refine the findings.

Statements

Ethics statement

The studies involving human participants were reviewed and approved by the Peking University Biomedical Ethics Committee (IRB00001052-11015). The patients/participants provided their written informed consent to participate in this study.

Author contributions

HhL was responsible for the collection of data. XZ conducted the statistical analysis. HhL, HnL, XZ, and SC interpreted the data. HhL and HnL wrote the manuscript and manuscript preparation. HnL was responsible for data management. Overall, all authors provided suggestions during the preparation of the manuscript and approved the final version submitted for publication.

Funding

This work was funded by the Fundamental Research Funds for Chengde Medical University (KY202113), the Humanities and Social Science Research Project of Hebei Education Department (SQ2023137), the Hebei Natural Science Foundation (C2022406010), the Open Project of Hebei Key Laboratory of Nerve Injury and Repair (NJKF202105) and University-Level Scientific Research Project in CDMC (202307).

Acknowledgments

We thank all participants in the China Health and Retirement Longitudinal Study. We thank the CHARLS research team for providing the data. We acknowledge the contribution of Technology Innovation Guidance Project Science and Technology Work Conference of Hebei Provincial Department of Science and Technology supported this research.

Conflict of interest

The authors declare that they do not have any conflicts of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.ssph-journal.org/articles/10.3389/ijph.2023.1605322/full#supplementary-material

Supplementary Figure S1

Flowchart of data preprocessing and random forest data analysis experiment (China, 2022).

Supplementary Figure S2

Receiver operating characteristic curve of 2-year (2013). The area under the receiver operating characteristics curve (AUC) is 0.81 (China, 2022).

Supplementary Figure S3

Receiver operating characteristic curve of 4-year (2015). The area under the receiver operating characteristics curve (AUC) is 0.79 (China, 2022).

Supplementary Figure S4

Receiver operating characteristic curve of cross-sectional verification data (2015). The area under the receiver operating characteristics curve (AUC) is 0.80 (China, 2022).

Abbreviations

ADL, activities of daily living; CHARLS, China health and retirement longitudinal study; IADL, instrumental activities of daily living; ML, machine learning; TICS, telephone interview of cognitive status.

References

1.
PrinceMGuerchetMPrinaM. Policy Brief for Heads of Government: The Global Impact of Dementia 2013-2050. London: Alzheimer's Disease International (2013). p. 14.
- Google Scholar
2.
PrinceMJWimoAGuerchetMMAliGCWuYTPrinaM. World Alzheimer Report 2015 - the Global Impact of Dementia. London: Alzheimer's Disease International (2015). p. 89.
- Google Scholar
3.
PatnodeCDPerdueLARossomRCRushkinMCRedmondNThomasRGet alU.S. Preventive Services Task Force Evidence Syntheses, Formerly Systematic Evidence Reviews. Rockville, MD, USA: Agency for Healthcare Research and Quality (2020).
- Google Scholar
4.
EspelandMARappSRKatulaJAAndrewsLAFeltonDGaussoinSAet alTelephone Interview for Cognitive Status (TICS) Screening for Clinical Trials of Physical Activity and Cognitive Training: The Seniors Health and Activity Research Program Pilot (SHARP-P) Study. Int J Geriatr Psychiatry (2011) 26(2):135–43. 10.1002/gps.2503
- CrossRef
- Google Scholar
5.
LivingstonGHuntleyJSommerladAAmesDBallardCBanerjeeSet alDementia Prevention, Intervention, and Care: 2020 Report of the Lancet Commission. The Lancet (2020) 396(10248):413–46. 10.1016/s0140-6736(20)30367-6
- CrossRef
- Google Scholar
6.
WalshCGRibeiroJDFranklinJC. Predicting Suicide Attempts in Adolescents with Longitudinal Clinical Data and Machine Learning. J Child Psychol Psychiatry (2018) 59(12):1261–70. 10.1111/jcpp.12916
- CrossRef
- Google Scholar
7.
BurkeTAAmmermanBAJacobucciR. The Use of Machine Learning in the Study of Suicidal and Non-suicidal Self-Injurious Thoughts and Behaviors: A Systematic Review. J Affect Disord (2019) 245:869–84. 10.1016/j.jad.2018.11.073
- CrossRef
- Google Scholar
8.
van EedenWALuoCvan HemertAMCarlierIVEPenninxBWWardenaarKJet alPredicting the 9-year Course of Mood and Anxiety Disorders with Automated Machine Learning: A Comparison between Auto-Sklearn, Naïve Bayes Classifier, and Traditional Logistic Regression. Psychiatry Res (2021) 299:113823. 10.1016/j.psychres.2021.113823
- CrossRef
- Google Scholar
9.
KoppeGMeyer-LindenbergADurstewitzD. Deep Learning for Small and Big Data in Psychiatry. Neuropsychopharmacology (2021) 46(1):176–90. 10.1038/s41386-020-0767-z
- CrossRef
- Google Scholar
10.
FutomaJMorrisJLucasJ. A Comparison of Models for Predicting Early Hospital Readmissions. J Biomed Inform (2015) 56:229–38. 10.1016/j.jbi.2015.05.016
- CrossRef
- Google Scholar
11.
KesslerRCvan LooHMWardenaarKJBossarteRMBrennerLACaiTet alTesting a Machine-Learning Algorithm to Predict the Persistence and Severity of Major Depressive Disorder from Baseline Self-Reports. Mol Psychiatry (2016) 21(10):1366–71. 10.1038/mp.2015.198
- CrossRef
- Google Scholar
12.
BreimanL. Random Forests. Machine Learn (2001) 45(1):5–32. 10.1023/a:1010933404324
- CrossRef
- Google Scholar
13.
FathimaASManimeglaiD. Analysis of Significant Factors for Dengue Infection Prognosis Using the Random forest Classifier. Int J Adv Comput Sci Appl (2015) 6(2):240–5.
- Google Scholar
14.
LunettaKLHaywardLBSegalJVan EerdeweghP. Screening Large-Scale Association Study Data: Exploiting Interactions Using Random Forests. BMC Genet (2004) 5:32. 10.1186/1471-2156-5-32
- CrossRef
- Google Scholar
15.
KurokiY. Risk Factors for Suicidal Behaviors Among Filipino Americans: A Data Mining Approach. Am J Orthopsychiatry (2015) 85(1):34–42. 10.1037/ort0000018
- CrossRef
- Google Scholar
16.
ByeonH. A Prediction Model for Mild Cognitive Impairment Using Random Forests. J Int J Adv Comput Sci App (2015) 6(12):8–12. 10.14569/ijacsa.2015.061202
- CrossRef
- Google Scholar
17.
VelazquezMLeeYAlzheimer’s Disease Neuroimaging Initiative. Random forest Model for Feature-Based Alzheimer's Disease Conversion Prediction from Early Mild Cognitive Impairment Subjects. PLoS One (2021) 16(4):e0244773. 10.1371/journal.pone.0244773
- CrossRef
- Google Scholar
18.
AnwarSMMajidMQayyumAAwaisMAlnowamiMKhanMK. Medical Image Analysis Using Convolutional Neural Networks: A Review. J Med Syst (2018) 42(11):226. 10.1007/s10916-018-1088-1
- CrossRef
- Google Scholar
19.
LebedevAVWestmanEVan WestenGJKrambergerMGLundervoldAAarslandDet alRandom forest Ensembles for Detection and Prediction of Alzheimer's Disease with a Good Between-Cohort Robustness. Neuroimage Clin (2014) 6:115–25. 10.1016/j.nicl.2014.08.023
- CrossRef
- Google Scholar
20.
LiHLiuYGongPZhangCYeJAlzheimers Disease Neuroimaging Initiative. Hierarchical Interactions Model for Predicting Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD) Conversion. PLoS One (2014) 9(1):e82450. 10.1371/journal.pone.0082450
- CrossRef
- Google Scholar
21.
Gomez-RamirezJAvila-VillanuevaMFernandez-BlazquezMA. Selecting the Most Important Self-Assessed Features for Predicting Conversion to Mild Cognitive Impairment with Random forest and Permutation-Based Methods. Sci Rep (2020) 10(1):20630. 10.1038/s41598-020-77296-4
- CrossRef
- Google Scholar
22.
NaKS. Prediction of Future Cognitive Impairment Among the Community Elderly: A Machine-Learning Based Approach. Sci Rep (2019) 9(1):3335. 10.1038/s41598-019-39478-7
- CrossRef
- Google Scholar
23.
Abd-El MohsenSAAlgameelMMHawashMAbd ElrahmanSWafikW. Predicting Cognitive Impairment Among Geriatric Patients at Asir central Hospital, saudi arabia. Saudi J Biol Sci (2021) 28(10):5781–5. 10.1016/j.sjbs.2021.06.023
- CrossRef
- Google Scholar
24.
CostanzaAXekardakiAKovariEGoldGBourasCGiannakopoulosP. Microvascular burden and Alzheimer-type Lesions across the Age Spectrum. J Alzheimers Dis (2012) 32(3):643–52. 10.3233/JAD-2012-120835
- CrossRef
- Google Scholar
25.
PerezAManningKJPowellWBarryLC. Cognitive Impairment in Older Incarcerated Males: Education and Race Considerations. Am J Geriatr Psychiatry (2021) 29(10):1062–73. 10.1016/j.jagp.2021.05.014
- CrossRef
- Google Scholar
26.
RenZLiYLiXShiHZhaoHHeMet alAssociations of Body Mass index, Waist Circumference and Waist-To-Height Ratio with Cognitive Impairment Among Chinese Older Adults: Based on the Clhls. J Affect Disord (2021) 295:463–70. 10.1016/j.jad.2021.08.093
- CrossRef
- Google Scholar
27.
ChenL. Leisure Activities and Psychological Wellbeing Reduce the Risk of Cognitive Impairment Among Older Adults with Hearing Difficulty: A Longitudinal Study in china. Maturitas (2021) 148:7–13. 10.1016/j.maturitas.2021.03.011
- CrossRef
- Google Scholar
28.
KatayamaOLeeSBaeSMakinoKShinkaiYChibaIet alRelationship between Instrumental Activities of Daily Living Performance and Incidence of Mild Cognitive Impairment Among Older Adults: A 48-month Follow-Up Study. Arch Gerontol Geriatr (2021) 94:104034. 10.1016/j.archger.2020.104034
- CrossRef
- Google Scholar
29.
FangFGaoYSchulzPESelvarajSZhangY. Brain Controllability Distinctiveness between Depression and Cognitive Impairment. J Affect Disord (2021) 294:847–56. 10.1016/j.jad.2021.07.106
- CrossRef
- Google Scholar
30.
CostanzaAAmerioAAgugliaAEscelsiorASerafiniGBerardelliIet alWhen Sick Brain and Hopelessness Meet: Some Aspects of Suicidality in the Neurological Patient. CNS Neurol Disord Drug Targets (2020) 19(4):257–63. 10.2174/1871527319666200611130804
- CrossRef
- Google Scholar
31.
CostanzaABaertschiMWeberKCanutoACanutoA. Neurological Diseases and Suicide: From Neurobiology to Hopelessness. Rev Med Suisse (2015) 11(461):402–5.
- Google Scholar
32.
LuHLiuJGuGLiXYinSCuiD. Nonlinear Phase Synchronization Analysis of Eeg Signals in Amnesic Mild Cognitive Impairment with Type 2 Diabetes Mellitus. Neuroscience (2021) 472:25–34. 10.1016/j.neuroscience.2021.07.022
- CrossRef
- Google Scholar
33.
ZhaoYHuYSmithJPStraussJYangG. Cohort Profile: The china Health and Retirement Longitudinal Study (Charls). Int J Epidemiol (2014) 43(1):61–8. 10.1093/ije/dys203
- CrossRef
- Google Scholar
34.
ChenCParkJWuCXueQAgogoGHanLet alCognitive Frailty in Relation to Adverse Health Outcomes Independent of Multimorbidity: Results from the china Health and Retirement Longitudinal Study. Aging (2020) 12(22):23129–45. 10.18632/aging.104078
- CrossRef
- Google Scholar
35.
PangmanVCSloanJGuseL. An Examination of Psychometric Properties of the Mini-Mental State Examination and the Standardized Mini-Mental State Examination: Implications for Clinical Practice. Appl Nurs Res (2000) 13(4):209–13. 10.1053/apnr.2000.9231
- CrossRef
- Google Scholar
36.
TroyerAK. Activities of Daily Living (ADL). In: KreutzerJSDeLucaJCaplanB, editors. Encyclopedia of Clinical Neuropsychology. New York, NY: Springer New York (2011). p. 28–30.
- Google Scholar
37.
CrimminsEMSeemanT. Integrating Biology into Demographic Research on Health and Aging (With a Focus on the Macarthur Study of Successful Aging). Cells and Surveys: Should Biological Measures Be Included in Social Science Research?Washington, DC, USA: US National Academies Press (2001).
- Google Scholar
38.
SeemanC. Supplement: Aging, Health, and Public Policy || Integrating Biology into the Study of Health Disparities. Popul Develop Rev (2004) 30:89–107.
- Google Scholar
39.
WilliamsMWLiCYHayCC. Validation of the 10-item center for Epidemiologic Studies Depression Scale post Stroke. J Stroke Cerebrovasc Dis (2020) 29(12):105334. 10.1016/j.jstrokecerebrovasdis.2020.105334
- CrossRef
- Google Scholar
40.
JakAJBondiMWDelano-WoodLWierengaCCorey-BloomJSalmonDPet alQuantification of Five Neuropsychological Approaches to Defining Mild Cognitive Impairment. Am J Geriatr Psychiatry (2009) 17(5):368–75. 10.1097/JGP.0b013e31819431d5
- CrossRef
- Google Scholar
41.
ŠimundićA-M. Measures of Diagnostic Accuracy: Basic Definitions. EJIFCC (2009) 19(4):203–11.
- Google Scholar
42.
HaynosAFWangSBLipsonSPetersonCBMitchellJEHalmiKAet alMachine Learning Enhances Prediction of Illness Course: A Longitudinal Study in Eating Disorders. Psychol Med (2020) 51:1392–402. 10.1017/S0033291720000227
- CrossRef
- Google Scholar
43.
NagarajSDuongTQ. Deep Learning and Risk Score Classification of Mild Cognitive Impairment and Alzheimer’s Disease. J Alzheimer's Dis (2021) 80(3):1079–90. 10.3233/jad-201438
- CrossRef
- Google Scholar
44.
AnCLimHKimDWChangJHChoiYJKimSW. Machine Learning Prediction for Mortality of Patients Diagnosed with Covid-19: A Nationwide Korean Cohort Study. Sci Rep (2020) 10(1):18716. 10.1038/s41598-020-75767-2
- CrossRef
- Google Scholar
45.
MarocoJSilvaDRodriguesAGuerreiroMSantanaIde MendonçaA. Data Mining Methods in the Prediction of Dementia: A Real-Data Comparison of the Accuracy, Sensitivity and Specificity of Linear Discriminant Analysis, Logistic Regression, Neural Networks, Support Vector Machines, Classification Trees and Random Forests. BMC Res Notes (2011) 4(1):299. 10.1186/1756-0500-4-299
- CrossRef
- Google Scholar
46.
CouronneRProbstPBoulesteixAL. Random forest versus Logistic Regression: A Large-Scale Benchmark experiment. BMC Bioinformatics (2018) 19(1):270. 10.1186/s12859-018-2264-5
- CrossRef
- Google Scholar
47.
YangLJinXYanJJinYXuSXuYet alComparison of Prevalence and Associated Risk Factors of Cognitive Function Status Among Elderly between Nursing Homes and Common Communities of china: A Strobe-Compliant Observational Study. Medicine (2019) 98(49):e18248. 10.1097/MD.0000000000018248
- CrossRef
- Google Scholar
48.
SattlerCToroPSchonknechtPSchroderJ. Cognitive Activity, Education and Socioeconomic Status as Preventive Factors for Mild Cognitive Impairment and Alzheimer's Disease. Psychiatry Res (2012) 196(1):90–5. 10.1016/j.psychres.2011.11.012
- CrossRef
- Google Scholar
49.
VanohDShaharSRazaliRManafZAHamidTA. Influence of Gender Disparity in Predicting Occurrence of Successful Aging, Usual Aging and Mild Cognitive Impairment. J Int J Gerontol (2019) 13(3):207–11. 10.6890/IJGE.201909_13(3).0005
- CrossRef
- Google Scholar
50.
FarfelJMNitriniRSuemotoCKGrinbergLTFerrettiRELeiteREet alVery Low Levels of Education and Cognitive reserve: A Clinicopathologic Study. Neurology (2013) 81(7):650–7. 10.1212/WNL.0b013e3182a08f1b
- CrossRef
- Google Scholar
51.
RoeCMXiongCMillerJPMorrisJC. Education and Alzheimer Disease without Dementia: Support for the Cognitive reserve Hypothesis. Neurology (2007) 68(3):223–8. 10.1212/01.wnl.0000251303.50459.8a
- CrossRef
- Google Scholar
52.
GetsiosDMigliaccio-WalleKCaroJJ. Nice Cost-Effectiveness Appraisal of Cholinesterase Inhibitors. Pharmacoeconomics (2007) 25(12):997–1006. 10.2165/00019053-200725120-00003
- CrossRef
- Google Scholar
53.
Jimenez-BaladoJMaisterraODelgadoP. Non-invasive Markers of Vascular Disease: An Opportunity for Early Diagnosis of Cognitive Impairment. Atherosclerosis (2020) 312:101–3. 10.1016/j.atherosclerosis.2020.10.002
- CrossRef
- Google Scholar
54.
XueLLiuYXueHXueJSunKWuLet alLow Uric Acid Is a Risk Factor in Mild Cognitive Impairment. Neuropsychiatr Dis Treat (2017) 13:2363–7. 10.2147/NDT.S145812
- CrossRef
- Google Scholar
55.
JohansonaCEStopaEGDaielloLDe la MonteSMKeaneMOttBR. Disrupted Blood-Csf Barrier to Urea and Creatinine in Mild Cognitive Impairment and Alzheimer's Disease. J Alzheimer's Dis (2018) 08(2):435. 10.4172/2161-0460.1000435
- CrossRef
- Google Scholar
56.
GorelickPB. Role of Inflammation in Cognitive Impairment: Results of Observational Epidemiological Studies and Clinical Trials. Ann N Y Acad Sci (2010) 1207:155–62. 10.1111/j.1749-6632.2010.05726.x
- CrossRef
- Google Scholar
57.
NobleJMManlyJJSchupfNTangMXMayeuxRLuchsingerJA. Association of C-Reactive Protein with Cognitive Impairment. Arch Neurol (2010) 67(1):87–92. 10.1001/archneurol.2009.308
- CrossRef
- Google Scholar
58.
OoiTCMeramatARajabNFShaharSIsmailISAzamAAet alIntermittent Fasting Enhanced the Cognitive Function in Older Adults with Mild Cognitive Impairment by Inducing Biochemical and Metabolic Changes: A 3-year Progressive Study. Nutrients (2020) 12(9):2644. 10.3390/nu12092644
- CrossRef
- Google Scholar
59.
LaurinDDavid CurbJMasakiKHWhiteLRLaunerLJ. Midlife C-Reactive Protein and Risk of Cognitive Decline: A 31-year Follow-Up. Neurobiol Aging (2009) 30(11):1724–7. 10.1016/j.neurobiolaging.2008.01.008
- CrossRef
- Google Scholar
60.
MaCYinZZhuPLuoJShiXGaoX. Blood Cholesterol in Late-Life and Cognitive Decline: A Longitudinal Study of the Chinese Elderly. Mol Neurodegener (2017) 12(1):24. 10.1186/s13024-017-0167-y
- CrossRef
- Google Scholar
61.
ParthasarathyVFrazierDTBettcherBMJastrzabLChaoLReedBet alTriglycerides Are Negatively Correlated with Cognitive Function in Nondemented Aging Adults. Neuropsychology (2017) 31(6):682–8. 10.1037/neu0000335
- CrossRef
- Google Scholar
62.
AvgerinosKIEganJMMattsonMPKapogiannisD. Medium Chain Triglycerides Induce Mild Ketosis and May Improve Cognition in Alzheimer's Disease. A Systematic Review and Meta-Analysis of Human Studies. Ageing Res Rev (2020) 58:101001. 10.1016/j.arr.2019.101001
- CrossRef
- Google Scholar
63.
DimacheAMSalaruDLSascauRStatescuC. The Role of High Triglycerides Level in Predicting Cognitive Impairment: A Review of Current Evidence. Nutrients (2021) 13(6):2118. 10.3390/nu13062118
- CrossRef
- Google Scholar
64.
PondugulaSRMajrashiMAlmaghrabiMAbbottKGovindarajuluMRameshSet alPredictable Hematological Markers Associated with Cognitive Decline in Valid Rodent Models of Cognitive Impairment. Toxicol Mech Methods (2020) 30(6):454–61. 10.1080/15376516.2020.1760984
- CrossRef
- Google Scholar
65.
NohHMHanJKimYJJungJHRohYKSongHJ. Sex Differences in the Relationship between Cognitive Impairment and Overweight or Obesity in Late Life: A 3-year Prospective Study. Medicine (2019) 98(9):e14736. 10.1097/MD.0000000000014736
- CrossRef
- Google Scholar
66.
XuWLAttiARGatzMPedersenNLJohanssonBFratiglioniL. Midlife Overweight and Obesity Increase Late-Life Dementia Risk: A Population-Based Twin Study. Neurology (2011) 76(18):1568–74. 10.1212/WNL.0b013e3182190d09
- CrossRef
- Google Scholar
67.
AschwandenDAicheleSGhislettaPTerraccianoAKliegelMSutinARet alPredicting Cognitive Impairment and Dementia: A Machine Learning Approach. J Alzheimers Dis (2020) 75(3):717–28. 10.3233/JAD-190967
- CrossRef
- Google Scholar
68.
RavagliaGFortiPMaioliFMartelliMServadeiLBrunettiNet alConversion of Mild Cognitive Impairment to Dementia: Predictive Role of Mild Cognitive Impairment Subtypes and Vascular Risk Factors. Dement Geriatr Cogn Disord (2006) 21(1):51–8. 10.1159/000089515
- CrossRef
- Google Scholar
69.
XiaCVonderMSidorenkovGOudkerkMde GrootJCvan der HarstPet alThe Relationship of Coronary Artery Calcium and Clinical Coronary Artery Disease with Cognitive Function: A Systematic Review and Meta-Analysis. J Atheroscler Thromb (2020) 27(9):934–58. 10.5551/jat.52928
- CrossRef
- Google Scholar
70.
StreitSPoortvlietRKEElzenWBlomJWGusseklooJ. Systolic Blood Pressure and Cognitive Decline in Older Adults with Hypertension. Ann Fam Med (2019) 17(2):100–7. 10.1370/afm.2367
- CrossRef
- Google Scholar
71.
de MontgolfierOPouliotPGillisMAFerlandGLesageFThorin-TrescasesNet alSystolic Hypertension-Induced Neurovascular Unit Disruption Magnifies Vascular Cognitive Impairment in Middle-Age Atherosclerotic ldlr(-/-):Hapob(+/+) Mice. Geroscience (2019) 41(5):511–32. 10.1007/s11357-019-00070-6
- CrossRef
- Google Scholar
72.
EliasMFRobbinsMABudgeMMAbhayaratnaWPDoreGAEliasPK. Arterial Pulse Wave Velocity and Cognition with Advancing Age. Hypertension (2009) 53(4):668–73. 10.1161/HYPERTENSIONAHA.108.126342
- CrossRef
- Google Scholar
73.
Alvarez-BuenoCCunhaPGMartinez-VizcainoVPozuelo-CarrascosaDPVisier-AlfonsoMEJimenez-LopezEet alArterial Stiffness and Cognition Among Adults: A Systematic Review and Meta-Analysis of Observational and Longitudinal Studies. J Am Heart Assoc (2020) 9(5):e014621. 10.1161/JAHA.119.014621
- CrossRef
- Google Scholar
74.
DugganECGrahamRBPiccininAMJenkinsNDCloustonSMuniz-TerreraGet alSystematic Review of Pulmonary Function and Cognition in Aging. J Gerontol B Psychol Sci Soc Sci (2020) 75(5):937–52. 10.1093/geronb/gby128
- CrossRef
- Google Scholar
75.
LeeHHHongCTWuDChiWCYenCFLiaoHFet alAssociation between Ambulatory Status and Functional Disability in Elderly People with Dementia. Int J Environ Res Public Health (2019) 16(12):2168. 10.3390/ijerph16122168
- CrossRef
- Google Scholar
76.
FariasSTLauKHarveyDDennyKGBarbaCMeffordAN. Early Functional Limitations in Cognitively normal Older Adults Predict Diagnostic Conversion to Mild Cognitive Impairment. J Am Geriatr Soc (2017) 65(6):1152–8. 10.1111/jgs.14835
- CrossRef
- Google Scholar
77.
Roebuck-SpencerTMGlenTPuenteAEDenneyRLRuffRMHostetterGet alCognitive Screening Tests versus Comprehensive Neuropsychological Test Batteries: A National Academy of Neuropsychology Education Paper†. Arch Clin Neuropsychol (2017) 32(4):491–8. 10.1093/arclin/acx021
- CrossRef
- Google Scholar
78.
Md FadzilNHShaharSRajikanRSinghDKAMat LudinAFSubramaniamPet alA Scoping Review for Usage of Telerehabilitation Among Older Adults with Mild Cognitive Impairment or Cognitive Frailty. Int J Environ Res Public Health (2022) 19(7):4000. 10.3390/ijerph19074000
- CrossRef
- Google Scholar
79.
LiewTM. Depression, Subjective Cognitive Decline, and the Risk of Neurocognitive Disorders. Alzheimers Res Ther (2019) 11(1):70. 10.1186/s13195-019-0527-7
- CrossRef
- Google Scholar
80.
ZlatarZZMunizMCEspinozaSGGratianneRGollanTHGalaskoDet alSubjective Cognitive Decline, Objective Cognition, and Depression in Older Hispanics Screened for Memory Impairment. J Alzheimers Dis (2018) 63(3):949–56. 10.3233/jad-170865
- CrossRef
- Google Scholar
81.
Dos SantosSBRochaGPFernandezLLde PaduaACReppoldCT. Association of Lower Spiritual Well-Being, Social Support, Self-Esteem, Subjective Well-Being, Optimism and hope Scores with Mild Cognitive Impairment and Mild Dementia. Front Psychol (2018) 9:371. 10.3389/fpsyg.2018.00371
- CrossRef
- Google Scholar
82.
KorthauerLEGoveasJEspelandMAShumakerSAGarciaKRTindleHet alNegative Affect Is Associated with Higher Risk of Incident Cognitive Impairment in Nondepressed Postmenopausal Women. J Gerontol A Biol Sci Med Sci (2018) 73(4):506–12. 10.1093/gerona/glx175
- CrossRef
- Google Scholar
83.
AdamDRamliAShaharS. Effectiveness of a Combined Dance and Relaxation Intervention on Reducing Anxiety and Depression and Improving Quality of Life Among the Cognitively Impaired Elderly. Sultan Qaboos Univ Med J (2016) 16(1):e47–53. 10.18295/squmj.2016.16.01.009
- CrossRef
- Google Scholar
84.
Mohd SafienAIbrahimNSubramaniamPShaharSDinNCIsmailAet alRandomized Controlled Trials of a Psychosocial Intervention for Improving the Cognitive Function Among Older Adults: A Scoping Review. Gerontol Geriatr Med (2021) 7:23337214211025167. 10.1177/23337214211025167
- CrossRef
- Google Scholar
85.
DzierzewskiJMDautovichNRavytsS. Sleep and Cognition in Older Adults. Sleep Med Clin (2018) 13(1):93–106. 10.1016/j.jsmc.2017.09.009
- CrossRef
- Google Scholar
86.
BubuOMAndradeAGUmasabor-BubuOQHoganMMTurnerADde LeonMJet alObstructive Sleep Apnea, Cognition and Alzheimer's Disease: A Systematic Review Integrating Three Decades of Multidisciplinary Research. Sleep Med Rev (2020) 50:101250. 10.1016/j.smrv.2019.101250
- CrossRef
- Google Scholar

Summary

Keywords

longitudinal study, machine learning, random forest, middle-aged and older Chinese, cognitive impairment, dementia

Citation

Liu H, Zhang X, Liu H and Chong ST (2023) Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study. Int J Public Health 68:1605322. doi: 10.3389/ijph.2023.1605322

Received

15 August 2022

Accepted

09 January 2023

Published

19 January 2023

Volume

68 - 2023

Edited by

Gabriel Gulis, University of Southern Denmark, Denmark

Reviewed by

Alessandra Costanza, University of Geneva, Switzerland

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Haining Liu, liuhn0401@sina.com; Sheau Tsuey Chong, stchong@ukm.edu.my

This Original Article is part of the IJPH Special Issue “Public Health and Primary Care, is 1+1=1?”

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL ARTICLE

Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study

Abstract

Introduction

Methods

Dataset and Participants

Patient and Public Involvement

Cognitive Function

Demographics

Health Status and Functioning

Emotional Status

Lifestyle and Behavior

Outcome Variables

Performance Evaluation and Data Analyses

Descriptive Analysis

Results

Sample Characteristics

Model Performance

Predictor Variables Importance

Discussion

Advantages of Machine Learning Methods

Impact of Demographic Variables on Cognitive Impairment

Impact of Health Status on Cognitive Impairment

Effect of Depression on Cognitive Impairment

Influence of Lifestyle and Behavior on Cognitive Function

Conclusion

Limitation

Statements

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Supplementary material

Abbreviations

References

Summary

Outline

Cite article

ORIGINAL ARTICLE

Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study

Abstract

Introduction

Methods

Dataset and Participants

Patient and Public Involvement

Cognitive Function

Demographics

Health Status and Functioning

Emotional Status

Lifestyle and Behavior

Outcome Variables

Performance Evaluation and Data Analyses

Descriptive Analysis

Results

Sample Characteristics

Model Performance

Predictor Variables Importance

Discussion

Advantages of Machine Learning Methods

Impact of Demographic Variables on Cognitive Impairment

Impact of Health Status on Cognitive Impairment

Effect of Depression on Cognitive Impairment

Influence of Lifestyle and Behavior on Cognitive Function

Conclusion

Limitation

Statements

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Supplementary material

Abbreviations

References

Summary

Outline

Cite article

Share article