Journal of Shandong University (Health Sciences) ›› 2024, Vol. 62 ›› Issue (11): 73-84.doi: 10.6040/j.issn.1671-7554.0.2024.0723

• Public Health and Preventive Medicine • Previous Articles    

Development of the Bayesian network-based screening model for ischemic stroke

ZHANG Botao1,2,3, ZHANG Shuaijie1,2,3, SUN Shuangshuang1,2,3, YUAN Ying1,2,3, HU Xifeng1,2,3, JIA Xiaofeng4, YU Yuanyuan2,5, XUE Fuzhong1,2,3   

  1. 1. Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, Shandong, China;
    2. Healthcare Big Data Research Institute, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250003, Shandong, China;
    3. Qilu Hospital of Shandong University, Jinan 250012, Shandong, China;
    4. Health and Wellness Assurance Center Network Information Office of Boxing County, Binzhou 256500, Shandong, China;
    5. Data Science Institute, Shandong University, Jinan 250100, Shandong, China
  • Published:2024-11-25

Abstract: Objective To develop a screening model for ischemic stroke by relying on large-scale electronic health records and combining the advantages of Bayesian network uncertainty inference. Methods The screening model derivation cohort was derived from the Cheeloo Lifespan Electronic Health Research Data-library(Cheeloo LEAD)and divided into training and testing sets in a 7∶3 ratio. The external validation cohort was sourced from the Boxing Collaboration Center Database of the National Healthcare Big Data Research Institute(Boxing Database). The univariate Logistic regression analysis was used to screen for factors significantly associated with the ischemic stroke. These associated screening factors were used to develop the Bayesian network. The tabu search algorithm was employed for structure learning, while Bayesian estimation algorithm was used for parameter learning, ultimately leading to the development of the ischemic stroke screening model. The performance of the model was evaluated in terms of both discrimination and calibration abilities, and compared with the traditional Logistic regression model in screening for ischemic stroke. Results The derivation cohort included 1,067,609 individuals, among whom 31,019 suffered from ischemic stroke. The external validation cohort included 386,773 individuals, among whom 13,393 suffered from ischemic stroke. After the univariate screening, 67 screening factors were identified. The final Bayesian network model included 68 nodes and 440 directed edges. The parent nodes of the ischemic stroke node included age, hypertensive diseases, ischemic heart diseases, chronic lower respiratory diseases, other cerebrovascular diseases, episodic and paroxysmal disorders, and the symptoms and signs involved cognition, perception, emotional state and behavior. The AUC for the training set, testing set, and external validation cohort were 0.840(95%CI: 0.838-0.843), 0.839(95%CI: 0.836-0.843), and 0.811(95%CI: 0.808-0.814), respectively, indicating good discrimination ability, and calibration ability also performed well. Our newly developed screening model continued to outperform the traditional Logistic regression screening model, even in the presence of missing data. Conclusion This study developed the ischemic stroke screening model with the advantage of Bayesian network uncertainty inference. The model has good discrimination and calibration abilities, providing a convenient and efficient method for early ischemic stroke screening.

Key words: Electronic health records, Bayesian network, Logistic regression, Ischemic stroke, Screening model

CLC Number: 

  • R743.3
[1] Li XM, Bian D, Yu JH, et al. Using machine learning models to improve stroke risk level classification methods of China national stroke screening[J]. BMC Med Inform Decis Mak, 2019, 19(1): 261. doi:10.1186/s12911-019-0998-2.
[2] GBD Stroke Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019[J]. Lancet Neurol, 2021, 20(10): 795-820.
[3] 《中国脑卒中防治报告 2021》编写组. 《中国脑卒中防治报告2021》概要[J]. 中国脑血管病杂志, 2023, 20(11): 783-793. Report on Stroke Prevention and Treatment in China Writing Group. Brief report on stroke prevention and treatment in China, 2021 [J]. Chinese Journal of Cerebrovascular Diseases, 2023, 20(11): 783-793.
[4] Wolf PA, DAgostino RB, Belanger AJ, et al. Probability of stroke: a risk profile from the Framingham Study[J]. Stroke, 1991, 22(3): 312-318.
[5] DAgostino RB, Wolf PA, Belanger AJ, et al. Stroke risk profile: adjustment for antihypertensive medication. The Framingham Study[J]. Stroke, 1994, 25(1): 40-43.
[6] Andrus B, Lacaille D. 2013 ACC/AHA guideline on the assessment of cardiovascular risk[J]. J Am Coll Cardiol, 2014, 63(25): 2886. doi:10.1016/j.jacc.2014.02.606.
[7] Parmar P, Krishnamurthi R, Ikram MA, et al. The Stroke Riskometer(TM)App: validation of a data collection tool and stroke risk predictor[J]. Int J Stroke, 2015, 10(2): 231-244.
[8] Vartiainen E, Laatikainen T, Peltonen M, et al. Predicting coronary heart disease and stroke: the FINRISK calculator[J]. Glob Heart, 2016, 11(2): 213-216.
[9] 国家“十五” 攻关“冠心病、脑卒中综合危险度评估及干预方案的研究”课题组. 国人缺血性心血管病发病危险的评估方法及简易评估工具的开发研究[J]. 中华心血管病杂志, 2003, 31(12): 893-901. The Collaborative Research Group of the National 10th Five Year Plan Projiect: a Study on Evaluation and Intervention of the Coronary Heart Disease and Stroke Integrated Risk. A study on evaluation of the risk of ischemic cardiovascular diseases in Chinese and the development of simplified tools for the evaluation[J]. Chinese Journal of Cardiology, 2003, 31(12): 893-901.
[10] Yang XL, Li JX, Hu DS, et al. Predicting the 10-year risks of atherosclerotic cardiovascular disease in Chinese population: the china-PAR project(prediction for ASCVD risk in China)[J]. Circulation, 2016, 134(19): 1430-1440.
[11] Carroll RJ, Eyler AE, Denny JC. Intelligent use and clinical benefits of electronic health records in rheumatoid arthritis[J]. Expert Rev Clin Immunol, 2015, 11(3): 329-337.
[12] Liao WQ, Coupland CAC, Burchardt J, et al. Predicting the future risk of lung cancer: development, and internal and external validation of the CanPredict(lung)model in 19.67 million people and evaluation of model performance against seven other risk prediction models[J]. Lancet Respir Med, 2023, 11(8): 685-697.
[13] Han YT, Zhu X, Hu YZ, et al. Electronic health record-based absolute risk prediction model for esophageal cancer in the Chinese population: model development and external validation[J]. JMIR Public Health Surveill, 2023, 9: e43725. doi:10.2196/43725.
[14] Kyburg HE, Pearl J. Probabilistic reasoning in intelligent systems: networks of plausible inference[J]. J Philos, 1991, 88(8): 434. doi:10.2307/2026705.
[15] Yuan BY, Wang CB, Fan ZX, et al. A Bayesian network-based approach for identifying risk factors and predicting ischemic stroke in infective endocarditis patients[J]. Front Cardiovasc Med, 2023, 10: 1294229. doi:10.3389/fcvm.2023.1294229.
[16] Fan ZX, Wang CB, Fang LB, et al. Risk factors and a Bayesian network model to predict ischemic stroke in patients with dilated cardiomyopathy[J]. Front Neurosci, 2022, 16: 1043922. doi:10.3389/fnins.2022.1043922.
[17] Larrañaga P, Karshenas H, Bielza C, et al. A review on evolutionary algorithms in Bayesian network learning and inference tasks[J]. Inf Sci, 2013, 233: 109-125. doi:10.1016/j.ins.2012.12.051.
[18] Heckerman D, Geiger D, Chickering DM. Learning Bayesian networks: the combination of knowledge and statistical data[J]. Mach Learn, 1995, 20(3): 197-243.
[19] Glover F. Artificial intelligence, heuristic frameworks and tabu search[J]. Manag Decis Econ, 1990, 11(5): 365-375.
[20] Friedman N, Goldszmidt M, Wyner A. Data analysis with Bayesian networks: a bootstrap approach[EB/OL].(2013-01-23)[2024-07-01]. 2013, arXiv: 1301.6695. http://arxiv.org/abs/1301.6695.
[21] Peng Y, Zhang SY, Pan R. Bayesian network reasoning with uncertain evidences[J]. Int J Unc Fuzz Knowl Based Syst, 2010, 18(5): 539-564.
[22] Fung R, Chang KC. Weighing and integrating evidence for stochastic simulation in Bayesian networks[J]. Machine Intelligence and Pattern Recognition, 1990, 10: 209-219. doi:10.1016/B978-0-444-88738-2.50023-3.
[23] Shachter RD, Peot MA. Simulation approaches to general probabilistic inference on belief networks[J]. Machine Intelligence and Pattern Recognition, 1990, 10: 221-231. doi:http://dx.doi.org/.
[24] 鞠柠蔚, 蒋红, 宋润琛,等. 脑卒中患病影响因素的研究现状[J]. 中外医学研究, 2024, 22(14): 165-171. JU Ningwei, JIANG Hong, SONG Runchen, et al. Research status of influencing factors of stroke [J]. 2024, 22(14): 165-171.
[25] Kelly-Hayes M. Influence of age and health behaviors on stroke risk: lessons from longitudinal studies[J]. J Am Geriatr Soc, 2010, 58(Suppl 2): S325-S328.
[26] ODonnell MJ, Xavier D, Liu LS, et al. Risk factors for ischaemic and intracerebral haemorrhagic stroke in 22 countries(the INTERSTROKE study): a case-control study[J]. Lancet, 2010, 376(9735): 112-123.
[27] Sundbøll J, Horváth-Puhó E, Schmidt M, et al. Long-term risk of stroke in myocardial infarction survivors thirty-year population-based cohort study[J]. Stroke, 2016, 47(7): 1727-1733.
[28] 李丹. 老年慢性阻塞性肺疾病合并卒中的临床分析[J]. 中国卒中杂志, 2023, 18(10): 1216.
[29] Wen LY, Ni H, Li KS, et al. Asthma and risk of stroke: a systematic review and meta-analysis[J]. J Stroke Cerebrovasc Dis, 2016, 25(3): 497-503.
[30] Chang RW, Tucker LY, Rothenberg KA, et al. Incidence of ischemic stroke in patients with asymptomatic severe carotid stenosis without surgical intervention[J]. JAMA, 2022, 327(20): 1974-1982.
[31] Holmstedt CA, Turan TN, Chimowitz MI. Atherosclerotic intracranial arterial stenosis: risk factors, diagnosis, and treatment[J]. Lancet Neurol, 2013, 12(11): 1106-1114.
[32] Johnston SC, Gress DR, Browner WS, et al. Short-term prognosis after emergency department diagnosis of TIA[J]. JAMA, 2000, 284(22): 2901-2906.
[33] 潘飞豹, 周冀英, 谭戈. 偏头痛与脑卒中:复杂的关联[J]. 重庆医学, 2013, 42(22): 2665-2667.
[34] 张亦舒, 李晓宁, 苗晨欣. 睡眠障碍与卒中风险的因果关系:孟德尔随机化研究[J]. 现代医学, 2023, 51(11): 1559-1565. ZHANG Yishu, LI Xiaoning, MIAO Chenxin. Sleep disorder and stroke risk: a Mendelian randomization study [J]. Modern Medical Journal, 2023, 51(11): 1559-1565.
[1] ZHONG Lu, XUE Fuzhong. A Lung cancer risk prediction model based on Bayesian network uncertainty inference [J]. Journal of Shandong University (Health Sciences), 2023, 61(4): 86-94.
[2] JIANG Zhen, SUN Jing, ZOU Wen, WANG Changchang, GAO Qi. A comparison study of suicidal behavior predictive models of bipolar disorder patients based on two machine learning algorithms [J]. Journal of Shandong University (Health Sciences), 2022, 60(1): 101-108.
[3] YANG Jiulong, YU Tao, XUE Fuzhong. Distribution of cerebrovascular stenosis and construction of a screening model [J]. Journal of Shandong University (Health Sciences), 2021, 59(11): 114-119.
[4] WANG Xiaoxuan, ZHU Gaopei, SUN Na, FENG Jianing, XIAO Yufei, SHI Fuyan, WANG Suzhen. Analysis on the health status and influencing factors of poor people in Sanchunji County of Dong ming [J]. Journal of Shandong University (Health Sciences), 2021, 59(1): 108-114.
[5] WANG Yuyu, GAO Li, CHEN Shaohua. Association between cognitive impairment and thyroid function after acute ischemic stroke in 94 patients with type 2 diabetes mellitus [J]. Journal of Shandong University (Health Sciences), 2020, 58(5): 56-61.
[6] ZHANG Ning, YANG Yan, LI Rui, YIN Yunhong, LI Hao, QU Yiqing. Analysis of risk factors and drug resistance of Acinetobacter baumannii in patients with chronic obstructive pulmonary disease [J]. Journal of Shandong University (Health Sciences), 2019, 57(9): 88-96.
[7] CHEN Xi, LIU Xinyu, FAN Lixia, TIAN Yonghao, YUAN Suomao. Clinical value of carotid ultrasound in evaluating ischemic stroke risk in elderly patients with spinal diseases [J]. Journal of Shandong University (Health Sciences), 2019, 57(5): 48-55.
[8] GONG Zhuo, ZHANG Minmin, WANG Zhiping. Influence of abortion and family heredity history on the risk of uterine leiomyomas [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2017, 55(9): 100-104.
[9] LI Xiaoying, LIU Fang, CHE Haijie, ZHANG Jinhui. Establishment and preliminary evaluation of a model for predicting the malignant probability of solitary pulmonary nodule with tumor markers [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2017, 55(4): 60-64.
[10] WANG Yongqiang, ZHANG Weiquan, SUN Qifeng, DONG Xiaopeng, PENG Chuanliang, ZHANG Yuan, ZHAO Xiaogang. Expression and significance of CK14 in non-small cell lung cancer [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2017, 55(3): 83-87.
[11] LI Fan, SHANG Wei. Clinical predictors of remote lesions for patients with acute ischemic stroke [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2016, 54(6): 61-64.
[12] WANG Yihua, MA Lin, KAN Yanmin, WANG Wentao. Early ultrasound observation of uterine incision defect after cesarean section and analysis of related factors [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2016, 54(5): 39-44.
[13] QU Lixin, SHI Xinghua, DU Yifeng. Relationship between circulating platelets microparticles and endothelial microparticls and prognosis in patients with acute ischemic stroke [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2016, 54(12): 32-36.
[14] CHEN Haili, GU Jiaoyang, ZHANG Wenjing, YUAN Linran, ZHENG Juan, YUAN Zhongrui. Involvement of canonical Wnt pathway in postischemic angiogenesis in a rat model of ischemic stroke [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2015, 53(4): 31-36.
[15] DING Juan, ZHENG Guixi, DU Lutao, YIN Zuohua, ZHANG Jian, WANG Chuanxin. Evaluation of combined detection of sHLA-G, PGR and GPR in the diagnosis of gastric cancer [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2015, 53(11): 59-63.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!