Journal of Shandong University (Health Sciences) ›› 2020, Vol. 58 ›› Issue (1): 20-25.doi: 10.6040/j.issn.1671-7554.0.2019.1009

Previous Articles    

Value of blood routine data in distinguishing myelodysplastic syndrome and acute myeloid leukemia

DAI Xiaoyu1,2, LU Yuan1,2, WANG Zhiheng1,2, LI Mingzhuo1,2, SI Shucheng1,2, LI Jiqing1,2, JING Ming2, XUE Fuzhong1,2   

  1. 1. Department of Biostatistics, School of Public Health, Shandong University, Jinan 250012, Shandong, China;
    2. Institute for Medical Detaology, Shandong University, Jinan 250002, Shandong, China
  • Published:2022-09-27

Abstract: Objective To establish a model to identify myelodysplastic syndrome(MDS)and acute myeloid leukemia(AML)based on blood routine data. Methods The data of 1 681 patients from Shandong Multi-Center Health Medical Big Data Platform were randomly divided into the training set(70%)and testing set(30%). MDS and AML were identified with random forest model. The discriminatory ability of the model was determined with the area under the receiver operating characteristic curve(AUC)and the stability of the model was tested with ten-fold cross validation. Results Both of the random forest model and support vector machine model were able to identify MDS and AML, but the former had better performance. It showed the estimated AUC for male was 0.874(95%CI: 0.815-0.932), sensitivity was 81.1%, and specificity was 81.9%; the estimated AUC for female was 0.831(95%CI: 0.752-0.911), sensitivity was 77.8%, and specificity was 74.3%. The ten-fold cross validation showed the estimated AUC was 0.884(95%CI: 0.854-0.913)for male and 0.842(95%CI: 0.802-0.883)for female. Conclusion The discriminant model is capable of identifying MDS and AML based on blood routine data.

Key words: Blood routine, Random forest, Myelodysplastic syndrome, Acute myeloid leukemia, ROC curve

CLC Number: 

  • R551.3
[1] Sultan C, Imbert M, Jouault H, et al. Myelodysplastic Syndromes[J]. Acta Haematologica, 1987, 78(1): 91-93.
[2] Steensma DP, Bennett JM. The Myelodysplastic Syndromes: Diagnosis and Treatment[J]. Mayo Clinic Proceedings, 2006, 81(1): 104-130.
[3] Miyazato A. Identification of myelodysplastic syndrome-specific genes by DNA microarray analysis with purified hematopoietic stem cell fraction[J]. Blood, 2001, 98(2): 422-427.
[4] Shukron O, Vainstein V, Kündgen A, et al. Analyzing transformation of myelodysplastic syndrome to secondary acute myeloid leukemia using a large patient database[J]. Am J Hematol, 2012, 87(9): 853-860.
[5] Song X, Peng Y, Wang X, et al. Incidence, survival, and risk factors for adults with acute myeloid leukemia not otherwise specified and acute myeloid leukemia with recurrent genetic abnormalities: analysis of the surveillance, epidemiology, and end results(SEER)database, 2001—2013[J]. Acta Haematologica, 2018, 139(2): 115-127.
[6] 王英俏. 急性髓系白血病发病因素及慢性髓性白血病TKIs早期疗效评估的探讨[D]. 济南: 山东大学, 2016.
[7] 陈影. 流式细胞术检测急性髓细胞白血病免疫表型及其临床意义[D]. 合肥: 安徽医科大学, 2014.
[8] 李国, 宋英儒, 李新文, 等. 骨髓异常增生综合征与急性髓性白血病的MRI表现[J]. 中国医学影像技术, 2010, 26(7): 1296-1299. LI Guo, SONG Yingru, LI Xinwen, et al. Bone marrow MRI of myelodysplastic syndromes and acute myeloid leukemia[J]. Chinese Journal of Imaging Technology, 2010, 26(7): 1296-1299.
[9] Mills KI, Kohlmann A, Williams PM, et al. Microarray-based classifiers and prognosis models identify subgroups with distinct clinical outcomes and high risk of AML transformation of myelodysplastic syndrome[J]. Blood, 2009, 114(5): 1063-1072.
[10] 中华医学会血液学分会白血病淋巴瘤学组. 成人急性髓系白血病(非急性早幼粒细胞白血病)中国诊疗指南(2017年版)[J]. 中华血液学杂志, 2017, 38(3): 177-182. Leukemia & Lymphoma Group, Chinese Society of Hematology, Chinese Medical Association. Chinese guidelines for diagnosis and treatment of adult acute myeloid leukemia(not APL)(2017)[J]. Chinese Journal of Hematology, 2017, 38(3): 177-182.
[11] Arber DA, Orazi A, Hasserjian R, et al. The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia[J]. Blood, 2016, 127(20): 2391-2405.
[12] 吴德沛, 肖志坚, 黄晓军. 骨髓增生异常综合征中国诊断与治疗指南(2019年版)[J]. 中华血液学杂志, 2019, 40(2): 89-97. WU Depei, XIAO Zhijian, HUANG Xiaojun. Chinese guidelines for diagnosis and treatment of myelodysplastic syndromes(2019)[J]. Chinese Journal of Hematology, 2019, 40(2): 89-97.
[13] 詹曙, 姚尧, 高贺. 基于随机森林的脑磁共振图像分类[J]. 电子测量与仪器学报, 2013, 27(11): 1067-1072. ZHAN Shu, YAO Yao, GAO He. Magnetic resonance image classification of brain based on random forest[J]. Journal of Electronic Measurement and Instrument, 2013, 27(11): 1067-1072.
[14] Cerrada M, Zurita G, Cabrera D, et al. Fault diagnosis in spur gears based on genetic algorithm and random forest[J]. Mech Syst Signal Process, 2016, 70-71: 87-103. doi:10.1016/j.ymssp.
[15] Hariharan S, Tirodkar S, Bhattacharya A. Polarimetric SAR decomposition parameter subset selection and their optimal dynamic range evaluation for urban area classification using Random Forest[J]. Int J Appl Earth Obs Geoinfor, 2016, 44: 144-158. doi: 10.1016/j.jag.
[16] Hu J, Li Y, Yang JY, et al. GPCR-drug interactions prediction using random forest with drug-association-matrix-based post-processing procedure[J]. Comput Biol Chem, 2016, 60: 59-71. doi: 10.1016/j.compbiolchem.
[17] Breiman L. Random Forests[J]. Machine Learning, 2001, 45(1): 5-32.
[18] Johnson RW. An Introduction to the Bootstrap[J]. Teaching Statistics, 2010, 23(2): 49-54.
[19] 方匡南, 吴见彬, 朱建平, 等. 随机森林方法研究综述[J]. 统计与信息论坛, 2011, 26(3): 32-38.
[20] 董师师, 黄哲学. 随机森林理论浅析[J]. 集成技术, 2013, 2(1): 1-7. DONG Shishi, HUANG Zhexue. A brief theoretical overview of random forests[J]. Journal of integration technology, 2013, 2(1): 1-7.
[21] Vapnik VN. The Nature of Statistical Learning Theory[M]. Berlin: Springer, 2000.
[22] 李磊, 黄水平. 支持向量机原理及其在医学分类中的应用[J]. 中国卫生统计, 2009, 26(1): 22-25.
[23] Metz CE. Basic Principles of ROC analysis[J]. Semin Nucl Med, 1978, 8(4): 283-298.
[24] Robin X, Turck N, Hainard A, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves[J]. Bmc Bioinformatics, 2011, 12(1): 1-8.
[25] Fan RE, Chen PH, Lin CJ, et al. Working set selection using second order information for training support vector machines[J]. J Mach Learn Res, 2005, 6(4): 1889-1918.
[26] Wang W, Wang H, Wang XQ, et al. First report of incidence of adult myelodysplastic syndrome in China[J]. Ann Hematol, 2012, 91(8): 1321-1322.
[27] Shysh AC, Nguyen LT, Guo M, et al. The incidence of acute myeloid leukemia in Calgary, Alberta, Canada: a retrospective cohort study[J]. BMC Public Health, 2018, 18(1): 94.
[28] Heaney ML, Golde DW. Myelodysplasia[J]. N Engl J Med, 1999, 340(21): 1649-1660.
[1] ZHANG Xiaohui, LUO Jianmin, SUO Xiaohui, SUN Guofeng, LI Jing. Relationship between methylation of tumor suppressor genes SOCS1 and SOCS3 and remission and prognosis of 80 acute myeloid leukemia patients [J]. Journal of Shandong University (Health Sciences), 2021, 59(3): 67-73.
[2] WANG Yan, ZHANG Yuhui, HU Naibo, TENG Guangshuai, ZHOU Yuan, BAI Jie. Characteristics of bone marrow immune microenvironment in patients with acute myeloid leukemia based on single-cell RNA sequencing [J]. Journal of Shandong University (Health Sciences), 2021, 59(10): 30-38.
[3] WANG Jianxiang, GU Runxia. Advances in targeted therapy for acute myeloid leukemia [J]. Journal of Shandong University (Health Sciences), 2019, 57(7): 6-12.
[4] DOU Chunhui, SHAO Jianhua, DONG Xuebin, ZHANG Ling, CHEN Ping, ZHAO Hongyu, GU Linping, SUN Lin, XIE Jie, WANG Min, WANG Juan, LI Na, LI Fan, LI Daqi. Effect of gene mutation on the clinical efficacy of decitabine for patients with myelodysplastic syndrome [J]. Journal of Shandong University (Health Sciences), 2019, 57(3): 42-48.
[5] GONG Xiaoyun, SHEN Xiaotao, XU Jing, ZHANG Tao, ZHU Zhengjiang, XUE Fuzhong. Influence of normality of metabolomics data on the classification accuracy of diseases [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2016, 54(4): 89-93.
[6] ZHANG Zhiqiang, WU Yanhua, WANG Maoshui, WANG Xinfeng, WANG Yunshan. Diagnostic value of combination of PCT and NSE in serum for detecting small cell lung cancer [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2016, 54(11): 36-39.
[7] YANG Juan, QIU Zongjian, SONG Qiang. Impacts of Simvastatin on DNA methyltransferase of cell line NB4 in acute myeloid leukemia [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2015, 53(6): 28-32.
[8] LIU Yingjun, ZHANG Tao, WANG Lu, LIU Jia, CHANG Xuerun, ZHANG Jingxuan, XUE Fuzhong. Serum metabolic profiling of schizophrenia based on random forest [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2015, 53(2): 92-96.
[9] DING Juan, ZHENG Guixi, DU Lutao, YIN Zuohua, ZHANG Jian, WANG Chuanxin. Evaluation of combined detection of sHLA-G, PGR and GPR in the diagnosis of gastric cancer [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2015, 53(11): 59-63.
[10] LIU Xin-nong, ZHAN Han-xiang, HU San-yuan, ZHANG Guang-yong, WANG Lei . The role of serum miR-146a in early diagnosis of sepsis [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2013, 51(9): 92-94.
[11] XU Jia, SONG Qiang. Deletion of the RASSF1A gene and promoter methylation in the bone marrow of patients with myelodysplastic syndromes [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2013, 51(2): 65-69.
[12] HAN Yan-xin, CONG Ya-qin, WANG Zhi-min. Immuno-suppressive effects mediated by mesenchymal stem cells of
myelodysplastic syndrome patients
[J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2012, 50(6): 97-.
[13] YIN Dong-mei1, XU Hong-zhi1, ZHANG Jing-yao1, SUI Xiao-hui1, CUI Bin2, MA Chun-yan2, ZHEN Chang-qing1. Changes and clinical significances of CD4+T cell subsets in aplastic anemia,
myelodysplastic syndromes and acute myeloid leukemia patients
[J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2012, 50(3): 66-70.
[14] WU Qing-zhong1, CHE Feng-yuan2, XUE Fu-zhong1. A predictive model of epileptic
seizures based on unbalanced data
[J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2012, 50(2): 141-.
[15] KONG Hai-li, MA Dao-xin, ZHANG Jingjing, WANG Hui-jun, LIU Chuan-fang. Expression and significance of PDCD4 and TGF-β1 in patients with acute myeloid leukemia [J]. JOURNAL OF SHANDONG UNIVERSITY (HEALTH SCIENCES), 2011, 49(6): 110-114.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!