Background Various combinations of ultrasonographic (US) characteristics are increasingly utilized to classify thyroid nodules. But they lack theories, and heavily depend on radiologists' experience, and cannot correctly classify thyroid nodules. Hence, our main purpose of this manuscript is to select the US characteristics significantly associated with malignancy and to develop an efficient scoring system for facilitating ultrasonic clinicians to correctly identify thyroid malignancy. Methods A logistic regression (LR) model is utilized to identify the potential thyroid malignancy, and the least absolute shrinkage and selection operator (LASSO) method is adopted to simultaneously select US characteristics significantly associated with malignancy and estimate parameters in LR model. Based on the selected US characteristics, we calculate the probability for each of thyroid nodules via random forest (RF) and extreme learning machine (ELM), and develop a scoring system to classify thyroid nodules. For comparison, we also consider eight state-of-the-art methods such as support vector machine (SVM), neural network (NET), etc. The area under the receiver operating characteristic curve (AUC) is employed to measure the accuracy of various classifiers. Results The US characteristics: nodule size, AP/T >= 1, solid component, micro-calcifications, hackly border, hypoechogenicity, presence of halo, unclear border, irregular margin, and central vascularity are selected as the significant predictors associated with thyroid malignancy via the LASSO LR (LLR). Using the developed scoring system, thyroid nodules are classified into the following four categories: benign, low suspicion, intermediate suspicion, and high suspicion, whose rates of malignancy correctly identified for RF (ELM) method on the testing dataset are 0.0% (4.3%), 14.3% (50.0%), 58.1% (59.1%) and 96.1% (97.7%), respectively. Conclusion LLR together with RF performs better than other methods in identifying malignancy, especially for abnormal nodules, in terms of risk scores. The developed scoring system can well predict the risk of malignancy and guide medical doctors to make management decisions for reducing the number of unnecessary biopsies for benign nodules.
第一作者机构:[1]Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Yunnan, Peoples R China
通讯作者:
通讯机构:[1]Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Yunnan, Peoples R China[2]Kunming Med Univ, Dept Ultrasound, Affiliated Hosp 1, Kunming 650032, Yunnan, Peoples R China[3]Yunnan Agr Univ, Coll Sci, Kunming 650201, Yunnan, Peoples R China
推荐引用方式(GB/T 7714):
Chen Dan,Hu Jun,Zhu Mei,et al.Diagnosis of thyroid nodules for ultrasonographic characteristics indicative of malignancy using random forest[J].BIODATA MINING.2020,13(1):doi:10.1186/s13040-020-00223-w.
APA:
Chen, Dan,Hu, Jun,Zhu, Mei,Tang, Niansheng,Yang, Yang&Feng, Yuran.(2020).Diagnosis of thyroid nodules for ultrasonographic characteristics indicative of malignancy using random forest.BIODATA MINING,13,(1)
MLA:
Chen, Dan,et al."Diagnosis of thyroid nodules for ultrasonographic characteristics indicative of malignancy using random forest".BIODATA MINING 13..1(2020)