融合中医神情特征的抑郁症机器学习识别模型

尧明慧; 朱蓉蓉; 钱鹏; 刘慧琳; 孙喜蓉; 高利民; 李福凤

doi:10.1016/j.dcmed.2026.02.006

融合中医神情特征的抑郁症机器学习识别模型

A machine learning-based depression recognition model integrating spirit-expression features from traditional Chinese medicine

摘要

摘要:
目的本研究旨在融合中医“神与表情”诊断框架与机器学习算法，构建一个抑郁症识别模型。该模型致力于建立一种融合中医理论的早期抑郁筛查工具，从而将传统诊断原则与现代计算方法相连接。
方法研究纳入2022年10月1日至2023年10月1日在上海市浦东新区精神卫生中心就诊的抑郁症患者以及同期的上海中医药大学学生和老师作为健康对照组，采用Xiaomi Pad 5拍摄3 − 10 s视频，通过中医专家判读中医神与表情（5名专家中至少3名专家判读一致以确定中医神与表情类别）。通过便携式中医智能分析与诊断设备采集基本信息、面部图像和问诊信息，运用Open CV计算机视觉库技术提取面诊特征。应用参数检验和非参数检验等统计分析方法对两组人群的基线资料和中医神与表情及面诊特征参数进行分析，比较中医神与表情及面部特征差异。运用极值梯度提升（XGBoost）、决策树（DT）、伯努利朴素贝叶斯（BernoulliNB）、支持向量机（SVM）和k-近邻分类（KNN）五种机器学习算法，构建基于中医“望神与表情”特征融合的抑郁症识别模型，以准确率、精准率和受试者工作特征（ROC）曲线下面积（AUC）等指标评估模型性能。使用夏普利加性解释（SHAP）来解释模型结果。
结果本研究最终纳入93例抑郁症患者及87例健康人群，两组人群的基线资料无统计学差异（P > 0.05）。两组人群的中医神与表情特征及面部特征差异显示：（1）抑郁症组以面部少神、少光泽为主，相较于正常人群，抑郁症组面部画像向悲伤表情、面红、面黑及唇红偏移。（2）抑郁症患者面色L值、唇色L值、唇色a值、有光泽指数均小于正常人，面色a值、面色b值、唇色b值、少光泽指数、无光泽指数均大于正常人（P < 0.05）；（3）多模型结果显示，由XGBoost算法构建的联合中医“望神与表情”特征的抑郁症识别模型总体准确率达98.61%，优于其他四种机器学习算法；（4）SHAP可视化结果显示，在由XGBoost算法构建的识别模型中，面色b值、神的类别、有光泽指数、无光泽指数、表情与纹理特征对模型的贡献度较大。
结论本研究通过机器学习算法构建了基于中医“望神与表情”特征融合的抑郁症识别模型，该模型具有较高的识别准确率，为临床辅助诊断抑郁症提供了新的思路和方法。

Abstract:
Objective To develop a depression recognition model by integrating the spirit-expression diagnostic framework of traditional Chinese medicine (TCM) with machine learning algorithms. The proposed model seeks to establish a TCM-informed tool for early depression screening, thereby bridging traditional diagnostic principles with modern computational approaches.
Methods The study included patients with depression who visited the Shanghai Pudong New Area Mental Health Center from October 1, 2022 to October 1, 2023, as well as students and teachers from Shanghai University of Traditional Chinese Medicine during the same period as the healthy control group. Videos of 3 – 10 s were captured using a Xiaomi Pad 5, and the TCM spirit and expressions were determined by TCM experts (at least 3 out of 5 experts agreed to determine the category of TCM spirit and expressions). Basic information, facial images, and interview information were collected through a portable TCM intelligent analysis and diagnosis device, and facial diagnosis features were extracted using the Open CV computer vision library technology. Statistical analysis methods such as parametric and non-parametric tests were used to analyze the baseline data, TCM spirit and expression features, and facial diagnosis feature parameters of the two groups, to compare the differences in TCM spirit and expression and facial features. Five machine learning algorithms, including extreme gradient boosting (XGBoost), decision tree (DT), Bernoulli naive Bayes (BernoulliNB), support vector machine (SVM), and k-nearest neighbor (KNN) classification, were used to construct a depression recognition model based on the fusion of TCM spirit and expression features. The performance of the model was evaluated using metrics such as accuracy, precision, and the area under the receiver operating characteristic (ROC) curve (AUC). The model results were explained using the Shapley Additive exPlanations (SHAP).
Results A total of 93 depression patients and 87 healthy individuals were ultimately included in this study. There was no statistically significant difference in the baseline characteristics between the two groups (P > 0.05). The differences in the characteristics of the spirit and expressions in TCM and facial features between the two groups were shown as follows. (i) Quantispirit facial analysis revealed that depression patients exhibited significantly reduced facial spirit and luminance compared with healthy controls (P < 0.05), with characteristic features such as sad expressions, facial erythema, and changes in the lip color ranging from erythematous to cyanotic. (ii) Depressed patients exhibited significantly lower values in facial complexion L, lip L, and a values, and gloss index, but higher values in facial complexion a and b, lip b, low gloss index, and matte index (all P < 0.05). (iii) The results of multiple models show that the XGBoost-based depression recognition model, integrating the TCM “spirit-expression” diagnostic framework, achieved an accuracy of 98.61% and significantly outperformed four benchmark algorithms—DT, BernoulliNB, SVM, and KNN (P < 0.01). (iv) The SHAP visualization results show that in the recognition model constructed by the XGBoost algorithm, the complexion b value, categories of facial spirit, high gloss index, low gloss index, categories of facial expression and texture features have significant contribution to the model.
Conclusion This study demonstrates that integrating TCM spirit-expression diagnostic features with machine learning enables the construction of a high-precision depression detection model, offering a novel paradigm for objective depression diagnosis.

HTML全文

参考文献(49)

施引文献

资源附件(0)