阻塞性冠状动脉疾病中医舌象的客观量化研究: 基于多模态特征融合的计算分析方法

Quantitative research on tongue diagnosis in traditional Chinese medicine for obstructive coronary artery disease: a computational analysis based on multimodal feature fusion

  • 摘要:
    目的 本研究旨在通过计算机辅助舌象图像分析,探究阻塞性与非阻塞性冠状动脉疾病(CAD)患者在舌象形态学上的差异,并识别与中医辨证相关的色值与纹理特征。
    方法 这项研究前瞻性纳入2024年5月1日至2025年8月7日期间在辽宁中医药大学附属医院接受冠状动脉电脑断层血管摄影术(CTA)检查的患者。依据CTA结果将患者分为阻塞性CAD和非阻塞性CAD两组。使用专用的移动应用程序(中医舌象辅助诊断系统)采集标准化舌象,并对整体舌面及三种宏观特征(齿痕、裂纹、点刺)进行分析,提取高维纹理和颜色参数。采用空间域拉普拉斯金字塔和频域离散小波变换两种多尺度分解方法,采用空间域拉普拉斯金字塔和频域小波变换方法提取多尺度纹理特征。应用主成分分析(PCA)和随机森林算法(5折交叉验证)进行降维和特征选择,同时使用Hodges-Lehmann估计量和Cliff’s δ值评估特征稳定性。开发一个多视角XGBoost模型以区分两组患者,并在一个时间独立的验证集上使用准确率和受试者工作特征曲线下面积(AUC)评估其性能。最后,采用夏普利加性解释(SHAP)分析来阐释模型决策。
    结果 本研究共纳入了373名CAD患者,根据CTA诊断结果,将其分为167名阻塞性CAD和206名非阻塞性CAD患者。所有患者按时间顺序划分为训练集(n = 316,阻塞性 : 非阻塞性 = 142 : 174)和验证集(n = 57,阻塞性 : 非阻塞性 = 25 : 32),组间基线特征均无显著差异(P > 0.05)。宏观舌象特征显示,阻塞性CAD患者的齿痕比值比(OR)= 0.43, P < 0.05和点刺(OR = 0.46, P < 0.05)出现频率较低。高维颜色分析提示了组间存在显著色值差异,最明显的是阻塞性CAD患者在色调、饱和度和强度(HSI)色彩空间中的色调值显著降低(Cliff’s δ = – 0.31, P = 2.72 × 10–6;Hodges-Lehmann估计量: – 0.31)。PCA结果提示舌面特征可能解释最高比例的方差(48.2%)。随机森林算法在所有分析的舌区域中筛选出77个稳定特征,其中经小波变换后的纹理特征显示出主导特征的重要性。多视角XGBoost融合模型在独立验证集中达到了75%的准确率和0.779的AUC。SHAP分析确定LHL_glszm_ZoneVariance为最重要的预测因子,其贡献占模型40.6%的决策方差,并进一步提示了模型85.3%的预测能力来源于小波纹理特征。
    结论 本研究通过人工智能驱动的图像分析,识别出阻塞性CAD患者独特的舌象形态与色值模式,为中医“舌为心之苗”理论提供了客观证据。计算模型所展现的良好性能提示,舌象分析有望成为CAD评估的一种无创辅助工具。

     

    Abstract:
    Objective To investigate morphological differences between obstructive and non-obstructive coronary artery disease (CAD) patients using computer-aided image analysis, and identify color and texture features for traditional Chinese medicine (TCM) syndrome differentiation.
    Methods This prospective study enrolled patients undergoing coronary computed tomography angiography (CTA) at the Affiliated Hospital of Liaoning University of Traditional Chinese Medicine between May 1, 2024 and August 7, 2025. Based on CTA results, patients were categorized into obstructive CAD and non-obstructive CAD groups. Standardized tongue images were acquired using a dedicated mobile application (Traditional Chinese Medicine Tongue Image-Assisted Diagnosis System) and analyzed for the overall tongue surface and three macroscopic features (tooth marks, fissures, and red dots) from which high-dimensional color and texture parameters were extracted. Multi-scale texture features were derived using spatial-domain Laplacian pyramid and frequency-domain wavelet transform methods. Dimensionality reduction and feature selection were performed using principal component analysis (PCA) and random forest with 5-fold cross-validation. Feature stability was assessed using Hodges-Lehmann estimator and Cliff’s δ. A multi-view XGBoost model was developed to differentiate the two groups and evaluated on a temporally independent validation set using accuracy and the area under the receiver operating characteristic curve (AUC). SHapley Additive exPlanations (SHAP) analysis was applied to interpret model decisions.
    Results This study analyzed 373 CAD patients, including 167 with obstructive CAD and 206 with non-obstructive CAD according to CTA results. The whole cohort was divided into training set (n = 316, obstructive : non-obstructive = 142 : 174 ) and validation set (n = 57, obstructive : non-obstructive = 25 : 32), with balanced baseline characteristics (P > 0.05). Macroscopic tongue analysis revealed that patients with obstructive CAD had fewer tooth marks odds ratio (OR) = 0.43, P < 0.05 and red dots (OR = 0.46, P < 0.05). High-dimensional color analysis identified pronounced intergroup differences, most notably a reduction in hue values in the hue-saturation-intensity (HSI) color space among obstructive CAD patients (Cliff’s δ = – 0.31, P = 2.72 × 10–6; Hodges-Lehmann estimator: – 0.31). PCA results suggested that tongue surface features explained the highest proportion of variance (48.2%). Random forest screening identified 77 stable features across all tongue regions, with wavelet-transformed texture features demonstrating the highest importance. The multi-view XGBoost fusion model achieved an accuracy of 75% and an AUC of 0.779 in the independent validation set. SHAP analysis identified the wavelet-based feature—left-handed lower-level gray-level size zone matrix zone variance (LHL_glszm_ZoneVariance) as the top predictor, accounting for 40.6% of the model's decision variance, and indicated that 85.3% of the predictive power came from wavelet-based texture features.
    Conclusion This study has provided objective evidence for the TCM concept that “the tongue reflects the heart” by identifying distinct morphological and colorimetric tongue patterns in patients with obstructive CAD through artificial intelligence (AI)-driven image analysis, and the promising performance of the computational model suggests its potential as a non-invasive adjunctive tool for CAD assessment.

     

/

返回文章
返回