基于双分支深度学习的眼底疾病多标签分类:一种受中医五轮理论启发的智能诊断框架

Multi-label fundus disease classification using dual-branch deep learning: an intelligent diagnosis framework inspired by traditional Chinese medicine Five Wheels theory

  • 摘要:
    目的 针对现有自动化诊断方法在互补特征提取不足及跨模态特征融合能力欠缺等关键问题,开发一种用于眼底疾病精准多标签分类的双分支深度学习框架。
    方法 通过整合ODIR和RFMiD数据集中的互补样本,构建了包含6 936幅眼底图像、涵盖12类视网膜病变的FMLC-12数据集,并在FMLC-12和ODIR两个数据集上对所提框架进行验证。受中医眼科五轮学说整体多区域观察原则的启发,本研究开发了双分支多标签网络(DBMNet),该框架将互补视觉特征提取与病理关联建模相结合。网络架构采用TransNeXt作为骨干网络,设计了双分支结构:一条分支处理红绿蓝(RGB)图像以捕获血管形态、病灶结构等色彩依赖性特征,另一条分支处理灰度转换图像以增强细微纹理细节和对比度变化。特征交互模块(FIM)有效融合了两条分支的多尺度特征。通过全面的消融实验评估双分支架构和FIM的贡献。将DBMNet与四种先进方法进行了性能比较,包括EfficientNet集成方法、基于迁移学习的卷积神经网络(CNN)、BFENet和EyeDeep-Net,评估指标包括平均精度均值(mAP)、F1分数和Cohen's kappa系数。
    结果 与单分支 TransNeXt 基线模型相比,引入双分支结构后,模型的 mAP 提升了 15.44 个百分点,由 34.41% 提高至 44.24%;在此基础上进一步引入 FIM 模块,使 mAP 进一步提升至 49.85%。在 FMLC-12 数据集上,DBMNet 的 mAP 达到 49.85%,Cohen’s kappa 系数为 62.14%,F1 值为 70.21%。与 BFENet(mAP:45.42%,kappa:46.64%,F1 值:71.34%)相比,DBMNet 在 mAP 和 kappa 指标上分别提高了 4.43 和 15.50 个百分点,但 BFENet 在 F1 值上略高。 在 ODIR 数据集上,DBMNet 的 F1 值达到 85.50%,与当前先进方法的性能相当。
    结论 DBMNet通过双分支架构有效整合RGB和灰度视觉模态,显著提升了眼底疾病多标签分类的性能。该框架不仅解决了现有方法中缺乏有效特征融合的问题,还展现了卓越的跨疾病类别平衡性,特别是在对常见和罕见疾病的均衡检测方面,为智能化、标准化的眼底疾病分类提供了一条具有临床应用前景的技术路径。

     

    Abstract:
    Objective To develop a dual-branch deep learning framework for accurate multi-label classification of fundus diseases, addressing the key limitations of insufficient complementary feature extraction and inadequate cross-modal feature fusion in existing automated diagnostic methods.
    Methods The fundus multi-label classification dataset with 12 disease categories (FMLC-12) dataset was constructed by integrating complementary samples from Ocular Disease Intelligent Recognition (ODIR) and Retinal Fundus Multi-Disease Image Dataset (RFMiD), yielding 6 936 fundus images across 12 retinal pathology categories, and the framework was validated on both FMLC-12 and ODIR. Inspired by the holistic multi-regional assessment principle of the Five Wheels theory in traditional Chinese medicine (TCM) ophthalmology, the dual-branch multi-label network (DBMNet) was developed as a novel framework integrating complementary visual feature extraction with pathological correlation modeling. The architecture employed a TransNeXt backbone within a dual-branch design: one branch processed red-green-blue (RGB) images to capture color-dependent features, such as vascular patterns and lesion morphology, while the other processed grayscale-converted images to enhance subtle textural details and contrast variations. A feature interaction module (FIM) effectively integrated the multi-scale features from both branches. Comprehensive ablation studies were conducted to evaluate the contributions of the dual-branch architecture and the FIM. The performance of DBMNet was compared against four state-of-the-art methods, including EfficientNet Ensemble, transfer learning-based convolutional neural network (CNN), BFENet, and EyeDeep-Net, using mean average precision (mAP), F1-score, and Cohen's kappa coefficient.
    Results The dual-branch architecture improved mAP by 15.44 percentage points over the single-branch TransNeXt baseline, increasing from 34.41% to 44.24%, and the addition of FIM further boosted mAP to 49.85%. On FMLC-12, DBMNet achieved an mAP of 49.85%, a Cohen’s kappa coefficient of 62.14%, and an F1-score of 70.21%. Compared with BFENet (mAP: 45.42%, kappa: 46.64%, F1-score: 71.34%), DBMNet outperformed it by 4.43 percentage points in mAP and 15.50 percentage points in kappa, while BFENet achieved a marginally higher F1-score. On ODIR, DBMNet achieved an F1-score of 85.50%, comparable to state-of-the-art methods.
    Conclusion DBMNet effectively integrates RGB and grayscale visual modalities through a dual-branch architecture, significantly improving multi-label fundus disease classification. The framework not only addresses the issue of insufficient feature fusion in existing methods but also demonstrates outstanding performance in balancing detection across both common and rare diseases, providing a promising and clinically applicable pathway for standardized, intelligent fundus disease classification.

     

/

返回文章
返回