• School of Information Science and Engineering, Yunnan University, Kunming 650500, P. R. China;
ZHANG Junhua, Email: jhzhang@ynu.edu.cn
Export PDF Favorites Scan Get Citation

Skin cancer is a significant public health issue, and computer-aided diagnosis technology can effectively alleviate this burden. Accurate identification of skin lesion types is crucial when employing computer-aided diagnosis. This study proposes a multi-level attention cascaded fusion model based on Swin-T and ConvNeXt. It employed hierarchical Swin-T and ConvNeXt to extract global and local features, respectively, and introduced residual channel attention and spatial attention modules for further feature extraction. Multi-level attention mechanisms were utilized to process multi-scale global and local features. To address the problem of shallow features being lost due to their distance from the classifier, a hierarchical inverted residual fusion module was proposed to dynamically adjust the extracted feature information. Balanced sampling strategies and focal loss were employed to tackle the issue of imbalanced categories of skin lesions. Experimental testing on the ISIC2018 and ISIC2019 datasets yielded accuracy, precision, recall, and F1-Score of 96.01%, 93.67%, 92.65%, and 93.11%, respectively, and 92.79%, 91.52%, 88.90%, and 90.15%, respectively. Compared to Swin-T, the proposed method achieved an accuracy improvement of 3.60% and 1.66%, and compared to ConvNeXt, it achieved an accuracy improvement of 2.87% and 3.45%. The experiments demonstrate that the proposed method accurately classifies skin lesion images, providing a new solution for skin cancer diagnosis.

Citation: WANG Zetong, ZHANG Junhua, WANG Xiao. Skin lesion classification with multi-level fusion of Swin-T and ConvNeXt. Journal of Biomedical Engineering, 2024, 41(3): 544-551. doi: 10.7507/1001-5515.202305025 Copy

Copyright © the editorial department of Journal of Biomedical Engineering of West China Medical Publisher. All rights reserved

  • Previous Article

    Ischemic stroke infarct segmentation model based on depthwise separable convolution for multimodal magnetic resonance imaging
  • Next Article

    An identification method of chromatin topological associated domains based on spatial density clustering