Fundus Image Classification Research Based on Ensemble Convolutional Neural Network and Vision Transformer

Yuan Yuan; Minghui Chen; Shuting Ke; Teng Wang; Longxi He; Linjie Lü; Hao Sun; Jiannan Liu

doi:10.3788/CJL202249.2007205

Stage

Operator

Stride

Number of channels

Number of layers

Conv 3×3

Fused-MBConv1，k 3×3

Fused-MBConv4，k 3×3

MBConv4, k 3×3, SimAM

128

MBConv6, k 3×3, SimAM

160

MBConv6, k 3×3, SimAM

272

Conv 1×1 & Pooling & FC

－

1792

Degree of illness

Number of training images

Number of testing images

Total number of images

Normal

2818

409

3227

1389

203

1592

ARMD

147

196

Myopia

234

269

Cataract

262

305

Model

Accuracy /%

Precision /%

Specificity /%

Training time /h

Vit

91.1

86.4

97.2

11.0

EfficientNetV2-S

92.2

87.6

97.5

9.2

EfficientNet-Vit

92.7

88.3

98.1

－

Model

Accuracy /%

Resnet50

87.3

Densenet121

89.5

ResNeSt-101

90.7

EfficientNet-B0

91.3

TNT-B

91.1

EfficientNet-Vit

92.7

Weighted factor

Accuracy /%

0.3, 0.7

92.0

0.4, 0.6

92.7

0.5, 0.5

91.6

Yuan Yuan, Minghui Chen, Shuting Ke, Teng Wang, Longxi He, Linjie Lü, Hao Sun, Jiannan Liu. Fundus Image Classification Research Based on Ensemble Convolutional Neural Network and Vision Transformer[J]. Chinese Journal of Lasers, 2022, 49(20): 2007205

Download Citation

Set citation alerts for the article

Tools

Set citation alerts for the article

Save the article for my favorites

Paper Information

微信扫一扫：分享

微信扫一扫：分享