Dual-Stream Feature Aggregation Network for Unmanned Aerial Vehicle Aerial Images Semantic Segmentation

Runzeng Li; Zaifeng Shi; Fanning Kong; Xiangyang Zhao; Tao Luo

doi:10.3788/LOP230955

Journals >Laser & Optoelectronics Progress >Volume 60 >Issue 24 >Page 2428005 > Article

Laser & Optoelectronics Progress
Vol. 60, Issue 24, 2428005 (2023)

Dual-Stream Feature Aggregation Network for Unmanned Aerial Vehicle Aerial Images Semantic Segmentation

Runzeng Li¹, Zaifeng Shi^1,3,*, Fanning Kong¹, Xiangyang Zhao¹, and Tao Luo²

Author Affiliations

¹School of Microelectronics, Tianjin University, Tianjin 300072, China

²College of Intelligence and Computing, Tianjin University, Tianjin 300072, China

³Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, Tianjin 300072, China

show less

DOI: 10.3788/LOP230955 Cite this Article Set citation alerts

Runzeng Li, Zaifeng Shi, Fanning Kong, Xiangyang Zhao, Tao Luo. Dual-Stream Feature Aggregation Network for Unmanned Aerial Vehicle Aerial Images Semantic Segmentation[J]. Laser & Optoelectronics Progress, 2023, 60(24): 2428005 Copy Citation Text

show less

Fig. 1. Network architecture and partial module. (a) Overall network architecture; (b) ConvNeXt block; (c) coordinate attention block

Download full size

Fig. 2. BGA module

Download full size

Fig. 3. Comparison of prediction maps of different models on AeroScapes dataset. (a) Picture 002001_049; (b) picture 038032_032; (c) picture 045002_049; (d) picture 310019_016; (e) picture 311000_004

Download full size

Fig. 4. Comparison of prediction maps of different models on Semantic Drone dataset. (a) Picture 002; (b) picture 056; (c) picture 119; (d) picture 311; (e) picture 412

Download full size

Method	Backbone	mIoU /%	mPA /%
FCN^［3］	VGG-16^［29］	67.59	74.53
U-Net^［4］	ResNet-50^［20］	75.84	83.31
PSPNet^［7］	MobileNetV3^［30］	58.15	63.86
PSPNet^［7］	ResNet-50^［20］	60.57	66.72
RefineNet^［8，31］	ResNet-101^［20］	63.09	70.82
DeepLabV3+^［10］	MobileNetV3^［30］	78.01	84.3
DeepLabV3+^［10］	Xception^［32］	77.49	85.03
DADA^{［27，31］}	DeepLabV2^［33］	81.53	88.75
DSRL^{［28，［31］}	ResNet-101^［20］	82.48	89.72
Proposed	Xception^［32］	83.16	90.75

Table 1. Comparison of evaluation results of different models on AeroScapes dataset

Location	mIoU /%	mPA /%
4	79.35	87.29
5	79.41	87.53
6	79.55	87.67
4，6	79.71	87.90
1，2，3，4	78.86	87.68
1，2，3，4，6	79.62	87.88

Table 2. Comparison of evaluation results using coordinate attention block at different locations of CA-ASPP

Method	mIoU /%	mPA /%
-	77.49	85.03
CA-ASPP	79.71	87.90
ConvBranch	78.96	86.84
BGAModule	79.45	87.38
ConvBranch，BGAModule	80.85	88.04
CA-ASPP + ConvBranch + BGAModule	81.53	88.96
ConvBranch + BGAModule + Multi-loss	82.84	90.31
CA-ASPP + ConvBranch + BGAModule + Multi-loss	83.16	90.75

Table 3. Evaluation results of ablation experiments with different improved methods

Method	Backbone	mIoU /%	mPA /%
FCN^［3］	VGG-16^［29］	54.61	63.63
U-Net^［4］	ResNet-50^［20］	57.38	68.45
PSPNet^［7］	MobileNetV3^［30］	45.43	54.08
PSPNet^［7］	ResNet-50^［20］	42.81	51.55
DeepLabV3+^［10］	MobileNetV3^［30］	55.31	64.56
DeepLabV3+^［10］	Xception^［32］	55.48	64.00
Proposed	Xception^［32］	72.09	80.34

Table 4. Comparison of evaluation results of different models on Semantic Drone dataset

Download Citation

Set citation alerts for the article

Tools

Set citation alerts for the article

Save the article for my favorites

Paper Information

微信扫一扫：分享

微信扫一扫：分享