[3] Minaee S, Boykov Y Y, Porikli F, et al. Image segmentation using deep learning: A survey[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2022, 44(7): 3523-3542.
[9] He K, Gkioxari G, Dollr P, et al. Mask R-CNN[C]// Proc. of the IEEE Inter. Conf. on Computer Vision, 2017: 2961-2969.
[10] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Trans. on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137-1149.
[11] Liu S, Qi L, Qin H, et al. Path aggregation network for instance segmentation[C]// Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2018: 8759-8768.
[12] Huang Z, Huang L, Gong Y, et al. Mask scoring R-CNN[C]// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2019: 6409-6418.
[13] Cheng T, Wang X, Huang L, et al. Boundary-preserving mask R-CNN[C]// European Conf. on Computer Vision. Springer, Cham, 2020: 660-676.
[14] Zhang G, Lu X, Tan J, et al. Refinemask: Towards high-quality instance segmentation with fine-grained features[C]// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2021: 6861-6869.
[15] Bolya D, Zhou C, Xiao F, et al. Yolact: Real-time instance segmentation[C]// Proc. of the IEEE/CVF Inter. Conf. on Computer Vision, 2019: 9157-9166.
[16] Tian Z, Shen C, Chen H. Conditional convolutions for instance segmentation[C]// European Conf. on Computer Vision. Springer, Cham, 2020: 282-298.
[17] Tian Z, Shen C, Chen H, et al. Fcos: Fully convolutional one-stage object detection[C]// Proc. of the IEEE/CVF Inter. Conf. on Computer Vision, 2019: 9627-9636.
[18] Wang X, Kong T, Shen C, et al. Solo: Segmenting objects by locations[C]// European Conf. on Computer Vision. Springer, Cham, 2020: 649-665.
[19] Qi L, Wang Y, Chen Y, et al. Pointins: Point-based instance segmentation[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2021, 44(10): 6377-6392.
[20] Wang Y, Xu Z, Shen H, et al. Centermask: single shot instance segmentation with point representation[C]// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2020: 9313-9321.
[21] Chen H, Sun K, Tian Z, et al. Blendmask: Top-down meets bottom-up for instance segmentation[C]// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2020: 8573-8581.
[22] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. Adv. in Neural Information Processing Systems, 2017, 30: 6000-6010.
[23] Parmar N, Vaswani A, Uszkoreit J, et al. Image transformer[C]// Inter. Conf. on Machine Learning, PMLR, 2018: 4055-4064.
[24] Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers[C]// European Conf. on Computer Vision. Springer, Cham, 2020: 213-229.
[25] Hu J, Cao L, Lu Y, et al. ISTR: End-to-end instance segmentation with transformers[DB/OL]. 2021-5-6.
[27] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]// Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2016: 770-778. http://www.xueshufan.com/publication/3159833358.
[28] Lin T Y, Dollr P, Girshick R, et al. Feature pyramid networks for object detection[C]// Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2017: 2117-2125.