Research on Instance Segmentation Algorithm Based on YOLACT and Transformer

ZHAO Jingwei; LIN Shanling; MEI Ting; LIN Zhixian; GUO Tailiang

doi:10.16818/j.issn1001-5868.2022110201

[3] Minaee S, Boykov Y Y, Porikli F, et al. Image segmentation using deep learning: A survey［J］. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2022, 44(7): 3523-3542.

[9] He K, Gkioxari G, Dollr P, et al. Mask R-CNN［C］// Proc. of the IEEE Inter. Conf. on Computer Vision, 2017: 2961-2969.

[10] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks［J］. IEEE Trans. on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137-1149.

[11] Liu S, Qi L, Qin H, et al. Path aggregation network for instance segmentation［C］// Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2018: 8759-8768.

[12] Huang Z, Huang L, Gong Y, et al. Mask scoring R-CNN［C］// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2019: 6409-6418.

[13] Cheng T, Wang X, Huang L, et al. Boundary-preserving mask R-CNN［C］// European Conf. on Computer Vision. Springer, Cham, 2020: 660-676.

[14] Zhang G, Lu X, Tan J, et al. Refinemask: Towards high-quality instance segmentation with fine-grained features［C］// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2021: 6861-6869.

[15] Bolya D, Zhou C, Xiao F, et al. Yolact: Real-time instance segmentation［C］// Proc. of the IEEE/CVF Inter. Conf. on Computer Vision, 2019: 9157-9166.

[16] Tian Z, Shen C, Chen H. Conditional convolutions for instance segmentation［C］// European Conf. on Computer Vision. Springer, Cham, 2020: 282-298.

[17] Tian Z, Shen C, Chen H, et al. Fcos: Fully convolutional one-stage object detection［C］// Proc. of the IEEE/CVF Inter. Conf. on Computer Vision, 2019: 9627-9636.

[18] Wang X, Kong T, Shen C, et al. Solo: Segmenting objects by locations［C］// European Conf. on Computer Vision. Springer, Cham, 2020: 649-665.

[19] Qi L, Wang Y, Chen Y, et al. Pointins: Point-based instance segmentation［J］. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2021, 44(10): 6377-6392.

[20] Wang Y, Xu Z, Shen H, et al. Centermask: single shot instance segmentation with point representation［C］// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2020: 9313-9321.

[21] Chen H, Sun K, Tian Z, et al. Blendmask: Top-down meets bottom-up for instance segmentation［C］// Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 2020: 8573-8581.

[22] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need［J］. Adv. in Neural Information Processing Systems, 2017, 30: 6000-6010.

[23] Parmar N, Vaswani A, Uszkoreit J, et al. Image transformer［C］// Inter. Conf. on Machine Learning, PMLR, 2018: 4055-4064.

[24] Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers［C］// European Conf. on Computer Vision. Springer, Cham, 2020: 213-229.

[25] Hu J, Cao L, Lu Y, et al. ISTR: End-to-end instance segmentation with transformers［DB/OL］. 2021-5-6.

[27] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition［C］// Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2016: 770-778. http://www.xueshufan.com/publication/3159833358.

[28] Lin T Y, Dollr P, Girshick R, et al. Feature pyramid networks for object detection［C］// Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, 2017: 2117-2125.

微信扫一扫：分享

微信扫一扫：分享