• Laser & Optoelectronics Progress
  • Vol. 58, Issue 20, 2020001 (2021)
Yue Wang, Hansong Su, and Gaohua Liu*
Author Affiliations
  • School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
  • show less
    DOI: 10.3788/LOP202158.2020001 Cite this Article Set citation alerts
    Yue Wang, Hansong Su, Gaohua Liu. Improved Encoder-Decoder Temporal Action Detection Algorithm[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2020001 Copy Citation Text show less
    Structure of the improved encoder-decoder temporal convolutional neural network
    Fig. 1. Structure of the improved encoder-decoder temporal convolutional neural network
    Structure of the residual module
    Fig. 2. Structure of the residual module
    Different feature fusion methods
    Fig. 3. Different feature fusion methods
    Schematic diagram of traditional upsampling and improved upsampling. (a) Traditional upsampling; (b)improved upsampling
    Fig. 4. Schematic diagram of traditional upsampling and improved upsampling. (a) Traditional upsampling; (b)improved upsampling
    Detection example of MERL Shopping dataset
    Fig. 5. Detection example of MERL Shopping dataset
    Detection example of GTEA dataset
    Fig. 6. Detection example of GTEA dataset
    BlockKernel sizeNumber of channels
    Conv17×764
    Conv2_x1×13×31×1×36464256×3
    Conv3_x1×13×31×1×4128128512×4
    Conv4_x1×13×31×1×62562561024×6
    Conv5_x1×13×31×1×35125122048×3
    Table 1. Parameters of feature extraction network
    DatasetActionAccuracy /%
    MERL ShoppingReach to shelf77.8
    Retract from shelf79.3
    Hand in shelf81.6
    Inspect the product80.4
    Inspect the shelf81.2
    Table 2. Recognition accuracy rate of each action
    DatasetVggNet16ResNet50ED-TCNImproved ED-TCNmAP /%
    MERL Shopping24.3
    MERL Shopping25.6
    MERL Shopping29.3
    GTEA25.8
    GTEA27.2
    GTEA30.2
    Table 3. Effectiveness of various module on the algorithm
    DatasetED-TCNImproved ED-TCNSeg-F1@10Seg-F1@25Seg-F1@50
    MERL Shopping86.785.172.9
    MERL Shopping89.287.474.8
    GTEA72.269.356.0
    GTEA76.871.958.5
    Table 4. Seg-F1 of different algorithms on different datasets
    AlgorithmAccuracy /%mAP /%Seg-F1@10Seg-F1@25Seg-F1@50
    MSN Det64.629.546.442.625.6
    MSN Seg76.324.280.078.365.4
    Dilated TCN76.426.379.978.067.5
    ED-TCN79.025.586.785.172.9
    Improved ED-TCN82.429.389.287.474.8
    Table 5. Comparison of results of different algorithms on the MERL Shopping dataset
    Yue Wang, Hansong Su, Gaohua Liu. Improved Encoder-Decoder Temporal Action Detection Algorithm[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2020001
    Download Citation