• Laser & Optoelectronics Progress
  • Vol. 58, Issue 6, 610005 (2021)
Shen Hao1、2、3, Meng Qinghao1、2、3, and Liu Yinbo1、2、3、*
Author Affiliations
  • 1School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
  • 2Institute of Robotics and Autonomous Systems, Tianjin University, Tianjin 300072, China
  • 3Tianjin Key Laboratory of Process Detection and Control, Tianjin 300072, China
  • show less
    DOI: 10.3788/LOP202158.0610005 Cite this Article Set citation alerts
    Shen Hao, Meng Qinghao, Liu Yinbo. Facial Expression Recognition by Merging Multilayer Features of Lightweight Convolutional Networks[J]. Laser & Optoelectronics Progress, 2021, 58(6): 610005 Copy Citation Text show less
    Structure of ms_model_v1 model
    Fig. 1. Structure of ms_model_v1 model
    Bottleneck_M structure
    Fig. 2. Bottleneck_M structure
    Process of feature selection module
    Fig. 3. Process of feature selection module
    Confusion matrix of RAF-DB dataset using the proposed method
    Fig. 4. Confusion matrix of RAF-DB dataset using the proposed method
    Confusion matrix of AffectNet dataset using the proposed method
    Fig. 5. Confusion matrix of AffectNet dataset using the proposed method
    Layer namecstOutput size
    Conv2D 1 (3×3)16256×56×16
    Bottleneck_M1161156×56×16
    Bottleneck_M2242528×28×24
    Bottleneck_M3241528×28×24
    Bottleneck_M3_1321528×28×32
    Bottleneck_M3_2321528×28×32
    Feature selection module 132
    Bottleneck_M4322514×14×32
    Bottleneck_M5321514×14×32
    Feature selection module 232
    Bottleneck_M6401514×14×40
    Bottleneck_M7401514×14×40
    Feature selection module 340
    Bottleneck_M8401514×14×40
    Bottleneck_M948257×7×48
    Bottleneck_M1064157×7×64
    Conv2D 2 (1×1)6417×7×64
    Global average pooling64
    Concat168
    Reshape 11×1×168
    Dropout1×1×168
    Conv2D 3 (1×1)k1×1×k
    Softmax1×1×k
    Reshape 2k
    Table 1. Parameters of the CNN
    ClassNeutralHappySadSurpriseFearDisgustAngryContemptTotal
    Train set5978597959665963637838035979375043796
    Test set5005005005005005005005004000
    Table 2. Number of categories in AffectNet dataset after random screening
    Model nameNumber of parametersModel size/MbitAcc in RAF-DB/%Acc in AffectNet/%
    base_model_R1951592.983.6456.93
    base_model_M1951592.984.2357.30
    ms_model_fully329930340.285.4557.37
    ms_model_v1_R1939273.084.8157.43
    ms_model_v1_M1939273.085.4957.70
    Table 3. Performance of different models
    MethodAcc in RAF-DB/%Acc in AffectNet/%
    Boosting-POOF[24]73.19
    VGG16[25]80.9651.11
    DLP-CNN[22]84.13
    pACNN[26]83.0555.33
    gACNN[26]85.0758.78
    E2-CapsNet[27]85.24
    ms_model_v1_M85.4957.70
    Table 4. Accuracy of different methods in RAF-DB and AffectNet datasets
    Shen Hao, Meng Qinghao, Liu Yinbo. Facial Expression Recognition by Merging Multilayer Features of Lightweight Convolutional Networks[J]. Laser & Optoelectronics Progress, 2021, 58(6): 610005
    Download Citation