• Laser & Optoelectronics Progress
  • Vol. 57, Issue 18, 181702 (2020)
Kailong Ren, Yi Wang*, Xiaodong Chen, and Huaiyu Cai
Author Affiliations
  • School of Precision Instruments and Optoelectronics Engineering, Tianjin University, Tianjin 300072, China
  • show less
    DOI: 10.3788/LOP57.181702 Cite this Article Set citation alerts
    Kailong Ren, Yi Wang, Xiaodong Chen, Huaiyu Cai. Speaker-Dependent Speech Recognition Algorithm for Laparoscopic Supporter Control[J]. Laser & Optoelectronics Progress, 2020, 57(18): 181702 Copy Citation Text show less
    Diagram of simple RNN and its expansion
    Fig. 1. Diagram of simple RNN and its expansion
    Diagram of the unit of LSTM recurrent neural network hidden layer
    Fig. 2. Diagram of the unit of LSTM recurrent neural network hidden layer
    Diagram of BiLSTM RNN structure
    Fig. 3. Diagram of BiLSTM RNN structure
    Diagram of LSTM recurrent neural network model with i-vector feature
    Fig. 4. Diagram of LSTM recurrent neural network model with i-vector feature
    Diagrams of i-vector parameter fusion and adding rejection identification unit. (a) Parameter fusion of i-vector; (b) adding rejection identification unit
    Fig. 5. Diagrams of i-vector parameter fusion and adding rejection identification unit. (a) Parameter fusion of i-vector; (b) adding rejection identification unit
    Layer IDNameNumberof unitsActivationfunction
    1Input layer--
    2FC164ReLU
    3FC264ReLU
    4FC364ReLU
    5BiLSTM64-
    6FC464ReLU
    7FC564ReLU
    8Output layer-Softmax
    Table 1. LSTM recurrent neural network model structure with i-vector feature
    Word IDDTWGMM-HMMLSTM RNN with i-vector
    TotalCorrectErrorTotalCorrectErrorTotalCorrectError
    FRFAFRFA
    FRFA
    1605046605316605910
    2605424605424606000
    3605424605613606000
    4605325605514605910
    5605235605415606000
    6605433605532606000
    7605262605217606000
    8605424605415606000
    Sum4804232433480433113648047820
    Table 2. Recognition results of surgeon speech by three models
    Word IDDTWGMM-HMMLSTM RNN with i-vector
    TotalRejectionFATotalRejectionFATotalRejectionFA
    1605376054660600
    2605556056460600
    3605826057360600
    4605466055560600
    5605646056460600
    6605376052860600
    7605556056460600
    8605646057360600
    Sum48044040480443374804800
    Table 3. Recognition results of assistant doctors speech by three models
    Word IDDTWGMM-HMMLSTM RNN with i-vector
    ToatlRejectionFATotalRejectionFATotalRejectionFA
    18072880701080800
    2807288075580773
    3807558076480782
    4807378072880764
    5807378074680773
    6807468075580782
    7807558075580791
    8807378072880791
    Sum640587536405895164062416
    Table 4. Recognition results of interference speech by three models
    Kailong Ren, Yi Wang, Xiaodong Chen, Huaiyu Cai. Speaker-Dependent Speech Recognition Algorithm for Laparoscopic Supporter Control[J]. Laser & Optoelectronics Progress, 2020, 57(18): 181702
    Download Citation