• Laser & Optoelectronics Progress
  • Vol. 59, Issue 10, 1007001 (2022)
Weikang Tang, Yubin Shao*, and Hua Long
Author Affiliations
  • School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, Yunnan , China
  • show less
    DOI: 10.3788/LOP202259.1007001 Cite this Article Set citation alerts
    Weikang Tang, Yubin Shao, Hua Long. Chinese Syllable Segmentation Algorithm with Logarithmic Envelope[J]. Laser & Optoelectronics Progress, 2022, 59(10): 1007001 Copy Citation Text show less
    Steps of syllable segmentation algorithm
    Fig. 1. Steps of syllable segmentation algorithm
    Waveform and its upper envelope of speech signal
    Fig. 2. Waveform and its upper envelope of speech signal
    Some envelopes formed by various interpolation methods
    Fig. 3. Some envelopes formed by various interpolation methods
    Original speech waveform and envelope after interpolation. (a) Original speech waveform; (b) envelope after interpolation
    Fig. 4. Original speech waveform and envelope after interpolation. (a) Original speech waveform; (b) envelope after interpolation
    Filtered envelope and logarithmic envelope. (a) Filtered envelope; (b) logarithmic envelope
    Fig. 5. Filtered envelope and logarithmic envelope. (a) Filtered envelope; (b) logarithmic envelope
    Adjustment of extreme points on logarithmic envelope. (a) Extreme points on logarithmic envelope; (b) extreme points after single threshold processing; (c) extreme points after thresholding
    Fig. 6. Adjustment of extreme points on logarithmic envelope. (a) Extreme points on logarithmic envelope; (b) extreme points after single threshold processing; (c) extreme points after thresholding
    Chinese phonetic syllable segmentation under noisy environment. (a) Original speech; (b) distribution of extreme points on logarithmic envelope of speech without noise; (c) speech with noises (SNR is 5 dB); (d) distribution of extreme points on logarithmic envelope of speech with noise
    Fig. 7. Chinese phonetic syllable segmentation under noisy environment. (a) Original speech; (b) distribution of extreme points on logarithmic envelope of speech without noise; (c) speech with noises (SNR is 5 dB); (d) distribution of extreme points on logarithmic envelope of speech with noise
    Filter typeSNR /dB
    2015105
    Chebyshev filtering79.678.377.169.4
    Bessel filtering81.480.679.374.8
    Butterworth filtering91.791.090.687.5
    Table 1. Segmentation accuracy of different filters for different signal-to-noise ratio
    Segmentation algorithmAccuracy /%
    Singularity index1526.9
    Local singularity1350.3
    Short-term energy1276.9
    Fractal dimension1482.3
    Method of this article92.1
    Table 2. Segmentation accuracy of different algorithms without noise
    Segmentation algorithmSNR /dB
    1050-5
    Singular index1524.720.618.716.9
    Local singularity1349.347.742.639.1
    Short-term energy1269.660.358.454.2
    Fractal dimension1483.480.179.771.2
    Method of this article90.687.576.370.6
    Table 3. Speech segmentation accuracy of different algorithms under different signal-to-noise ratio
    Segmentation algorithmVoice duration /s
    46810
    Singular index151.32.23.95.1
    Local singularity132.94.15.77.4
    Short-term energy123.14.65.97.6
    Fractal dimension141.92.74.56.0
    Method of this article2.53.75.46.9
    Table 4. Segmentation time of different algorithms under different durations
    Weikang Tang, Yubin Shao, Hua Long. Chinese Syllable Segmentation Algorithm with Logarithmic Envelope[J]. Laser & Optoelectronics Progress, 2022, 59(10): 1007001
    Download Citation