.

ISSN 2063-5346
For urgent queries please contact : +918130348310

CLASSIFICATION OF SPEECH FLUENCY THROUGH TRANSFER LEARNING ON SPECTROGRAM

Main Article Content

S. Mohamed Mansoor Roomi1*, K. Priya2 , V. Natchu3 , Faazelah Mohamed Farook4
» doi: 10.48047/ecb/2023.12.si5a.0272

Abstract

Fluency recognition from speech signals plays a vital role in computer-assisted voice analysis. The proposed work presents a computational framework using an audio processing system capable of classifying the fluency of speech such as fluency, non-fluency pause, and non-fluency stammer. The proposed model comprises preprocessing, spectrogram generation, and classification of speech fluency by the VGG16 pre-trained model. This model consists of convolutional layers and these layers extract discriminative features from spectrogram images of the speech signal. In this work, speech datasets such as Libri Speech, Crosslinguistic Corpus of Hesitation Phenomena (CCHP) English, and University College London’s Archive of Stuttered Speech (UCLASS) were used to find speech fluency. The performance of the proposed model was compared with the existing pre-trained network and state of art methods.

Article Details