Review on human speech Recognition Techniques

2022 ◽  
Author(s):  
Ravikiran R
Author(s):  
PHILIPPE MORIN ◽  
JEAN-PAUL HATON ◽  
JEAN-MARIE PIERREL ◽  
GUENTHER RUSKE ◽  
WALTER WEIGEL

In the framework of man-machine communication, oral dialogue has a particular place since human speech presents several advantages when used either alone or in multimedia interfaces. The last decade has witnessed a proliferation of research into speech recognition and understanding, but few systems have been defined with a view to managing and understanding an actual man-machine dialogue. The PARTNER system that we describe in this paper proposes a solution in the case of task oriented dialogue with the use of artificial languages. A description of the essential characteristics of dialogue systems is followed by a presentation of the architecture and the principles of the PARTNER system. Finally, we present the most recent results obtained in the oral management of electronic mail in French and German.


2018 ◽  
Author(s):  
Dave F Kleinschmidt

One of the persistent puzzles in understanding human speech perception is how listeners cope with talker variability. One thing that might help listeners is structure in talker variability: rather than varying randomly, talkers of the same gender, dialect, age, etc. tend to produce language in similar ways. Sociolinguistic research has shown that listeners are sensitive to this covariation between linguistic variation and socio-indexical variables. In this paper I present new techniques based on ideal observer models to quantify 1) the amount and type of structure in talker variation, and 2) how useful such structure can be for robust speech recognition in the face of talker variability. I demonstrate these techniques in two phonetic domains---word-initial stop voicing and vowel identity---and show that these domains have different amounts and types of talker variability, consistent with previous, impressionistic findings. An `R` package accompanies this paper, enabling researchers to apply these techniques to their own data.


2020 ◽  
Vol 44 (4) ◽  
Author(s):  
James S. Magnuson ◽  
Heejo You ◽  
Sahil Luthra ◽  
Monica Li ◽  
Hosung Nam ◽  
...  

2020 ◽  
Vol 13 (4) ◽  
pp. 650-656
Author(s):  
Somayeh Khajehasani ◽  
Louiza Dehyadegari

Background: Today, the automatic intelligent system requirement has caused an increasing consideration on the interactive modern techniques between human being and machine. These techniques generally consist of two types: audio and visual methods. Meanwhile, the need for developing the algorithms that enable the human speech recognition by machine is of high importance and frequently studied by the researchers. Objective: Using artificial intelligence methods has led to better results in human speech recognition, but the basic problem is the lack of an appropriate strategy to select the recognition data among the huge amount of speech information that practically makes it impossible for the available algorithms to work. Method: In this article, to solve the problem, the linear predictive coding coefficients extraction method is used to sum up the data related to the English digits pronunciation. After extracting the database, it is utilized to an Elman neural network to recognize the relation between the linear coding coefficients of an audio file with the pronounced digit. Results: The results show that this method has a good performance compared to other methods. According to the experiments, the obtained results of network training (99% recognition accuracy) indicate that the network still has better performance than RBF despite many errors. Conclusion: The results of the experiments showed that the Elman memory neural network has had an acceptable performance in recognizing the speech signal compared to the other algorithms. The use of the linear predictive coding coefficients along with the Elman neural network has led to higher recognition accuracy and improved the speech recognition system.


2020 ◽  
Vol 40 (34) ◽  
pp. 6613-6623
Author(s):  
Miriam I. Marrufo-Pérez ◽  
Dora del Pilar Sturla-Carreto ◽  
Almudena Eustaquio-Martín ◽  
Enrique A. Lopez-Poveda

Sign in / Sign up

Export Citation Format

Share Document