Method and apparatus for speech generation from phonetic codes

Richard T. Gagnon

doi:10.1121/1.416057

Real-Time Bangla Sign Language Detection with Sentence and Speech Generation

2020 23rd International Conference on Computer and Information Technology (ICCIT) ◽

10.1109/iccit51783.2020.9392693 ◽

2020 ◽

Author(s):

Dipon Talukder ◽

Fatima Jahara

Keyword(s):

Real Time ◽

Sign Language ◽

Speech Generation ◽

Language Detection

Download Full-text

End-to-end Image-to-speech Generation for Untranscribed Unknown Languages

IEEE Access ◽

10.1109/access.2021.3071541 ◽

2021 ◽

pp. 1-1

Author(s):

Johanes Effendi ◽

Sakriani Sakti ◽

Satoshi Nakamura

Keyword(s):

Speech Generation ◽

End To End

Download Full-text

Synthetic speech detection through short-term and long-term prediction traces

EURASIP Journal on Information Security ◽

10.1186/s13635-021-00116-3 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Clara Borrelli ◽

Paolo Bestagini ◽

Fabio Antonacci ◽

Augusto Sarti ◽

Stefano Tubaro

Keyword(s):

Deep Learning ◽

Speech Processing ◽

Synthetic Speech ◽

Opinion Formation ◽

Closed Set ◽

Speech Detection ◽

Open Set ◽

Technological Advances ◽

Speech Generation ◽

Long Term Prediction

AbstractSeveral methods for synthetic audio speech generation have been developed in the literature through the years. With the great technological advances brought by deep learning, many novel synthetic speech techniques achieving incredible realistic results have been recently proposed. As these methods generate convincing fake human voices, they can be used in a malicious way to negatively impact on today’s society (e.g., people impersonation, fake news spreading, opinion formation). For this reason, the ability of detecting whether a speech recording is synthetic or pristine is becoming an urgent necessity. In this work, we develop a synthetic speech detector. This takes as input an audio recording, extracts a series of hand-crafted features motivated by the speech-processing literature, and classify them in either closed-set or open-set. The proposed detector is validated on a publicly available dataset consisting of 17 synthetic speech generation algorithms ranging from old fashioned vocoders to modern deep learning solutions. Results show that the proposed method outperforms recently proposed detectors in the forensics literature.

Download Full-text

New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer

Speech Communication ◽

10.1016/j.specom.2006.05.003 ◽

2006 ◽

Vol 48 (10) ◽

pp. 1227-1242 ◽

Cited By ~ 30

Author(s):

Javier Latorre ◽

Koji Iwano ◽

Sadaoki Furui

Keyword(s):

New Approach ◽

Speech Generation

Download Full-text

Expressive Visual Speech Generation

Data-Driven 3D Facial Animation ◽

10.1007/978-1-84628-907-1_2 ◽

2007 ◽

pp. 29-59

Author(s):

Thomas Di Giacomo ◽

Stephane Garchery ◽

Nadia Magnenat-Thalmann

Keyword(s):

Visual Speech ◽

Speech Generation

Download Full-text

Development of Speech Generation Device Program for Student with Cerebral Palsy

The Journal of the Korea Contents Association ◽

10.5392/jkca.2009.9.12.448 ◽

2009 ◽

Vol 9 (12) ◽

pp. 448-458

Author(s):

Jin-Bok Koh ◽

Byung-Un Jeon

Keyword(s):

Cerebral Palsy ◽

Speech Generation

Download Full-text

Thought as word dynamics

Behavioral and Brain Sciences ◽

10.1017/s0140525x99361823 ◽

1999 ◽

Vol 22 (2) ◽

pp. 295-295

Author(s):

Paul J. M. Jorion

Keyword(s):

Parts Of Speech ◽

Functional Relationships ◽

Gradient Model ◽

Speech Generation

A Hebbian model for speech generation opens a number of paths. A cross-linguistic scheme of functional relationships (inspired by Aristotle) dispenses with distraction by the “parts of speech” distinctions, while bridging the gap between “content” and “structure” words. A gradient model identifies emotional and rational dynamics and shows speech generation as a process where a speaker's dissatisfaction gets minimized.

Download Full-text

Speech Generation in Mobile Phones

Human Factors and Voice Interactive Systems - Signals and Communication Technology ◽

10.1007/978-0-387-68439-0_6 ◽

2007 ◽

pp. 163-191 ◽

Cited By ~ 3

Author(s):

Géza Németh ◽

Géza Kiss ◽

Csaba Zainkó ◽

Gábor Olaszy ◽

Bálint Tóth

Keyword(s):

Mobile Phones ◽

Speech Generation

Download Full-text

SPEECH-TO-SPEECH GENERATION SYSTEM AND METHOD

The Journal of the Acoustical Society of America ◽

10.1121/1.4707514 ◽

2012 ◽

Vol 131 (4) ◽

pp. 3203

Author(s):

Shen Liqin

Keyword(s):

Generation System ◽

Speech Generation

Download Full-text

Concept‐to‐speech conversion for reply speech generation in a spoken dialogue system for road guidance and its prosodic control

The Journal of the Acoustical Society of America ◽

10.1121/1.4787191 ◽

2006 ◽

Vol 120 (5) ◽

pp. 3038-3038

Author(s):

Yuji Yagi ◽

Seiya Takada ◽

Keikichi Hirose ◽

Nobuaki Minematsu

Keyword(s):

Dialogue System ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Speech Generation

Download Full-text