voice output Latest Research Papers

Applications of science and technology have made a human life much easier. Vision plays a very important role in one’s life. Disease, accidents or due some other reasons people may loose their vision. Navigation becomes a major problem for the people with complete blindness or partial blindness. This paper aims to provide navigation guidance for visually impaired. Here we have designed a model which provides the instruction for the visionless people to navigate freely. NoIR camera is used to capture the picture around the person and identifies the objects. Using earphones voice output is provided defining the objects. This model includes Raspberry Pi 3 processor which collects the objects in surroundings and converts them into voice message, NoIR camera is used detect the object, power bank provides the power and earphones are used here the output message. TensorFlow API an open source software library used for object detection and classification. Using TensorFlow API multiple objects are obtained in a single frame. eSpeak a Text to Speech synthesizer (TTS) software is used to convert text (detected objects) to speech format. Hence using NoIR camera video which is captured is converted into voice output which provides the guidance for detecting objects. Using COCO model 90 commonly used objects are identified like person, table, book etc.

Download Full-text

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

EURASIP Journal on Audio Speech and Music Processing ◽

10.1186/s13636-019-0163-y ◽

2019 ◽

Vol 2019 (1) ◽

Author(s):

Marc Freixes ◽

Francesc Alías ◽

Joan Claudi Socoró

Keyword(s):

Time Scale ◽

General Purpose ◽

Short Length ◽

Proof Of Concept ◽

Text To Speech ◽

Speech Corpus ◽

Unit Selection ◽

Scale Factors ◽

Input Text ◽

Voice Output

AbstractText-to-speech (TTS) synthesis systems have been widely used in general-purpose applications based on the generation of speech. Nonetheless, there are some domains, such as storytelling or voice output aid devices, which may also require singing. To enable a corpus-based TTS system to sing, a supplementary singing database should be recorded. This solution, however, might be too costly for eventual singing needs, or even unfeasible if the original speaker is unavailable or unable to sing properly. This work introduces a unit selection-based text-to-speech-and-singing (US-TTS&S) synthesis framework, which integrates speech-to-singing (STS) conversion to enable the generation of both speech and singing from an input text and a score, respectively, using the same neutral speech corpus. The viability of the proposal is evaluated considering three vocal ranges and two tempos on a proof-of-concept implementation using a 2.6-h Spanish neutral speech corpus. The experiments show that challenging STS transformation factors are required to sing beyond the corpus vocal range and/or with notes longer than 150 ms. While score-driven US configurations allow the reduction of pitch-scale factors, time-scale factors are not reduced due to the short length of the spoken vowels. Moreover, in the MUSHRA test, text-driven and score-driven US configurations obtain similar naturalness rates of around 40 for all the analysed scenarios. Although these naturalness scores are far from those of vocaloid, the singing scores of around 60 which were obtained validate that the framework could reasonably address eventual singing needs.

Download Full-text

Calculator for Blind with Self Correction Feature using Voice Output

2019 International Conference on Advanced Mechatronics, Intelligent Manufacture and Industrial Automation (ICAMIMIA) ◽

10.1109/icamimia47173.2019.9223404 ◽

2019 ◽

Author(s):

Albert Sudaryanto ◽

Sulfan Bagus Setyawan ◽

Hanum Arrosida

Keyword(s):

Voice Output

Download Full-text

voice output
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

“I Can’t Talk Now”: Speaking with Voice Output Communication Aid Using Text-to-Speech Synthesis During Multiparty Video Conference

Voice Output Communication Aids

Evaluation of the acceptability and usability of Augmentative and Alternative Communication (ACC) tools: the example of Pictogram grid communication systems with voice output.

Design and Development of Neuro - OCR based Assistive System for Text Detection with Voice Output

Towards Situation-Adaptive In-Vehicle Voice Output

Personality Traits, Speech and Adaptive In-Vehicle Voice Output

Teach-Me DNA: an Interactive Course Using Voice Output in an Augmented Reality System

Navigation Aid for the Blind and the Visually Impaired People using eSpeak and Tensor Flow

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

Calculator for Blind with Self Correction Feature using Voice Output

Export Citation Format

voice outputRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

“I Can’t Talk Now”: Speaking with Voice Output Communication Aid Using Text-to-Speech Synthesis During Multiparty Video Conference

Voice Output Communication Aids

Evaluation of the acceptability and usability of Augmentative and Alternative Communication (ACC) tools: the example of Pictogram grid communication systems with voice output.

Design and Development of Neuro - OCR based Assistive System for Text Detection with Voice Output

Towards Situation-Adaptive In-Vehicle Voice Output

Personality Traits, Speech and Adaptive In-Vehicle Voice Output

Teach-Me DNA: an Interactive Course Using Voice Output in an Augmented Reality System

Navigation Aid for the Blind and the Visually Impaired People using eSpeak and Tensor Flow

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

Calculator for Blind with Self Correction Feature using Voice Output

voice output
Recently Published Documents