scholarly journals Development of speech technologies to support hearing through mobile terminal users

Author(s):  
Taro Togawa ◽  
Takeshi Otani ◽  
Kaori Suzuki ◽  
Tomohiko Taniguchi

Mobile terminals have become the most familiar communication tool we use, and various types of people have come to use mobile terminals in various environments. Accordingly, situations in which we talk over the telephone in noisy environments or with someone who speaks fast have increased. However, it is sometimes difficult to hear a person's voice in these cases. To make the voice received through mobile terminals easy to hear, authors have developed two technologies. One is a voice enhancement technology that emphasizes a caller's voice according to the noise surrounding the recipient, and the other is a speech rate conversion technology that slows speech while maintaining voice quality. In this paper, we explain the trends and the features of these technologies and discuss ways to implement their algorithms on mobile terminals.

1976 ◽  
Vol 19 (1) ◽  
pp. 168-180 ◽  
Author(s):  
Ralph O. Coleman

Comparisons were made between the contributions of the fundamental frequency (F 0 ) on one hand, and vocal tract resonances on the other, to a perception of maleness and femaleness in the adult voice. In the first of two experiments, the F 0 of natural voice was found to be very highly correlated with the degree of maleness and femalenesss in the voice. The vocal tract resonances were less highly correlated and it is apparent that in the presence of the natural laryngeal tone, these perceptions are based on the frequency of the F 0 . In the second experiment, a tone produced by a laryngeal vibrator was substituted for the normal glottal tone at simulated F 0 's representing both males (120 Hz) and females (240 Hz). When listeners were asked to identify the sex of the speakers some inconsistency with the findings of the first experiment was seen. The female F 0 was a weak indicator of female voice quality when combined with male vocal tract resonance although the male F 0 retained the perceptual prominence seen in the first experiment. This finding may be indicative of some basic difference in the normal glottal characteristics of males and females.


2021 ◽  
pp. 22-30
Author(s):  
Anna V. Dunashova ◽  

This paper aims to study phonostylistic variation of prosodic characteristics of a linguistic persona. The new aspect brought to the field is the focus not only on pitch and speech rate but also on voice quality prosodic aspects of a linguistic persona. The subject was a world-famous British linguist David Crystal whose recordings of lecture and interview were used as the material for this study. The data suggest wide variability of practically every prosodic feature. Among them, pitch minimum, pitch range, loudness median and shimmer values proved to be most constant features of the linguistic persona. The other prosodic values underwent changes due to the shift from the modal voice in the interview to a more arduous voice in the lecture thus reflecting different pragmatics goals of commuication. Prosodic variation range of the linguistic persona in question turned out wider than the average one.


2021 ◽  
Vol 11 (23) ◽  
pp. 11320
Author(s):  
Giorgia Cistola ◽  
Alex Peiró-Lilja ◽  
Guillermo Cámbara ◽  
Ineke van der Meulen ◽  
Mireia Farrús

Text-to-speech (TTS) systems provide fundamental reading support for people with aphasia and reading difficulties. However, artificial voices are more difficult to process than natural voices. The current study is an extended analysis of the results of a clinical experiment investigating which, among three artificial voices and a digitised human voice, is more suitable for people with aphasia and reading impairments. Such results show that the voice synthesised with Ogmios TTS, a concatenative speech synthesis system, caused significantly slower reaction times than the other three voices used in the experiment. The present study explores whether and what voice quality metrics are linked to delayed reaction times. For this purpose, the voices were analysed using an automatic assessment of intelligibility, naturalness, and jitter and shimmer voice quality parameters. This analysis revealed that Ogmios TTS, in general, performed worse than the other voices in all parameters. These observations could explain the significantly delayed reaction times in people with aphasia and reading impairments when listening to Ogmios TTS and could open up consideration about which TTS to choose for compensative devices for these patients based on the voice analysis of these parameters.


TEKNOSASTIK ◽  
2018 ◽  
Vol 14 (2) ◽  
pp. 1
Author(s):  
Dina Amelia

There are two most inevitable issues on national literature, in this case Indonesian literature. First is the translation and the second is the standard of world literature. Can one speak for the other as a representative? Why is this representation matter? Does translation embody the voice of the represented? Without translation Indonesian literature cannot gain its recognition in world literature, yet, translation conveys the voice of other. In the case of production, publication, or distribution of Indonesian Literature to the world, translation works can be very beneficial. The position of Indonesian literature is as a part of world literature. The concept that the Western world should be the one who represent the subaltern can be overcome as long as the subaltern performs as the active speaker. If the subaltern remains silent then it means it allows the “representation” by the Western.


Metahumaniora ◽  
2017 ◽  
Vol 7 (3) ◽  
pp. 411
Author(s):  
Abu Bakar Ramadhan Muhamad

AbstrakHegemoni kolonialisme dalam budaya poskolonial merupakan alasan penelitian inikemudian mengkaji wacana kolonial dalam novel Max Havellar (MH) khususnya dampakditimbulkannya. Dampak dimaksud adalah posisi keberpihakan pemikiran tersirat darikarya tersebut. Hasil pembahasan menunjukkan, secara temporal maupun permanen MHmenyuarakan ketidakadilan dalam kondisi-kondisi kolonial menyangkut penindasan sangpenjajah terhadap terjajah. Hanya saja, upaya mengatasnamakan atau mewakili suarakaum terjajah terbukti mengimplikasikan ciri ideologis statis kerangka kolonialisme(orientalisme); yakni cara pandang Eropasentris, di mana “Barat” sebagai self adalah superior,dan “Timur” sebagai other adalah inferior. Dalam konteks poskolonialisme, MH dengan sifatkritisnya yang berupaya “menyuarakan” nasib pribumi terjajah, justru menampilkan stigmapenguatan kolonialitas itu sendiri secara hegemonik. Artinya, “menyuarakan” nasib pribumidimaknai sebagai keberpihankan kolonial yang kontradiktif, di mana stigma penguatankolonialitas justru lebih terasa, ujung-ujungnya melanggengkan hegemoni kolonial. Tidakmembela yang terjajah, tetapi memperhalus cara kerja mesin kolonial.AbstractThe hegemony of colonialism in the culture of postcolonial society is the reason this studythen examines the colonial discourse in the novel Max Havellar (MH) in particular the impactit brings. The impact in question is the implied position of thought in the work. The resultsof the discussion show that, temporarily or permanently, MH voiced injustice in the colonialconditions regarding the oppression of the colonist against the colonized. However, the effort toname or represent the voice of the colonized has proven to imply a static ideological characterin the framework of colonialism (orientalism); ie Eropacentric point of view, in which “West” asself is superior, and “East” as the other is the inferior. In the context of postcolonialism, MH withits critical nature that seeks to “voice” the fate of the colonized natives, actually presents thestigma of strengthening coloniality itself hegemonicly. That is, “voicing” the fate of the pribumiis interpreted as a contradictory colonial flare, where the stigma of strengthening colonialityis more pronounced, which ultimately perpetuates the hegemony of colonialism. No longerdefending the colonized, but refining the workings of the colonial machinery.


Author(s):  
Celine Parreñas Shimizu

Transnational films representing intimacy and inequality disrupt and disgust Western spectators. When wounded bodies within poverty entangle with healthy wealthy bodies in sex, romance and care, fear and hatred combine with desire and fetishism. Works from the Philippines, South Korea, and independents from the United States and France may not be made for the West and may not make use of Hollywood traditions. Rather, they demand recognition for the knowledge they produce beyond our existing frames. They challenge us to go beyond passive consumption, or introspection of ourselves as spectators, for they represent new ways of world-making we cannot unsee, unhear, or unfeel. The spectator is redirected to go beyond the rapture of consuming the other to the rupture that arises from witnessing pain and suffering. Self-displacement is what proximity to intimate inequality in cinema ultimately compels and demands so as to establish an ethical way of relating to others. In undoing the spectator, the voice of the transnational filmmaker emerges. Not only do we need to listen to filmmakers from outside Hollywood who unflinchingly engage the inexpressibility of difference, we need to make room for critics and theorists who prioritize the subjectivities of others. When the demographics of filmmakers and film scholars are not as diverse as its spectators, films narrow our worldviews. To recognize our culpability in the denigration of others unleashes the power of cinema. The unbearability of stories we don’t want to watch and don’t want to feel must be borne.


1790 ◽  
Vol 2 (2) ◽  
pp. 111-153
Author(s):  
Andrew Dalzel

The power of pronouncing articulate sounds is one of the most obvious marks which distinguish man from the other animals. No philosophical investigation is necessary for pointing it out, and therefore it has not escaped the notice of the poets, the most ancient of all authors. In the works of Homer and Hesiod, we often meet with the expression μέϱοπες ἄνθϱωποι, men having an articulate voice; the word μέϱοψ being evidently compounded of μείϱω, to divide, and ὄψ, the voice.


2021 ◽  
pp. 1-26
Author(s):  
Teresa Pratt

Abstract This article argues for a focus on affect in sociolinguistic style. I integrate recent scholarship on affective practice (Wetherell 2015) and the circulation of affective value (Ahmed 2004b) in order to situate the linguistic and bodily semiotics of affect as components of stylistic practice. At a Bay Area public arts high school, ideologically distinct affects of chill or high-energy are co-constructed across signs and subjects. I analyze a group of cisgender young men's use of creaky voice quality, speech rate, and bodily hexis in enacting and circulating these affective values. Crucially, affect co-constructs students’ positioning within the high school political economy (as college-bound or not, artistically driven or not), highlighting the ideological motivations of stylistic practice. Building on recent scholarship, I propose that a more thorough consideration of affect can deepen our understanding of meaning-making as it occurs in everyday interaction in institutional settings. (Affect, political economy, embodiment, bricolage, voice quality, speech rate, high school)


Sign in / Sign up

Export Citation Format

Share Document