Development of Parameters towards Voice Bifurcations

Takeshi Ikuma; Andrew J. McWhorter; Lacey Adkins; Melda Kunduk

doi:10.3390/app11125469

Development of Parameters towards Voice Bifurcations

Applied Sciences ◽

10.3390/app11125469 ◽

2021 ◽

Vol 11 (12) ◽

pp. 5469

Author(s):

Takeshi Ikuma ◽

Andrew J. McWhorter ◽

Lacey Adkins ◽

Melda Kunduk

Keyword(s):

Vocal Folds ◽

Selection Algorithm ◽

Voice Source ◽

Frequency Selection ◽

Wide Range ◽

Disturbance Factor ◽

Subglottal Pressure ◽

Glottal Source ◽

Fast Classification

Pathological vocal folds are known to exhibit multiple oscillation patterns, depending on tissue imbalance, subglottal pressure level, and other factors. This includes mid-phonation changes due to bifurcations in the underlying voice source system. Knowledge of when changes in oscillation patterns occur is helpful in the assessments of voice disorders, and the knowledge could be transformed into useful objective measures. Mid-phonation bifurcations can occur in rapid succession; hence, a fast classification of oscillation pattern is critical to minimize the averaging of data across bifurcations. This paper proposes frequency-ratio based short-term measures, named harmonic disturbance factor (HDF) and biphonic index (BI), towards the detection of the bifurcations. For the evaluation of HDF and BI, a frequency selection algorithm for glottal source signals is devised, and its efficacy is demonstrated with the glottal area waveforms of four cases, representing the wide range of oscillatory behaviors. The HDF and BI exhibit clear transitions when the voice bifurcations are apparent in the spectrograms. The presented proof-of-concept experiment’s outcomes warrant a larger scale study to formalize the parameters of the frequency selection algorithm.

Download Full-text

The Singing Voice

The Oxford Handbook of Voice Perception ◽

10.1093/oxfordhb/9780198743187.013.6 ◽

2018 ◽

pp. 116-142

Author(s):

Johan Sundberg

Keyword(s):

Mechanical Properties ◽

Vocal Tract ◽

Voice Quality ◽

Vocal Folds ◽

Sound Level ◽

Vowel Sound ◽

Formant Frequency ◽

Voice Source ◽

Subglottal Pressure ◽

The Voice

The sound quality of singing is determined by three basic factors—the air pressure under the vocal folds (or the subglottal pressure), the mechanical properties of the vocal folds, and the resonance properties of the vocal tract. Subglottal pressure is controlled by the respiratory apparatus. It regulates vocal loudness and is varied with pitch in singing. Together with the mechanical properties of the folds, which are controlled by laryngeal muscles, it has a decisive influence on vocal fold vibrationswhich convert the tracheal airstream to a pulsating airflow, the voice source. The voice source determines pitch, vibrato, and register, and also the overall slope of the spectrum. The sound of the voice source is filtered by the resonances of the vocal tract, or the formants, of which the two lowest determine the vowel quality and the higher ones the personal voice quality. Timing is crucial for creating emotional expressivity; it uses an acoustic code that shows striking similarities to that used in speech. The perceived loudness of a vowel sound seems more closely related to the subglottal pressure with which it was produced than with the acoustical sound level. Some investigations of acoustical correlates of tone placement and variation of larynx height are described, as are properties that affect the perceived naturalness of synthesized singing. Finally, subglottal pressure, voice source, and formant-frequency characteristics of some non-classical styles of singing are discussed.

Download Full-text

A Quantitative Output-Cost Ratio in Voice Production

Journal of Speech Language and Hearing Research ◽

10.1044/1092-4388(2001/003) ◽

2001 ◽

Vol 44 (1) ◽

pp. 29-37 ◽

Cited By ~ 53

Author(s):

David A. Berry ◽

Katherine Verdolini ◽

Douglas W. Montequin ◽

Markus M. Hess ◽

Roger W. Chan ◽

...

Keyword(s):

Vocal Tract ◽

Human Subjects ◽

Vocal Folds ◽

Cost Ratio ◽

Voice Production ◽

Wide Range ◽

Subglottal Pressure ◽

Potential Clinical Utility ◽

Acoustic Output ◽

Excised Larynx

A quantitative output-cost ratio (OCR) is proposed for objective use in voice production and is defined as the ratio of the acoustic output intensity to the collision intensity of the vocal folds. Measurement of the OCR is demonstrated in a laboratory experiment using 5 excised larynges and a transducer designed for use on human subjects. Data were gathered at constant fundamental frequency (150 Hz). Subglottal pressure was varied from 1.0 to 1.6 kPa, and glottal width at the vocal processes was varied from a pressed condition to a 2-mm gap. The OCR was plotted as a function of glottal width. With no vocal tract, the excised larynx experiments yielded a broad maxima in the OCR curves, across all subglottal pressure conditions, at about 0.6 mm. Computer simulations indicate that sharper maxima may occur when the influence of the vocal tract is taken into account. The potential clinical utility of the OCR is discussed for treatment of a wide range of voice disorders, including those involving both hyper- and hypoadduction.

Download Full-text

Subglottal pressure oscillations in anechoic and resonant conditions and their influence on excised larynx phonations

Scientific Reports ◽

10.1038/s41598-020-79265-3 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Hugo Lehoux ◽

Vít Hampala ◽

Jan G. Švec

Keyword(s):

Vocal Fold ◽

Computational Models ◽

Vocal Folds ◽

Voice Source ◽

Resonance Frequencies ◽

Subglottal Pressure ◽

Radiated Sound ◽

Acoustic Resonances ◽

The Impact ◽

Excised Larynx

AbstractExcised larynges serve as natural models for studying behavior of the voice source. Acoustic resonances inside the air-supplying tubes below the larynx (i.e., subglottal space), however, interact with the vibratory behavior of the larynges and obscure their inherent vibration properties. Here, we explore a newly designed anechoic subglottal space which allows removing its acoustic resonances. We performed excised larynx experiments using both anechoic and resonant subglottal spaces in order to analyze and compare, for the very first time, the corresponding subglottal pressures, electroglottographic and radiated acoustic waveforms. In contrast to the resonant conditions, the anechoic subglottal pressure waveforms showed negligible oscillations during the vocal fold contact phase, as expected. When inverted, these waveforms closely matched the inverse filtered radiated sound waveforms. Subglottal resonances modified also the radiated sound pressures (Level 1 interactions). Furthermore, they changed the fundamental frequency (fo) of the vocal fold oscillations and offset phonation threshold pressures (Level 2 interactions), even for subglottal resonance frequencies 4–10 times higher than fo. The obtained data offer the basis for better understanding the inherent vibratory properties of the vocal folds, for studying the impact of structure-acoustic interactions on voice, and for validation of computational models of voice production.

Download Full-text

Modes of Interaction in Naturally Occurring Medical Encounters With General Practitioners: The “One in a Million” Study

Qualitative Health Research ◽

10.1177/1049732321993790 ◽

2021 ◽

pp. 104973232199379

Author(s):

Olaug S. Lian ◽

Sarah Nettleton ◽

Åge Wifstad ◽

Christopher Dowrick

Keyword(s):

General Practitioners ◽

Narrative Analysis ◽

Mode Competition ◽

General Applicability ◽

Naturally Occurring ◽

Wide Range ◽

The One ◽

Narrative Mode ◽

Modes Of Interaction

In this article, we qualitatively explore the manner and style in which medical encounters between patients and general practitioners (GPs) are mutually conducted, as exhibited in situ in 10 consultations sourced from the One in a Million: Primary Care Consultations Archive in England. Our main objectives are to identify interactional modes, to develop a classification of these modes, and to uncover how modes emerge and shift both within and between consultations. Deploying an interactional perspective and a thematic and narrative analysis of consultation transcripts, we identified five distinctive interactional modes: question and answer (Q&A) mode, lecture mode, probabilistic mode, competition mode, and narrative mode. Most modes are GP-led. Mode shifts within consultations generally map on to the chronology of the medical encounter. Patient-led narrative modes are initiated by patients themselves, which demonstrates agency. Our classification of modes derives from complete naturally occurring consultations, covering a wide range of symptoms, and may have general applicability.

Download Full-text

Uncertainty-Aware Deep Learning-Based Cardiac Arrhythmias Classification Model of Electrocardiogram Signals

Computers ◽

10.3390/computers10060082 ◽

2021 ◽

Vol 10 (6) ◽

pp. 82

Author(s):

Ahmad O. Aseeri

Keyword(s):

Deep Learning ◽

Cardiac Arrhythmias ◽

Large Scale ◽

Clinical Decision Making ◽

Probabilistic Approach ◽

Classification Model ◽

Gating Mechanism ◽

Uncertainty Estimates ◽

Wide Range

Deep Learning-based methods have emerged to be one of the most effective and practical solutions in a wide range of medical problems, including the diagnosis of cardiac arrhythmias. A critical step to a precocious diagnosis in many heart dysfunctions diseases starts with the accurate detection and classification of cardiac arrhythmias, which can be achieved via electrocardiograms (ECGs). Motivated by the desire to enhance conventional clinical methods in diagnosing cardiac arrhythmias, we introduce an uncertainty-aware deep learning-based predictive model design for accurate large-scale classification of cardiac arrhythmias successfully trained and evaluated using three benchmark medical datasets. In addition, considering that the quantification of uncertainty estimates is vital for clinical decision-making, our method incorporates a probabilistic approach to capture the model’s uncertainty using a Bayesian-based approximation method without introducing additional parameters or significant changes to the network’s architecture. Although many arrhythmias classification solutions with various ECG feature engineering techniques have been reported in the literature, the introduced AI-based probabilistic-enabled method in this paper outperforms the results of existing methods in outstanding multiclass classification results that manifest F1 scores of 98.62% and 96.73% with (MIT-BIH) dataset of 20 annotations, and 99.23% and 96.94% with (INCART) dataset of eight annotations, and 97.25% and 96.73% with (BIDMC) dataset of six annotations, for the deep ensemble and probabilistic mode, respectively. We demonstrate our method’s high-performing and statistical reliability results in numerical experiments on the language modeling using the gating mechanism of Recurrent Neural Networks.

Download Full-text

Fast Classification of MPI Applications Using Lamport's Logical Clocks

2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) ◽

10.1109/ipdps.2016.40 ◽

2016 ◽

Cited By ~ 1

Author(s):

Zhou Tong ◽

Scott Pakin ◽

Michael Lang ◽

Xin Yuan

Keyword(s):

Mpi Applications ◽

Fast Classification

Download Full-text

Classification of unlabeled online media

Scientific Reports ◽

10.1038/s41598-021-85608-5 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Sakthi Kumar Arul Prakash ◽

Conrad Tucker

Keyword(s):

Social Media ◽

Real World ◽

Graphical Model ◽

Ground Truth ◽

Classification Problem ◽

Machine Learning Algorithms ◽

Social Media Networks ◽

Online Social Media ◽

Wide Range

AbstractThis work investigates the ability to classify misinformation in online social media networks in a manner that avoids the need for ground truth labels. Rather than approach the classification problem as a task for humans or machine learning algorithms, this work leverages user–user and user–media (i.e.,media likes) interactions to infer the type of information (fake vs. authentic) being spread, without needing to know the actual details of the information itself. To study the inception and evolution of user–user and user–media interactions over time, we create an experimental platform that mimics the functionality of real-world social media networks. We develop a graphical model that considers the evolution of this network topology to model the uncertainty (entropy) propagation when fake and authentic media disseminates across the network. The creation of a real-world social media network enables a wide range of hypotheses to be tested pertaining to users, their interactions with other users, and with media content. The discovery that the entropy of user–user and user–media interactions approximate fake and authentic media likes, enables us to classify fake media in an unsupervised learning manner.

Download Full-text

Sequence data from isolated lichen-associated melanized fungi enhance delimitation of two new lineages within Chaetothyriomycetidae

Mycological Progress ◽

10.1007/s11557-021-01706-8 ◽

2021 ◽

Vol 20 (7) ◽

pp. 911-927

Author(s):

Lucia Muggia ◽

Yu Quan ◽

Cécile Gueidan ◽

Abdullah M. S. Al-Hatmi ◽

Martin Grube ◽

...

Keyword(s):

Sequence Data ◽

Single Species ◽

Sister Group ◽

Asexual Propagation ◽

Dna Sequence Data ◽

Wide Range ◽

The Family ◽

Rock Inhabiting Fungi ◽

Stable Habitat

AbstractLichen thalli provide a long-lived and stable habitat for colonization by a wide range of microorganisms. Increased interest in these lichen-associated microbial communities has revealed an impressive diversity of fungi, including several novel lineages which still await formal taxonomic recognition. Among these, members of the Eurotiomycetes and Dothideomycetes usually occur asymptomatically in the lichen thalli, even if they share ancestry with fungi that may be parasitic on their host. Mycelia of the isolates are characterized by melanized cell walls and the fungi display exclusively asexual propagation. Their taxonomic placement requires, therefore, the use of DNA sequence data. Here, we consider recently published sequence data from lichen-associated fungi and characterize and formally describe two new, individually monophyletic lineages at family, genus, and species levels. The Pleostigmataceae fam. nov. and Melanina gen. nov. both comprise rock-inhabiting fungi that associate with epilithic, crust-forming lichens in subalpine habitats. The phylogenetic placement and the monophyly of Pleostigmataceae lack statistical support, but the family was resolved as sister to the order Verrucariales. This family comprises the species Pleostigma alpinum sp. nov., P. frigidum sp. nov., P. jungermannicola, and P. lichenophilum sp. nov. The placement of the genus Melanina is supported as a lineage within the Chaetothyriales. To date, this genus comprises the single species M. gunde-cimermaniae sp. nov. and forms a sister group to a large lineage including Herpotrichiellaceae, Chaetothyriaceae, Cyphellophoraceae, and Trichomeriaceae. The new phylogenetic analysis of the subclass Chaetothyiomycetidae provides new insight into genus and family level delimitation and classification of this ecologically diverse group of fungi.

Download Full-text

A CONTROL LIST FOR THE SYSTEMATIC IDENTIFICATION OF DISTURBANCE FACTORS

Proceedings of the Design Society ◽

10.1017/pds.2021.6 ◽

2021 ◽

Vol 1 ◽

pp. 51-60

Author(s):

Peter Welzbacher ◽

Gunnar Vorwerk-Handing ◽

Eckhard Kirchner

Keyword(s):

Literature Review ◽

Product Development ◽

Model Theory ◽

Development Process ◽

Product Development Process ◽

Key Factors ◽

Starting Point ◽

Disturbance Factor ◽

Systematic Identification

AbstractThe importance of considering disturbance factors in the product development process is often emphasized as one of the key factors to a functional and secure product. However, there is only a small number of tools to support the developer in the identification of disturbance factors and none of them yet ensures that the majority of occurring disturbance factors is considered. Thus, it is the aim of this contribution to provide a tool in form of a control list for the systematic identification of disturbance factors. At the beginning of this contribution, the terms “disturbance factor” and “uncertainty” are defined based on a literature review and different approaches for the classification of uncertainty are presented. Subsequently, the fundamentals of multipole based model theory are outlined. Moreover, a first approach in terms of a control list for a systematic identification of disturbance factors is discussed. Based on the discussed approach and taking the identified weaknesses as a starting point, a control list is presented that combines the existing basic concept of the control list with the fundamentals of multipole based model theory.

Download Full-text

Reinke's Edema: Phonatory Mechanisms and Management Strategies

Annals of Otology Rhinology & Laryngology ◽

10.1177/000348949710600701 ◽

1997 ◽

Vol 106 (7) ◽

pp. 533-543 ◽

Cited By ~ 68

Author(s):

Steven M. Zeitels ◽

Glenn W. Bunting ◽

Robert E. Hillman ◽

Traci Vaughn

Keyword(s):

Lamina Propria ◽

Fundamental Frequency ◽

Vocal Fold ◽

Management Strategies ◽

Vocal Folds ◽

Vocal Fold Vibration ◽

Reinke’S Edema ◽

Subglottal Pressure ◽

Almost All ◽

Superficial Lamina

Reinke's edema (RE) has been associated typically with smoking and sometimes with vocal abuse, but aspects of the pathophysiology of RE remain unclear. To gain new insights into phonatory mechanisms associated with RE pathophysiology, weused an integrated battery of objective vocal function tests to analyze 20 patients (19 women) who underwent phonomicrosurgical resection. Preoperative stroboscopic examinations demonstrated that the superficial lamina propria is distended primarily on the superior vocal fold surface. Acoustically, these individuals have an abnormally low average speaking fundamental frequency (123 Hz), and they generate abnormally high average subglottal pressures (9.7 cm H20). The presence of elevated aerodynamic driving pressures reflects difficulties in producing vocal fold vibration that are most likely the result of mass loading associated with RE, and possibly vocal hyperfunction. Furthermore, it is hypothesized that in the environment of chronic glottal mucositis secondary to smoking and reflux, the cephalad force on the vocal folds by the subglottal driving pressure contributes to the superior distention of the superficial lamina propria. Surgical reduction of the volume of the superficial lamina propria resulted in a significant elevation in fundamental frequency (154 Hz) and improvement in perturbation measures. In almost all instances, both the clinician and the patient perceived the voice as improved. However, these patients continued to generate elevated subglottal pressure (probably a sign of persistent hyperfunction) that was accompanied by visually observed supraglottal strain despite the normalsized vocal folds. This finding suggests that persistent hyperfunctional vocal behaviors may contribute to postsurgical RE recurrence if therapeutic strategies are not instituted to modify such behavior.

Download Full-text