Influence of Analyzed Sequence Length on Parameters in Laryngeal High-Speed Videoendoscopy

Patrick Schlegel; Marion Semmler; Melda Kunduk; Michael Döllinger; Christopher Bohr; Anne Schützenberger

doi:10.3390/app8122666

Influence of Analyzed Sequence Length on Parameters in Laryngeal High-Speed Videoendoscopy

Applied Sciences ◽

10.3390/app8122666 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2666 ◽

Cited By ~ 5

Author(s):

Patrick Schlegel ◽

Marion Semmler ◽

Melda Kunduk ◽

Michael Döllinger ◽

Christopher Bohr ◽

...

Keyword(s):

Vocal Fold ◽

High Speed ◽

Vocal Folds ◽

Frame Rate ◽

Sequence Length ◽

Influence Parameter ◽

Variability Index ◽

Perturbation Parameters ◽

Almost All ◽

Healthy Females

Laryngeal high-speed videoendoscopy (HSV) allows objective quantification of vocal fold vibratory characteristics. However, it is unknown how the analyzed sequence length affects some of the computed parameters. To examine if varying sequence lengths influence parameter calculation, 20 HSV recordings of healthy females during sustained phonation were investigated. The clinical prevalent Photron Fastcam MC2 camera with a frame rate of 4000 fps and a spatial resolution of 512 × 256 pixels was used to collect HSV data. The glottal area waveform (GAW), describing the increase and decrease of the area between the vocal folds during phonation, was extracted. Based on the GAW, 16 perturbation parameters were computed for sequences of 5, 10, 20, 50 and 100 consecutive cycles. Statistical analysis was performed using SPSS Statistics, version 21. Only three parameters (18.8%) were statistically significantly influenced by changing sequence lengths. Of these parameters, one changed until 10 cycles were reached, one until 20 cycles were reached and one, namely Amplitude Variability Index (AVI), changed between almost all groups of different sequence lengths. Moreover, visually observable, but not statistically significant, changes within parameters were observed. These changes were often most prominent between shorter sequence lengths. Hence, we suggest using a minimum sequence length of at least 20 cycles and discarding the parameter AVI.

Download Full-text

Comparative analysis of high-speed videolaryngoscopy images and sound data simultaneously acquired from rigid and flexible laryngoscope: a pilot study

Scientific Reports ◽

10.1038/s41598-021-99948-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Wioletta Pietruszewska ◽

Marcin Just ◽

Joanna Morawska ◽

Jakub Malinowski ◽

Joanna Hoffman ◽

...

Keyword(s):

Vocal Fold ◽

High Speed ◽

Acoustic Analysis ◽

Vocal Tract ◽

Vocal Folds ◽

Clinical Settings ◽

Vocal Fold Vibration ◽

Audio Data ◽

Bright Color ◽

Perturbation Parameters

AbstractHigh-Speed Videoendoscopy (HSV) is becoming a robust tool for the assessment of vocal fold vibration in laboratory investigation and clinical practice. We describe the first successful application of flexible High Speed Videoendoscopy with innovative laser light source conducted in clinical settings. The acquired image and simultaneously recorded audio data are compared to the results obtained by means of a rigid endoscope. We demonstrated that the HSV recordings with fiber-optic laryngoscope have enabled obtaining consistently bright, color images suitable for parametrization of vocal fold oscillation similarly as in the case of the HSV data obtained from a rigid laryngoscope. The comparison of period and amplitude perturbation parameters calculated on the basis of image and audio data acquired from flexible and rigid HSV recording objectively confirm that flexible High-Speed Videoendoscopy is a more suitable method for examination of natural phonation. The HSV-based measures generated from this kymographic analysis are arguably a superior representation of the vocal fold vibrations than the acoustic analysis because their quantification is independent of the vocal tract influences. This experimental study has several implications for further research in the field of HSV application in clinical assessment of glottal pathologies nature and its effect on vocal folds vibrations.

Download Full-text

High-Speed Imaging to Study an Auto-Oscillating Vocal Fold Replica for Different Initial Conditions

International Journal of Applied Mechanics ◽

10.1142/s1758825117500648 ◽

2017 ◽

Vol 09 (05) ◽

pp. 1750064 ◽

Cited By ~ 2

Author(s):

A. Van Hirtum ◽

X. Pelorson

Keyword(s):

Vocal Fold ◽

High Speed ◽

Initial Conditions ◽

Vocal Folds ◽

High Speed Imaging ◽

Human Voice ◽

Manual Intervention ◽

Geometrical Features ◽

Upstream Pressure

Experiments on mechanical deformable vocal folds replicas are important in physical studies of human voice production to understand the underlying fluid–structure interaction. At current date, most experiments are performed for constant initial conditions with respect to structural as well as geometrical features. Varying those conditions requires manual intervention, which might affect reproducibility and hence the quality of experimental results. In this work, a setup is described which allows setting elastic and geometrical initial conditions in an automated way for a deformable vocal fold replica. High-speed imaging is integrated in the setup in order to decorrelate elastic and geometrical features. This way, reproducible, accurate and systematic measurements can be performed for prescribed initial conditions of glottal area, mean upstream pressure and vocal fold elasticity. Moreover, quantification of geometrical features during auto-oscillation is shown to contribute to the experimental characterization and understanding.

Download Full-text

Reinke's Edema: Phonatory Mechanisms and Management Strategies

Annals of Otology Rhinology & Laryngology ◽

10.1177/000348949710600701 ◽

1997 ◽

Vol 106 (7) ◽

pp. 533-543 ◽

Cited By ~ 68

Author(s):

Steven M. Zeitels ◽

Glenn W. Bunting ◽

Robert E. Hillman ◽

Traci Vaughn

Keyword(s):

Lamina Propria ◽

Fundamental Frequency ◽

Vocal Fold ◽

Management Strategies ◽

Vocal Folds ◽

Vocal Fold Vibration ◽

Reinke’S Edema ◽

Subglottal Pressure ◽

Almost All ◽

Superficial Lamina

Reinke's edema (RE) has been associated typically with smoking and sometimes with vocal abuse, but aspects of the pathophysiology of RE remain unclear. To gain new insights into phonatory mechanisms associated with RE pathophysiology, weused an integrated battery of objective vocal function tests to analyze 20 patients (19 women) who underwent phonomicrosurgical resection. Preoperative stroboscopic examinations demonstrated that the superficial lamina propria is distended primarily on the superior vocal fold surface. Acoustically, these individuals have an abnormally low average speaking fundamental frequency (123 Hz), and they generate abnormally high average subglottal pressures (9.7 cm H20). The presence of elevated aerodynamic driving pressures reflects difficulties in producing vocal fold vibration that are most likely the result of mass loading associated with RE, and possibly vocal hyperfunction. Furthermore, it is hypothesized that in the environment of chronic glottal mucositis secondary to smoking and reflux, the cephalad force on the vocal folds by the subglottal driving pressure contributes to the superior distention of the superficial lamina propria. Surgical reduction of the volume of the superficial lamina propria resulted in a significant elevation in fundamental frequency (154 Hz) and improvement in perturbation measures. In almost all instances, both the clinician and the patient perceived the voice as improved. However, these patients continued to generate elevated subglottal pressure (probably a sign of persistent hyperfunction) that was accompanied by visually observed supraglottal strain despite the normalsized vocal folds. This finding suggests that persistent hyperfunctional vocal behaviors may contribute to postsurgical RE recurrence if therapeutic strategies are not instituted to modify such behavior.

Download Full-text

A portable smartphone-based laryngoscope system for high-speed vocal cord imaging of patients with throat disorders (Preprint)

10.2196/preprints.25816 ◽

2020 ◽

Author(s):

Jun Ki Kim ◽

Youngkyu Kim ◽

Jungmin Oh ◽

Seung-Ho Choi ◽

Ahra Jung ◽

...

Keyword(s):

Image Processing ◽

Vocal Cord ◽

High Speed ◽

High Performance ◽

Vocal Folds ◽

Frame Rate ◽

Endoscopic Imaging ◽

High Speed Imaging ◽

Underdeveloped Countries ◽

Longitudinal Edge

BACKGROUND Recently, high-speed digital imaging (HSDI), especially HSD endoscopic imaging is being routinely used for the diagnosis of vocal fold disorders. However, high-speed digital endoscopic imaging devices are usually large and costly, which limits access by patients in underdeveloped countries and in regions with inadequate medical infrastructure. Modern smartphones have sufficient functionality to process the complex calculations that are required for processing high-resolution images and videos with a high frame rate. Recently, several attempts have been made to integrate medical endoscopes with smartphones to make them more accessible to underdeveloped countries. OBJECTIVE To develop a smartphone adaptor for endoscopes to reduce the cost of devices, and to demonstrate the possibility of high-speed vocal cord imaging using the high-speed imaging functions of a high-performance smartphone camera. METHODS A customized smartphone adaptor was designed for clinical endoscopy using selective laser melting (SLM)-based 3D printing. Existing laryngoscope was attached to the smartphone adaptor to acquire high-speed vocal cord endoscopic images. Only existing basic functions of the smartphone camera were used for HSDI of the vocal folds. For image processing, segmented glottal areas were calculated from whole HSDI frames, and characteristics such as volume, shape and longitudinal edge length were analyzed. RESULTS High-speed digital smartphone imaging with the smartphone-endoscope adaptor could achieve 940 frames per second, and was used to image the vocal folds of five volunteers. The image processing and analytics demonstrated successful calculation of relevant diagnostic variables from the acquired images. CONCLUSIONS A smartphone-based HSDI endoscope system can function as a point-of-care clinical diagnostic device. Furthermore, this system is suitable for use as an accessible diagnostic method in underdeveloped areas with inadequate medical service infrastructure.

Download Full-text

Dynamic Digital Image Correlation of a Dynamic Physical Model of the Vocal Folds

Advances in Bioengineering ◽

10.1115/imece2005-81457 ◽

2005 ◽

Cited By ~ 5

Author(s):

S. Mantha ◽

L. Mongeau ◽

T. Siegmund

Keyword(s):

Digital Image Correlation ◽

Digital Image ◽

Vocal Fold ◽

High Speed ◽

Vocal Folds ◽

Image Correlation ◽

Strain Component ◽

Medial Surface ◽

Superior Surface ◽

Incomplete Closure

An experimental study of the vibratory deformation of the human vocal folds was conducted. Experiments were performed using model vocal folds [1, 2], Fig. 1, made of silicone rubber implemented into an air supply system, Fig. 2. The material used to cast the model is an isotropic homogeneous material, [3] with a tangent modulus E=5 kPa at ε = 0, i.e. elastic properties similar to those of the human vocal fold cover [4]. The advantages of the use of model larynx systems over the use of excised larynges include easy accessibility to fundamental studies of the vocal fold vibration without invasive testing. Acoustic analysis of voice or electroglottography provide certain insight into voice production processes but optical techniques for the study of vocal fold vibrations have drawn considerable attention. Videoendoscopy, stroboscopy, high-speed photography, and kymography have shown to provide a visual impression of vocal fold dynamics but are limited in providing insight into the fundamental deformation processes of the vocal folds. Quantitative measures of deformation have been conducted through micro-suture techniques but are invasive and allows for measurements of only view image points. Laser triangulation is non-invasive but is limited to only one local measurement point. Here, digital image correlation technique with the software VIC 3D [5] is applied. For the experimental set-up see Fig. 2. The analysis consists of (1) stereo correlation to obtain in-plane displacements and (2) stereo triangulation step to obtain out-of-plane deformation. For the stereo correlation images of the object at two different stages of deformation are compared. A point in the image of the undeformed object is matched with the corresponding point in the deformed stage. “Subsets” of digital images are traced via their gray value distribution from the undeformed reference image to the deformed image. The uniqueness of the matching is enabled by the creation of a speckle pattern on the object’s surface. Here, a white pigment is mixed into the silicone rubber and subsequently black enamel paint is sprayed onto the superior surface of the vocal folds. The stereo triangulation requires two images of the object at each stage of deformation. These are obtained in a single CCD frame by placing a beam splitter in the optical axis between camera and object. These images provide a “left” and “right” view of the model larynx. Thus, the deformed shape of the vocal folds can be obtained. The method allows for noninvasive measurement of the full-field displacement fields. Images of the superior surface of the model larynx are obtained by the use of a high speed digital camera with a frame rate of 3000 frames per second allowing for more than 30 image frames for each vibration cycle. For the 3D digital image correlation analysis two images of the object are obtained for each time instance as a beam splitter is placed in the optical axis between the camera and the model larynx. Phonation frequencies and onset pressure are given in Fig. 3, showing that the model larynx behavior is close to actual physiological data. Figs 4(a) and (b) provide superior views of the model larynx at maximum glottal opening and at glottal closure, respectively. As one example of measured strain fields, Figs 5(a) and (b) depict the distributions of the transverse strain component, on the glottal surface in a contour plot on the deformed superior surface. The knowledge of the distribution of this strain component is relevant to the assessment of the impact of vocal fold collision on potential tissue damage. In the position of maximum opening the vocal folds are deformed by a combination of a bulging-type deformation and the opening movement. At this time instance, the transverse strains at the medial surface are found to be negative, an indication of Poisson’s deformation. During the closing stage, vocal folds collide and simultaneously a mode 3 vibration pattern emerges. Closure of the glottal opening is not complete and two incomplete closure areas are formed during the closure stage. These open areas are located at the anterior and posterior ends of the model larynx, see Fig. 4(b). The finding of this type of incomplete closure is agreement with both actual glottal measurements [6] and 3D finite element simulations of [7]. Transverse strains during that stage are now positive and considerably larger that during the opening stage. Finally, Fig. 6 depicts the time evolution of the out of plane displacements along the medial surface for the closing phase and Fig. 7 depicts the maximum values of the longitudinal strain (at the coronal section of the medial surface) in dependence of the flow rate. These examples of measurements indicate that the DIC method is promising for studies of vocal fold dynamics.

Download Full-text

Electroglottography and Vocal Fold Physiology

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3302.245 ◽

1990 ◽

Vol 33 (2) ◽

pp. 245-254 ◽

Cited By ~ 76

Author(s):

D. G. Childers ◽

D. M. Hicks ◽

G. P. Moore ◽

L. Eskenazi ◽

A. L. Lalwani

Keyword(s):

Vocal Fold ◽

High Speed ◽

Human Subjects ◽

Vocal Folds ◽

Supporting Evidence ◽

Direct Measurements ◽

Analysis And Synthesis ◽

Vocal Fold Motion ◽

Maximum Opening ◽

Major Hypothesis

The electroglottogram (EGG) is known to be related to vocal fold motion. A major hypothesis undergoing examination in several research centers is that the EGG is related to the area of contact of the vocal folds. This hypothesis is difficult to substantiate with direct measurements using human subjects. However, other supporting evidence can be offered. For this study we made measurements from synchronized ultra high-speed laryngeal films and from EGG waveforms collected from subjects with normal larynges and patients with vocal disorders. We compare certain features of the EGG waveform to (a) the instant of the opening of the glottis, (b) the instant of the closing of the glottis, and (c) the instant of the maximum opening of the glottis. In addition, we compare both the open quotient and the relative average perturbation measured from the glottal area to that estimated from the EGG. All of these comparisons indicate that vocal fold vibratory characteristics are reflected by features of the EGG waveform. This makes the EGG useful for speech analysis and synthesis as well as for modeling laryngeal behavior. The limitations of the EGG are discussed.

Download Full-text

Usefulness of high-speed digital imaging (HSDI) in the diagnosis of oedematous – hypertrophic changes of the larynx in people using voice occupationally

Otolaryngologia Polska ◽

10.5604/01.3001.0010.2244 ◽

2017 ◽

Vol 71 (4) ◽

pp. 19-25 ◽

Cited By ~ 1

Author(s):

Bożena Kosztyła-Hojna ◽

Diana Moskal ◽

Anna Kuryliszyn-Moskal ◽

Anna Andrzejewska ◽

Anna Łobaczuk-Sitnik ◽

...

Keyword(s):

Electron Microscopy ◽

Digital Imaging ◽

Vocal Fold ◽

High Speed ◽

Vocal Folds ◽

Intercellular Spaces ◽

Vacuolar Degeneration ◽

Transmission Electron ◽

Tem Method ◽

Hypertrophic Changes

Introduction. The aim of the study is the evaluation of the usefulness of High-Speed Digital Imaging (HSDI) in the diagnosis of organic dysphonia in a form of oedematous-hypertrophic changes of vocal fold mucosa, morphologically confirmed by Transmission Electron Microscopy (TEM) method in patients working with voice occupationally. Material and methods. The group consisted of 30 patients working with voice occupationally with oedematous-hypertrophic changes of vocal fold mucosa. Parameters of vocal folds vibrations were evaluated using HSDI technique with a digital HS camera, HRES Endocam Richard Wolf GmbH. The image of vocal folds was recorded with a rate of 4000 frames per second. Postoperative material of the larynx was prepared in a routine way and observed in transmission electron microscope OPTON 900–PC. Results. HSDI technique allows to assess the real vibrations of vocal folds and determine many parameters. The results of TEM in the postoperative material showed destruction of epithelial cells with severe vacuolar degeneration, the enlargement of intercellular spaces and a large number of blood vessels in the stroma, which indicates the presence of oedematous-hypertrophic changes of the larynx. Discussion. The ultrastructural assessment confirm the particular usefulness of HSDI method in the diagnosis of organic dysphonia in a form of oedematous-hypertrophic changes. Key words: High-Speed Digital Imaging, oedematous-hypertrophic changes, vocal fold mucosa, larynx

Download Full-text

Normal Voice Production: Computation of Driving Parameters from Endoscopic Digital High Speed Images

Methods of Information in Medicine ◽

10.1055/s-0038-1634360 ◽

2003 ◽

Vol 42 (03) ◽

pp. 271-276 ◽

Cited By ~ 19

Author(s):

T. Braunschweig ◽

J. Lohscheller ◽

U. Eysholdt ◽

U. Hoppe ◽

M. Döllinger

Keyword(s):

Vocal Fold ◽

High Speed ◽

Biomechanical Model ◽

Vocal Folds ◽

Inversion Algorithm ◽

Knowledge Based ◽

Normal Voice ◽

Inversion Procedure ◽

High Speed Glottography

Summary Objectives: A central point for quantitative evaluation of pathological and healthy voices is the analysis of vocal fold oscillations. By means of digital High Speed Glottography (HGG), vocal fold oscillations can be recorded in real time. Recently, a numerical inversion procedure was developed that allows the extraction of physiological parameters from digital high speed videos and a classification of voice disorders. The aim of this work was to validate the inversion procedure and to investigate the applicability to normal voices. Methods: High speed recordings were performed during phonation within a group of five female and five male persons with normal voices. By using knowledge based image processing algorithms, motion curves of the vocal folds were extracted at three different positions (dorsal, medial, ventral). These curves were used to obtain physiological voice parameters, and in particular the degree of symmetry of the vocal folds based upon a biomechanical model of the vocal folds. Results: The highest degree of symmetry was observed for the medial motion curves. While the dor-sally and ventrally extracted motion curves exhibited similar results concerning the degree of symmetry the performance of the algorithm was less stable. Conclusions: The inversion algorithm provides reasonable results for all subjects when applied to the medial motion curves. However, for dorsal and ventral motion curves, correct performance is reduced to 85 %.

Download Full-text

Empirical Eigenfunctions and Medial Surface Dynamics of a Human Vocal Fold

Methods of Information in Medicine ◽

10.1055/s-0038-1633981 ◽

2005 ◽

Vol 44 (03) ◽

pp. 384-391 ◽

Cited By ~ 30

Author(s):

N. Tayama ◽

D. A. Berry ◽

M. Döllinger

Keyword(s):

Vocal Fold ◽

High Speed ◽

Computational Models ◽

Imaging System ◽

Vocal Folds ◽

Sustained Oscillation ◽

Medial Surface ◽

Physical Mechanisms ◽

Vocal Fold Vibration ◽

Modes Of Vibration

Summary Objectives: The purpose of this investigation was to use an excised human larynx to substantiate physical mechanisms of sustained vocal fold oscillation over a variety of phonatory conditions. During sustained, flow-induced oscillation, dynamical data was collected from the medial surface of the vocal fold. The method of Empirical Eigenfunctions was used to analyze the data and to probe physical mechanisms of sustained oscillation. Methods: Thirty microsutures were mounted on the medial margin of a human vocal fold. Across five distinct phonatory conditions, the vocal fold was set into oscillation and imaged with a high-speed digital imaging system. The position coordinates of the sutures were extracted from the images and converted into physical coordinates. Empirical Eigenfunctions were computed from the time-varying physical coordinates, and mechanisms of sustained oscillation were explored. Results: Using the method of Empirical Eigenfunctions, physical mechanisms of sustained vocal fold oscillation were substantiated. In particular, the essential dynamics of vocal fold vibration were captured by two dominant Empirical Eigenfunctions. The largest Eigenfunction primarily captured the alternating convergent/ divergent shape of the medial surface of the vocal fold, while the second largest Eigenfunction primarily captured the lateral vibrations of the vocal fold. Conclusions: The hemi-larynx setup yielded a view of the medial surface of the vocal folds, revealing the tissue vibrations which produced sound. Through the use of Empirical Eigenfunctions, the underlying modes of vibration were computed, disclosing physical mechanisms of sustained vocal fold oscillation. The investigation substantiated previous theoretical analyses and yielded significant data to help evaluate and refine computational models of vocal fold vibration.

Download Full-text

Airflow Characterization Combined With 3D Reconstruction of Rabbit Vocal Cord Based on In-Vivo Phonatory Experiments

Volume 2: Fluid Mechanics; Multiphase Flows ◽

10.1115/fedsm2020-20369 ◽

2020 ◽

Author(s):

Zhipeng Lou ◽

Junshi Wang ◽

James J. Daniero ◽

Haibo Dong ◽

Jinxiang Xi

Keyword(s):

3D Reconstruction ◽

Vocal Cord ◽

Vocal Fold ◽

High Speed ◽

3D Model ◽

Vocal Folds ◽

Computational Effort ◽

Immersed Boundary ◽

Vibration Modes

Abstract In this paper, a numerical approach combined with experiments is employed to characterize the airflow through the vocal cord. Rabbits are used to perform in vivo magnetic resonance imaging (MRI) experiments and the MRI scan data are directly imposed for the three-dimensional (3D) reconstruction of a 3D high-fidelity model. The vibration modes are observed via the in vivo high-speed videoendoscopy (HSVM) technique, and the time-dependent glottal height is evaluated dynamically for the validation of the 3D reconstruction model. 72 sets of rabbit in vivo high-speed recordings are evaluated to achieve the most common vibration mode. The reconstruction is mainly based on MRI data and the HSVM records are supporting and validate the 3D model. A sharp-interface immersed-boundary-method (IBM)-based compressible flow solver is employed to compute the airflow. The primary purpose of the computational effort is to characterize the influence of the vocal folds that applied to the airflow and the airflow-induced phonation. The vocal fold kinematics and the vibration modes are quantified and the vortex structures are analyzed under the influence of vocal folds. The results have shown significant effects of the vocal fold height on the vortex structure, vorticity and velocity. The reconstructed 3D model from this work helps to bring insight into further understanding of the rabbit phonation mechanism. The results provide potential improvement for diagnosis of human vocal fold dysfunction and phonation disorder.

Download Full-text