Timbre Space Learning for Augmentation of Musical Audio Synthesizer Interfaces

Lossy audio codecs compress (and decompress) digital audio streams by removing information that tends to be inaudible in human perception. Under high compression rates, such codecs may introduce a variety of impairments in the audio signal. Many works have tackled the problem of audio enhancement and compression artifact removal using deep-learning techniques. However, only a few works tackle the restoration of heavily compressed audio signals in the musical domain. In such a scenario, there is no unique solution for the restoration of the original signal. Therefore, in this study, we test a stochastic generator of a Generative Adversarial Network (GAN) architecture for this task. Such a stochastic generator, conditioned on highly compressed musical audio signals, could one day generate outputs indistinguishable from high-quality releases. Therefore, the present study may yield insights into more efficient musical data storage and transmission. We train stochastic and deterministic generators on MP3-compressed audio signals with 16, 32, and 64 kbit/s. We perform an extensive evaluation of the different experiments utilizing objective metrics and listening tests. We find that the models can improve the quality of the audio signals over the MP3 versions for 16 and 32 kbit/s and that the stochastic generators are capable of generating outputs that are closer to the original signals than those of the deterministic generators.

Download Full-text

Context-Aware Musical Audio

Encyclopedia of Multimedia ◽

10.1007/0-387-30038-4_41 ◽

2006 ◽

pp. 133-134

Keyword(s):

Context Aware ◽

Musical Audio

Download Full-text

Mid-level Representations of Musical Audio Signals for Music Information Retrieval

Advances in Music Information Retrieval - Studies in Computational Intelligence ◽

10.1007/978-3-642-11674-2_4 ◽

2010 ◽

pp. 65-91 ◽

Cited By ~ 1

Author(s):

Tetsuro Kitahara

Keyword(s):

Information Retrieval ◽

Music Information Retrieval ◽

Audio Signals ◽

Music Information ◽

Musical Audio

Download Full-text

Atypical Lyrics Completion Considering Musical Audio Signals

MultiMedia Modeling - Lecture Notes in Computer Science ◽

10.1007/978-3-030-67832-6_15 ◽

2021 ◽

pp. 174-186

Author(s):

Kento Watanabe ◽

Masataka Goto

Keyword(s):

Audio Signals ◽

Musical Audio

Download Full-text

On the Relations Between Audio Features and Room Acoustic Parameters of Auralizations

Journal of Vibration and Acoustics ◽

10.1115/1.4023835 ◽

2013 ◽

Vol 135 (6) ◽

Cited By ~ 1

Author(s):

Salvador Cerdá ◽

Alicia Giménez ◽

Radha Montell ◽

Arturo Barba ◽

Radu Lacatis ◽

...

Keyword(s):

Room Acoustics ◽

Subjective Perception ◽

Audio Signals ◽

Acoustic Parameters ◽

Acoustic Characteristics ◽

Statistical Correlations ◽

Audio Features ◽

Musical Characteristics ◽

Audio Files ◽

Musical Audio

The usual parameters in room acoustics are used to quantify the acoustic characteristics of rooms and their relation to the subjective perception of transmitted signals. Audio features (calculated with MIRToolbox) have been designed to study the relationships between the characteristics of musical audio files and their subjective perception. Both musical characteristics and acoustic parameters are oriented towards acoustic perception. By using auralizations with calibrated models of auditoriums and tools from the MIRtoolbox it is possible to jointly work with the calculation of audio features and room parameters. In this work, the statistical correlations between C80, STI, D50, EDT, RT and certain audio features have been analyzed. The Pearson r values are higher than 0.8 in all cases. These high correlations enable acoustic parameters to be calculated from the musical characteristics of auralized audio signals.

Download Full-text

Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2007.909290 ◽

2008 ◽

Vol 16 (1) ◽

pp. 174-185 ◽

Cited By ~ 36

Author(s):

CÉdric Fevotte ◽

Bruno Torresani ◽

Laurent Daudet ◽

Simon J. Godsill

Keyword(s):

Linear Regression ◽

Musical Audio

Download Full-text

Musical, audio-visual, poetic, and narrative input: A longitudinal case study of French- English bilingual first language acquisition

Cognitive Perspectives on Bilingualism ◽

10.1515/9781614514190-009 ◽

2016 ◽

Cited By ~ 1

Author(s):

Catrin Bellay

Keyword(s):

Language Acquisition ◽

First Language ◽

First Language Acquisition ◽

Longitudinal Case Study ◽

English Bilingual ◽

Musical Audio

Download Full-text

Profiling musical audio processing effects with deep neural networks

The Journal of the Acoustical Society of America ◽

10.1121/1.5067764 ◽

2018 ◽

Vol 144 (3) ◽

pp. 1753-1753

Author(s):

Scott H. Hawley ◽

Benjamin L. Colburn ◽

Stylianos I. Mimilakis

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Audio Processing ◽

Processing Effects ◽

Musical Audio

Download Full-text

A BEAT INDUCTION METHOD FOR MUSICAL AUDIO SIGNALS

Digital Media Processing for Multimedia Interactive Services ◽

10.1142/9789812704337_0051 ◽

2003 ◽

Cited By ~ 8

Author(s):

F. GOUYON ◽

P. HERRERA

Keyword(s):

Audio Signals ◽

Induction Method ◽

Musical Audio ◽

Beat Induction

Download Full-text