Learning Low-Dimensional Embeddings of Audio Shingles for Cross-Version Retrieval of Classical Music

Frank Zalkow; Meinard Müller

doi:10.3390/app10010019

Learning Low-Dimensional Embeddings of Audio Shingles for Cross-Version Retrieval of Classical Music

Applied Sciences ◽

10.3390/app10010019 ◽

2019 ◽

Vol 10 (1) ◽

pp. 19 ◽

Cited By ~ 1

Author(s):

Frank Zalkow ◽

Meinard Müller

Keyword(s):

Neural Networks ◽

Dimensionality Reduction ◽

Classical Music ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Neighbor Search ◽

Reduction Methods ◽

Retrieval Problem ◽

Low Dimensional ◽

Western Classical Music

Cross-version music retrieval aims at identifying all versions of a given piece of music using a short query audio fragment. One previous approach, which is particularly suited for Western classical music, is based on a nearest neighbor search using short sequences of chroma features, also referred to as audio shingles. From the viewpoint of efficiency, indexing and dimensionality reduction are important aspects. In this paper, we extend previous work by adapting two embedding techniques; one is based on classical principle component analysis, and the other is based on neural networks with triplet loss. Furthermore, we report on systematically conducted experiments with Western classical music recordings and discuss the trade-off between retrieval quality and embedding dimensionality. As one main result, we show that, using neural networks, one can reduce the audio shingles from 240 to fewer than 8 dimensions with only a moderate loss in retrieval accuracy. In addition, we present extended experiments with databases of different sizes and different query lengths to test the scalability and generalizability of the dimensionality reduction methods. We also provide a more detailed view into the retrieval problem by analyzing the distances that appear in the nearest neighbor search.

Download Full-text

Semi-supervised hash learning method with consistency-based dimensionality reduction

Advances in Mechanical Engineering ◽

10.1177/1687814018819170 ◽

2019 ◽

Vol 11 (1) ◽

pp. 168781401881917

Author(s):

Fang Lv ◽

Yuliang Wei ◽

Xixian Han ◽

Bailing Wang

Keyword(s):

Dimensionality Reduction ◽

Nearest Neighbor ◽

Critical Role ◽

Computational Cost ◽

High Volume ◽

Nearest Neighbor Search ◽

Semantic Features ◽

Learning Method ◽

Retrieval Performance ◽

Low Dimensional

With the explosive growth of surveillance data, exact match queries become much more difficult for its high dimension and high volume. Owing to its good balance between the retrieval performance and the computational cost, hash learning technique is widely used in solving approximate nearest neighbor search problems. Dimensionality reduction plays a critical role in hash learning, as its target is to preserve the most original information into low-dimensional vectors. However, the existing dimensionality reduction methods neglect to unify diverse resources in original space when learning a downsized subspace. In this article, we propose a numeric and semantic consistency semi-supervised hash learning method, which unifies the numeric features and supervised semantic features into a low-dimensional subspace before hash encoding, and improves a multiple table hash method with complementary numeric local distribution structure. A consistency-based learning method, which confers the meaning of semantic to numeric features in dimensionality reduction, is presented. The experiments are conducted on two public datasets, that is, a web image NUS-WIDE and text dataset DBLP. Experimental results demonstrate that the semi-supervised hash learning method, with the consistency-based information subspace, is more effective in preserving useful information for hash encoding than state-of-the-art methods and achieves high-quality retrieval performance in multi-table context.

Download Full-text

A Meta-Algorithm for Improving Top-N Prediction Efficiency of Matrix Factorization Models in Collaborative Filtering

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001420590077 ◽

2019 ◽

Vol 34 (03) ◽

pp. 2059007

Author(s):

A. Murat Yagci ◽

Tevfik Aytekin ◽

Fikret S. Gurgen

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization ◽

Large Scale ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Space Efficiency ◽

Neighbor Search ◽

Prediction Time ◽

Low Dimensional ◽

Prediction Efficiency

Matrix factorization models often reveal the low-dimensional latent structure in high-dimensional spaces while bringing space efficiency to large-scale collaborative filtering problems. Improving training and prediction time efficiencies of these models are also important since an accurate model may raise practical concerns if it is slow to capture the changing dynamics of the system. For the training task, powerful improvements have been proposed especially using SGD, ALS, and their parallel versions. In this paper, we focus on the prediction task and combine matrix factorization with approximate nearest neighbor search methods to improve the efficiency of top-N prediction queries. Our efforts result in a meta-algorithm, MMFNN, which can employ various common matrix factorization models, drastically improve their prediction efficiency, and still perform comparably to standard prediction approaches or sometimes even better in terms of predictive power. Using various batch, online, and incremental matrix factorization models, we present detailed empirical analysis results on many large implicit feedback datasets from different application domains.

Download Full-text

Approximate Nearest Neighbor Search for Low Dimensional Queries

Proceedings of the Twenty-Second Annual ACM-SIAM Symposium on Discrete Algorithms ◽

10.1137/1.9781611973082.67 ◽

2011 ◽

Cited By ~ 1

Author(s):

Sariel Har-Peled ◽

Nirman Kumar

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search ◽

Low Dimensional

Download Full-text

Approximation Techniques to Enable Dimensionality Reduction for Voronoi-Based Nearest Neighbor Search

Lecture Notes in Computer Science - Advances in Database Technology - EDBT 2006 ◽

10.1007/11687238_15 ◽

2006 ◽

pp. 204-221

Author(s):

Christoph Brochhaus ◽

Marc Wichterich ◽

Thomas Seidl

Keyword(s):

Dimensionality Reduction ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Approximation Techniques ◽

Neighbor Search

Download Full-text

Artificial neural networks for the nearest neighbor search problem

Proceedings of TENCON '93. IEEE Region 10 International Conference on Computers, Communications and Automation ◽

10.1109/tencon.1993.320130 ◽

2002 ◽

Author(s):

De-Yuan Cheng ◽

T.J. Terrell ◽

M. Varley

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Search Problem ◽

Neighbor Search ◽

Artificial Neural

Download Full-text

PATTERN CATEGORY ASSIGNMENT BY NEURAL NETWORKS AND NEAREST NEIGHBORS RULE: A SYNOPSIS AND A CHARACTERIZATION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001496000268 ◽

1996 ◽

Vol 10 (05) ◽

pp. 393-408 ◽

Cited By ~ 10

Author(s):

A. MITICHE ◽

J. K. AGGARWAL

Keyword(s):

Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

Nearest Neighbor ◽

Nearest Neighbors ◽

Automatic Target Recognition ◽

Nearest Neighbor Search ◽

Neighbor Search ◽

Category Assignment

The purpose of this paper is two-fold: to give a synoptic description of favored neural networks and to characterize the potency of these neural networks as pattern classifiers, against the background of the familiar nearest neighbors classification. We limit the study to those neural network structures most commonly used for pattern classification: the multilayer perceptron, the Kohonen associative memory, and the Carpenter–Grossberg clustering network, for which we give a tutorial description with the aim of making the driving concepts apparent. The nearest neighbors rule is presented with improved nearest neighbor search and reference data sample pruning. To gain some familiarity with the classifiers, we expound the sequence of computations implicated in pattern category assignment by each classifier. A characterization of the classifiers is drawn from observed and expected properties and from experiments in automatic target recognition and optical character recognition as summarized in comparative tables of performance. This characterization supports the suggestion that nearest neighbors classification always be considered before endorsing alternative pattern classifiers such as neural networks.

Download Full-text

Deep Convolutional Neural Networks and Maximum-Likelihood Principle in Approximate Nearest Neighbor Search

Pattern Recognition and Image Analysis - Lecture Notes in Computer Science ◽

10.1007/978-3-319-58838-4_5 ◽

2017 ◽

pp. 42-49 ◽

Cited By ~ 2

Author(s):

Andrey V. Savchenko

Keyword(s):

Neural Networks ◽

Maximum Likelihood ◽

Convolutional Neural Networks ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Deep Convolutional Neural Networks ◽

Likelihood Principle ◽

Approximate Nearest Neighbor Search ◽

Maximum Likelihood Principle ◽

Neighbor Search

Download Full-text

Approximate Nearest Neighbor Search for Low-Dimensional Queries

SIAM Journal on Computing ◽

10.1137/110852711 ◽

2013 ◽

Vol 42 (1) ◽

pp. 138-159 ◽

Cited By ~ 1

Author(s):

Sariel Har-Peled ◽

Nirman Kumar

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search ◽

Low Dimensional

Download Full-text

The Earth Mover’s Distance as a Metric for the Space of Inorganic Compositions

10.26434/chemrxiv.12777566.v1 ◽

2020 ◽

Author(s):

Cameron Hargreaves ◽

Matthew Dyer ◽

Michael Gaultois ◽

Vitaliy Kurlin ◽

Matthew J Rosseinsky

Keyword(s):

Euclidean Distance ◽

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Inorganic Crystal Structure Database ◽

Earth Mover’S Distance ◽

Chemical Similarity ◽

Earth Mover's Distance ◽

Neighbor Search ◽

The Earth ◽

Binary Compounds

It is a core problem in any field to reliably tell how close two objects are to being the same, and once this relation has been established we can use this information to precisely quantify potential relationships, both analytically and with machine learning (ML). For inorganic solids, the chemical composition is a fundamental descriptor, which can be represented by assigning the ratio of each element in the material to a vector. These vectors are a convenient mathematical data structure for measuring similarity, but unfortunately, the standard metric (the Euclidean distance) gives little to no variance in the resultant distances between chemically dissimilar compositions. We present the Earth Mover’s Distance (EMD) for inorganic compositions, a well-defined metric which enables the measure of chemical similarity in an explainable fashion. We compute the EMD between two compositions from the ratio of each of the elements and the absolute distance between the elements on the modified Pettifor scale. This simple metric shows clear strength at distinguishing compounds and is efficient to compute in practice. The resultant distances have greater alignment with chemical understanding than the Euclidean distance, which is demonstrated on the binary compositions of the Inorganic Crystal Structure Database (ICSD). The EMD is a reliable numeric measure of chemical similarity that can be incorporated into automated workflows for a range of ML techniques. We have found that with no supervision the use of this metric gives a distinct partitioning of binary compounds into clear trends and families of chemical property, with future applications for nearest neighbor search queries in chemical database retrieval systems and supervised ML techniques.

Download Full-text

Adaptive bit allocation hashing for approximate nearest neighbor search

Neurocomputing ◽

10.1016/j.neucom.2014.10.042 ◽

2015 ◽

Vol 151 ◽

pp. 719-728 ◽

Cited By ~ 4

Author(s):

Qin-Zhen Guo ◽

Zhi Zeng ◽

Shuwu Zhang

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Bit Allocation ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text