Gender Recognition by Voice using an Improved Self-Labeled Algorithm

Ioannis Livieris; Emmanuel Pintelas; Panagiotis Pintelas

doi:10.3390/make1010030

Gender Recognition by Voice using an Improved Self-Labeled Algorithm

Machine Learning and Knowledge Extraction ◽

10.3390/make1010030 ◽

2019 ◽

Vol 1 (1) ◽

pp. 492-503 ◽

Cited By ~ 6

Author(s):

Ioannis Livieris ◽

Emmanuel Pintelas ◽

Panagiotis Pintelas

Keyword(s):

Predictive Models ◽

Numerical Experiments ◽

Digital Technologies ◽

Gender Recognition ◽

Learning Framework ◽

Gender Categorization ◽

Learning Techniques ◽

Video Categorization ◽

Telephone Calls ◽

Machine Interaction

Speech recognition has various applications including human to machine interaction, sorting of telephone calls by gender categorization, video categorization with tagging and so on. Currently, machine learning is a popular trend which has been widely utilized in various fields and applications, exploiting the recent development in digital technologies and the advantage of storage capabilities from electronic media. Recently, research focuses on the combination of ensemble learning techniques with the semi-supervised learning framework aiming to build more accurate classifiers. In this paper, we focus on gender recognition by voice utilizing a new ensemble semi-supervised self-labeled algorithm. Our preliminary numerical experiments demonstrate the classification efficiency of the proposed algorithm in terms of accuracy, leading to the development of stable and robust predictive models.

Download Full-text

Mol-BERT: An Effective Molecular Representation with BERT for Molecular Property Prediction

Wireless Communications and Mobile Computing ◽

10.1155/2021/7181815 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Juncai Li ◽

Xiaofei Jiang

Keyword(s):

Deep Learning ◽

Language Processing ◽

Large Scale ◽

Molecular Data ◽

Molecular Property ◽

Property Prediction ◽

Learning Framework ◽

Learning Techniques ◽

Potential Benefits ◽

Current Sequence

Molecular property prediction is an essential task in drug discovery. Most computational approaches with deep learning techniques either focus on designing novel molecular representation or combining with some advanced models together. However, researchers pay fewer attention to the potential benefits in massive unlabeled molecular data (e.g., ZINC). This task becomes increasingly challenging owing to the limitation of the scale of labeled data. Motivated by the recent advancements of pretrained models in natural language processing, the drug molecule can be naturally viewed as language to some extent. In this paper, we investigate how to develop the pretrained model BERT to extract useful molecular substructure information for molecular property prediction. We present a novel end-to-end deep learning framework, named Mol-BERT, that combines an effective molecular representation with pretrained BERT model tailored for molecular property prediction. Specifically, a large-scale prediction BERT model is pretrained to generate the embedding of molecular substructures, by using four million unlabeled drug SMILES (i.e., ZINC 15 and ChEMBL 27). Then, the pretrained BERT model can be fine-tuned on various molecular property prediction tasks. To examine the performance of our proposed Mol-BERT, we conduct several experiments on 4 widely used molecular datasets. In comparison to the traditional and state-of-the-art baselines, the results illustrate that our proposed Mol-BERT can outperform the current sequence-based methods and achieve at least 2% improvement on ROC-AUC score on Tox21, SIDER, and ClinTox dataset.

Download Full-text

A rough set theory and deep learning based predictive system for gender recognition using audio speech

10.21203/rs.3.rs-915201/v1 ◽

2021 ◽

Author(s):

Ghazaala Yasmin ◽

ASIT KUMAR DAS ◽

Janmenjoy Nayak ◽

S Vimal ◽

Soumi Dutta

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Set Theory ◽

Rough Set ◽

Gender Discrimination ◽

Rough Set Theory ◽

Recognition System ◽

Gender Recognition ◽

Learning Techniques

Abstract Speech is one of the most delicate medium through which gender of the speakers can easily be identified. Though the related research has shown very good progress in machine learning but recently, deep learning has imparted a very good research area to explore the deficiency of gender discrimination using traditional machine learning techniques. In deep learning techniques, the speech features are automatically generated by the reinforcement learning from the raw data which have more discriminating power than the human generated features. But in some practical situations like gender recognition, it is observed that combination of both types of features sometimes provides comparatively better performance. In the proposed work, we have initially extracted and selected some informative and precise acoustic features relevant to gender recognition using entropy based information theory and Rough Set Theory (RST). Next, the audio speech signals are directly fed into the deep neural network model consists of Convolution Neural Network (CNN) and Gated Recurrent Unit network (GRUN) for extracting features useful for gender recognition. The RST selects precise and informative features, CNN extracts the locally encoded important features, and GRUN reduces the vanishing gradient and exploding gradient problems. Finally, a hybrid gender recognition system is developed combining both generated feature vectors. The developed model has been tested with five bench mark and a simulated dataset to evaluate its performance and it is observed that combined feature vector provides more effective gender recognition system specially when transgender is considered as a gender type together with male and female.

Download Full-text

Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data

Nature Communications ◽

10.1038/s41467-021-26921-5 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Vishu Gupta ◽

Kamal Choudhary ◽

Francesca Tavazza ◽

Carelyn Campbell ◽

Wei-keng Liao ◽

...

Keyword(s):

Transfer Learning ◽

Predictive Models ◽

Materials Science ◽

Predictive Analytics ◽

Large Datasets ◽

Small Data ◽

Widespread Application ◽

Learning Framework ◽

Large Databases ◽

Physical Attributes

AbstractArtificial intelligence (AI) and machine learning (ML) have been increasingly used in materials science to build predictive models and accelerate discovery. For selected properties, availability of large databases has also facilitated application of deep learning (DL) and transfer learning (TL). However, unavailability of large datasets for a majority of properties prohibits widespread application of DL/TL. We present a cross-property deep-transfer-learning framework that leverages models trained on large datasets to build models on small datasets of different properties. We test the proposed framework on 39 computational and two experimental datasets and find that the TL models with only elemental fractions as input outperform ML/DL models trained from scratch even when they are allowed to use physical attributes as input, for 27/39 (≈ 69%) computational and both the experimental datasets. We believe that the proposed framework can be widely useful to tackle the small data challenge in applying AI/ML in materials science.

Download Full-text

Tree-Based Modeling Techniques

Advances in Business Information Systems and Analytics - Machine Learning Techniques for Improved Business Analytics ◽

10.4018/978-1-5225-3534-8.ch001 ◽

2019 ◽

pp. 1-18

Author(s):

Dileep Kumar G.

Keyword(s):

Predictive Models ◽

Data Science ◽

Linear Models ◽

High Accuracy ◽

Gradient Boosting ◽

Regression Methods ◽

Data Analyst ◽

Linear Relationships ◽

Learning Techniques ◽

Modeling Techniques

Tree-based learning techniques are considered to be one of the best and most used supervised learning methods. Tree-based methods empower predictive models with high accuracy, stability, and ease of interpretation. Unlike linear models, they map non-linear relationships pretty well. These methods are adaptable at solving any kind of problem at hand (classification or regression). Methods like decision trees, random forest, gradient boosting are being widely used in all kinds of machine learning and data science problems. Hence, for every data analyst, it is important to learn these algorithms and use them for modeling. This chapter guide the learner to learn tree-based modeling techniques from scratch.

Download Full-text

Minimal Linear Networks for Magnetic Resonance Image Reconstruction

Scientific Reports ◽

10.1038/s41598-019-55763-x ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 1

Author(s):

Gilad Liberman ◽

Benedikt A. Poser

Keyword(s):

Deep Learning ◽

Magnetic Resonance ◽

Image Reconstruction ◽

Simulated Data ◽

Linear Networks ◽

Learning Framework ◽

Scan Time ◽

Learning Techniques ◽

Ill Posed ◽

Magnetic Resonance Imaging Mri

AbstractModern sequences for Magnetic Resonance Imaging (MRI) trade off scan time with computational challenges, resulting in ill-posed inverse problems and the requirement to account for more elaborated signal models. Various deep learning techniques have shown potential for image reconstruction from reduced data, outperforming compressed sensing, dictionary learning and other advanced techniques based on regularization, by characterization of the image manifold. In this work we suggest a framework for reducing a “neural” network to the bare minimum required by the MR physics, reducing the network depth and removing all non-linearities. The networks performed well both on benchmark simulated data and on arterial spin labeling perfusion imaging, showing clear images while preserving sensitivity to the minute signal changes. The results indicate that the deep learning framework plays a major role in MR image reconstruction, and suggest a concrete approach for probing into the contribution of additional elements.

Download Full-text

Context-Aware Zero-Shot Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6841 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11709-11716

Author(s):

Ruotian Luo ◽

Ning Zhang ◽

Bohyung Han ◽

Linjie Yang

Keyword(s):

Conditional Random Field ◽

Object Relation ◽

Visual Context ◽

Context Aware ◽

The Novel ◽

Learning Framework ◽

Shot Detection ◽

Learning Techniques ◽

Novel Objects ◽

Problem Setting

We present a novel problem setting in zero-shot learning, zero-shot object recognition and detection in the context. Contrary to the traditional zero-shot learning methods, which simply infers unseen categories by transferring knowledge from the objects belonging to semantically similar seen categories, we aim to understand the identity of the novel objects in an image surrounded by the known objects using the inter-object relation prior. Specifically, we leverage the visual context and the geometric relationships between all pairs of objects in a single image, and capture the information useful to infer unseen categories. We integrate our context-aware zero-shot learning framework into the traditional zero-shot learning techniques seamlessly using a Conditional Random Field (CRF). The proposed algorithm is evaluated on both zero-shot region classification and zero-shot detection tasks. The results on Visual Genome (VG) dataset show that our model significantly boosts performance with the additional visual context compared to traditional methods.

Download Full-text

Perturbation Training for Human-Robot Teams

Journal of Artificial Intelligence Research ◽

10.1613/jair.5390 ◽

2017 ◽

Vol 59 ◽

pp. 495-541 ◽

Cited By ~ 3

Author(s):

Ramya Ramakrishnan ◽

Chongjie Zhang ◽

Julie Shah

Keyword(s):

Transfer Learning ◽

Large Scale ◽

Communication Protocol ◽

Learning Algorithm ◽

Training Strategy ◽

Learning Framework ◽

Perturbation Training ◽

Learning Techniques ◽

High Level ◽

Live Interaction

In this work, we design and evaluate a computational learning model that enables a human-robot team to co-develop joint strategies for performing novel tasks that require coordination. The joint strategies are learned through "perturbation training," a human team-training strategy that requires team members to practice variations of a given task to help their team generalize to new variants of that task. We formally define the problem of human-robot perturbation training and develop and evaluate the first end-to-end framework for such training, which incorporates a multi-agent transfer learning algorithm, human-robot co-learning framework and communication protocol. Our transfer learning algorithm, Adaptive Perturbation Training (AdaPT), is a hybrid of transfer and reinforcement learning techniques that learns quickly and robustly for new task variants. We empirically validate the benefits of AdaPT through comparison to other hybrid reinforcement and transfer learning techniques aimed at transferring knowledge from multiple source tasks to a single target task. We also demonstrate that AdaPT's rapid learning supports live interaction between a person and a robot, during which the human-robot team trains to achieve a high level of performance for new task variants. We augment AdaPT with a co-learning framework and a computational bi-directional communication protocol so that the robot can co-train with a person during live interaction. Results from large-scale human subject experiments (n=48) indicate that AdaPT enables an agent to learn in a manner compatible with a human's own learning process, and that a robot undergoing perturbation training with a human results in a high level of team performance. Finally, we demonstrate that human-robot training using AdaPT in a simulation environment produces effective performance for a team incorporating an embodied robot partner.

Download Full-text

Delivering Precision Medicine to Patients with Spinal Cord Disorders; Insights into Applications of Bioinformatics and Machine Learning from Studies of Degenerative Cervical Myelopathy

10.5772/intechopen.98713 ◽

2021 ◽

Author(s):

Kalum J. Ost ◽

David W. Anderson ◽

David W. Cadotte

Keyword(s):

Machine Learning ◽

Precision Medicine ◽

New Technologies ◽

Machine Learning Techniques ◽

Massive Datasets ◽

Learning Framework ◽

Learning Techniques ◽

Machine Learning Approach ◽

Spinal Cord Disorders ◽

Degenerative Cervical Myelopathy

With the common adoption of electronic health records and new technologies capable of producing an unprecedented scale of data, a shift must occur in how we practice medicine in order to utilize these resources. We are entering an era in which the capacity of even the most clever human doctor simply is insufficient. As such, realizing “personalized” or “precision” medicine requires new methods that can leverage the massive amounts of data now available. Machine learning techniques provide one important toolkit in this venture, as they are fundamentally designed to deal with (and, in fact, benefit from) massive datasets. The clinical applications for such machine learning systems are still in their infancy, however, and the field of medicine presents a unique set of design considerations. In this chapter, we will walk through how we selected and adjusted the “Progressive Learning framework” to account for these considerations in the case of Degenerative Cervical Myeolopathy. We additionally compare a model designed with these techniques to similar static models run in “perfect world” scenarios (free of the clinical issues address), and we use simulated clinical data acquisition scenarios to demonstrate the advantages of our machine learning approach in providing personalized diagnoses.

Download Full-text

Lignin Biorefinery Optimization Through Machine Learning

10.33774/chemrxiv-2021-6r888 ◽

2021 ◽

Author(s):

Joakim Löfgren ◽

Dmitry Tarasov ◽

Taru Koitto ◽

Patrick Rinke ◽

Mikhail Balakshin ◽

...

Keyword(s):

Machine Learning ◽

Predictive Models ◽

Model Building ◽

Paper Industry ◽

2D Nmr ◽

Hydrothermal Pretreatment ◽

Bayesian Optimization ◽

Learning Framework ◽

Point Analysis ◽

Data Points

Lignin is an abundant biomaterial that currently emerges as a low value by-product in the pulp and paper industry but could be repurposed for high-value products as part of the ongoing global transition to a sustainable society. To increase lignins value, rational and efficient approaches to optimizing lignin biorefineries to produce high value bioproducts are required. Here, we report the optimization of the AquaSolv Omni (AqSO) Biorefinery, a newly introduced biorefinery concept based on hydrothermal pretreatment and solvent extraction. We employ a machine-learning framework based on Bayesian optimization, to provide sample-efficient and guided data collection as well as surrogate model building. The surrogate models allow us to map multiple experimental outputs, including the extracted lignin yield and main structural properties obtained by 2D NMR, as functions of the hydrothermal pretreatment reaction severity and temperature. Our results show that with Bayesian optimization, predictive models can be converged with only 21 data points to within a margin of error comparable to the underlying experimental error. By applying a Pareto point analysis, we demonstrate how the predictive models can be used in tandem to identify optimal extraction conditions for concrete applications in lignin valorization.

Download Full-text

FiberNET: An ensemble deep learning framework for clustering white matter fibers

10.1101/141036 ◽

2017 ◽

Author(s):

Vikash Gupta ◽

Sophia I. Thomopoulos ◽

Faisal M. Rashid ◽

Paul M. Thompson

Keyword(s):

Deep Learning ◽

White Matter ◽

Structural Integrity ◽

Region Of Interest ◽

Fiber Bundles ◽

Anatomical Connectivity ◽

Learning Framework ◽

Fiber Clustering ◽

Learning Techniques ◽

The Brain

AbstractWhite matter tracts are commonly analyzed in studies of micro-structural integrity and anatomical connectivity in the brain. Over the last decade, it has been an open problem as to how best to cluster white matter fibers, extracted from whole-brain tractography, into anatomically meaningful groups. Some existing techniques use region of interest (ROI) based clustering, atlas-based labeling, or unsupervised spectral clustering. ROI-based clustering is popular for analyzing anatomical connectivity among a set of ROIs, but it does not always partition the brain into recognizable fiber bundles. Here we propose an approach using convolutional neural networks (CNNs) to learn shape features of the fiber bundles, which are then exploited to cluster white matter fibers. To achieve such clustering, we first need to re-parameterize the fibers in an intrinsic space. The clustering is performed in induced parameterized coordinates. To our knowledge, this is one of the first approaches for fiber clustering using deep learning techniques. The results show strong accuracy - on a par with or better than other state-of-the-art methods.

Download Full-text