scholarly journals Improving Singing Voice Separation Using Curriculum Learning on Recurrent Neural Networks

2020 ◽  
Vol 10 (7) ◽  
pp. 2465
Author(s):  
Seungtae Kang ◽  
Jeong-Sik Park  ◽  
Gil-Jin Jang 

Single-channel singing voice separation has been considered a difficult task, as it requires predicting two different audio sources independently from mixed vocal and instrument sounds recorded by a single microphone. We propose a new singing voice separation approach based on the curriculum learning framework, in which learning is started with only easy examples and then task difficulty is gradually increased. In this study, we regard the data providing obviously dominant characteristics of a single source as an easy case and the other data as a difficult case. To quantify the dominance property between two sources, we define a dominance factor that determines a difficulty level according to relative intensity between vocal sound and instrument sound. If a given data is determined to provide obviously dominant characteristics of a single source according to the factor, it is regarded as an easy case; otherwise, it belongs to a difficult case. Early stages in the learning focus on easy cases, thus allowing rapidly learning overall characteristics of each source. On the other hand, later stages handle difficult cases, allowing more careful and sophisticated learning. In experiments conducted on three song datasets, the proposed approach demonstrated superior performance compared to the conventional approaches.

Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Yu Li ◽  
Zeling Xu ◽  
Wenkai Han ◽  
Huiluo Cao ◽  
Ramzan Umarov ◽  
...  

Abstract Background The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. Results Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. Conclusions We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/.


1995 ◽  
Vol 18 (2) ◽  
pp. 141-158 ◽  
Author(s):  
Marshall H. Raskind ◽  
Eleanor Higgins

This study investigated the effects of speech synthesis on the proofreading efficiency of postsecondary students with learning disabilities. Subjects proofread self-generated written language samples under three conditions: (a) using a speech synthesis system that simultaneously highlighted and “spoke” words on a computer monitor, (b) having the text read aloud to them by another person, and (c) receiving no assistance. Using the speech synthesis system enabled subjects to detect a significantly higher percentage of total errors than either of the other two proofreading conditions. In addition, subjects were able to locate a significantly higher percentage of capitalization, spelling, usage and typographical errors under the speech synthesis condition. However, having the text read aloud by another person significantly outperformed the other conditions in finding “grammar-mechanical” errors. Results are discussed with regard to underlying reasons for the overall superior performance of the speech synthesis system and the implications of using speech synthesis as a compensatory writing aid for postsecondary students with learning disabilities.


2002 ◽  
Vol 75 (3) ◽  
pp. 459-468 ◽  
Author(s):  
L. E. R. Dawson ◽  
A. F. Carson ◽  
L. O. W. McClinton

AbstractAn experiment was undertaken to compare the productivity of crossbred ewes, produced by crossing Texel sires with Rouge de l’Ouest (Rouge) dams and Rouge sires with Texel dams, relative to purebred Texel and Rouge ewes. The purebred and crossbred ewes were crossed with Rouge and Texel sires. The proportion of productive ewes was similar in the purebred and crossbred ewes with an average figure of 0·92. Irrespective of crossing sire, Rouge ewes produced 0·48 more lambs per ewe lambed than Texel ewes (P < 0·001). The two crossbred ewe types (Texel ✕ Rouge and Rouge ✕ Texel) each produced similar numbers of lambs (on average 1·92 lambs per ewe lambed). Individual heterosis values for ewe fertility and prolificacy were small and not significant (–1·67 for the proportion of productive ewes and –3·14 for the number of lambs born per ewe lambed). Maternal heterosis values were also not significant but were of larger magnitude (6·26 for ewe fertility and 3·12 for prolificacy). Lamb mortality (number of lambs born dead per ewe lambed) at birth was similar for purebred Rouge (0·44) and Texel (0·30) ewes and was significantly reduced by crossbred matings and mating the crossbred ewes (individual heterosis –30·68, P < 0·10; maternal heterosis –80·23, P < 0·001). Individual and maternal heterosis values for lamb growth rate from birth to six weeks were 8 (P < 0·05) and 4 (P > 0·05) respectively. Lamb growth rate from birth to weaning was significantly lower in lambs from Texel ewes compared with those from the other genotypes (P < 0·05). Individual and maternal heterosis values for live-weight gain from birth to weaning were 5 (P < 0·10) and 5 (P < 0·01). The results of the current study demonstrate the superior performance of purebred Rouge ewes compared with purebred Texel ewes in terms of prolificacy and lamb growth rate from birth to weaning. However, both breeds had high lamb mortality at birth. Crossbreeding led to the production of hybrid ewes which had relatively high prolificacy with low levels of dystocia and lamb mortality.


2020 ◽  
Author(s):  
Sam Gelman ◽  
Philip A. Romero ◽  
Anthony Gitter

ABSTRACTThe mapping from protein sequence to function is highly complex, making it challenging to predict how sequence changes will affect a protein’s behavior and properties. We present a supervised deep learning framework to learn the sequence-function mapping from deep mutational scanning data and make predictions for new, uncharacterized sequence variants. We test multiple neural network architectures, including a graph convolutional network that incorporates protein structure, to explore how a network’s internal representation affects its ability to learn the sequence-function mapping. Our supervised learning approach displays superior performance over physics-based and unsupervised prediction methods. We find networks that capture nonlinear interactions and share parameters across sequence positions are important for learning the relationship between sequence and function. Further analysis of the trained models reveals the networks’ ability to learn biologically meaningful information about protein structure and mechanism. Our software is available from https://github.com/gitter-lab/nn4dms.


F1000Research ◽  
2018 ◽  
Vol 7 ◽  
pp. 431
Author(s):  
Mike L. Smith ◽  
Andrzej K. Oleś ◽  
Wolfgang Huber

The Bioconductor Gateway on the F1000Research platform is a channel for peer-reviewed and citable publication of end-to-end data analysis workflows rooted in the Bioconductor ecosystem. In addition to the largely static journal publication, it is hoped that authors will also deposit their workflows as executable documents on Bioconductor, where the benefits of regular code testing and easy updating can be realized. Ideally these two endpoints would be met from a single source document. However, so far this has not been easy, due to lack of a technical solution that meets both the requirements of the F1000Research article submission format and the executable documents on Bioconductor. Submission to the platform requires a LaTeX file, which many authors traditionally have produced by writing an Rnw document for Sweave or knitr. On the other hand, to produce the HTML rendering of the document hosted by Bioconductor, the most straightforward starting point is the R Markdown format. Tools such as pandoc enable conversion between many formats, but typically a high degree of manual intervention used to be required to satisfactorily handle aspects such as floating figures, cross-references, literature references, and author affiliations. The BiocWorkflowTools package aims to solve this problem by enabling authors to work with R Markdown right up until the moment they wish to submit to the platform.


Sign in / Sign up

Export Citation Format

Share Document