Denoising Genome-wide Histone ChIP-seq with Convolutional Neural Networks

Mapping Intimacies ◽

10.1101/052118 ◽

2016 ◽

Cited By ~ 5

Author(s):

Pang Wei Koh ◽

Emma Pierson ◽

Anshul Kundaje

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Cell Types ◽

Genome Wide ◽

Discriminative Model ◽

Experimental Parameters ◽

Chromatin Immunoprecipitation Sequencing ◽

Chromatin Profiling ◽

Chip Enrichment

AbstractMotivationChromatin immunoprecipitation sequencing (ChIP-seq) experiments are commonly used to obtain genome-wide profiles of histone modifications associated with different types of functional genomic elements. However, the quality of histone ChIP-seq data is affected by a myriad of experimental parameters such as the amount of input DNA, antibody specificity, ChIP enrichment, and sequencing depth. Making accurate inferences from chromatin profiling experiments that involve diverse experimental parameters is challenging.ResultsWe introduce a convolutional denoising algorithm, Coda, that uses convolutional neural networks to learn a mapping from suboptimal to high-quality histone ChIP-seq data. This overcomes various sources of noise and variability, substantially enhancing and recovering signal when applied to low-quality chromatin profiling datasets across individuals, cell types, and species. Our method has the potential to improve data quality at reduced costs. More broadly, this approach – using a high-dimensional discriminative model to encode a generative noise process – is generally applicable to other biological domains where it is easy to generate noisy data but difficult to analytically characterize the noise or underlying data distribution.Availabilityhttps://github.com/kundajelab/[email protected]

Download Full-text

Quality of Experience using Deep Convolutional Neural Networks and future trends

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.1109/apsipaasc47483.2019.9023170 ◽

2019 ◽

Author(s):

Woojae Kim ◽

Jaekyung Kim ◽

Sanghoon Lee

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Quality Of Experience ◽

Future Trends ◽

Deep Convolutional Neural Networks

Download Full-text

OPTIMAL UNIFORM QUANTIZATION OF PARAMETERS OF CONVOLUTIONAL NEURAL NETWORKS

Issues of radio electronics ◽

10.21778/2218-5453-2018-8-99-103 ◽

2018 ◽

pp. 99-103

Author(s):

D. S. Kolesnikov ◽

D. A. Kuznetsov

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Mobile Applications ◽

Recognition Accuracy ◽

State Of The Art ◽

Network Parameters ◽

Wide Range ◽

Uniform Quantization ◽

Adaptive Step

State of the art convolutional neural networks provide high accuracy in solving a wide range of problems. Usually it is achieved by a significant increasing their computational complexity and the representation of the network parameters in single-precision floating point numbers. However, due to the limited resources, the application of networks in embedded systems and mobile applications in real time is problematic. One of the methods to solve this problem is to reduce the bit depth of data and use integer arithmetic. For this purpose, the network parameters are quantized. Performing quantization, it is necessary to ensure a minimum loss of recognition accuracy. The article proposes to use an optimal uniform quantizer with an adaptive step. The quantizer step depends on the distribution function of the quantized parameters. It reduces the effect of the quantization error on the recognition accuracy. There are also described approaches to improving the quality of quantization. The proposed quantization method is estimated on the CIFAR-10 database. It is shown that the optimal uniform quantizer for CIFAR-10 database with 8-bit representation of network parameters allows to achieve the accuracy of the initial trained network.

Download Full-text

Randomized SMILES Strings Improve the Quality of Molecular Generative Models

10.26434/chemrxiv.8639942 ◽

2019 ◽

Cited By ~ 1

Author(s):

Josep Arús-Pous ◽

Simon Johansson ◽

Oleksii Prykhodko ◽

Esben Jannik Bjerrum ◽

Christian Tyrchan ◽

...

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Chemical Space ◽

Cell Types ◽

Generative Models ◽

The Other ◽

Probability Models ◽

String Representation ◽

Almost All

Recurrent Neural Networks (RNNs) trained with a set of molecules represented as unique (canonical) SMILES strings, have shown the capacity to create large chemical spaces of valid and meaningful structures. Herein we perform an extensive benchmark on models trained with subsets of GDB-13 of different sizes (1 million , 10,000 and 1,000), with different SMILES variants (canonical, randomized and DeepSMILES), with two different recurrent cell types (LSTM and GRU) and with different hyperparameter combinations. To guide the benchmarks new metrics were developed that define the generated chemical space with respect to its uniformity, closedness and completeness. Results show that models that use LSTM cells trained with 1 million randomized SMILES, a non-unique molecular string representation, are able to generate larger chemical spaces than the other approaches and they represent more accurately the target chemical space. Specifically, a model was trained with randomized SMILES that was able to generate almost all molecules from GDB-13 with a quasi-uniform probability. Models trained with smaller samples show an even bigger improvement when trained with randomized SMILES models. Additionally, models were trained on molecules obtained from ChEMBL and illustrate again that training with randomized SMILES lead to models having a better representation of the drug-like chemical space. Namely, the model trained with randomized SMILES was able to generate at least double the amount of unique molecules with the same distribution of properties comparing to one trained with canonical SMILES.

Download Full-text

ADAPTIVE AND NON-ADAPTIVE FUSION ALGORITHMS ANALYSIS FOR DIGITAL SURFACE MODEL GENERATED USING CENSUS AND CONVOLUTIONAL NEURAL NETWORKS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2021-283-2021 ◽

2021 ◽

Vol XLIII-B2-2021 ◽

pp. 283-288

Author(s):

H. Albanwan ◽

R. Qin

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Contextual Information ◽

Median Filter ◽

Weighted Average ◽

3D Models ◽

Surface Model ◽

Stereo Pair ◽

Adaptive Fusion

Abstract. The digital surface models (DSM) fusion algorithms are one of the ongoing challenging problems to enhance the quality of 3D models, especially for complex regions with variable radiometric and geometric distortions like satellite datasets. DSM generation using Multiview stereo analysis (MVS) is the most common cost-efficient approach to recover elevations. Algorithms like Census-semi global matching (SGM) and Convolutional Neural Networks (MC-CNN) have been successfully implemented to generate the disparity and recover DSMs; however, their performances are limited when matching stereo pair images with ill-posed regions, low texture, dense texture, occluded, or noisy, which can yield missing or incorrect elevation values, in additions to fuzzy boundaries. DSM fusion algorithms have proven to tackle such problems, but their performance may vary based on the quality of the input and the type of fusion which can be classified into adaptive and non-adaptive. In this paper, we evaluate the performance of the adaptive and nonadaptive fusion methods using median filter, adaptive median filter, K-median clustering fusion, weighted average fusion, and adaptive spatiotemporal fusion for DSM generated using Census and MC-CNN. We perform our evaluation on 9 testing regions using stereo pair images from Worldview-3 satellite to generate DSMs using Census and MC-CNN. Our results show that adaptive fusion algorithms are more accurate than non-adaptive algorithms in predicting elevations due to their ability to learn from temporal and contextual information. Our results also show that MC-CNN produces better fusion results with a lower overall average RMSE than Census.

Download Full-text

Genome annotation across species using deep convolutional neural networks

PeerJ Computer Science ◽

10.7717/peerj-cs.278 ◽

2020 ◽

Vol 6 ◽

pp. e278 ◽

Cited By ~ 2

Author(s):

Ghazaleh Khodabandelou ◽

Etienne Routhier ◽

Julien Mozziconacci

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Functional Role ◽

Genomic Sequences ◽

Sequence Motifs ◽

Deep Convolutional Neural Networks ◽

Genome Wide ◽

Associated Proteins ◽

Using Data ◽

Genome Annotations

Application of deep neural network is a rapidly expanding field now reaching many disciplines including genomics. In particular, convolutional neural networks have been exploited for identifying the functional role of short genomic sequences. These approaches rely on gathering large sets of sequences with known functional role, extracting those sequences from whole-genome-annotations. These sets are then split into learning, test and validation sets in order to train the networks. While the obtained networks perform well on validation sets, they often perform poorly when applied on whole genomes in which the ratio of positive over negative examples can be very different than in the training set. We here address this issue by assessing the genome-wide performance of networks trained with sets exhibiting different ratios of positive to negative examples. As a case study, we use sequences encompassing gene starts from the RefGene database as positive examples and random genomic sequences as negative examples. We then demonstrate that models trained using data from one organism can be used to predict gene-start sites in a related species, when using training sets providing good genome-wide performance. This cross-species application of convolutional neural networks provides a new way to annotate any genome from existing high-quality annotations in a related reference species. It also provides a way to determine whether the sequence motifs recognised by chromatin-associated proteins in different species are conserved or not.

Download Full-text

Applying convolutional neural networks to assess the external quality of strawberries

Journal of Food Composition and Analysis ◽

10.1016/j.jfca.2021.104071 ◽

2021 ◽

pp. 104071

Author(s):

Ji-Young Choi ◽

Kwangwon Seo ◽

Jeong-Seok Cho ◽

Kwang-Deog Moon

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

External Quality

Download Full-text

Super Sparse Convolutional Neural Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33014440 ◽

2019 ◽

Vol 33 ◽

pp. 4440-4447 ◽

Cited By ~ 11

Author(s):

Yao Lu ◽

Guangming Lu ◽

Bob Zhang ◽

Yuanrong Xu ◽

Jinxing Li

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Mobile Networks ◽

The Other ◽

Spatial Extent ◽

Low Resolution ◽

Feature Maps ◽

Performance Loss ◽

Fitting Problem

To construct small mobile networks without performance loss and address the over-fitting issues caused by the less abundant training datasets, this paper proposes a novel super sparse convolutional (SSC) kernel, and its corresponding network is called SSC-Net. In a SSC kernel, every spatial kernel has only one non-zero parameter and these non-zero spatial positions are all different. The SSC kernel can effectively select the pixels from the feature maps according to its non-zero positions and perform on them. Therefore, SSC can preserve the general characteristics of the geometric and the channels’ differences, resulting in preserving the quality of the retrieved features and meeting the general accuracy requirements. Furthermore, SSC can be entirely implemented by the “shift” and “group point-wise” convolutional operations without any spatial kernels (e.g., “3×3”). Therefore, SSC is the first method to remove the parameters’ redundancy from the both spatial extent and the channel extent, leading to largely decreasing the parameters and Flops as well as further reducing the img2col and col2img operations implemented by the low leveled libraries. Meanwhile, SSC-Net can improve the sparsity and overcome the over-fitting more effectively than the other mobile networks. Comparative experiments were performed on the less abundant CIFAR and low resolution ImageNet datasets. The results showed that the SSC-Nets can significantly decrease the parameters and the computational Flops without any performance losses. Additionally, it can also improve the ability of addressing the over-fitting problem on the more challenging less abundant datasets.

Download Full-text

Apple Fruit Detection and Maturity Status Classification

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b4063.079220 ◽

2020 ◽

Vol 9 (2) ◽

pp. 1055-1059

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Fresh Produce ◽

Apple Fruit ◽

Major Task ◽

Proposed Model ◽

Human Effort ◽

Status Classification ◽

Percent Accuracy

Identifying the quality of fresh produce while procuring is a major task that involves time and human effort in the retail industry. The main objective of this project is to identify and classify whether the apple fruit is fresh or rotten using Convolutional Neural Networks. The outcome of our study resulted in 97.92 percent accuracy for the 2 classes of approximately 5031 images in the classification, by identifying apples using Resnet 50 and then classifying them using the proposed model.

Download Full-text

Basset: Learning the regulatory code of the accessible genome with deep convolutional neural networks

10.1101/028399 ◽

2015 ◽

Cited By ~ 20

Author(s):

David R. Kelley ◽

Jasper Snoek ◽

John Rinn

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Dna Sequences ◽

Predictive Accuracy ◽

Cell Types ◽

Chromatin Accessibility ◽

Deep Convolutional Neural Networks ◽

Eukaryotic Gene ◽

Latent Potential ◽

Variant Alleles

AbstractThe complex language of eukaryotic gene expression remains incompletely understood. Despite the importance suggested by many noncoding variants statistically associated with human disease, nearly all such variants have unknown mechanism. Here, we address this challenge using an approach based on a recent machine learning advance—deep convolutional neural networks (CNNs). We introduce an open source package Basset (https://github.com/davek44/Basset) to apply CNNs to learn the functional activity of DNA sequences from genomics data. We trained Basset on a compendium of accessible genomic sites mapped in 164 cell types by DNaseI-seq and demonstrate far greater predictive accuracy than previous methods. Basset predictions for the change in accessibility between variant alleles were far greater for GWAS SNPs that are likely to be causal relative to nearby SNPs in linkage disequilibrium with them. With Basset, a researcher can perform a single sequencing assay in their cell type of interest and simultaneously learn that cell’s chromatin accessibility code and annotate every mutation in the genome with its influence on present accessibility and latent potential for accessibility. Thus, Basset offers a powerful computational approach to annotate and interpret the noncoding genome.

Download Full-text

Segmentation of wood cell in cross-section using deep convolutional neural networks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211386 ◽

2021 ◽

pp. 1-10

Author(s):

Halime Ergun

Keyword(s):

Neural Networks ◽

Cross Section ◽

Convolutional Neural Networks ◽

Optimization Algorithm ◽

Cell Types ◽

Anatomical Structure ◽

Automatic Identification ◽

Segmentation Method ◽

Deep Convolutional Neural Networks ◽

Anatomical Features

Fiber and vessel structures located in the cross-section are anatomical features that play an important role in identifying tree species. In order to determine the microscopic anatomical structure of these cell types, each cell must be accurately segmented. In this study, a segmentation method is proposed for wood cell images based on deep convolutional neural networks. The network, which was developed by combining two-stage CNN structures, was trained using the Adam optimization algorithm. For evaluation, the method was compared with SegNet and U-Net architectures, trained with the same dataset. The losses in these models trained were compared using IoU (Intersection over Union), accuracy, and BF-score measurements on the test data. The automatic identification of the cells in the wood images obtained using a microscope will provide a fast, inexpensive, and reliable tool for those working in this field.

Download Full-text