Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study

Rajesh Baliram Singh; Hanqi Zhuang; Jeet Kiran Pawani

doi:10.3390/s21217320

Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study

Sensors ◽

10.3390/s21217320 ◽

2021 ◽

Vol 21 (21) ◽

pp. 7320

Author(s):

Rajesh Baliram Singh ◽

Hanqi Zhuang ◽

Jeet Kiran Pawani

Keyword(s):

Deep Learning ◽

Experimental Studies ◽

Extraction Methods ◽

Classification Model ◽

Computer Algorithms ◽

Mel Frequency Cepstral Coefficients ◽

Plastic Bag ◽

Life Threatening ◽

The Difference

Distinguishing between a dangerous audio event like a gun firing and other non-life-threatening events, such as a plastic bag bursting, can mean the difference between life and death and, therefore, the necessary and unnecessary deployment of public safety personnel. Sounds generated by plastic bag explosions are often confused with real gunshot sounds, by either humans or computer algorithms. As a case study, the research reported in this paper offers insight into sounds of plastic bag explosions and gunshots. An experimental study in this research reveals that a deep learning-based classification model trained with a popular urban sound dataset containing gunshot sounds cannot distinguish plastic bag pop sounds from gunshot sounds. This study further shows that the same deep learning model, if trained with a dataset containing plastic pop sounds, can effectively detect the non-life-threatening sounds. For this purpose, first, a collection of plastic bag-popping sounds was recorded in different environments with varying parameters, such as plastic bag size and distance from the recording microphones. The audio clips’ duration ranged from 400 ms to 600 ms. This collection of data was then used, together with a gunshot sound dataset, to train a classification model based on a convolutional neural network (CNN) to differentiate life-threatening gunshot events from non-life-threatening plastic bag explosion events. A comparison between two feature extraction methods, the Mel-frequency cepstral coefficients (MFCC) and Mel-spectrograms, was also done. Experimental studies conducted in this research show that once the plastic bag pop sounds are injected into model training, the CNN classification model performs well in distinguishing actual gunshot sounds from plastic bag sounds.

Download Full-text

Deep Learning-Based Cattle Vocal Classification Model and Real-Time Livestock Monitoring System with Noise Filtering

Animals ◽

10.3390/ani11020357 ◽

2021 ◽

Vol 11 (2) ◽

pp. 357

Author(s):

Dae-Hyun Jung ◽

Na Yeon Kim ◽

Sang Ho Moon ◽

Changho Jhin ◽

Hak-Jin Kim ◽

...

Keyword(s):

Deep Learning ◽

Real Time ◽

Model Development ◽

Classification Model ◽

Intelligence Analysis ◽

Noise Filtering ◽

Meat Industry ◽

Mel Frequency Cepstral Coefficients ◽

Web Based ◽

The Status

The priority placed on animal welfare in the meat industry is increasing the importance of understanding livestock behavior. In this study, we developed a web-based monitoring and recording system based on artificial intelligence analysis for the classification of cattle sounds. The deep learning classification model of the system is a convolutional neural network (CNN) model that takes voice information converted to Mel-frequency cepstral coefficients (MFCCs) as input. The CNN model first achieved an accuracy of 91.38% in recognizing cattle sounds. Further, short-time Fourier transform-based noise filtering was applied to remove background noise, improving the classification model accuracy to 94.18%. Categorized cattle voices were then classified into four classes, and a total of 897 classification records were acquired for the classification model development. A final accuracy of 81.96% was obtained for the model. Our proposed web-based platform that provides information obtained from a total of 12 sound sensors provides cattle vocalization monitoring in real time, enabling farm owners to determine the status of their cattle.

Download Full-text

EMBER: Multi-label prediction of kinase-substrate phosphorylation events through deep learning

10.1101/2020.02.04.934216 ◽

2020 ◽

Author(s):

Kathryn E. Kirchoff ◽

Shawn M. Gomez

Keyword(s):

Deep Learning ◽

Experimental Studies ◽

Classification Model ◽

Learning Method ◽

Kinase Substrate ◽

Phosphorylation Of Proteins ◽

Separate Model ◽

Label Prediction ◽

Substrate Phosphorylation ◽

Vector Representations

AbstractKinase-catalyzed phosphorylation of proteins forms the backbone of signal transduction within the cell, enabling the coordination of numerous processes such as the cell cycle, apoptosis, and differentiation. While on the order of 105 phosphorylation events have been described, we know the specific kinase performing these functions for less than 5% of cases. The ability to predict which kinases initiate specific individual phosphorylation events has the potential to greatly enhance the design of downstream experimental studies, while simultaneously creating a preliminary map of the broader phosphorylation network that controls cellular signaling. To this end, we describe EMBER, a deep learning method that integrates kinase-phylogeny information and motif-dissimilarity information into a multi-label classification model for the prediction of kinase-motif phosphorylation events. Unlike previous deep learning methods that perform single-label classification, we restate the task of kinase-motif phosphorylation prediction as a multi-label problem, allowing us to train a single unified model rather than a separate model for each of the 134 kinase families. We utilize a Siamese network to generate novel vector representations, or an embedding, of motif sequences, and we compare our novel embedding to a previously proposed peptide embedding. Our motif vector representations are used, along with one-hot encoded motif sequences, as input to a classification network while also leveraging kinase phylogenetic relationships into our model via a kinase phylogeny-based loss function. Results suggest that this approach holds significant promise for improving our map of phosphorylation relations that underlie kinome signaling.Availabilityhttps://github.com/gomezlab/EMBER

Download Full-text

COVID-19 Detection from Chest X-ray Images Using Feature Fusion and Deep Learning

Sensors ◽

10.3390/s21041480 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1480

Author(s):

Nur-A-Alam Alam ◽

Mominul Ahsan ◽

Md. Abdul Based ◽

Julfikar Haider ◽

Marcin Kowalski

Keyword(s):

Deep Learning ◽

Cross Validation ◽

Feature Fusion ◽

Extraction Methods ◽

Classification Performance ◽

Classification Model ◽

Edge Preservation ◽

X Ray ◽

Testing Stage ◽

Chest X Ray

Currently, COVID-19 is considered to be the most dangerous and deadly disease for the human body caused by the novel coronavirus. In December 2019, the coronavirus spread rapidly around the world, thought to be originated from Wuhan in China and is responsible for a large number of deaths. Earlier detection of the COVID-19 through accurate diagnosis, particularly for the cases with no obvious symptoms, may decrease the patient’s death rate. Chest X-ray images are primarily used for the diagnosis of this disease. This research has proposed a machine vision approach to detect COVID-19 from the chest X-ray images. The features extracted by the histogram-oriented gradient (HOG) and convolutional neural network (CNN) from X-ray images were fused to develop the classification model through training by CNN (VGGNet). Modified anisotropic diffusion filtering (MADF) technique was employed for better edge preservation and reduced noise from the images. A watershed segmentation algorithm was used in order to mark the significant fracture region in the input X-ray images. The testing stage considered generalized data for performance evaluation of the model. Cross-validation analysis revealed that a 5-fold strategy could successfully impair the overfitting problem. This proposed feature fusion using the deep learning technique assured a satisfactory performance in terms of identifying COVID-19 compared to the immediate, relevant works with a testing accuracy of 99.49%, specificity of 95.7% and sensitivity of 93.65%. When compared to other classification techniques, such as ANN, KNN, and SVM, the CNN technique used in this study showed better classification performance. K-fold cross-validation demonstrated that the proposed feature fusion technique (98.36%) provided higher accuracy than the individual feature extraction methods, such as HOG (87.34%) or CNN (93.64%).

Download Full-text

Deep Learning Feature Extraction for Brain Tumor Characterization and Detection

IRA-International Journal of Applied Sciences (ISSN 2455-4499) ◽

10.21013/jas.v16.n1.p1 ◽

2021 ◽

Vol 16 (1) ◽

pp. 1

Author(s):

Otman Basir ◽

Kalifa Shantta

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Brain Tumor ◽

Extraction Methods ◽

Daily Lives ◽

Tumor Characterization ◽

Learning Techniques ◽

Wide Range ◽

Attribute Extraction

Deep Learning is a growing field of artificial intelligence that has become an operative research topic in a wide range of disciplines. Today we are witnessing the tangible successes of Deep Learning in our daily lives in various applications, including education, manufacturing, transportation, healthcare, military, and automotive, etc.<strong> </strong>Deep Learning is a subfield of Machine Learning that stems from Artificial Neural Networks, where a cascade of layers is employed to progressively extract higher-level features from the raw input and make predictive guesses about new data. This paper will discuss the effect of attribute extraction profoundly inherent in training<strong> </strong>approaches such as Convolutional Neural Networks (CNN). Furthermore, the paper aims to offer a study on Deep Learning techniques and attribute extraction methods that have appeared in the last few years. As the demand increases, considerable research in the attribute extraction assignment has become even more instrumental. Brain tumor characterization and detection will be used as a case study to demonstrate Deep Learning CNN's ability to achieve effective representational learning and tumor characterization.

Download Full-text

A METHOD FOR ESTIMATING THE NUMBER OF HOUSEHOLDS IN A REGION FROM THE NUMBER OF BUILDINGS ESTIMATED BY DEEP LEARNING WITH THE ADJUSTMENT OF ITS NUMBER USING ANCILLARY DATASETS: CASE STUDY IN DJAKARTA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b4-2020-673-2020 ◽

2020 ◽

Vol XLIII-B4-2020 ◽

pp. 673-678

Author(s):

R. Ito ◽

K. Hara ◽

Y. Shimazaki ◽

N. Mori ◽

Y. Kani ◽

...

Keyword(s):

Deep Learning ◽

Satellite Images ◽

Census Data ◽

Statistical Data ◽

Aerial Images ◽

Disaster Prevention ◽

Ancillary Data ◽

Small Areas ◽

The Difference

Abstract. The high resolution statistical data such as the number of households in small areas are indispensable for urban planning, disaster prevention and many kinds of business activities. However, it is difficult to obtain the number of households in small areas because census data are usually aggregated in municipal districts. Techniques for automatically analyzing statistical data, e.g., land cover, population density, and the number of households obtained from satellite/aerial images have been continuously studied. In recent years, many methods using deep learning have been proposed in the related literature. In estimating the number of households, the use of buildings, the number of floors and that of rooms are also important information, but it is difficult to obtain such information from only image analysis using deep learning. This study proposes a method for estimating the number of households in 100 meter grid cells from satellite images using deep learning, and adjusting it using ancillary data obtained from a few statistical datasets. The application of this method to Djakarta shows that the difference between the estimated values and the corresponding values of census is less than 10%.

Download Full-text

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Baghdad Science Journal ◽

10.21123/bsj.2021.18.2(suppl.).0925 ◽

2021 ◽

Vol 18 (2(Suppl.)) ◽

pp. 0925

Author(s):

Asroni Asroni ◽

Ku Ruhana Ku-Mahamud ◽

Cahya Damarjati ◽

Hasan Basri Slamat

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolution Neural Network ◽

Classification Model ◽

Mel Frequency Cepstral Coefficients ◽

Speech Classification ◽

Deep Learning Neural Network ◽

Voice Data ◽

The Voice

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to evaluate the pronunciation of the Arabic alphabet. Voice data from six school children are recorded and used to test the performance of the proposed method. The padding technique has been used to augment the voice data before feeding the data to the CNN structure to developed the classification model. In addition, three other feature extraction techniques have been introduced to enable the comparison of the proposed method which employs padding technique. The performance of the proposed method with padding technique is at par with the spectrogram but better than mel-spectrogram and mel-frequency cepstral coefficients. Results also show that the proposed method was able to distinguish the Arabic alphabets that are difficult to pronounce. The proposed method with padding technique may be extended to address other voice pronunciation ability other than the Arabic alphabets.

Download Full-text

Air-pollution prediction in smart city, deep learning approach

Journal Of Big Data ◽

10.1186/s40537-021-00548-1 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Abdellatif Bekkar ◽

Badr Hssina ◽

Samira Douzi ◽

Khadija Douzi

Keyword(s):

Air Pollution ◽

Deep Learning ◽

Urban Areas ◽

Historical Data ◽

Meteorological Data ◽

Predictive Performance ◽

The Past ◽

Life Threatening ◽

The Difference ◽

Beijing China

AbstractOver the past few decades, due to human activities, industrialization, and urbanization, air pollution has become a life-threatening factor in many countries around the world. Among air pollutants, Particulate Matter with a diameter of less than $$2.5 \mu m$$ 2.5 μ m ($$PM_{2.5}$$ P M 2.5 ) is a serious health problem. It causes various illnesses such as respiratory tract and cardiovascular diseases. Hence, it is necessary to accurately predict the $$PM_{2.5}$$ P M 2.5 concentrations in order to prevent the citizens from the dangerous impact of air pollution beforehand. The variation of $$PM_{2.5}$$ P M 2.5 depends on a variety of factors, such as meteorology and the concentration of other pollutants in urban areas. In this paper, we implemented a deep learning solution to predict the hourly forecast of $$PM_{2.5}$$ P M 2.5 concentration in Beijing, China, based on CNN-LSTM, with a spatial-temporal feature by combining historical data of pollutants, meteorological data, and $$PM_{2.5}$$ P M 2.5 concentration in the adjacent stations. We examined the difference in performances among Deep learning algorithms such as LSTM, Bi-LSTM, GRU, Bi-GRU, CNN, and a hybrid CNN-LSTM model. Experimental results indicate that our method “hybrid CNN-LSTM multivariate” enables more accurate predictions than all the listed traditional models and performs better in predictive performance.

Download Full-text

Deep learning based analysis of sentiment dynamics in online cancer community forums: An experience

Health Informatics Journal ◽

10.1177/14604582211007537 ◽

2021 ◽

Vol 27 (2) ◽

pp. 146045822110075

Author(s):

Athira Balakrishnan ◽

Sumam Mary Idicula ◽

Josette Jones

Keyword(s):

Deep Learning ◽

Treatment Options ◽

Learning Model ◽

Satisfaction Level ◽

Change Analysis ◽

Community Group ◽

Life Threatening ◽

The Difference ◽

Health Communities ◽

Deep Learning Model

Online health communities (OHC) provide various opportunities for patients with chronic or life-threatening illnesses, especially for cancer patients and survivors. A better understanding of the sentiment dynamics of patients in OHCs can help in the precise formulation of the needs during their treatment. The current study investigated the sentiment dynamics in patients’ narratives in a Breast Cancer community group ( Breastcancer.org ) to identify the changes in emotions, thoughts, stress, and coping mechanisms while undergoing treatment options, particularly chemotherapy, radiation, and surgery. Sentiment dynamics of users’ posts was performed using a deep learning model. A sentiment change analysis was performed to measure change in the satisfaction level of the users. The deep learning model BiLSTM with sentiment embedding features provided a better F1-score of 91.9%. Sentiment dynamics can assess the difference in satisfaction level the users acquire by interacting with other users in the forum. A comparison of the proposed model with existing models revealed the effectiveness of this methodology.

Download Full-text

A New Text Classification Model Based on Contrastive Word Embedding for Detecting Cybersecurity Intelligence in Twitter

Electronics ◽

10.3390/electronics9091527 ◽

2020 ◽

Vol 9 (9) ◽

pp. 1527 ◽

Cited By ~ 1

Author(s):

Han-Sub Shin ◽

Hyuk-Yoon Kwon ◽

Seung-Jin Ryu

Keyword(s):

Deep Learning ◽

Text Classification ◽

Area Under The Curve ◽

Word Embedding ◽

Classification Model ◽

Data Set ◽

Feature Vectors ◽

Model Based ◽

Proposed Model ◽

The Difference

Detecting cybersecurity intelligence (CSI) on social media such as Twitter is crucial because it allows security experts to respond cyber threats in advance. In this paper, we devise a new text classification model based on deep learning to classify CSI-positive and -negative tweets from a collection of tweets. For this, we propose a novel word embedding model, called contrastive word embedding, that enables to maximize the difference between base embedding models. First, we define CSI-positive and -negative corpora, which are used for constructing embedding models. Here, to supplement the imbalance of tweet data sets, we additionally employ the background knowledge for each tweet corpus: (1) CVE data set for CSI-positive corpus and (2) Wikitext data set for CSI-negative corpus. Second, we adopt the deep learning models such as CNN or LSTM to extract adequate feature vectors from the embedding models and integrate the feature vectors into one classifier. To validate the effectiveness of the proposed model, we compare our method with two baseline classification models: (1) a model based on a single embedding model constructed with CSI-positive corpus only and (2) another model with CSI-negative corpus only. As a result, we indicate that the proposed model shows high accuracy, i.e., 0.934 of F1-score and 0.935 of area under the curve (AUC), which improves the baseline models by 1.76∼6.74% of F1-score and by 1.64∼6.98% of AUC.

Download Full-text

EVALUASI PROYEKSI DEMAND BARANG DAN PENUMPANG PELABUHAN DI PROVINSI MALUKU

Jurnal Transportasi ◽

10.26593/jt.v19i2.3467.101-110 ◽

2019 ◽

Vol 19 (2) ◽

pp. 101-110

Author(s):

Adrian Firdaus ◽

M. Dwi Yoga Sutanto ◽

Rajin Sihombing ◽

M. Weldy Hermawan

Keyword(s):

Time Series ◽

Projection Method ◽

Development Plan ◽

Master Plan ◽

Port Development ◽

Loading And Unloading ◽

The Difference ◽

Existing Data

Abstract Every port in Indonesia must have a Port Master Plan that contains an integrated port development plan. This study discusses one important aspect in the preparation of the Port Master Plan, namely the projected movement of goods and passengers, which can be used as a reference in determining the need for facilities at each stage of port development. The case study was conducted at a port located in a district in Maluku Province and aims to evaluate the analysis of projected demand for goods and passengers occurring at the port. The projection method used is time series and econometric projection. The projection results are then compared with the existing data in 2018. The results of this study show that the econometric projection gives adequate results in predicting loading and unloading activities as well as the number of passenger arrival and departure in 2018. This is indicated by the difference in the percentage of projection results towards the existing data, which is smaller than 10%. Whereas for loading and unloading activities, time series projections with logarithmic trends give better results than econometric projections. Keywords: port, port master plan, port development, unloading activities Abstrak Setiap pelabuhan di Indonesia harus memiliki sebuah Rencana Induk Pelabuhan yang memuat rencana pengem-bangan pelabuhan secara terpadu. Studi ini membahas salah satu aspek penting dalam penyusunan Rencana Induk Pelabuhan, yaitu proyeksi pergerakan barang dan penumpang, yang dapat dipakai sebagai acuan dalam penentuan kebutuhan fasilitas di setiap tahap pengembangan pelabuhan. Studi kasus dilakukan pada sebuah pelabuhan yang terletak di sebuah kabupaten di Provinsi Maluku dan bertujuan untuk melakukan evaluasi ter-hadap analisis proyeksi demand barang dan penumpang yang terjadi di pelabuhan tersebut. Metode proyeksi yang dipakai adalah proyeksi deret waktu dan ekonometrik. Hasil proyeksi selanjutnya dibandingkan dengan data eksisting tahun 2018. Hasil studi ini menunjukkan bahwa proyeksi ekonometrik memberikan hasil yang cukup baik dalam memprediksi aktivitas bongkar barang serta jumlah penumpang naik dan turun di tahun 2018. Hal ini diindikasikan dengan selisih persentase hasil proyeksi terhadap data eksisting yang lebih kecil dari 10%. Sedangkan untuk aktivitas muat barang, proyeksi deret waktu dengan tren logaritmik memberikan hasil yang lebih baik daripada proyeksi ekonometrik. Kata-kata kunci: pelabuhan, rencana induk pelabuhan, pengembangan pelauhan, aktivitas bongkar barang

Download Full-text