How to Use Machine Learning to Improve the Discrimination between Signal and Background at Particle Colliders

Xabier Cid Vidal; Lorena Dieste Maroñas; Álvaro Dosil Suárez

doi:10.3390/app112211076

How to Use Machine Learning to Improve the Discrimination between Signal and Background at Particle Colliders

Applied Sciences ◽

10.3390/app112211076 ◽

2021 ◽

Vol 11 (22) ◽

pp. 11076

Author(s):

Xabier Cid Vidal ◽

Lorena Dieste Maroñas ◽

Álvaro Dosil Suárez

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Particle Physics ◽

Hadron Collider ◽

Particle Detectors ◽

Overall Performance ◽

Boosted Decision Trees ◽

High Level ◽

Fully Connected

The popularity of Machine Learning (ML) has been increasing in recent decades in almost every area, with the commercial and scientific fields being the most notorious ones. In particle physics, ML has been proven a useful resource to make the most of projects such as the Large Hadron Collider (LHC). The main advantage provided by ML is a reduction in the time and effort required for the measurements carried out by experiments, and improvements in the performance. With this work we aim to encourage scientists working with particle colliders to use ML and to try the different alternatives that are available, focusing on the separation of signal and background. We assess some of the most-used libraries in the field, such as Toolkit for Multivariate Data Analysis with ROOT, and also newer and more sophisticated options such as PyTorch and Keras. We also assess the suitability of some of the most common algorithms for signal-background discrimination, such as Boosted Decision Trees, and propose the use of others, namely Neural Networks. We compare the overall performance of different algorithms and libraries in simulated LHC data and produce some guidelines to help analysts deal with different situations. Examples include the use of low or high-level features from particle detectors or the amount of statistics that are available for training the algorithms. Our main conclusion is that the algorithms and libraries used more frequently at LHC collaborations might not always be those that provide the best results for the classification of signal candidates, and fully connected Neural Networks trained with Keras can improve the performance scores in most of the cases we formulate.

Download Full-text

Machine and deep learning applications in particle physics

International Journal of Modern Physics A ◽

10.1142/s0217751x19300199 ◽

2019 ◽

Vol 34 (35) ◽

pp. 1930019 ◽

Cited By ~ 3

Author(s):

Dimitri Bourilkov

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Deep Learning ◽

Decision Trees ◽

Particle Physics ◽

Analysis Techniques ◽

Data Intensive ◽

Future Challenges ◽

Boosted Decision Trees ◽

The Many

The many ways in which machine and deep learning are transforming the analysis and simulation of data in particle physics are reviewed. The main methods based on boosted decision trees and various types of neural networks are introduced, and cutting-edge applications in the experimental and theoretical/phenomenological domains are highlighted. After describing the challenges in the application of these novel analysis techniques, the review concludes by discussing the interactions between physics and machine learning as a two-way street enriching both disciplines and helping to meet the present and future challenges of data-intensive science at the energy and intensity frontiers.

Download Full-text

Semi-automated classification of colonial Microcystis by FlowCAM imaging flow cytometry in mesocosm experiment reveals high heterogeneity during seasonal bloom

Scientific Reports ◽

10.1038/s41598-021-88661-2 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Yersultan Mirasbekov ◽

Adina Zhumakhanova ◽

Almira Zhantuyakova ◽

Kuanysh Sarkytbayev ◽

Dmitry V. Malashenkov ◽

...

Keyword(s):

Machine Learning ◽

Flow Cytometry ◽

Spatial Resolution ◽

Mesocosm Experiment ◽

Imaging Flow Cytometry ◽

Leibler Divergence ◽

Temporal And Spatial ◽

High Level ◽

Training Sets

AbstractA machine learning approach was employed to detect and quantify Microcystis colonial morphospecies using FlowCAM-based imaging flow cytometry. The system was trained and tested using samples from a long-term mesocosm experiment (LMWE, Central Jutland, Denmark). The statistical validation of the classification approaches was performed using Hellinger distances, Bray–Curtis dissimilarity, and Kullback–Leibler divergence. The semi-automatic classification based on well-balanced training sets from Microcystis seasonal bloom provided a high level of intergeneric accuracy (96–100%) but relatively low intrageneric accuracy (67–78%). Our results provide a proof-of-concept of how machine learning approaches can be applied to analyze the colonial microalgae. This approach allowed to evaluate Microcystis seasonal bloom in individual mesocosms with high level of temporal and spatial resolution. The observation that some Microcystis morphotypes completely disappeared and re-appeared along the mesocosm experiment timeline supports the hypothesis of the main transition pathways of colonial Microcystis morphoforms. We demonstrated that significant changes in the training sets with colonial images required for accurate classification of Microcystis spp. from time points differed by only two weeks due to Microcystis high phenotypic heterogeneity during the bloom. We conclude that automatic methods not only allow a performance level of human taxonomist, and thus be a valuable time-saving tool in the routine-like identification of colonial phytoplankton taxa, but also can be applied to increase temporal and spatial resolution of the study.

Download Full-text

A primer on deep learning and convolutional neural networks for clinicians

Insights into Imaging ◽

10.1186/s13244-021-01052-z ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Lara Lloret Iglesias ◽

Pablo Sanz Bellón ◽

Amaia Pérez del Barrio ◽

Pablo Menéndez Fernández-Miranda ◽

David Rodríguez González ◽

...

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Learning Capabilities ◽

Computer Vision Applications ◽

Fully Connected ◽

Imaging Perspective ◽

Adaptive Momentum

AbstractDeep learning is nowadays at the forefront of artificial intelligence. More precisely, the use of convolutional neural networks has drastically improved the learning capabilities of computer vision applications, being able to directly consider raw data without any prior feature extraction. Advanced methods in the machine learning field, such as adaptive momentum algorithms or dropout regularization, have dramatically improved the convolutional neural networks predicting ability, outperforming that of conventional fully connected neural networks. This work summarizes, in an intended didactic way, the main aspects of these cutting-edge techniques from a medical imaging perspective.

Download Full-text

Distributed training and scalability for the particle clustering method UCluster

EPJ Web of Conferences ◽

10.1051/epjconf/202125102054 ◽

2021 ◽

Vol 251 ◽

pp. 02054

Author(s):

Olga Sunneborn Gudnadottir ◽

Daniel Gedon ◽

Colin Desmarais ◽

Karl Bengtsson Bernander ◽

Raazesh Sainudiin ◽

...

Keyword(s):

Particle Physics ◽

Hadron Collider ◽

Large Data ◽

Large Data Sets ◽

Data Sets ◽

Training Time ◽

Distributed Training ◽

Machine Learning Methods ◽

Multi Class Classification

In recent years, machine-learning methods have become increasingly important for the experiments at the Large Hadron Collider (LHC). They are utilised in everything from trigger systems to reconstruction and data analysis. The recent UCluster method is a general model providing unsupervised clustering of particle physics data, that can be easily modified to provide solutions for a variety of different decision problems. In the current paper, we improve on the UCluster method by adding the option of training the model in a scalable and distributed fashion, and thereby extending its utility to learn from arbitrarily large data sets. UCluster combines a graph-based neural network called ABCnet with a clustering step, using a combined loss function in the training phase. The original code is publicly available in TensorFlow v1.14 and has previously been trained on a single GPU. It shows a clustering accuracy of 81% when applied to the problem of multi-class classification of simulated jet events. Our implementation adds the distributed training functionality by utilising the Horovod distributed training framework, which necessitated a migration of the code to TensorFlow v2. Together with using parquet files for splitting data up between different compute nodes, the distributed training makes the model scalable to any amount of input data, something that will be essential for use with real LHC data sets. We find that the model is well suited for distributed training, with the training time decreasing in direct relation to the number of GPU’s used. However, further improvements by a more exhaustive and possibly distributed hyper-parameter search is required in order to achieve the reported accuracy of the original UCluster method.

Download Full-text

Deep learning based DNA:RNA triplex forming potential prediction

BMC Bioinformatics ◽

10.1186/s12859-020-03864-0 ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Yu Zhang ◽

Yahui Long ◽

Chee Keong Kwoh

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Cross Validation ◽

Roc Curves ◽

Triplex Dna ◽

Triplex Formation ◽

Machine Learning Model ◽

Non Coding Rnas ◽

High Level ◽

Integrated Program

Abstract Background Long non-coding RNAs (lncRNAs) can exert functions via forming triplex with DNA. The current methods in predicting the triplex formation mainly rely on mathematic statistic according to the base paring rules. However, these methods have two main limitations: (1) they identify a large number of triplex-forming lncRNAs, but the limited number of experimentally verified triplex-forming lncRNA indicates that maybe not all of them can form triplex in practice, and (2) their predictions only consider the theoretical relationship while lacking the features from the experimentally verified data. Results In this work, we develop an integrated program named TriplexFPP (Triplex Forming Potential Prediction), which is the first machine learning model in DNA:RNA triplex prediction. TriplexFPP predicts the most likely triplex-forming lncRNAs and DNA sites based on the experimentally verified data, where the high-level features are learned by the convolutional neural networks. In the fivefold cross validation, the average values of Area Under the ROC curves and PRC curves for removed redundancy triplex-forming lncRNA dataset with threshold 0.8 are 0.9649 and 0.9996, and these two values for triplex DNA sites prediction are 0.8705 and 0.9671, respectively. Besides, we also briefly summarize the cis and trans targeting of triplexes lncRNAs. Conclusions The TriplexFPP is able to predict the most likely triplex-forming lncRNAs from all the lncRNAs with computationally defined triplex forming capacities and the potential of a DNA site to become a triplex. It may provide insights to the exploration of lncRNA functions.

Download Full-text

A Novel Method for Sea-Land Clutter Separation Using Regularized Randomized and Kernel Ridge Neural Networks

Sensors ◽

10.3390/s20226491 ◽

2020 ◽

Vol 20 (22) ◽

pp. 6491

Author(s):

Le Zhang ◽

Jeyan Thiyagalingam ◽

Anke Xue ◽

Shuwen Xu

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Classification Accuracy ◽

Signal Amplitude ◽

Statistical Characteristics ◽

Support Vector ◽

Amplitude Change ◽

Novel Method

Classification of clutter, especially in the context of shore based radars, plays a crucial role in several applications. However, the task of distinguishing and classifying the sea clutter from land clutter has been historically performed using clutter models and/or coastal maps. In this paper, we propose two machine learning, particularly neural network, based approaches for sea-land clutter separation, namely the regularized randomized neural network (RRNN) and the kernel ridge regression neural network (KRR). We use a number of features, such as energy variation, discrete signal amplitude change frequency, autocorrelation performance, and other statistical characteristics of the respective clutter distributions, to improve the performance of the classification. Our evaluation based on a unique mixed dataset, which is comprised of partially synthetic clutter data for land and real clutter data from sea, offers improved classification accuracy. More specifically, the RRNN and KRR methods offer 98.50% and 98.75% accuracy, outperforming the conventional support vector machine and extreme learning based solutions.

Download Full-text

Scientists and Students from Hong Kong Join a Top Particle Physics Experiment

Asia Pacific Physics Newsletter ◽

10.1142/s2251158x14000289 ◽

2014 ◽

Vol 03 (02) ◽

pp. 23-24

Author(s):

Keyword(s):

Hong Kong ◽

Higgs Boson ◽

Particle Physics ◽

Hadron Collider ◽

New Physics ◽

Particle Accelerator ◽

Atlas Collaboration ◽

Particle Detectors ◽

The World ◽

Physics Experiment

A team of physicists from Hong Kong has now formally joined one of the most prestigious physics experiments in the world. Following a unanimous vote of approval today by its Collaboration Board, ATLAS has admitted the Hong Kong team as a member. The ATLAS Collaboration operates one of the largest particle detectors in the world, located at the Large Hadron Collider (LHC), the world's highest energy particle accelerator at CERN, Switzerland. In 2012, the ATLAS team — along with the CMS Collaboration — co-discovered the Higgs boson, or so-called 'God Particle'. The gigantic but sensitive and precise ATLAS detector, together with the unprecedentedly high collision energy and luminosity of the LHC, make it possible to search for fundamentally new physics, such as dark matter, hidden extra dimensions, and supersymmetry — a proposed symmetry among elementary particles. The LHC is currently undergoing an upgrade, targeting a substantial increase in beam energy and intensity in a year's time. It is widely expected that the discovery of the Higgs boson is only the beginning of an era of new breakthroughs in fundamental physics. All these exciting opportunities are now opened up to scientists and students from Hong Kong.

Download Full-text

Neural networks for classification of strokes in electrical impedance tomography on a 3D head model

Mathematics in Engineering ◽

10.3934/mine.2022029 ◽

2022 ◽

Vol 4 (4) ◽

pp. 1-22

Author(s):

Valentina Candiani ◽

◽

Matteo Santacesaria ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Electrical Impedance Tomography ◽

Electrical Impedance ◽

Network Architectures ◽

Impedance Tomography ◽

Average Accuracy ◽

Neural Network Architectures ◽

Fully Connected

<abstract><p>We consider the problem of the detection of brain hemorrhages from three-dimensional (3D) electrical impedance tomography (EIT) measurements. This is a condition requiring urgent treatment for which EIT might provide a portable and quick diagnosis. We employ two neural network architectures - a fully connected and a convolutional one - for the classification of hemorrhagic and ischemic strokes. The networks are trained on a dataset with $ 40\, 000 $ samples of synthetic electrode measurements generated with the complete electrode model on realistic heads with a 3-layer structure. We consider changes in head anatomy and layers, electrode position, measurement noise and conductivity values. We then test the networks on several datasets of unseen EIT data, with more complex stroke modeling (different shapes and volumes), higher levels of noise and different amounts of electrode misplacement. On most test datasets we achieve $ \geq 90\% $ average accuracy with fully connected neural networks, while the convolutional ones display an average accuracy $ \geq 80\% $. Despite the use of simple neural network architectures, the results obtained are very promising and motivate the applications of EIT-based classification methods on real phantoms and ultimately on human patients.</p></abstract>

Download Full-text

Hazards of data leakage in machine learning: a study on classification of breast cancer using deep neural networks

Medical Imaging 2020: Computer-Aided Diagnosis ◽

10.1117/12.2549313 ◽

2020 ◽

Author(s):

Ravi K. Samala ◽

Heang-Ping Chan ◽

Lubomir Hadjiiski ◽

Sathvik Koneru

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Neural Networks ◽

Deep Neural Networks

Download Full-text

A Novel Architecture to Classify Histopathology Images Using Convolutional Neural Networks

Applied Sciences ◽

10.3390/app10082929 ◽

2020 ◽

Vol 10 (8) ◽

pp. 2929 ◽

Cited By ~ 2

Author(s):

Ibrahem Kandel ◽

Mauro Castelli

Keyword(s):

Neural Network ◽

Neural Networks ◽

State Of The Art ◽

Treatment Plan ◽

Tissue Structure ◽

Activation Functions ◽

Proposed Model ◽

Histopathology Images ◽

Fully Connected

Histopathology is the study of tissue structure under the microscope to determine if the cells are normal or abnormal. Histopathology is a very important exam that is used to determine the patients’ treatment plan. The classification of histopathology images is very difficult to even an experienced pathologist, and a second opinion is often needed. Convolutional neural network (CNN), a particular type of deep learning architecture, obtained outstanding results in computer vision tasks like image classification. In this paper, we propose a novel CNN architecture to classify histopathology images. The proposed model consists of 15 convolution layers and two fully connected layers. A comparison between different activation functions was performed to detect the most efficient one, taking into account two different optimizers. To train and evaluate the proposed model, the publicly available PatchCamelyon dataset was used. The dataset consists of 220,000 annotated images for training and 57,000 unannotated images for testing. The proposed model achieved higher performance compared to the state-of-the-art architectures with an AUC of 95.46%.

Download Full-text