A Survey of On-Device Machine Learning

Sauptik Dhar; Junyao Guo; Jiayi (Jason) Liu; Samarth Tripathi; Unmesh Kurup; Mohak Shah

doi:10.1145/3450494

A Survey of On-Device Machine Learning

ACM Transactions on Internet of Things ◽

10.1145/3450494 ◽

2021 ◽

Vol 2 (3) ◽

pp. 1-49

Author(s):

Sauptik Dhar ◽

Junyao Guo ◽

Jiayi (Jason) Liu ◽

Samarth Tripathi ◽

Unmesh Kurup ◽

...

Keyword(s):

Machine Learning ◽

State Of The Art ◽

The State ◽

Smart Devices ◽

Model Adaptation ◽

Middle Ground ◽

Research Areas ◽

Comprehensive Survey ◽

Constrained Learning ◽

Model Training

The predominant paradigm for using machine learning models on a device is to train a model in the cloud and perform inference using the trained model on the device. However, with increasing numbers of smart devices and improved hardware, there is interest in performing model training on the device. Given this surge in interest, a comprehensive survey of the field from a device-agnostic perspective sets the stage for both understanding the state of the art and for identifying open challenges and future avenues of research. However, on-device learning is an expansive field with connections to a large number of related topics in AI and machine learning (including online learning, model adaptation, one/few-shot learning, etc.). Hence, covering such a large number of topics in a single survey is impractical. This survey finds a middle ground by reformulating the problem of on-device learning as resource constrained learning where the resources are compute and memory. This reformulation allows tools, techniques, and algorithms from a wide variety of research areas to be compared equitably. In addition to summarizing the state of the art, the survey also identifies a number of challenges and next steps for both the algorithmic and theoretical aspects of on-device learning.

Download Full-text

Assuring the Machine Learning Lifecycle

ACM Computing Surveys ◽

10.1145/3453444 ◽

2021 ◽

Vol 54 (5) ◽

pp. 1-39

Author(s):

Rob Ashmore ◽

Radu Calinescu ◽

Colin Paterson

Keyword(s):

Machine Learning ◽

Iterative Process ◽

State Of The Art ◽

The State ◽

Enabling Technology ◽

Safety Critical ◽

Wide Range ◽

Comprehensive Survey ◽

Intended Use

Machine learning has evolved into an enabling technology for a wide range of highly successful applications. The potential for this success to continue and accelerate has placed machine learning (ML) at the top of research, economic, and political agendas. Such unprecedented interest is fuelled by a vision of ML applicability extending to healthcare, transportation, defence, and other domains of great societal importance. Achieving this vision requires the use of ML in safety-critical applications that demand levels of assurance beyond those needed for current ML applications. Our article provides a comprehensive survey of the state of the art in the assurance of ML , i.e., in the generation of evidence that ML is sufficiently safe for its intended use. The survey covers the methods capable of providing such evidence at different stages of the machine learning lifecycle , i.e., of the complex, iterative process that starts with the collection of the data used to train an ML component for a system, and ends with the deployment of that component within the system. The article begins with a systematic presentation of the ML lifecycle and its stages. We then define assurance desiderata for each stage, review existing methods that contribute to achieving these desiderata, and identify open challenges that require further research.

Download Full-text

Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-020-01201-2 ◽

2020 ◽

Vol 20 (1) ◽

Cited By ~ 1

Author(s):

Julian Hatwell ◽

Mohamed Medhat Gaber ◽

R. Muhammad Atif Azad

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Statistical Tests ◽

Black Box ◽

The State ◽

Related Data ◽

Research Areas ◽

Novel Approach ◽

Tightly Coupled ◽

Novel Algorithm

Abstract Background Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients’ disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning models and high dimensional data sources such as electronic health records, magnetic resonance imaging scans, cardiotocograms, etc. These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. Methods In this work, we focus on AdaBoost, a black box model that has been widely adopted in the CAD literature. We address the challenge – to explain AdaBoost classification – with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost’s adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model’s decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting. Results Experiments on 9 CAD-related data sets showed that Ada-WHIPS explanations consistently generalise better (mean coverage 15%-68%) than the state of the art while remaining competitive for specificity (mean precision 80%-99%). A very small trade-off in specificity is shown to guard against over-fitting which is a known problem in the state of the art methods. Conclusions The experimental results demonstrate the benefits of using our novel algorithm for explaining CAD AdaBoost classifiers widely found in the literature. Our tightly coupled, AdaBoost-specific approach outperforms model-agnostic explanation methods and should be considered by practitioners looking for an XAI solution for this class of models.

Download Full-text

Ada-WHIPS: Explaining AdaBoost Classification with Applications in the Health Sciences

10.21203/rs.2.19113/v5 ◽

2020 ◽

Author(s):

Julian Hatwell ◽

Mohamed Medhat Gaber ◽

R.M. Atif Azad

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Statistical Tests ◽

Black Box ◽

The State ◽

Related Data ◽

Research Areas ◽

Novel Approach ◽

Tightly Coupled ◽

Novel Algorithm

Abstract Background Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients' disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning (ML) models and high dimensional data sources (electronic health records, MRI scans, cardiotocograms, etc). These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. Methods In this work, we focus on AdaBoost, a black box ML model that has been widely adopted in the CAD literature. We address the challenge -- to explain AdaBoost classification -- with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost's adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees (DT) of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model's decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting.Results Experiments on 9 CAD-related data sets showed that Ada-WHIPS explanations consistently generalise better (mean coverage 15%-68%) than the state of the art while remaining competitive for speciﬁcity (mean precision 80%-99%). A very small trade-oﬀ in speciﬁcity is shown to guard againstover-ﬁtting which is a known problem in the state of the art methods.Conclusions The experimental results demonstrate the beneﬁts of using our novel algorithm for explaining CAD AdaBoost classiﬁers widely found in the literature. Our tightly coupled, AdaBoost-speciﬁc approach outperforms model-agnostic explanation methods and should be considered by practitioners looking for an XAI solution for this class of models.

Download Full-text

A Systematic Review of Recommender Systems and Their Applications in Cybersecurity

Sensors ◽

10.3390/s21155248 ◽

2021 ◽

Vol 21 (15) ◽

pp. 5248

Author(s):

Aleksandra Pawlicka ◽

Marek Pawlicki ◽

Rafał Kozik ◽

Ryszard S. Choraś

Keyword(s):

Systematic Review ◽

Recommender Systems ◽

Recommender System ◽

State Of The Art ◽

The State ◽

Advantages And Disadvantages ◽

Comprehensive Survey ◽

Security Concerns ◽

Valuable Role

This paper discusses the valuable role recommender systems may play in cybersecurity. First, a comprehensive presentation of recommender system types is presented, as well as their advantages and disadvantages, possible applications and security concerns. Then, the paper collects and presents the state of the art concerning the use of recommender systems in cybersecurity; both the existing solutions and future ideas are presented. The contribution of this paper is two-fold: to date, to the best of our knowledge, there has been no work collecting the applications of recommenders for cybersecurity. Moreover, this paper attempts to complete a comprehensive survey of recommender types, after noticing that other works usually mention two–three types at once and neglect the others.

Download Full-text

Artificial intelligence and machine learning in design of mechanical materials

Materials Horizons ◽

10.1039/d0mh01451f ◽

2021 ◽

Author(s):

Kai Guo ◽

Zhenze Yang ◽

Chi-Hua Yu ◽

Markus J. Buehler

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

State Of The Art ◽

The State

This review revisits the state of the art of research efforts on the design of mechanical materials using machine learning.

Download Full-text

The state of the art on buffer allocation problem: a comprehensive survey

Journal of Intelligent Manufacturing ◽

10.1007/s10845-012-0687-9 ◽

2012 ◽

Vol 25 (3) ◽

pp. 371-392 ◽

Cited By ~ 78

Author(s):

Leyla Demir ◽

Semra Tunali ◽

Deniz Tursel Eliiyi

Keyword(s):

State Of The Art ◽

The State ◽

Allocation Problem ◽

Buffer Allocation ◽

Comprehensive Survey

Download Full-text

On the state of the art in machine learning: A personal review

Artificial Intelligence ◽

10.1016/s0004-3702(01)00125-4 ◽

2001 ◽

Vol 131 (1-2) ◽

pp. 199-222 ◽

Cited By ~ 27

Author(s):

Peter A. Flach

Keyword(s):

Machine Learning ◽

State Of The Art ◽

The State

Download Full-text

Adversarial Machine Learning: A Multi-Layer Review of the State-of-the-Art and Challenges for Wireless and Mobile Systems

IEEE Communications Surveys & Tutorials ◽

10.1109/comst.2021.3136132 ◽

2021 ◽

pp. 1-1

Author(s):

Jinxin Liu ◽

Michele Nogueira ◽

Johan Fernandes ◽

Burak Kantarci

Keyword(s):

Machine Learning ◽

State Of The Art ◽

The State ◽

Mobile Systems

Download Full-text

A Graph-based Evolutionary Algorithm for Automated Machine Learning

10.37686/ser.v1i2.77 ◽

2020 ◽

Author(s):

Fei Qi ◽

Zhaohui Xia ◽

Gaoyang Tang ◽

Hang Yang ◽

Yu Song ◽

...

Keyword(s):

Machine Learning ◽

Evolutionary Algorithm ◽

Parameter Optimization ◽

State Of The Art ◽

The State ◽

Complex Structures ◽

Architecture Evolution ◽

Automated Machine Learning ◽

Art Performance

As an emerging field, Automated Machine Learning (AutoML) aims to reduce or eliminate manual operations that require expertise in machine learning. In this paper, a graph-based architecture is employed to represent flexible combinations of ML models, which provides a large searching space compared to tree-based and stacking-based architectures. Based on this, an evolutionary algorithm is proposed to search for the best architecture, where the mutation and heredity operators are the key for architecture evolution. With Bayesian hyper-parameter optimization, the proposed approach can automate the workflow of machine learning. On the PMLB dataset, the proposed approach shows the state-of-the-art performance compared with TPOT, Autostacker, and auto-sklearn. Some of the optimized models are with complex structures which are difficult to obtain in manual design.

Download Full-text

A Network Parameter Database False Data Injection Correction Physics-Based Model: A Machine Learning Synthetic Measurement-Based Approach

Applied Sciences ◽

10.3390/app11178074 ◽

2021 ◽

Vol 11 (17) ◽

pp. 8074

Author(s):

Tierui Zou ◽

Nader Aljohani ◽

Keerthiraj Nagaraj ◽

Sheng Zou ◽

Cody Ruben ◽

...

Keyword(s):

Machine Learning ◽

Power Systems ◽

State Of The Art ◽

Real Life ◽

The State ◽

Wide Area ◽

Network Parameter ◽

False Data Injection ◽

Network Parameters ◽

Injection Attacks

Concerning power systems, real-time monitoring of cyber–physical security, false data injection attacks on wide-area measurements are of major concern. However, the database of the network parameters is just as crucial to the state estimation process. Maintaining the accuracy of the system model is the other part of the equation, since almost all applications in power systems heavily depend on the state estimator outputs. While much effort has been given to measurements of false data injection attacks, seldom reported work is found on the broad theme of false data injection on the database of network parameters. State-of-the-art physics-based model solutions correct false data injection on network parameter database considering only available wide-area measurements. In addition, deterministic models are used for correction. In this paper, an overdetermined physics-based parameter false data injection correction model is presented. The overdetermined model uses a parameter database correction Jacobian matrix and a Taylor series expansion approximation. The method further applies the concept of synthetic measurements, which refers to measurements that do not exist in the real-life system. A machine learning linear regression-based model for measurement prediction is integrated in the framework through deriving weights for synthetic measurements creation. Validation of the presented model is performed on the IEEE 118-bus system. Numerical results show that the approximation error is lower than the state-of-the-art, while providing robustness to the correction process. Easy-to-implement model on the classical weighted-least-squares solution, highlights real-life implementation potential aspects.

Download Full-text