Automated Labeling for Robotic Autonomous Navigation Through Multi-Sensory Semi-Supervised Learning on Big Data

This chapter presents some popular graph-based semi-supervised approaches. These techniques apply to classification and regression problems and can be extended to big data problems using recently developed anchor graph enhancements. The background necessary for understanding this Chapter includes linear algebra and optimization. No prior knowledge in methods of machine learning is necessary. An empirical demonstration of the techniques for these methods is also provided on real data set benchmarks.

Download Full-text

Supervised Learning of Natural-Terrain Traversability with Synthetic 3D Laser Scans

Applied Sciences ◽

10.3390/app10031140 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1140 ◽

Cited By ~ 4

Author(s):

Jorge L. Martínez ◽

Mariano Morán ◽

Jesús Morales ◽

Alfredo Robles ◽

Manuel Sánchez

Keyword(s):

Supervised Learning ◽

Autonomous Navigation ◽

Three Dimensional ◽

Point Clouds ◽

Natural Environments ◽

Good Prediction ◽

Ground Vehicles ◽

3D Point Clouds ◽

Natural Terrain ◽

Robotic Simulator

Autonomous navigation of ground vehicles on natural environments requires looking for traversable terrain continuously. This paper develops traversability classifiers for the three-dimensional (3D) point clouds acquired by the mobile robot Andabata on non-slippery solid ground. To this end, different supervised learning techniques from the Python library Scikit-learn are employed. Training and validation are performed with synthetic 3D laser scans that were labelled point by point automatically with the robotic simulator Gazebo. Good prediction results are obtained for most of the developed classifiers, which have also been tested successfully on real 3D laser scans acquired by Andabata in motion.

Download Full-text

Regularized Instance Embedding for Deep Multi-Instance Learning

Applied Sciences ◽

10.3390/app10010064 ◽

2019 ◽

Vol 10 (1) ◽

pp. 64

Author(s):

Yi Lin ◽

Honggang Zhang

Keyword(s):

Neural Network ◽

Big Data ◽

Supervised Learning ◽

Regularization Method ◽

Gradient Descent ◽

State Of The Art ◽

Stochastic Gradient Descent ◽

Learning Framework ◽

Weakly Supervised ◽

The Cost

In the era of Big Data, multi-instance learning, as a weakly supervised learning framework, has various applications since it is helpful to reduce the cost of the data-labeling process. Due to this weakly supervised setting, learning effective instance representation/embedding is challenging. To address this issue, we propose an instance-embedding regularizer that can boost the performance of both instance- and bag-embedding learning in a unified fashion. Specifically, the crux of the instance-embedding regularizer is to maximize correlation between instance-embedding and underlying instance-label similarities. The embedding-learning framework was implemented using a neural network and optimized in an end-to-end manner using stochastic gradient descent. In experiments, various applications were studied, and the results show that the proposed instance-embedding-regularization method is highly effective, having state-of-the-art performance.

Download Full-text

Fraud detection in big data using supervised and semi-supervised learning techniques

2017 IEEE Colombian Conference on Communications and Computing (COLCOM) ◽

10.1109/colcomcon.2017.8088206 ◽

2017 ◽

Cited By ~ 5

Author(s):

German E. Melo-Acosta ◽

Freddy Duitama-Munoz ◽

Julian D. Arias-Londono

Keyword(s):

Big Data ◽

Supervised Learning ◽

Fraud Detection ◽

Learning Techniques

Download Full-text

Noise Removal Process from Label Classification using Machine Learning

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c3920.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 172-175

Keyword(s):

Machine Learning ◽

Big Data ◽

Supervised Learning ◽

Noise Removal ◽

Error Rates ◽

Training Data ◽

Learning Performance ◽

Training Dataset ◽

Noise Filtering ◽

Label Noise

Text classification and clustering approach is essential for big data environments. In supervised learning applications many classification algorithms have been proposed. In the era of big data, a large volume of training data is available in many machine learning works. However, there is a possibility of mislabeled or unlabeled data that are not labeled properly. Some labels may be incorrect resulted in label noise which in turn regress learning performance of a classifier. A general approach to address label noise is to apply noise filtering techniques to identify and remove noise before learning. A range of noise filtering approaches have been developed to improve the classifiers performance. This paper proposes noise filtering approach in text data during the training phase. Many supervised learning algorithms generates high error rates due to noise in training dataset, our work eliminates such noise and provides accurate classification system.

Download Full-text

A Supervised Learning Algorithm to Forecast Weather Conditions for Playing Cricket

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a4528.119119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 1560-1565

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Cloud Computing ◽

Big Data ◽

Internet Of Things ◽

Supervised Learning ◽

Learning Algorithm ◽

Weather Conditions ◽

Redundant Data ◽

Classification Technique

Now days, Machine learning is considered as the key technique in the field of technologies, such as, Internet of things (IOT), Cloud computing, Big data and Artificial Intelligence etc. As technology enhances, lots of incorrect and redundant data are collected from these fields. To make use of these data for a meaningful purpose, we have to apply mining or classification technique in the real world. In this paper, we have proposed two nobel approaches towards data classification by using supervised learning algorithm

Download Full-text

SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.11192 ◽

2018 ◽

Vol 61 ◽

pp. 863-905 ◽

Cited By ~ 150

Author(s):

Alberto Fernandez ◽

Salvador Garcia ◽

Francisco Herrera ◽

Nitesh V. Chawla

Keyword(s):

Big Data ◽

Open Source ◽

Supervised Learning ◽

Incremental Learning ◽

Class Imbalance ◽

Imbalanced Data ◽

Multilabel Classification ◽

Current State ◽

Software Packages ◽

State Of Affairs

The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is considered "de facto" standard in the framework of learning from imbalanced data. This is due to its simplicity in the design of the procedure, as well as its robustness when applied to different type of problems. Since its publication in 2002, SMOTE has proven successful in a variety of applications from several different domains. SMOTE has also inspired several approaches to counter the issue of class imbalance, and has also significantly contributed to new supervised learning paradigms, including multilabel classification, incremental learning, semi-supervised learning, multi-instance learning, among others. It is standard benchmark for learning from imbalanced data. It is also featured in a number of different software packages - from open source to commercial. In this paper, marking the fifteen year anniversary of SMOTE, we reflect on the SMOTE journey, discuss the current state of affairs with SMOTE, its applications, and also identify the next set of challenges to extend SMOTE for Big Data problems.

Download Full-text