Machine Learning Predictions as Regression Covariates

Political Analysis ◽

10.1017/pan.2020.38 ◽

2020 ◽

pp. 1-18

Author(s):

Christian Fong ◽

Matthew Tyler

Keyword(s):

Machine Learning ◽

Prediction Error ◽

Learning Algorithm ◽

Data Sets ◽

Machine Learning Algorithm ◽

Regression Analyses ◽

Political Dialogue ◽

Latent Features ◽

Text Images ◽

True Values

Abstract In text, images, merged surveys, voter files, and elsewhere, data sets are often missing important covariates, either because they are latent features of observations (such as sentiment in text) or because they are not collected (such as race in voter files). One promising approach for coping with this missing data is to find the true values of the missing covariates for a subset of the observations and then train a machine learning algorithm to predict the values of those covariates for the rest. However, plugging in these predictions without regard for prediction error renders regression analyses biased, inconsistent, and overconfident. We characterize the severity of the problem posed by prediction error, describe a procedure to avoid these inconsistencies under comparatively general assumptions, and demonstrate the performance of our estimators through simulations and a study of hostile political dialogue on the Internet. We provide software implementing our approach.

Download Full-text

Probabilistic Random Forest: A Machine Learning Algorithm for Noisy Data Sets

The Astronomical Journal ◽

10.3847/1538-3881/aaf101 ◽

2018 ◽

Vol 157 (1) ◽

pp. 16 ◽

Cited By ~ 7

Author(s):

Itamar Reis ◽

Dalya Baron ◽

Sahar Shahaf

Keyword(s):

Machine Learning ◽

Random Forest ◽

Learning Algorithm ◽

Noisy Data ◽

Data Sets ◽

Machine Learning Algorithm

Download Full-text

Weather Prediction using Machine Learning and IOT

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.d9130.049420 ◽

2020 ◽

Vol 9 (4) ◽

pp. 2094-2098

Keyword(s):

Machine Learning ◽

Weather Forecasting ◽

Learning Algorithm ◽

Weather Prediction ◽

Weather Conditions ◽

Data Sets ◽

Machine Learning Algorithm ◽

Time Data ◽

Weather Parameters ◽

Set Up

This project proposes a method for forecasting weather conditions and predicting rainfall by means of machine learning. Here, there are two set ups: one, to measure the weather parameters like temperature, humidity using sensors along with Arduino and another set up, to display the current values(status) and predicted rainfall based on the trained machine learning data sets. The weather forecasting and prediction is done based on the older datasets collected and compared with the current values. The user need not have a backup of huge data to predict the rainfall. Instead a machine learning algorithm can suffice the same. The temperature, humidity sensor modules are used to measure weather parameters and interfaced to an Arduino controller. The proposed setup will compare the forecast value with real-time data, and the predict rainfall based on the dataset fed to the machine learning algorithm.

Download Full-text

Aislamiento social obligatorio: un análisis de sentimientos mediante machine learning

Suma de Negocios ◽

10.14349/sumneg/2021.v12.n26.a1 ◽

2021 ◽

Vol 12 (26) ◽

pp. 1-13

Author(s):

Carlos Alberto Arango Pastrana ◽

Carlos Fernando Osorio Andrade

Keyword(s):

Machine Learning ◽

Social Network ◽

Social Network Analysis ◽

Network Analysis ◽

Learning Algorithm ◽

Data Sets ◽

Machine Learning Algorithm ◽

Economic Problems ◽

Colombian Government

To reduce the rate of contagion by Covid-19, the Colombian government has adopted, among other measures, for mandatory isolation, with divided opinions, because despite helping to reduce the spread of the virus, it generates mental and economic problems that are difficult to overcome. The objective of this document was to analyze the underlying sentiments in the Twitter comments related to isolation, identifying the topics and words most frequently used in this context. A machine learning algorithm was built to identify sentiments in 72,564 posts and a social network analysis was applied establishing the most frequent topics in the data sets. The results suggest that the algorithm is highly accurate in classifying feelings. Also, as the isolation extends, comments related to the quarantine grow proportionally. Fear was identified as the predominant feeling throughout the period of confinement in Colombia.

Download Full-text

A Novel Machine Learning Algorithm to Reduce Prediction Error and Accelerate Learning Curve for Very Large Datasets

2019 IEEE 49th International Symposium on Multiple-Valued Logic (ISMVL) ◽

10.1109/ismvl.2019.00025 ◽

2019 ◽

Author(s):

Wenjun Hou ◽

Marek Perkowski

Keyword(s):

Machine Learning ◽

Learning Curve ◽

Prediction Error ◽

Learning Algorithm ◽

Large Datasets ◽

Machine Learning Algorithm ◽

Very Large Datasets

Download Full-text

Aspect based feature extraction and sentiment classification of review data sets using Incremental machine learning algorithm

2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB) ◽

10.1109/aeeicb.2017.7972395 ◽

2017 ◽

Cited By ~ 4

Author(s):

Rajalaxmi Hegde ◽

Seema S.

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Learning Algorithm ◽

Sentiment Classification ◽

Data Sets ◽

Machine Learning Algorithm

Download Full-text

Improving forest above ground biomass estimates over Indian forests using multi source data sets with machine learning algorithm

Ecological Informatics ◽

10.1016/j.ecoinf.2021.101392 ◽

2021 ◽

pp. 101392

Author(s):

Rakesh Fararoda ◽

R. Suraj Reddy ◽

G. Rajashekar ◽

T.R. Kiran Chand ◽

C.S. Jha ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Data Sets ◽

Machine Learning Algorithm ◽

Above Ground Biomass ◽

Ground Biomass ◽

Source Data ◽

Indian Forests

Download Full-text

Issues of COVID 19 Screening with Machine Learning Algorithm and Data Sets Availability

10.3233/apc210298 ◽

2021 ◽

Author(s):

G.N. Balaji ◽

S.V. Suryanarayana ◽

P. Vijayaragavan

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

School Management ◽

Data Sets ◽

Machine Learning Algorithm ◽

Shopping Malls ◽

Railway Stations ◽

Group Participation ◽

The Mathematical Model ◽

Screening Systems

There is a need to wear a mask during the coronavirus outbreak to efficiently deter the transmission of COVID-19 virus. In these instances, traditional facial screening technologies obsolete for monitoring of group entry at Airports, shopping malls, railway stations, etc. It is, therefore, vital to boost the efficiency of screening. This paper addresses the machine learning algorithm for contactless face screening systems in group participation, social interaction, school management, mall entry management, and market resumption scenarios in the case of COVID- 19. A method to screen entry with masks are developed using machine learning, which depends on various face specimens that were discussed here. The second fold discussion in this paper is that previously there are not many freely accessible masked face-databases. To this end, various forms of masked face data sets are identified, namely MFDD, Real MFRD, and Simulated MFRD. Such data sets became widely accessible to businesses and academics, based on which specific apps may be built on masked faces. The mathematical model, with the code was given. The availability and issues of the above data sets were discussed for the benefit of researchers.

Download Full-text

Photometric selection of quasars in large astronomical data sets with a fast and accurate machine learning algorithm

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stt2490 ◽

2014 ◽

Vol 439 (1) ◽

pp. 644-650 ◽

Cited By ~ 1

Author(s):

Pramod Gupta ◽

Andrew J. Connolly ◽

Jeffrey P. Gardner

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Data Sets ◽

Machine Learning Algorithm ◽

Astronomical Data ◽

Selection Of

Download Full-text

Analysis of Machine Learning Algorithm with Road Accidents Data Sets

International Journal of Engineering and Management Research ◽

10.31033/ijemr.10.2.3 ◽

2020 ◽

Vol 10 (02) ◽

pp. 20-25

Author(s):

P Sumanth ◽

P Sai Anudeep ◽

S Divya

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Data Sets ◽

Machine Learning Algorithm ◽

Road Accidents

Download Full-text

Machine Learning Algorithm to Predict Early Complications after Brain Tumor Surgery

10.1055/s-0038-1660728 ◽

2018 ◽

Author(s):

C.H.B. van Niftrik ◽

F. van der Wouden ◽

V. Staartjes ◽

J. Fierstra ◽

M. Stienen ◽

...

Keyword(s):

Machine Learning ◽

Brain Tumor ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Tumor Surgery ◽

Early Complications ◽

Brain Tumor Surgery

Download Full-text