Multiple Linear Regression: Bayesian Inference for Distributed and Big Data in the Medical Informatics Platform of the Human Brain Project

Bought and Sold: Exploring the Effects of Big Data on User Agency and Commodification

10.32920/ryerson.14657883.v1 ◽

2021 ◽

Author(s):

Kristia M. Pavlakos

Keyword(s):

Social Sciences ◽

Big Data ◽

Data Privacy ◽

Large Data ◽

Data Sets ◽

Privacy Regulation ◽

Scholarly Literature ◽

User Interests ◽

The Social ◽

The Relationship

Big Data1is a phenomenon that has been increasingly studied in the academy in recent years, especially in technological and scientific contexts. However, it is still a relatively new field of academic study; because it has been previously considered in mainly technological contexts, more attention needs to be drawn to the contributions made in Big Data scholarship in the social sciences by scholars like Omar Tene and Jules Polonetsky, Bart Custers, Kate Crawford, Nick Couldry, and Jose van Dijk. The purpose of this Major Research Paper is to gain insight into the issues surrounding privacy and user rights, roles, and commodification in relation to Big Data in a social sciences context. The term “Big Data” describes the collection, aggregation, and analysis of large data sets. While corporations are usually responsible for the analysis and dissemination of the data, most of this data is user generated, and there must be considerations regarding the user’s rights and roles. In this paper, I raise three main issues that shape the discussion: how users can be more active agents in data ownership, how consent measures can be made to actively reflect user interests instead of focusing on benefitting corporations, and how user agency can be preserved. Through an analysis of social sciences scholarly literature on Big Data, privacy, and user commodification, I wish to determine how these concepts are being discussed, where there have been advancements in privacy regulation and the prevention of user commodification, and where there is a need to improve these measures. In doing this, I hope to discover a way to better facilitate the relationship between data collectors and analysts, and user-generators. 1 While there is no definitive resolution as to whether or not to capitalize the term “Big Data”, in capitalizing it I chose to conform with such authors as boyd and Crawford (2012), Couldry and Turow (2014), and Dalton and Thatcher (2015), who do so in the scholarly literature.

Download Full-text

Descriptive and Predictive Analytical Methods for Big Data

Web Services ◽

10.4018/978-1-5225-7501-6.ch018 ◽

2019 ◽

pp. 314-331 ◽

Cited By ~ 1

Author(s):

Sema A. Kalaian ◽

Rafa M. Kasim ◽

Nabeel R. Kasim

Keyword(s):

Big Data ◽

Standard Deviation ◽

Linear Regression ◽

Multiple Linear Regression ◽

Knowledge Discovery ◽

Data Visualization ◽

Analytical Methods ◽

Data Analytics ◽

Enterprise Performance ◽

Analytical Tools

Data analytics and modeling are powerful analytical tools for knowledge discovery through examining and capturing the complex and hidden relationships and patterns among the quantitative variables in the existing massive structured Big Data in efforts to predict future enterprise performance. The main purpose of this chapter is to present a conceptual and practical overview of some of the basic and advanced analytical tools for analyzing structured Big Data. The chapter covers descriptive and predictive analytical methods. Descriptive analytical tools such as mean, median, mode, variance, standard deviation, and data visualization methods (e.g., histograms, line charts) are covered. Predictive analytical tools for analyzing Big Data such as correlation, simple- and multiple- linear regression are also covered in the chapter.

Download Full-text

A Detailed Study on Classification Algorithms in Big Data

Big Data Analytics for Sustainable Computing - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9750-6.ch002 ◽

2020 ◽

pp. 30-46

Author(s):

Saranya N. ◽

Saravana Selvam

Keyword(s):

Big Data ◽

Random Forest ◽

Linear Regression ◽

Comprehensive Evaluation ◽

Large Data ◽

Large Data Sets ◽

Data Sets ◽

Classification Methods ◽

Computing Science ◽

Data Collections

After an era of managing data collection difficulties, these days the issue has turned into the problem of how to process these vast amounts of information. Scientists, as well as researchers, think that today, probably the most essential topic in computing science is Big Data. Big Data is used to clarify the huge volume of data that could exist in any structure. This makes it difficult for standard controlling approaches for mining the best possible data through such large data sets. Classification in Big Data is a procedure of summing up data sets dependent on various examples. There are distinctive classification frameworks which help us to classify data collections. A few methods that discussed in the chapter are Multi-Layer Perception Linear Regression, C4.5, CART, J48, SVM, ID3, Random Forest, and KNN. The target of this chapter is to provide a comprehensive evaluation of classification methods that are in effect commonly utilized.

Download Full-text

Statistical Modelling of the Capital Asset Pricing Model (CAPM)

Accounting and Finance Research ◽

10.5430/afr.v7n2p146 ◽

2018 ◽

Vol 7 (2) ◽

pp. 146

Author(s):

Silvi Qemo ◽

Eahab Elsaid

Keyword(s):

Linear Regression ◽

Regression Model ◽

Multiple Linear Regression ◽

Linear Regression Model ◽

Multiple Linear Regression Model ◽

Statistical Modelling ◽

Expected Returns ◽

Explanatory Variables ◽

Bid Ask Spread ◽

Model Average

The purpose of this study is to derive a multiple linear regression model of the CAPM. More specifically, to test for other potential explanatory variables that can be added to the basic linear regression model for the expected returns on Apple Inc. The following explanatory variables were examined: share volume, outstanding shares, closing bid/ask spread, high/low spread and average spread. Using daily returns of Apple Inc. stock from 2007 till 2014 we were able to create a multiple linear regression model of CAPM that increase the R2 value from the basic linear regression model and enhances the amount of variability in the returns on an asset. This is an important modification that can help better forecast returns on assets.Keywords: CAPM; multiple linear regression model; average spread; variability in the returns

Download Full-text

Multiple linear regression with correlated explanatory variables and responses

Survey Review ◽

10.1179/1752270615y.0000000006 ◽

2016 ◽

Vol 49 (352) ◽

pp. 1-8 ◽

Cited By ~ 1

Author(s):

B. Li ◽

M. Wang ◽

Y. Yang

Keyword(s):

Linear Regression ◽

Multiple Linear Regression ◽

Explanatory Variables

Download Full-text

Bought and Sold: Exploring the Effects of Big Data on User Agency and Commodification

10.32920/ryerson.14657883 ◽

2021 ◽

Author(s):

Kristia M. Pavlakos

Keyword(s):

Social Sciences ◽

Big Data ◽

Data Privacy ◽

Large Data ◽

Data Sets ◽

Privacy Regulation ◽

Scholarly Literature ◽

User Interests ◽

The Social ◽

The Relationship

Big Data1is a phenomenon that has been increasingly studied in the academy in recent years, especially in technological and scientific contexts. However, it is still a relatively new field of academic study; because it has been previously considered in mainly technological contexts, more attention needs to be drawn to the contributions made in Big Data scholarship in the social sciences by scholars like Omar Tene and Jules Polonetsky, Bart Custers, Kate Crawford, Nick Couldry, and Jose van Dijk. The purpose of this Major Research Paper is to gain insight into the issues surrounding privacy and user rights, roles, and commodification in relation to Big Data in a social sciences context. The term “Big Data” describes the collection, aggregation, and analysis of large data sets. While corporations are usually responsible for the analysis and dissemination of the data, most of this data is user generated, and there must be considerations regarding the user’s rights and roles. In this paper, I raise three main issues that shape the discussion: how users can be more active agents in data ownership, how consent measures can be made to actively reflect user interests instead of focusing on benefitting corporations, and how user agency can be preserved. Through an analysis of social sciences scholarly literature on Big Data, privacy, and user commodification, I wish to determine how these concepts are being discussed, where there have been advancements in privacy regulation and the prevention of user commodification, and where there is a need to improve these measures. In doing this, I hope to discover a way to better facilitate the relationship between data collectors and analysts, and user-generators. 1 While there is no definitive resolution as to whether or not to capitalize the term “Big Data”, in capitalizing it I chose to conform with such authors as boyd and Crawford (2012), Couldry and Turow (2014), and Dalton and Thatcher (2015), who do so in the scholarly literature.

Download Full-text

A preliminary investigation into honey bee (Apis mellifera) pollination of canola (Brassica napus cv. Karoo) in Western Australia

Australian Journal of Experimental Agriculture ◽

10.1071/ea98148 ◽

2000 ◽

Vol 40 (3) ◽

pp. 439 ◽

Cited By ~ 15

Author(s):

R. Manning ◽

J. Boland ◽

J. Boland

Keyword(s):

Apis Mellifera ◽

Brassica Napus ◽

Linear Regression ◽

Multiple Linear Regression ◽

Honey Bee ◽

Preliminary Investigation ◽

Conservative Estimate ◽

Branch Number ◽

Explanatory Variables ◽

Pod Yield

The aim of this preliminary experiment was to evaluate the effect of distance from the apiary on pod yield in canola. Beehives were used at a density of 1.28 hives/ha. The results showed that the number of pods/plant decreased as distance from the apiary increased, when plant height and branch number were used as explanatory variables. Multiple linear regression indicated a predicted pod loss of 15.3 pods/plant over a distance of 1000 m from an apiary. This was equivalent to a 16% loss based on an average of 59 plants/m2 and average pod production of 5666 pods/m2 from this experiment. For a 2 t/ha crop this would be equivalent to about 320 kg/ha. The results are only indicative because of the variation in the crop studied and lack of replication, but may, in fact, be a conservative estimate.

Download Full-text