Data mining with molecular design rules identifies new class of dyes for dye-sensitised solar cells

This chapter presents this new emerging technology of social media and networking with a detailed discussion on: basic definitions and applications, how this technology evolved in the last few years, the need for dynamicity under data mining environment. It also provides a comprehensive design and analysis of popular social networking media and sites available for the users. A brief discussion on the data mining methodologies for implementing the variety of new applications dealing with huge/big data in data science is presented. Further, an attempt is being made in this chapter to present a new emerging perspective of data mining methodologies with its dynamicity for social networking media and sites as a new trend and needed framework for dealing with huge amount of data for its collection, analysis and interpretation for a number of real world applications. A discussion will also be provided for the current and future status of data mining of social media and networking applications.

Download Full-text

Affordances of Data Science in Agriculture, Manufacturing, and Education

Web Services ◽

10.4018/978-1-5225-7501-6.ch052 ◽

2019 ◽

pp. 953-978

Author(s):

Krishnan Umachandran ◽

Debra Sharon Ferdinand-James

Keyword(s):

Big Data ◽

Large Scale ◽

Data Science ◽

Data Generation ◽

Large Scale Data ◽

Big Data Applications ◽

Effective Decision ◽

Effective Decision Making ◽

Text Images ◽

Scale Data

Continued technological advancements of the 21st Century afford massive data generation in sectors of our economy to include the domains of agriculture, manufacturing, and education. However, harnessing such large-scale data, using modern technologies for effective decision-making appears to be an evolving science that requires knowledge of Big Data management and analytics. Big data in agriculture, manufacturing, and education are varied such as voluminous text, images, and graphs. Applying Big data science techniques (e.g., functional algorithms) for extracting intelligence data affords decision markers quick response to productivity, market resilience, and student enrollment challenges in today's unpredictable markets. This chapter serves to employ data science for potential solutions to Big Data applications in the sectors of agriculture, manufacturing and education to a lesser extent, using modern technological tools such as Hadoop, Hive, Sqoop, and MongoDB.

Download Full-text

Affordances of Data Science in Agriculture, Manufacturing, and Education

Privacy and Security Policies in Big Data - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-2486-1.ch002 ◽

2017 ◽

pp. 14-40 ◽

Cited By ~ 2

Author(s):

Krishnan Umachandran ◽

Debra Sharon Ferdinand-James

Keyword(s):

Big Data ◽

Large Scale ◽

Data Science ◽

Data Generation ◽

Large Scale Data ◽

Big Data Applications ◽

Effective Decision ◽

Effective Decision Making ◽

Text Images ◽

Scale Data

Continued technological advancements of the 21st Century afford massive data generation in sectors of our economy to include the domains of agriculture, manufacturing, and education. However, harnessing such large-scale data, using modern technologies for effective decision-making appears to be an evolving science that requires knowledge of Big Data management and analytics. Big data in agriculture, manufacturing, and education are varied such as voluminous text, images, and graphs. Applying Big data science techniques (e.g., functional algorithms) for extracting intelligence data affords decision markers quick response to productivity, market resilience, and student enrollment challenges in today's unpredictable markets. This chapter serves to employ data science for potential solutions to Big Data applications in the sectors of agriculture, manufacturing and education to a lesser extent, using modern technological tools such as Hadoop, Hive, Sqoop, and MongoDB.

Download Full-text

Big Data Mining or Turning Data Mining into Predictive Analytics from Large-Scale 3Vs Data: The Future Challenge for Knowledge Discovery

Model and Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-319-11587-0_2 ◽

2014 ◽

pp. 4-8 ◽

Cited By ~ 8

Author(s):

Alfredo Cuzzocrea

Keyword(s):

Data Mining ◽

Big Data ◽

Knowledge Discovery ◽

Large Scale ◽

Predictive Analytics ◽

Future Challenge ◽

Big Data Mining ◽

The Future

Download Full-text

Large scale adverse event data mining for targeted therapies development.

Journal of Clinical Oncology ◽

10.1200/jco.2017.35.15_suppl.2538 ◽

2017 ◽

Vol 35 (15_suppl) ◽

pp. 2538-2538

Author(s):

Mayur Sarangdhar ◽

Bruce Aronow ◽

Anil Goud Jegga ◽

Brian Turpin ◽

Erin Haag Breese ◽

...

Keyword(s):

Data Mining ◽

Clinical Trials ◽

Adverse Event ◽

Big Data ◽

Real World ◽

Immune Checkpoint ◽

Immune Checkpoint Inhibitors ◽

Large Scale ◽

Checkpoint Inhibitors ◽

Anaplastic Lymphoma

2538 Background: Targeted anti-cancer small molecule drugs & immune therapies have had a dramatic impact in improving outcomes & the approach to clinical trials. Increasingly, regulatory approvals are expedited with small studies designed to identify strong efficacy signals. However, this may limit the extent of safety profiling. The use of large scale/big data meta-analyses can identify novel safety & efficacy signals in "real-world" medical settings. Methods: We used AERSMine, an open-source data mining platform to identify drug toxicity signatures in the FDA’s Adverse Event Reporting System of 8.6 million patients. We identified patients (n = 732,198) who received either traditional and targeted cancer therapy & identified therapy-specific toxicity patterns. Patients were classified based on exposures: anthracyclines (n = 83,179), platinum (117,993), antimetabolites (93,062), alkylators (81,507), antimicrotubule agents (97,726), HER2 inhibitors (40,040), VEGFis (79,144), VEGF-TKis (90,734), multi TKis (34,457), anaplastic lymphoma Kis (7,635), PI3K-AKT-mTOR inhibitors (33,864), Bruton TKis (9,247), MEKis (4,018), immunomodulatory agents (174,810), proteasome inhibitors (44,681), immune checkpoint inhibitors (20,287). Pharmacovigilance metrics [Relative Risks & safety signals] were used to establish statistical correlation & toxicity signatures were differentiated using the Kolmogorov–Smirnov test. Results: To validate the use of the AERSMine to detect AEs, we focused on cardiotoxicity. It identified classic drug associated AEs (e.g. ventricular dysfunction with anthracyclines, HER2is & VEGFis; VEGFi hypertension & vascular toxicity; multi TKIs vascular events). AERSMine also identified recently reported uncommon toxicities of myositis/myocarditis with immune checkpoint inhibitors. It indicated a higher frequency of myositis/myocarditis with combination immune checkpoint therapy, paralleling industry corporate safety databases. These toxicities were reported at higher frequencies in patients > 65 yrs. Conclusions: AERSMine “big data” analyses provide a sensitive tool to detect potential new patterns of AEs simultaneously across multiple clinical trials & in the real-world setting.

Download Full-text

Big Data Methods

Organizational Research Methods ◽

10.1177/1094428116677299 ◽

2016 ◽

Vol 21 (3) ◽

pp. 525-547 ◽

Cited By ~ 55

Author(s):

Scott Tonidandel ◽

Eden B. King ◽

Jose M. Cortina

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Data Analytics ◽

Data Science ◽

Data Sources ◽

Future Research ◽

Organizational Science ◽

Associated Data ◽

Organizational Sciences

Advances in data science, such as data mining, data visualization, and machine learning, are extremely well-suited to address numerous questions in the organizational sciences given the explosion of available data. Despite these opportunities, few scholars in our field have discussed the specific ways in which the lens of our science should be brought to bear on the topic of big data and big data's reciprocal impact on our science. The purpose of this paper is to provide an overview of the big data phenomenon and its potential for impacting organizational science in both positive and negative ways. We identifying the biggest opportunities afforded by big data along with the biggest obstacles, and we discuss specifically how we think our methods will be most impacted by the data analytics movement. We also provide a list of resources to help interested readers incorporate big data methods into their existing research. Our hope is that we stimulate interest in big data, motivate future research using big data sources, and encourage the application of associated data science techniques more broadly in the organizational sciences.

Download Full-text

Emergent Technologies in Big Data Sensing: A Survey

International Journal of Distributed Sensor Networks ◽

10.1155/2015/902982 ◽

2015 ◽

Vol 2015 ◽

pp. 1-13 ◽

Cited By ~ 5

Author(s):

Ting Zhu ◽

Sheng Xiao ◽

Qingquan Zhang ◽

Yu Gu ◽

Ping Yi ◽

...

Keyword(s):

Big Data ◽

Large Scale ◽

Data Science ◽

Mobile Sensing ◽

Emergent Technologies ◽

Sensing Applications ◽

Crowd Sensing ◽

Multiple Data ◽

Challenges And Opportunities ◽

Research Architecture

When the number of data generating sensors increases and the amount of sensing data grows to a scale that traditional methods cannot handle, big data methods are needed for sensing applications. However, big data is a fuzzy data science concept and there is no existing research architecture for it nor a generic application structure in the field of sensing. In this survey, we explore many scattered results that have been achieved by combining big data techniques with sensing and present our vision of big data in sensing. Firstly, we outline the application categories to generally summarize existing research achievements. Then we discuss the techniques proposed in these studies to demonstrate challenges and opportunities in this field. Finally, we present research trends and list some directions of big data in future sensing. Overall, mobile sensing and its related studies are hot topics, but other large-scale sensing researches are flourishing too. Although there are no “big data” techniques acting as research platforms or infrastructures to support various applications, multiple data science technologies, such as data mining, crowd sensing, and cloud computing, serve as foundations and bases of big data in the world of sensing.

Download Full-text

Migrating From Data Mining to Big Data Mining

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.4.14667 ◽

2018 ◽

Vol 7 (3.4) ◽

pp. 13

Author(s):

Gourav Bathla ◽

Himanshu Aggarwal ◽

Rinkle Rani

Keyword(s):

Data Mining ◽

Big Data ◽

Response Time ◽

Large Scale ◽

Naive Bayes ◽

Naïve Bayes ◽

Data Mining Algorithm ◽

Big Data Mining ◽

Data Mining Algorithms ◽

Mining Algorithms

Data mining is one of the most researched fields in computer science. Several researches have been carried out to extract and analyse important information from raw data. Traditional data mining algorithms like classification, clustering and statistical analysis can process small scale of data with great efficiency and accuracy. Social networking interactions, business transactions and other communications result in Big data. It is large scale of data which is not in competency for traditional data mining techniques. It is observed that traditional data mining algorithms are not capable for storage and processing of large scale of data. If some algorithms are capable, then response time is very high. Big data have hidden information, if that is analysed in intelligent manner can be highly beneficial for business organizations. In this paper, we have analysed the advancement from traditional data mining algorithms to Big data mining algorithms. Applications of traditional data mining algorithms can be straight forward incorporated in Big data mining algorithm. Several studies have analysed traditional data mining with Big data mining, but very few have analysed most important algortihsm within one research work, which is the core motive of our paper. Readers can easily observe the difference between these algorthithms with pros and cons. Mathemtics concepts are applied in data mining algorithms. Means and Euclidean distance calculation in Kmeans, Vectors application and margin in SVM and Bayes therorem, conditional probability in Naïve Bayes algorithm are real examples. Classification and clustering are the most important applications of data mining. In this paper, Kmeans, SVM and Naïve Bayes algorithms are analysed in detail to observe the accuracy and response time both on concept and empirical perspective. Hadoop, Mapreduce etc. Big data technologies are used for implementing Big data mining algorithms. Performace evaluation metrics like speedup, scaleup and response time are used to compare traditional mining with Big data mining.

Download Full-text

Large-Scale Data Mining and Distributed Processing in Big Data Internet

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.989-994.4594 ◽

2014 ◽

Vol 989-994 ◽

pp. 4594-4597

Author(s):

Chun Zhi Xing

Keyword(s):

Data Mining ◽

Big Data ◽

Decision Tree ◽

Large Scale ◽

Distributed Processing ◽

Processing Method ◽

Decision Tree Algorithm ◽

Data Query ◽

Large Scale Data ◽

Scale Data

With the development of Internet, various Internet-based large-scale data are facing increasing competition. With the hope of satisfying the need of data query, it is necessary to use data mining and distributed processing. As a consequence, this paper proposes a large-scale data mining and distributed processing method based on decision tree algorithm.

Download Full-text

Need for Dynamicity in Social Networking Sites

Data Mining in Dynamic Social Networks and Fuzzy Systems - Advances in Data Mining and Database Management ◽

10.4018/978-1-4666-4213-3.ch001 ◽

2013 ◽

pp. 1-24 ◽

Cited By ~ 2

Author(s):

Gurdeep S Hura

Keyword(s):

Data Mining ◽

Social Media ◽

Big Data ◽

Social Networking ◽

Real World ◽

Social Networking Sites ◽

Data Science ◽

Emerging Technology ◽

New Applications ◽

Future Status

This chapter presents this new emerging technology of social media and networking with a detailed discussion on: basic definitions and applications, how this technology evolved in the last few years, the need for dynamicity under data mining environment. It also provides a comprehensive design and analysis of popular social networking media and sites available for the users. A brief discussion on the data mining methodologies for implementing the variety of new applications dealing with huge/big data in data science is presented. Further, an attempt is being made in this chapter to present a new emerging perspective of data mining methodologies with its dynamicity for social networking media and sites as a new trend and needed framework for dealing with huge amount of data for its collection, analysis and interpretation for a number of real world applications. A discussion will also be provided for the current and future status of data mining of social media and networking applications.

Download Full-text