Data mining with molecular design rules identifies new class of dyes for dye-sensitised solar cells

2014 ◽  
Vol 16 (48) ◽  
pp. 26684-26690 ◽  
Author(s):  
Jacqueline M. Cole ◽  
Kian Sing Low ◽  
Hiroaki Ozoe ◽  
Panagiota Stathi ◽  
Chitoshi Kitamura ◽  
...  

Big data science informs energy research: large-scale screening of crystal structures identifies unforeseen class of dyes for dye-sensitised solar cells.

Author(s):  
Gurdeep S Hura

This chapter presents this new emerging technology of social media and networking with a detailed discussion on: basic definitions and applications, how this technology evolved in the last few years, the need for dynamicity under data mining environment. It also provides a comprehensive design and analysis of popular social networking media and sites available for the users. A brief discussion on the data mining methodologies for implementing the variety of new applications dealing with huge/big data in data science is presented. Further, an attempt is being made in this chapter to present a new emerging perspective of data mining methodologies with its dynamicity for social networking media and sites as a new trend and needed framework for dealing with huge amount of data for its collection, analysis and interpretation for a number of real world applications. A discussion will also be provided for the current and future status of data mining of social media and networking applications.


Web Services ◽  
2019 ◽  
pp. 953-978
Author(s):  
Krishnan Umachandran ◽  
Debra Sharon Ferdinand-James

Continued technological advancements of the 21st Century afford massive data generation in sectors of our economy to include the domains of agriculture, manufacturing, and education. However, harnessing such large-scale data, using modern technologies for effective decision-making appears to be an evolving science that requires knowledge of Big Data management and analytics. Big data in agriculture, manufacturing, and education are varied such as voluminous text, images, and graphs. Applying Big data science techniques (e.g., functional algorithms) for extracting intelligence data affords decision markers quick response to productivity, market resilience, and student enrollment challenges in today's unpredictable markets. This chapter serves to employ data science for potential solutions to Big Data applications in the sectors of agriculture, manufacturing and education to a lesser extent, using modern technological tools such as Hadoop, Hive, Sqoop, and MongoDB.


Author(s):  
Krishnan Umachandran ◽  
Debra Sharon Ferdinand-James

Continued technological advancements of the 21st Century afford massive data generation in sectors of our economy to include the domains of agriculture, manufacturing, and education. However, harnessing such large-scale data, using modern technologies for effective decision-making appears to be an evolving science that requires knowledge of Big Data management and analytics. Big data in agriculture, manufacturing, and education are varied such as voluminous text, images, and graphs. Applying Big data science techniques (e.g., functional algorithms) for extracting intelligence data affords decision markers quick response to productivity, market resilience, and student enrollment challenges in today's unpredictable markets. This chapter serves to employ data science for potential solutions to Big Data applications in the sectors of agriculture, manufacturing and education to a lesser extent, using modern technological tools such as Hadoop, Hive, Sqoop, and MongoDB.


2017 ◽  
Vol 35 (15_suppl) ◽  
pp. 2538-2538
Author(s):  
Mayur Sarangdhar ◽  
Bruce Aronow ◽  
Anil Goud Jegga ◽  
Brian Turpin ◽  
Erin Haag Breese ◽  
...  

2538 Background: Targeted anti-cancer small molecule drugs & immune therapies have had a dramatic impact in improving outcomes & the approach to clinical trials. Increasingly, regulatory approvals are expedited with small studies designed to identify strong efficacy signals. However, this may limit the extent of safety profiling. The use of large scale/big data meta-analyses can identify novel safety & efficacy signals in "real-world" medical settings. Methods: We used AERSMine, an open-source data mining platform to identify drug toxicity signatures in the FDA’s Adverse Event Reporting System of 8.6 million patients. We identified patients (n = 732,198) who received either traditional and targeted cancer therapy & identified therapy-specific toxicity patterns. Patients were classified based on exposures: anthracyclines (n = 83,179), platinum (117,993), antimetabolites (93,062), alkylators (81,507), antimicrotubule agents (97,726), HER2 inhibitors (40,040), VEGFis (79,144), VEGF-TKis (90,734), multi TKis (34,457), anaplastic lymphoma Kis (7,635), PI3K-AKT-mTOR inhibitors (33,864), Bruton TKis (9,247), MEKis (4,018), immunomodulatory agents (174,810), proteasome inhibitors (44,681), immune checkpoint inhibitors (20,287). Pharmacovigilance metrics [Relative Risks & safety signals] were used to establish statistical correlation & toxicity signatures were differentiated using the Kolmogorov–Smirnov test. Results: To validate the use of the AERSMine to detect AEs, we focused on cardiotoxicity. It identified classic drug associated AEs (e.g. ventricular dysfunction with anthracyclines, HER2is & VEGFis; VEGFi hypertension & vascular toxicity; multi TKIs vascular events). AERSMine also identified recently reported uncommon toxicities of myositis/myocarditis with immune checkpoint inhibitors. It indicated a higher frequency of myositis/myocarditis with combination immune checkpoint therapy, paralleling industry corporate safety databases. These toxicities were reported at higher frequencies in patients > 65 yrs. Conclusions: AERSMine “big data” analyses provide a sensitive tool to detect potential new patterns of AEs simultaneously across multiple clinical trials & in the real-world setting.


2016 ◽  
Vol 21 (3) ◽  
pp. 525-547 ◽  
Author(s):  
Scott Tonidandel ◽  
Eden B. King ◽  
Jose M. Cortina

Advances in data science, such as data mining, data visualization, and machine learning, are extremely well-suited to address numerous questions in the organizational sciences given the explosion of available data. Despite these opportunities, few scholars in our field have discussed the specific ways in which the lens of our science should be brought to bear on the topic of big data and big data's reciprocal impact on our science. The purpose of this paper is to provide an overview of the big data phenomenon and its potential for impacting organizational science in both positive and negative ways. We identifying the biggest opportunities afforded by big data along with the biggest obstacles, and we discuss specifically how we think our methods will be most impacted by the data analytics movement. We also provide a list of resources to help interested readers incorporate big data methods into their existing research. Our hope is that we stimulate interest in big data, motivate future research using big data sources, and encourage the application of associated data science techniques more broadly in the organizational sciences.


2015 ◽  
Vol 2015 ◽  
pp. 1-13 ◽  
Author(s):  
Ting Zhu ◽  
Sheng Xiao ◽  
Qingquan Zhang ◽  
Yu Gu ◽  
Ping Yi ◽  
...  

When the number of data generating sensors increases and the amount of sensing data grows to a scale that traditional methods cannot handle, big data methods are needed for sensing applications. However, big data is a fuzzy data science concept and there is no existing research architecture for it nor a generic application structure in the field of sensing. In this survey, we explore many scattered results that have been achieved by combining big data techniques with sensing and present our vision of big data in sensing. Firstly, we outline the application categories to generally summarize existing research achievements. Then we discuss the techniques proposed in these studies to demonstrate challenges and opportunities in this field. Finally, we present research trends and list some directions of big data in future sensing. Overall, mobile sensing and its related studies are hot topics, but other large-scale sensing researches are flourishing too. Although there are no “big data” techniques acting as research platforms or infrastructures to support various applications, multiple data science technologies, such as data mining, crowd sensing, and cloud computing, serve as foundations and bases of big data in the world of sensing.


2018 ◽  
Vol 7 (3.4) ◽  
pp. 13
Author(s):  
Gourav Bathla ◽  
Himanshu Aggarwal ◽  
Rinkle Rani

Data mining is one of the most researched fields in computer science. Several researches have been carried out to extract and analyse important information from raw data. Traditional data mining algorithms like classification, clustering and statistical analysis can process small scale of data with great efficiency and accuracy. Social networking interactions, business transactions and other communications result in Big data. It is large scale of data which is not in competency for traditional data mining techniques. It is observed that traditional data mining algorithms are not capable for storage and processing of large scale of data. If some algorithms are capable, then response time is very high. Big data have hidden information, if that is analysed in intelligent manner can be highly beneficial for business organizations. In this paper, we have analysed the advancement from traditional data mining algorithms to Big data mining algorithms. Applications of traditional data mining algorithms can be straight forward incorporated in Big data mining algorithm. Several studies have analysed traditional data mining with Big data mining, but very few have analysed most important algortihsm within one research work, which is the core motive of our paper. Readers can easily observe the difference between these algorthithms with  pros and cons. Mathemtics concepts are applied in data mining algorithms. Means and Euclidean distance calculation in Kmeans, Vectors application and margin in SVM and Bayes therorem, conditional probability in Naïve Bayes algorithm are real examples.  Classification and clustering are the most important applications of data mining. In this paper, Kmeans, SVM and Naïve Bayes algorithms are analysed in detail to observe the accuracy and response time both on concept and empirical perspective. Hadoop, Mapreduce etc. Big data technologies are used for implementing Big data mining algorithms. Performace evaluation metrics like speedup, scaleup and response time are used to compare traditional mining with Big data mining.  


2014 ◽  
Vol 989-994 ◽  
pp. 4594-4597
Author(s):  
Chun Zhi Xing

With the development of Internet, various Internet-based large-scale data are facing increasing competition. With the hope of satisfying the need of data query, it is necessary to use data mining and distributed processing. As a consequence, this paper proposes a large-scale data mining and distributed processing method based on decision tree algorithm.


Author(s):  
Gurdeep S Hura

This chapter presents this new emerging technology of social media and networking with a detailed discussion on: basic definitions and applications, how this technology evolved in the last few years, the need for dynamicity under data mining environment. It also provides a comprehensive design and analysis of popular social networking media and sites available for the users. A brief discussion on the data mining methodologies for implementing the variety of new applications dealing with huge/big data in data science is presented. Further, an attempt is being made in this chapter to present a new emerging perspective of data mining methodologies with its dynamicity for social networking media and sites as a new trend and needed framework for dealing with huge amount of data for its collection, analysis and interpretation for a number of real world applications. A discussion will also be provided for the current and future status of data mining of social media and networking applications.


Sign in / Sign up

Export Citation Format

Share Document