Managing, Analysing, and Integrating Big Data in Medical Bioinformatics: Open Problems and Future Perspectives

BioMed Research International ◽

10.1155/2014/134023 ◽

2014 ◽

Vol 2014 ◽

pp. 1-13 ◽

Cited By ~ 63

Author(s):

Ivan Merelli ◽

Horacio Pérez-Sánchez ◽

Sandra Gesing ◽

Daniele D’Agostino

Keyword(s):

Big Data ◽

High Performance ◽

Biomedical Data ◽

Huge Amount ◽

Open Problems ◽

Starting Point ◽

Data Driven Approach ◽

Access Data ◽

Performance Computing ◽

Open Issues

The explosion of the data both in the biomedical research and in the healthcare systems demands urgent solutions. In particular, the research in omics sciences is moving from a hypothesis-driven to a data-driven approach. Healthcare is additionally always asking for a tighter integration with biomedical data in order to promote personalized medicine and to provide better treatments. Efficient analysis and interpretation of Big Data opens new avenues to explore molecular biology, new questions to ask about physiological and pathological states, and new ways to answer these open issues. Such analyses lead to better understanding of diseases and development of better and personalized diagnostics and therapeutics. However, such progresses are directly related to the availability of new solutions to deal with this huge amount of information. New paradigms are needed to store and access data, for its annotation and integration and finally for inferring knowledge and making it available to researchers. Bioinformatics can be viewed as the “glue” for all these processes. A clear awareness of present high performance computing (HPC) solutions in bioinformatics, Big Data analysis paradigms for computational biology, and the issues that are still open in the biomedical and healthcare fields represent the starting point to win this challenge.

Download Full-text

Perspectives on High-Performance Computing in a Big Data World

Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing - HPDC '19 ◽

10.1145/3307681.3325410 ◽

2019 ◽

Author(s):

Geoffrey C. Fox

Keyword(s):

Big Data ◽

High Performance Computing ◽

High Performance ◽

Performance Computing

Download Full-text

High-Performance Computing for Big Data Processing

Future Generation Computer Systems ◽

10.1016/j.future.2018.07.054 ◽

2018 ◽

Vol 88 ◽

pp. 693-695 ◽

Cited By ~ 1

Author(s):

Yulei Wu ◽

Yang Xiang ◽

Jingguo Ge ◽

Peter Muller

Keyword(s):

Big Data ◽

Data Processing ◽

High Performance Computing ◽

High Performance ◽

Big Data Processing ◽

Performance Computing

Download Full-text

High Performance Computing and Big Data

Studies in Big Data - Guide to Big Data Applications ◽

10.1007/978-3-319-53817-4_6 ◽

2017 ◽

pp. 125-147 ◽

Cited By ~ 1

Author(s):

Rishi Divate ◽

Sankalp Sah ◽

Manish Singh

Keyword(s):

Big Data ◽

High Performance Computing ◽

High Performance ◽

Performance Computing

Download Full-text

Big Data and IT Network Data Visualization

International Journal of Mathematical Engineering and Management Sciences ◽

10.33889/ijmems.2018.3.1-002 ◽

2018 ◽

Vol 3 (1) ◽

pp. 9-16 ◽

Cited By ~ 3

Author(s):

Lidong Wang

Keyword(s):

Big Data ◽

Network Analysis ◽

Graphics Processing Units ◽

Data Analytics ◽

High Performance ◽

Big Data Analytics ◽

Network Visualization ◽

Network Data ◽

Graphics Processing ◽

Performance Computing

Visualization with graphs is popular in the data analysis of Information Technology (IT) networks or computer networks. An IT network is often modelled as a graph with hosts being nodes and traffic being flows on many edges. General visualization methods are introduced in this paper. Applications and technology progress of visualization in IT network analysis and big data in IT network visualization are presented. The challenges of visualization and Big Data analytics in IT network visualization are also discussed. Big Data analytics with High Performance Computing (HPC) techniques, especially Graphics Processing Units (GPUs) helps accelerate IT network analysis and visualization.

Download Full-text

High Performance Numerical Computing for High Energy Physics: A New Challenge for Big Data Science

Advances in High Energy Physics ◽

10.1155/2014/507690 ◽

2014 ◽

Vol 2014 ◽

pp. 1-13 ◽

Cited By ~ 3

Author(s):

Florin Pop

Keyword(s):

Monte Carlo ◽

Big Data ◽

Numerical Methods ◽

High Performance ◽

Data Science ◽

Experimental Validation ◽

High Energy Physics ◽

High Energy ◽

Performance Computing ◽

Energy Physics

Modern physics is based on both theoretical analysis and experimental validation. Complex scenarios like subatomic dimensions, high energy, and lower absolute temperature are frontiers for many theoretical models. Simulation with stable numerical methods represents an excellent instrument for high accuracy analysis, experimental validation, and visualization. High performance computing support offers possibility to make simulations at large scale, in parallel, but the volume of data generated by these experiments creates a new challenge for Big Data Science. This paper presents existing computational methods for high energy physics (HEP) analyzed from two perspectives: numerical methods and high performance computing. The computational methods presented are Monte Carlo methods and simulations of HEP processes, Markovian Monte Carlo, unfolding methods in particle physics, kernel estimation in HEP, and Random Matrix Theory used in analysis of particles spectrum. All of these methods produce data-intensive applications, which introduce new challenges and requirements for ICT systems architecture, programming paradigms, and storage capabilities.

Download Full-text

Bringing High Performance Computing to Big Data Algorithms

Handbook of Big Data Technologies ◽

10.1007/978-3-319-49340-4_23 ◽

2017 ◽

pp. 777-806 ◽

Cited By ~ 2

Author(s):

H. Anzt ◽

J. Dongarra ◽

M. Gates ◽

J. Kurzak ◽

P. Luszczek ◽

...

Keyword(s):

Big Data ◽

High Performance Computing ◽

High Performance ◽

Performance Computing

Download Full-text

Synchronizing Execution of Big Data in Distributed and Parallelized Environments

Big Data ◽

10.4018/978-1-4666-9840-6.ch071 ◽

2016 ◽

pp. 1555-1581

Author(s):

Gueyoung Jung ◽

Tridib Mukherjee

Keyword(s):

Big Data ◽

Distributed System ◽

Data Analytics ◽

High Performance ◽

Large Scale ◽

Big Data Analytics ◽

Loosely Coupled ◽

Current Trends ◽

Distributed Computing Infrastructures ◽

Performance Computing

In the modern information era, the amount of data has exploded. Current trends further indicate exponential growth of data in the future. This prevalent humungous amount of data—referred to as big data—has given rise to the problem of finding the “needle in the haystack” (i.e., extracting meaningful information from big data). Many researchers and practitioners are focusing on big data analytics to address the problem. One of the major issues in this regard is the computation requirement of big data analytics. In recent years, the proliferation of many loosely coupled distributed computing infrastructures (e.g., modern public, private, and hybrid clouds, high performance computing clusters, and grids) have enabled high computing capability to be offered for large-scale computation. This has allowed the execution of the big data analytics to gather pace in recent years across organizations and enterprises. However, even with the high computing capability, it is a big challenge to efficiently extract valuable information from vast astronomical data. Hence, we require unforeseen scalability of performance to deal with the execution of big data analytics. A big question in this regard is how to maximally leverage the high computing capabilities from the aforementioned loosely coupled distributed infrastructure to ensure fast and accurate execution of big data analytics. In this regard, this chapter focuses on synchronous parallelization of big data analytics over a distributed system environment to optimize performance.

Download Full-text

Big data, high performance computing and Pharmaceutical innovations

Journal of Pharmaceutics & Drug Delivery Research ◽

10.4172/2325-9604.c1.001 ◽

2016 ◽

Vol 05 (03) ◽

Author(s):

Jun Xu

Keyword(s):

Big Data ◽

High Performance Computing ◽

High Performance ◽

Performance Computing

Download Full-text

Approaches of enhancing interoperations among high performance computing and big data analytics via augmentation

Cluster Computing ◽

10.1007/s10586-019-02960-y ◽

2019 ◽

Vol 23 (2) ◽

pp. 953-988 ◽

Cited By ~ 3

Author(s):

Ajeet Ram Pathak ◽

Manjusha Pandey ◽

Siddharth S. Rautaray

Keyword(s):

Big Data ◽

High Performance Computing ◽

Data Analytics ◽

High Performance ◽

Big Data Analytics ◽

Performance Computing

Download Full-text

Rethinking High Performance Computing System Architecture for Scientific Big Data Applications

2016 IEEE Trustcom/BigDataSE/ISPA ◽

10.1109/trustcom.2016.0248 ◽

2016 ◽

Author(s):

Yong Chen ◽

Chao Chen ◽

Yanlong Yin ◽

Xian-He Sun ◽

Rajeev Thakur ◽

...

Keyword(s):

Big Data ◽

High Performance Computing ◽

System Architecture ◽

High Performance ◽

Computing System ◽

Big Data Applications ◽

High Performance Computing System ◽

Performance Computing

Download Full-text