Applying big data paradigms to a large scale scientific workflow: Lessons learned and future directions

BANKSAFE: Visual analytics for big data in large-scale computer networks

Information Visualization ◽

10.1177/1473871613488572 ◽

2013 ◽

Vol 14 (1) ◽

pp. 51-61 ◽

Cited By ~ 12

Author(s):

Fabian Fischer ◽

Johannes Fuchs ◽

Florian Mansmann ◽

Daniel A Keim

Keyword(s):

Big Data ◽

Computer Networks ◽

Visual Analytics ◽

Large Scale ◽

Science And Technology ◽

Lessons Learned ◽

Unstructured Data ◽

Monitoring Data ◽

Making Sense

The enormous growth of data in the last decades led to a wide variety of different database technologies. Nowadays, we are capable of storing vast amounts of structured and unstructured data. To address the challenge of exploring and making sense out of big data using visual analytics, the tight integration of such backend services is needed. In this article, we introduce BANKSAFE, which was built for the VAST Challenge 2012 and won the outstanding comprehensive submission award. BANKSAFE is based on modern database technologies and is capable of visually analyzing vast amounts of monitoring data and security-related datasets of large-scale computer networks. To better describe and demonstrate the visualizations, we utilize the Visual Analytics Science and Technology (VAST) Challenge 2012 as case study. Additionally, we discuss lessons learned during the design and development of BANKSAFE, which are also applicable to other visual analytics applications for big data.

Download Full-text

CO3D MISSION DIGITAL SURFACE MODEL PRODUCTION PIPELINE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2020-143-2020 ◽

2020 ◽

Vol XLIII-B2-2020 ◽

pp. 143-148

Author(s):

O. Melet ◽

D. Youssefi ◽

C. L'Helguen ◽

J. Michel ◽

E. Sarrazin ◽

...

Keyword(s):

Image Processing ◽

Big Data ◽

High Resolution ◽

Large Scale ◽

Lessons Learned ◽

Surface Model ◽

Processing Pipeline ◽

Production Pipeline ◽

Big Data Technologies ◽

Cloud Technologies

Abstract. Earth Observation (EO) remote sensing missions are producing an increasing volume of data due to higher spatial and spectral resolutions, and higher frequency of acquisitions. Thus, in order to prepare the future of image processing pipelines, CNES has carried out Research & Development studies related to the use of Big Data and Cloud technologies for image processing chains made. Since mid-2019, CNES in partnership with Airbus Defense & Space, has started a new High Resolution Optical EO mission dedicated to very high resolution 3D observation called CO3D (“Constellation Optique 3D”). To achieve those objectives, a new image processing pipeline prototype is being developed taking in consideration the lessons learned from the previous studies. The paper will introduce this new image processing pipeline, the processing paradigms used to take advantage of big data technologies and the results of production benchmarks at a large scale. The on-going works to optimize the processing pipeline and Cloud cluster will be also discussed.

Download Full-text

Experiences With and Lessons Learned From Developing, Implementing, and Evaluating a Support Program for Older Hearing Aid Users and Their Communication Partners in the Hearing Aid Dispensing Setting

American Journal of Audiology ◽

10.1044/2020_aja-19-00072 ◽

2020 ◽

Vol 29 (3S) ◽

pp. 638-647 ◽

Cited By ~ 2

Author(s):

Janine F. J. Meijerink ◽

Marieke Pronk ◽

Sophia E. Kramer

Keyword(s):

Large Scale ◽

Hearing Aid ◽

Critical Discussion ◽

Lessons Learned ◽

Research Note ◽

Support Program ◽

Large Sample Size ◽

Long Term Effects ◽

Communication Program ◽

Communication Partners

Purpose The SUpport PRogram (SUPR) study was carried out in the context of a private academic partnership and is the first study to evaluate the long-term effects of a communication program (SUPR) for older hearing aid users and their communication partners on a large scale in a hearing aid dispensing setting. The purpose of this research note is to reflect on the lessons that we learned during the different development, implementation, and evaluation phases of the SUPR project. Procedure This research note describes the procedures that were followed during the different phases of the SUPR project and provides a critical discussion to describe the strengths and weaknesses of the approach taken. Conclusion This research note might provide researchers and intervention developers with useful insights as to how aural rehabilitation interventions, such as the SUPR, can be developed by incorporating the needs of the different stakeholders, evaluated by using a robust research design (including a large sample size and a longer term follow-up assessment), and implemented widely by collaborating with a private partner (hearing aid dispensing practice chain).

Download Full-text

Lessons Learned from Various Data Collection Efforts on Child Maltreatment - Data Collection Efforts on Child Maltreatment in Korea: Current Situations and Future Directions

PsycEXTRA Dataset ◽

10.1037/e500792015-215 ◽

2014 ◽

Author(s):

Bong Joo Lee

Keyword(s):

Child Maltreatment ◽

Data Collection ◽

Lessons Learned ◽

Future Directions

Download Full-text

Expression of the Recombinant Soluble Glycoprotein IIb: Lessons Learned and Future Directions

Global Journal of Hematology and Blood Transfusion ◽

10.15379/2408-9877.2015.02.02.03 ◽

2015 ◽

Vol 2 (2) ◽

pp. 16-19

Author(s):

Younis Skaik ◽

Keyword(s):

Lessons Learned ◽

Future Directions

Download Full-text

A NOVEL ANALYTIC APPROACH FOR LARGE SCALE POWER PLANT WIDE PROCESSES WITH BIG DATA

Advances in Mathematics: Scientific Journal ◽

10.37418/amsj.9.6.30 ◽

2020 ◽

Vol 9 (6) ◽

pp. 3509-3517

Author(s):

K. Malakonda Rayudu ◽

A. Kumar

Keyword(s):

Big Data ◽

Power Plant ◽

Large Scale ◽

Analytic Approach

Download Full-text

Engaging Supply-Chain Manufacturers to Optimize Delivery of Automation: Case Study and Lessons Learned from Optimizing BNR and Energy Efficiency within Large-Scale Aeration Automation at Tallman Island WPCP, New York City

Proceedings of the Water Environment Federation ◽

10.2175/193864715819542278 ◽

2015 ◽

Vol 2015 (10) ◽

pp. 3006-3014

Author(s):

R. J Kowalski ◽

J Finnigan ◽

A Kreel ◽

M Zaman

Keyword(s):

New York ◽

Energy Efficiency ◽

New York City ◽

Supply Chain ◽

York City ◽

Large Scale ◽

Lessons Learned

Download Full-text

Multi Disease-Prediction Framework Using Hybrid Deep Learning: An Optimal Prediction Model (Preprint)

10.2196/preprints.22865 ◽

2020 ◽

Author(s):

Anusha Ampavathi ◽

Vijaya Saradhi T

Keyword(s):

Feature Extraction ◽

Big Data ◽

Deep Learning ◽

Weight Function ◽

Optimization Algorithm ◽

Large Scale ◽

Heuristic Algorithms ◽

Disease Prediction ◽

Health Care Decisions ◽

Proposed Model

UNSTRUCTURED Big data and its approaches are generally helpful for healthcare and biomedical sectors for predicting the disease. For trivial symptoms, the difficulty is to meet the doctors at any time in the hospital. Thus, big data provides essential data regarding the diseases on the basis of the patient’s symptoms. For several medical organizations, disease prediction is important for making the best feasible health care decisions. Conversely, the conventional medical care model offers input as structured that requires more accurate and consistent prediction. This paper is planned to develop the multi-disease prediction using the improvised deep learning concept. Here, the different datasets pertain to “Diabetes, Hepatitis, lung cancer, liver tumor, heart disease, Parkinson’s disease, and Alzheimer’s disease”, from the benchmark UCI repository is gathered for conducting the experiment. The proposed model involves three phases (a) Data normalization (b) Weighted normalized feature extraction, and (c) prediction. Initially, the dataset is normalized in order to make the attribute's range at a certain level. Further, weighted feature extraction is performed, in which a weight function is multiplied with each attribute value for making large scale deviation. Here, the weight function is optimized using the combination of two meta-heuristic algorithms termed as Jaya Algorithm-based Multi-Verse Optimization algorithm (JA-MVO). The optimally extracted features are subjected to the hybrid deep learning algorithms like “Deep Belief Network (DBN) and Recurrent Neural Network (RNN)”. As a modification to hybrid deep learning architecture, the weight of both DBN and RNN is optimized using the same hybrid optimization algorithm. Further, the comparative evaluation of the proposed prediction over the existing models certifies its effectiveness through various performance measures.

Download Full-text

Analysis of the Influence of Big Data Background on the Spread of Large-Scale Sports Events

Journal of Physics Conference Series ◽

10.1088/1742-6596/1744/3/032003 ◽

2021 ◽

Vol 1744 (3) ◽

pp. 032003

Author(s):

Tieniu Xia

Keyword(s):

Big Data ◽

Large Scale ◽

Sports Events

Download Full-text

The graph neural networking challenge

ACM SIGCOMM Computer Communication Review ◽

10.1145/3477482.3477485 ◽

2021 ◽

Vol 51 (3) ◽

pp. 9-16

Author(s):

José Suárez-Varela ◽

Miquel Ferriol-Galmés ◽

Albert López ◽

Paul Almasan ◽

Guillermo Bernárdez ◽

...

Keyword(s):

Machine Learning ◽

Computer Networks ◽

Real World ◽

Large Scale ◽

Lessons Learned ◽

Educational Resources ◽

Global Competition ◽

International Telecommunication Union ◽

International Telecommunication ◽

Broad Audience

During the last decade, Machine Learning (ML) has increasingly become a hot topic in the field of Computer Networks and is expected to be gradually adopted for a plethora of control, monitoring and management tasks in real-world deployments. This poses the need to count on new generations of students, researchers and practitioners with a solid background in ML applied to networks. During 2020, the International Telecommunication Union (ITU) has organized the "ITU AI/ML in 5G challenge", an open global competition that has introduced to a broad audience some of the current main challenges in ML for networks. This large-scale initiative has gathered 23 different challenges proposed by network operators, equipment manufacturers and academia, and has attracted a total of 1300+ participants from 60+ countries. This paper narrates our experience organizing one of the proposed challenges: the "Graph Neural Networking Challenge 2020". We describe the problem presented to participants, the tools and resources provided, some organization aspects and participation statistics, an outline of the top-3 awarded solutions, and a summary with some lessons learned during all this journey. As a result, this challenge leaves a curated set of educational resources openly available to anyone interested in the topic.

Download Full-text