An Evaluation of Model-Based Approaches to Sensor Data Compression

Nguyen Quoc Viet Hung; Hoyoung Jeung; Karl Aberer

doi:10.1109/tkde.2012.237

Efficient seismic response data storage and transmission using ARX model-based sensor data compression algorithm

Earthquake Engineering & Structural Dynamics ◽

10.1002/eqe.551 ◽

2006 ◽

Vol 35 (6) ◽

pp. 781-788 ◽

Cited By ~ 8

Author(s):

Yunfeng Zhang ◽

Jian Li

Keyword(s):

Data Compression ◽

Seismic Response ◽

Data Storage ◽

Compression Algorithm ◽

Sensor Data ◽

Arx Model ◽

Response Data ◽

Model Based

Download Full-text

Biosignal Compression Toolbox for Digital Biomarker Discovery

Sensors ◽

10.3390/s21020516 ◽

2021 ◽

Vol 21 (2) ◽

pp. 516

Author(s):

Brinnae Bent ◽

Baiying Lu ◽

Juseong Kim ◽

Jessilyn P. Dunn

Keyword(s):

Wavelet Transform ◽

Singular Value Decomposition ◽

Data Compression ◽

Biomarker Discovery ◽

Singular Value ◽

Sensor Data ◽

Discrete Wavelet ◽

Wearable Sensor ◽

Huffman Encoding ◽

Value Decomposition

A critical challenge to using longitudinal wearable sensor biosignal data for healthcare applications and digital biomarker development is the exacerbation of the healthcare “data deluge,” leading to new data storage and organization challenges and costs. Data aggregation, sampling rate minimization, and effective data compression are all methods for consolidating wearable sensor data to reduce data volumes. There has been limited research on appropriate, effective, and efficient data compression methods for biosignal data. Here, we examine the application of different data compression pipelines built using combinations of algorithmic- and encoding-based methods to biosignal data from wearable sensors and explore how these implementations affect data recoverability and storage footprint. Algorithmic methods tested include singular value decomposition, the discrete cosine transform, and the biorthogonal discrete wavelet transform. Encoding methods tested include run-length encoding and Huffman encoding. We apply these methods to common wearable sensor data, including electrocardiogram (ECG), photoplethysmography (PPG), accelerometry, electrodermal activity (EDA), and skin temperature measurements. Of the methods examined in this study and in line with the characteristics of the different data types, we recommend direct data compression with Huffman encoding for ECG, and PPG, singular value decomposition with Huffman encoding for EDA and accelerometry, and the biorthogonal discrete wavelet transform with Huffman encoding for skin temperature to maximize data recoverability after compression. We also report the best methods for maximizing the compression ratio. Finally, we develop and document open-source code and data for each compression method tested here, which can be accessed through the Digital Biomarker Discovery Pipeline as the “Biosignal Data Compression Toolbox,” an open-source, accessible software platform for compressing biosignal data.

Download Full-text

Model-based sensor data fusion of quasi-redundant voltage and current measurements in a lithium-ion battery module

Journal of Power Sources ◽

10.1016/j.jpowsour.2019.227156 ◽

2019 ◽

Vol 440 ◽

pp. 227156 ◽

Cited By ~ 5

Author(s):

Dominik Schneider ◽

Ulrich Vögele ◽

Christian Endisch

Keyword(s):

Data Fusion ◽

Lithium Ion Battery ◽

Lithium Ion ◽

Sensor Data ◽

Sensor Data Fusion ◽

Model Based ◽

Battery Module

Download Full-text

Routine Clustering of Mobile Sensor Data Facilitates Psychotic Relapse Prediction in Schizophrenia Patients (Preprint)

10.2196/preprints.31006 ◽

2021 ◽

Author(s):

Joanne Zhou ◽

Bishal Lamichhane ◽

Dror Ben-Zeev ◽

Andrew Campbell ◽

Akane Sano

Keyword(s):

Prediction Models ◽

Gaussian Mixture ◽

Mobile Sensing ◽

Cluster Models ◽

Sensor Data ◽

Sensing Data ◽

Model Based ◽

Clustering Model ◽

Relapse Prediction ◽

Lower Variability

BACKGROUND Behavioral representations obtained from mobile sensing data could be helpful for the prediction of an oncoming psychotic relapse in schizophrenia patients and delivery of timely interventions to mitigate such relapse. OBJECTIVE In this work, we aim to develop clustering models to obtain behavioral representations from continuous multimodal mobile sensing data towards relapse prediction tasks. The identified clusters could represent different routine behavioral trends related to daily living of patients as well as atypical behavioral trends associated with impending relapse. METHODS We used the mobile sensing data obtained in the CrossCheck project for our analysis. Continuous data from six different mobile sensing-based modalities (e.g. ambient light, sound/conversation, acceleration etc.) obtained from a total of 63 schizophrenia patients, each monitored for up to a year, were used for the clustering models and relapse prediction evaluation. Two clustering models, Gaussian Mixture Model (GMM) and Partition Around Medoids (PAM), were used to obtain behavioral representations from the mobile sensing data. These models have different notions of similarity between behaviors as represented by the mobile sensing data and thus provide differing behavioral characterizations. The features obtained from the clustering models were used to train and evaluate a personalized relapse prediction model using Balanced Random Forest. The personalization was done by identifying optimal features for a given patient based on a personalization subset consisting of other patients who are of similar age. RESULTS The clusters identified using the GMM and PAM models were found to represent different behavioral patterns (such as clusters representing sedentary days, active but with low communications days, etc.). While GMM based models better characterized routine behaviors by discovering dense clusters with low cluster spread, some other identified clusters had a larger cluster spread likely indicating heterogeneous behavioral characterizations. PAM model based clusters on the other hand had lower variability of cluster spread, indicating more homogeneous behavioral characterization in the obtained clusters. Significant changes near the relapse periods were seen in the obtained behavioral representation features from the clustering models. The clustering model based features, together with other features characterizing the mobile sensing data, resulted in an F2 score of 0.24 for the relapse prediction task in a leave-one-patient-out evaluation setting. This obtained F2 score is significantly higher than a random classification baseline with an average F2 score of 0.042. CONCLUSIONS Mobile sensing can capture behavioral trends using different sensing modalities. Clustering of the daily mobile sensing data may help discover routine as well as atypical behavioral trends. In this work, we used GMM and PAM-based cluster models to obtain behavioral trends in schizophrenia patients. The features derived from the cluster models were found to be predictive for detecting an oncoming psychotic relapse. Such relapse prediction models can be helpful to enable timely interventions.

Download Full-text

A Hybrid Approach: Dynamic Diagnostic Rules for Sensor Systems in Industry 4.0 Generated by Online Hyperparameter Tuned Random Forest

10.20944/preprints202007.0548.v1 ◽

2020 ◽

Author(s):

Ahlam Mallak ◽

Madjid Fathi

Keyword(s):

Random Forest ◽

Random Search ◽

Hybrid Approach ◽

Fault Detection And Diagnosis ◽

Data Driven ◽

Sensor Data ◽

Sensor Systems ◽

Hydraulic Test ◽

Model Based ◽

Detection And Diagnosis

In this work, A hybrid component Fault Detection and Diagnosis (FDD) approach for industrial sensor systems is established and analyzed, to provide a hybrid schema that combines the advantages and eliminates the drawbacks of both model-based and data-driven methods of diagnosis. Moreover, spotting the light on a new utilization of Random Forest (RF) together with model-based diagnosis, beyond its ordinary data-driven application. RF is trained and hyperparameter tuned using 3-fold cross-validation over a random grid of parameters using random search, to finally generate diagnostic graphs as the dynamic, data-driven part of this system. Followed by translating those graphs into model-based rules in the form of if-else statements, SQL queries or semantic queries such as SPARQL, in order to feed the dynamic rules into a structured model essential for further diagnosis. The RF hyperparameters are consistently updated online using the newly generated sensor data, in order to maintain the dynamicity and accuracy of the generated graphs and rules thereafter. The architecture of the proposed method is demonstrated in a comprehensive manner, as well as the dynamic rules extraction phase is applied using a case study on condition monitoring of a hydraulic test rig using time series multivariate sensor readings.

Download Full-text

AdaGUM: An Adaptive Graph Updating Model-Based Anomaly Detection Method for Edge Computing Environment

Security and Communication Networks ◽

10.1155/2021/9954951 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Xiang Yu ◽

Chun Shan ◽

Jilong Bian ◽

Xianfei Yang ◽

Ying Chen ◽

...

Keyword(s):

Internet Of Things ◽

Anomaly Detection ◽

Real Time ◽

Detection Method ◽

Edge Computing ◽

Sensor Data ◽

Computing Environment ◽

The Real ◽

Edge Node ◽

Model Based

With the rapid development of Internet of Things (IoT), massive sensor data are being generated by the sensors deployed everywhere at an unprecedented rate. As the number of Internet of Things devices is estimated to grow to 25 billion by 2021, when facing the explicit or implicit anomalies in the real-time sensor data collected from Internet of Things devices, it is necessary to develop an effective and efficient anomaly detection method for IoT devices. Recent advances in the edge computing have significant impacts on the solution of anomaly detection in IoT. In this study, an adaptive graph updating model is first presented, based on which a novel anomaly detection method for edge computing environment is then proposed. At the cloud center, the unknown patterns are classified by a deep leaning model, based on the classification results, the feature graphs are updated periodically, and the classification results are constantly transmitted to each edge node where a cache is employed to keep the newly emerging anomalies or normal patterns temporarily until the edge node receives a newly updated feature graph. Finally, a series of comparison experiments are conducted to demonstrate the effectiveness of the proposed anomaly detection method for edge computing. And the results show that the proposed method can detect the anomalies in the real-time sensor data efficiently and accurately. More than that, the proposed method performs well when there exist newly emerging patterns, no matter they are anomalous or normal.

Download Full-text

A Distributed Covert Channel of the Packet Ordering Enhancement Model Based on Data Compression

Computers Materials & Continua ◽

10.32604/cmc.2020.011219 ◽

2020 ◽

Vol 64 (3) ◽

pp. 2013-2030

Author(s):

Zhang Lejun ◽

Huang Tianwen ◽

Hu Xiaoyan ◽

Zhang Zhijie ◽

Wang Weizheng ◽

...

Keyword(s):

Data Compression ◽

Covert Channel ◽

Model Based

Download Full-text

Deformable model based data compression for gesture recognition

Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004. ◽

10.1109/icpr.2004.1333829 ◽

2004 ◽

Cited By ~ 4

Author(s):

F. Cheneviere ◽

S. Boukir

Keyword(s):

Data Compression ◽

Gesture Recognition ◽

Deformable Model ◽

Model Based

Download Full-text

Model-based data compression for vibration monitoring using Wireless Sensor Networks

Life-Cycle of Civil Engineering Systems - Life-Cycle of Structural Systems ◽

10.1201/b17618-16 ◽

2014 ◽

pp. 138-145 ◽

Cited By ~ 2

Author(s):

R Klis ◽

E Chatzi

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Data Compression ◽

Vibration Monitoring ◽

Wireless Sensor ◽

Model Based

Download Full-text

Camera-Aware Multi-Resolution Analysis for Raw Image Sensor Data Compression

IEEE Transactions on Image Processing ◽

10.1109/tip.2018.2794179 ◽

2018 ◽

Vol 27 (6) ◽

pp. 2806-2817 ◽

Cited By ~ 8

Author(s):

Yeejin Lee ◽

Keigo Hirakawa ◽

Truong Q. Nguyen

Keyword(s):

Data Compression ◽

Image Sensor ◽

Sensor Data ◽

Multi Resolution Analysis ◽

Resolution Analysis

Download Full-text