A Discrete Hidden Markov Model for SMS Spam Detection

Tian Xia; Xuemin Chen

doi:10.3390/app10145011

A Discrete Hidden Markov Model for SMS Spam Detection

Applied Sciences ◽

10.3390/app10145011 ◽

2020 ◽

Vol 10 (14) ◽

pp. 5011 ◽

Cited By ~ 2

Author(s):

Tian Xia ◽

Xuemin Chen

Keyword(s):

Machine Learning ◽

Markov Model ◽

Hidden Markov Model ◽

Word Order ◽

Short Term Memory ◽

Hidden Markov ◽

Support Vector ◽

Spam Detection ◽

Term Frequency ◽

Length Limitation

Many machine learning methods have been applied for short messaging service (SMS) spam detection, including traditional methods such as naïve Bayes (NB), vector space model (VSM), and support vector machine (SVM), and novel methods such as long short-term memory (LSTM) and the convolutional neural network (CNN). These methods are based on the well-known bag of words (BoW) model, which assumes documents are unordered collection of words. This assumption overlooks an important piece of information, i.e., word order. Moreover, the term frequency, which counts the number of occurrences of each word in SMS, is unable to distinguish the importance of words, due to the length limitation of SMS. This paper proposes a new method based on the discrete hidden Markov model (HMM) to use the word order information and to solve the low term frequency issue in SMS spam detection. The popularly adopted SMS spam dataset from the UCI machine learning repository is used for performance analysis of the proposed HMM method. The overall performance is compatible with deep learning by employing CNN and LSTM models. A Chinese SMS spam dataset with 2000 messages is used for further performance evaluation. Experiments show that the proposed HMM method is not language-sensitive and can identify spam with high accuracy on both datasets.

Download Full-text

Bidirectional LSTM Recurrent Neural Network Plus Hidden Markov Model for Wearable Sensor-Based Dynamic State Estimation

ASME Letters in Dynamic Systems and Control ◽

10.1115/1.4046685 ◽

2020 ◽

Vol 1 (2) ◽

Author(s):

Ritika Sibal ◽

Ding Zhang ◽

Julie Rocho-Levine ◽

K. Alex Shorter ◽

Kira Barton

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

State Estimation ◽

Short Term Memory ◽

Hidden Markov ◽

Training Data ◽

Dynamic State ◽

Support Vector ◽

Similar Amount ◽

Dynamic State Estimation

Abstract Behavior of animals living in the wild is often studied using visual observations made by trained experts. However, these observations tend to be used to classify behavior during discrete time periods and become more difficult when used to monitor multiple individuals for days or weeks. In this work, we present automatic tools to enable efficient behavior and dynamic state estimation/classification from data collected with animal borne bio-logging tags, without the need for statistical feature engineering. A combined framework of an long short-term memory (LSTM) network and a hidden Markov model (HMM) was developed to exploit sequential temporal information in raw motion data at two levels: within and between windows. Taking a moving window data segmentation approach, LSTM estimates the dynamic state corresponding to each window by parsing the contiguous raw data points within the window. HMM then links all of the individual window estimations and further improves the overall estimation. A case study with bottlenose dolphins was conducted to demonstrate the approach. The combined LSTM–HMM method achieved a 6% improvement over conventional methods such as K-nearest neighbor (KNN) and support vector machine (SVM), pushing the accuracy above 90%. In addition to performance improvements, the proposed method requires a similar amount of training data to traditional machine learning methods, making the method easily adaptable to new tasks.

Download Full-text

Fault Diagnosis Method of Low Noise Amplifier Based on Support Vector Machine and Hidden Markov Model

Journal of Electronic Testing ◽

10.1007/s10836-021-05938-0 ◽

2021 ◽

Author(s):

Lu Sun ◽

Yang Li ◽

Han Du ◽

Peipei Liang ◽

Fushun Nian

Keyword(s):

Support Vector Machine ◽

Fault Diagnosis ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Low Noise ◽

Low Noise Amplifier ◽

Support Vector ◽

Diagnosis Method ◽

Noise Amplifier

Download Full-text

Gait Phase Detection Based on a Foot-Mounted Inertial Sensor Using Long Short-Term Memory Enhanced by Hidden Markov Model

10.23919/icac50006.2021.9594161 ◽

2021 ◽

Author(s):

Zhipeng Yu ◽

Jianghai Zhao ◽

Xiong Zhou ◽

Kun Liu ◽

Yu Yan

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Short Term Memory ◽

Hidden Markov ◽

Inertial Sensor ◽

Phase Detection ◽

Short Term ◽

Term Memory ◽

Gait Phase ◽

Long Short Term Memory

Download Full-text

Prediksi Trend Pergerakan Harga Saham dengan Hidden Markov Model (HMM) dan Support Vector Machine (SVM)

Jurnal Matematika Integratif ◽

10.24198/jmi.v10.n1.10181.19-24 ◽

2020 ◽

Vol 10 (1) ◽

pp. 19

Author(s):

Firdaniza Firdaniza ◽

Jondri Jondri

Keyword(s):

Support Vector Machine ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Support Vector

Download Full-text

Fault diagnosis approach based on hidden Markov model and support vector machine

Chinese Journal of Mechanical Engineering ◽

10.3901/cjme.2007.05.092 ◽

2007 ◽

Vol 20 (05) ◽

pp. 92 ◽

Cited By ~ 5

Author(s):

Guanjun LIU

Keyword(s):

Support Vector Machine ◽

Fault Diagnosis ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Support Vector ◽

Diagnosis Approach

Download Full-text

Human gait based gender identification system using Hidden Markov Model and Support Vector Machines

International Conference on Computing, Communication & Automation ◽

10.1109/ccaa.2015.7148386 ◽

2015 ◽

Cited By ~ 8

Author(s):

Deepjoy Das ◽

Alok Chakrabarty

Keyword(s):

Support Vector Machines ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Human Gait ◽

Support Vector ◽

Identification System ◽

Gender Identification ◽

Vector Machines

Download Full-text

Wavelet Transform to Hybrid Support Vector Machine and Hidden Markov Model for Speech Recognition

2005 IEEE International Symposium on Circuits and Systems ◽

10.1109/iscas.2005.1465466 ◽

2005 ◽

Cited By ~ 1

Author(s):

Yu Shao ◽

Chip-Hong Chang

Keyword(s):

Support Vector Machine ◽

Wavelet Transform ◽

Speech Recognition ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Support Vector

Download Full-text

Activity Recognition in Surfing - A Comparative Study between Hidden Markov Model and Support Vector Machine

Procedia Engineering ◽

10.1016/j.proeng.2016.06.279 ◽

2016 ◽

Vol 147 ◽

pp. 912-917 ◽

Cited By ~ 6

Author(s):

Hannes Hoettinger ◽

Franziska Mally ◽

Anton Sabo

Keyword(s):

Support Vector Machine ◽

Comparative Study ◽

Markov Model ◽

Hidden Markov Model ◽

Activity Recognition ◽

Hidden Markov ◽

Support Vector

Download Full-text

Efficacy of hidden markov model over support vector machine on multiclass classification of healthy and cancerous cervical tissues

Optical Diagnostics and Sensing XVIII: Toward Point-of-Care Diagnostics ◽

10.1117/12.2291485 ◽

2018 ◽

Author(s):

Indrajit Kurmi ◽

Sukanya Mukherjee ◽

Ritwik Barman ◽

Prasanta K. Panigrahi ◽

Sabyasachi Mukhopadhyay ◽

...

Keyword(s):

Support Vector Machine ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Multiclass Classification ◽

Support Vector

Download Full-text

Multi-Layer Hidden Markov Model Based Intrusion Detection System

Machine Learning and Knowledge Extraction ◽

10.3390/make1010017 ◽

2018 ◽

Vol 1 (1) ◽

pp. 265-286 ◽

Cited By ~ 7

Author(s):

Wondimu Zegeye ◽

Richard Dean ◽

Farzad Moazzami

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Detection System ◽

Traffic Monitoring ◽

Machine Learning Algorithms ◽

Full Potential ◽

Multi Phase

The all IP nature of the next generation (5G) networks is going to open a lot of doors for new vulnerabilities which are going to be challenging in preventing the risk associated with them. Majority of these vulnerabilities might be impossible to detect with simple networking traffic monitoring tools. Intrusion Detection Systems (IDS) which rely on machine learning and artificial intelligence can significantly improve network defense against intruders. This technology can be trained to learn and identify uncommon patterns in massive volume of traffic and notify, using such as alert flags, system administrators for additional investigation. This paper proposes an IDS design which makes use of machine learning algorithms such as Hidden Markov Model (HMM) using a multi-layer approach. This approach has been developed and verified to resolve the common flaws in the application of HMM to IDS commonly referred as the curse of dimensionality. It factors a huge problem of immense dimensionality to a discrete set of manageable and reliable elements. The multi-layer approach can be expanded beyond 2 layers to capture multi-phase attacks over longer spans of time. A pyramid of HMMs can resolve disparate digital events and signatures across protocols and platforms to actionable information where lower layers identify discrete events (such as network scan) and higher layers new states which are the result of multi-phase events of the lower layers. The concepts of this novel approach have been developed but the full potential has not been demonstrated.

Download Full-text