Towards the Discovery of Influencers to Follow in Micro-Blogs (Twitter) by Detecting Topics in Posted Messages (Tweets)

Mubashir Ali; Anees Baqir; Giuseppe Psaila; Sayyam Malik

doi:10.3390/app10165715

Towards the Discovery of Influencers to Follow in Micro-Blogs (Twitter) by Detecting Topics in Posted Messages (Tweets)

Applied Sciences ◽

10.3390/app10165715 ◽

2020 ◽

Vol 10 (16) ◽

pp. 5715 ◽

Cited By ~ 1

Author(s):

Mubashir Ali ◽

Anees Baqir ◽

Giuseppe Psaila ◽

Sayyam Malik

Keyword(s):

Text Classification ◽

Web Sites ◽

Execution Time ◽

Cost Benefit ◽

Classification Model ◽

Classification Problems ◽

Large Pool ◽

Benefit Function ◽

Basic Brick

Micro-blogs, such as Twitter, have become important tools to share opinions and information among users. Messages concerning any topic are daily posted. A message posted by a given user reaches all the users that decided to follow her/him. Some users post many messages, because they aim at being recognized as influencers, typically on specific topics. How a user can discover influencers concerned with her/his interest? Micro-blog apps and web sites lack a functionality to recommend users with influencers, on the basis of the content of posted messages. In this paper, we envision such a scenario and we identify the problem that constitutes the basic brick for developing a recommender of (possibly influencer) users: training a classification model by exploiting messages labeled with topical classes, so as this model can be used to classify unlabeled messages, to let the hidden topic they talk about emerge. Specifically, the paper reports the investigation activity we performed to demonstrate the suitability of our idea. To perform the investigation, we developed an investigation framework that exploits various patterns for extracting features from within messages (labeled with topical classes) in conjunction with the mostly-used classifiers for text classification problems. By means of the investigation framework, we were able to perform a large pool of experiments, that allowed us to evaluate all the combinations of feature patterns with classifiers. By means of a cost-benefit function called “Suitability”, that combines accuracy with execution time, we were able to demonstrate that a technique for discovering topics from within messages suitable for the application context is available.

Download Full-text

An Improved Convolutional Neural Network for Text Classification

Journal of Physics Conference Series ◽

10.1088/1742-6596/2066/1/012091 ◽

2021 ◽

Vol 2066 (1) ◽

pp. 012091

Author(s):

Xiaojing Fan ◽

A Runa ◽

Zhili Pei ◽

Mingyang Jiang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Text Classification ◽

Classification Accuracy ◽

Classification Performance ◽

Classification Model ◽

Classification Problems ◽

Training Time ◽

Accuracy And Precision ◽

And Training

Abstract This paper studies the text classification based on deep learning. Aiming at the problem of over fitting and training time consuming of CNN text classification model, a SDCNN model is constructed based on sparse dropout convolutional neural network. Experimental results show that, compared with CNN, SDCNN further improves the classification performance of the model, and its classification accuracy and precision can reach 98.96% and 85.61%, respectively, indicating that SDCNN has more advantages in text classification problems.

Download Full-text

The Cost-Benefit Measure for Local Public Policies: Social Benefit Function and the Compensation Test

SSRN Electronic Journal ◽

10.2139/ssrn.2366236 ◽

2013 ◽

Author(s):

Tatsuhito Kono ◽

Akio Kishi

Keyword(s):

Public Policies ◽

Cost Benefit ◽

Social Benefit ◽

Benefit Function ◽

The Cost

Download Full-text

A Text Classification Model Base On Region Embedding AND LSTM

Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence ◽

10.1145/3404555.3404643 ◽

2020 ◽

Author(s):

Ying Li ◽

Ming Ye

Keyword(s):

Text Classification ◽

Classification Model ◽

Model Base

Download Full-text

Hybrid Chinese text classification model based on pretraining model

Journal of Physics Conference Series ◽

10.1088/1742-6596/1961/1/012002 ◽

2021 ◽

Vol 1961 (1) ◽

pp. 012002

Author(s):

Xing Zhaoye ◽

Liu Xiaoqun ◽

Sun Peijie

Keyword(s):

Chinese Text ◽

Text Classification ◽

Classification Model ◽

Chinese Text Classification ◽

Model Based

Download Full-text

WNTC: An Efficient Weight News Text Classification Model

2021 Asia-Pacific Conference on Communications Technology and Computer Science (ACCTCS) ◽

10.1109/acctcs52002.2021.00061 ◽

2021 ◽

Author(s):

Pei Yan ◽

Hongxuan Li ◽

Zhipeng Wang

Keyword(s):

Text Classification ◽

Classification Model

Download Full-text

Review of short-text classification

International Journal of Web Information Systems ◽

10.1108/ijwis-12-2017-0083 ◽

2019 ◽

Vol 15 (2) ◽

pp. 155-182 ◽

Cited By ~ 5

Author(s):

Issa Alsmadi ◽

Keng Hoon Gan

Keyword(s):

Social Networks ◽

Text Classification ◽

Development Trend ◽

Practical Reasons ◽

Significant Implication ◽

Classification Problems ◽

Content Type ◽

Short Text ◽

Promising Area ◽

Low Performance

PurposeRapid developments in social networks and their usage in everyday life have caused an explosion in the amount of short electronic documents. Thus, the need to classify this type of document based on their content has a significant implication in many applications. The need to classify these documents in relevant classes according to their text contents should be interested in many practical reasons. Short-text classification is an essential step in many applications, such as spam filtering, sentiment analysis, Twitter personalization, customer review and many other applications related to social networks. Reviews on short text and its application are limited. Thus, this paper aims to discuss the characteristics of short text, its challenges and difficulties in classification. The paper attempt to introduce all stages in principle classification, the technique used in each stage and the possible development trend in each stage.Design/methodology/approachThe paper as a review of the main aspect of short-text classification. The paper is structured based on the classification task stage.FindingsThis paper discusses related issues and approaches to these problems. Further research could be conducted to address the challenges in short texts and avoid poor accuracy in classification. Problems in low performance can be solved by using optimized solutions, such as genetic algorithms that are powerful in enhancing the quality of selected features. Soft computing solution has a fuzzy logic that makes short-text problems a promising area of research.Originality/valueUsing a powerful short-text classification method significantly affects many applications in terms of efficiency enhancement. Current solutions still have low performance, implying the need for improvement. This paper discusses related issues and approaches to these problems.

Download Full-text

ASSOCIATIVE CLASSIFICATION OF MAMMOGRAMS BASED ON PARALLEL MINING OF IMAGE BLOCKS

Biomedical Engineering Applications Basis and Communications ◽

10.4015/s1016237212500470 ◽

2012 ◽

Vol 24 (06) ◽

pp. 513-524

Author(s):

Mohsen Alavash Shooshtari ◽

Keivan Maghooli ◽

Kambiz Badie

Keyword(s):

Association Rules ◽

Classification Systems ◽

Classification Model ◽

Automated Classification ◽

Associative Classification ◽

Classification Problems ◽

Association Rules Mining ◽

Parallel Mining ◽

Transactional Databases ◽

Unique Decision

One of the main objectives of data mining as a promising multidisciplinary field in computer science is to provide a classification model to be used for decision support purposes. In the medical imaging domain, mammograms classification is a difficult diagnostic task which calls for development of automated classification systems. Associative classification, as a special case of association rules mining, has been adopted in classification problems for years. In this paper, an associative classification framework based on parallel mining of image blocks is proposed to be used for mammograms discrimination. Indeed, association rules mining is applied to a commonly used mammography image database to classify digital mammograms into three categories, namely normal, benign and malign. In order to do so, first images are preprocessed and then features are extracted from non-overlapping image blocks and discretized for rule discovery. Association rules are then discovered through parallel mining of transactional databases which correspond to the image blocks, and finally are used within a unique decision-making scheme to predict the class of unknown samples. Finally, experiments are conducted to assess the effectiveness of the proposed framework. Results show that the proposed framework proved successful in terms of accuracy, precision, and recall, and suggest that the framework could be used as the core of any future associative classifier to support mammograms discrimination.

Download Full-text

A Text Classification Model to Identify Performance Bonds Requirement in Public Bidding Notices

Advances in Intelligent Systems and Computing - Advances in Information and Communication ◽

10.1007/978-3-030-39442-4_50 ◽

2020 ◽

pp. 678-691

Author(s):

Urias Cruz da Cunha ◽

Ricardo Silva Carvalho ◽

Alexandre Zaghetto

Keyword(s):

Text Classification ◽

Classification Model

Download Full-text

Text Classification of Public Feedbacks using Convolutional Neural Network Based on Differential Evolution Algorithm

International Journal of Computers Communications & Control ◽

10.15837/ijccc.2019.1.3420 ◽

2019 ◽

Vol 14 (1) ◽

pp. 124-134 ◽

Cited By ~ 2

Author(s):

Shuai Zhang ◽

Yong Chen ◽

Xiaoling Huang ◽

Yishuai Cai

Keyword(s):

Neural Network ◽

Differential Evolution ◽

Convolutional Neural Network ◽

Text Classification ◽

Differential Evolution Algorithm ◽

Classification Performance ◽

Classification Model ◽

Evolution Algorithm ◽

Classification Prediction

Online feedback is an effective way of communication between government departments and citizens. However, the daily high number of public feedbacks has increased the burden on government administrators. The deep learning method is good at automatically analyzing and extracting deep features of data, and then improving the accuracy of classification prediction. In this study, we aim to use the text classification model to achieve the automatic classification of public feedbacks to reduce the work pressure of administrator. In particular, a convolutional neural network model combined with word embedding and optimized by differential evolution algorithm is adopted. At the same time, we compared it with seven common text classification models, and the results show that the model we explored has good classification performance under different evaluation metrics, including accuracy, precision, recall, and F1-score.

Download Full-text

Enhancing the performance of cancer text classification model based on cancer hallmarks

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v10.i2.pp316-323 ◽

2021 ◽

Vol 10 (2) ◽

pp. 316

Author(s):

Noha Ali ◽

Ahmed H. AbuEl-Atta ◽

Hala H. Zayed

Keyword(s):

Neural Network ◽

Language Processing ◽

Text Classification ◽

State Of The Art ◽

Classification Model ◽

Biomedical Text ◽

Cancer Hallmarks ◽

Embedding Technique ◽

Proposed Model ◽

Biomedical Text Classification

<span id="docs-internal-guid-cb130a3a-7fff-3e11-ae3d-ad2310e265f8"><span>Deep learning (DL) algorithms achieved state-of-the-art performance in computer vision, speech recognition, and natural language processing (NLP). In this paper, we enhance the convolutional neural network (CNN) algorithm to classify cancer articles according to cancer hallmarks. The model implements a recent word embedding technique in the embedding layer. This technique uses the concept of distributed phrase representation and multi-word phrases embedding. The proposed model enhances the performance of the existing model used for biomedical text classification. The result of the proposed model overcomes the previous model by achieving an F-score equal to 83.87% using an unsupervised technique that trained on PubMed abstracts called PMC vectors (PMCVec) embedding. Also, we made another experiment on the same dataset using the recurrent neural network (RNN) algorithm with two different word embeddings Google news and PMCVec which achieving F-score equal to 74.9% and 76.26%, respectively.</span></span>

Download Full-text