scholarly journals A Framework for Identifying Influential People by Analyzing Social Media Data

2020 ◽  
Vol 10 (24) ◽  
pp. 8773
Author(s):  
Md. Sabbir Al Ahsan ◽  
Mohammad Shamsul Arefin ◽  
A. S. M. Kayes ◽  
Mohammad Hammoudeh ◽  
Omar Aldabbas

In this paper, we introduce a new framework for identifying the most influential people from social sensor networks. Selecting influential people from social networks is a complicated task as it depends on many metrics like the network of friends, followers, reactions, comments, shares, etc. (e.g., friends-of-a-friend, friends-of-a-friend-of-a-friend). Data on social media are increasing day-by-day at an enormous rate. It is also a challenge to store and process these data. Towards this goal, we use Hadoop to store data and Apache Spark for the fast computation of the data. To select influential people, we apply the mechanisms of skyline query and top-k query. To the best of our knowledge, this is the first work to apply the Apache Spark framework to identify influential people on social sensor network, such as online social media. Our proposed mechanism can find influential people very quickly and efficiently on the data pattern of Facebook.

Sentiment analysis is one of the heated topic in the field of text mining. As the social media data is increased day by day the main need of the data scientists is to classify the data so that it can be further used for decision making or knowledge discovery. Now –a-days everything and everyone available online so to check the latest trends in business or in daily life one must consider the online data. The main focus of sentiment analysis is to focus on positive or negative comments so that a well define picture is created that what is trending or not but the sarcasm manipulates the data as in sarcastic comment negative comment consider as positive because of the presence of positive words in the comment or data so it is necessary to detect the sarcasm in online data . The data on social media is available in various languages so sentiment analysis in regional languages is also a main step . In the proposed work we focus on two languages i.e Punjabi and English. Here we use deep learning based neural networks for the sarcasm detection in English as well as Punjabi language. In the proposed work we consider three datasets i.e. balanced English dataset, Balanced Punjabi Dataset and unbalanced Punjabi dataset. We used six different models to check the accuracy of the classified data the models we used are LSTM with word embedding layer, BiLSTM with , LSTM+LSTM, BiLSTM+BiLSTM, LSTM+BiLSTM, CNN respectively. LSTM provide better accuracy for balanced Punjabi and English dataset i.e. 95.63% and 94.17% respectively. The accuracy for unbalanced Punjabi dataset is provided by BiLSTM i.e.96.31%.


2021 ◽  
Vol 12 ◽  
Author(s):  
Muhammad Usman Tariq ◽  
Muhammad Babar ◽  
Marc Poulin ◽  
Akmal Saeed Khattak ◽  
Mohammad Dahman Alshehri ◽  
...  

Intelligent big data analysis is an evolving pattern in the age of big data science and artificial intelligence (AI). Analysis of organized data has been very successful, but analyzing human behavior using social media data becomes challenging. The social media data comprises a vast and unstructured format of data sources that can include likes, comments, tweets, shares, and views. Data analytics of social media data became a challenging task for companies, such as Dailymotion, that have billions of daily users and vast numbers of comments, likes, and views. Social media data is created in a significant amount and at a tremendous pace. There is a very high volume to store, sort, process, and carefully study the data for making possible decisions. This article proposes an architecture using a big data analytics mechanism to efficiently and logically process the huge social media datasets. The proposed architecture is composed of three layers. The main objective of the project is to demonstrate Apache Spark parallel processing and distributed framework technologies with other storage and processing mechanisms. The social media data generated from Dailymotion is used in this article to demonstrate the benefits of this architecture. The project utilized the application programming interface (API) of Dailymotion, allowing it to incorporate functions suitable to fetch and view information. The API key is generated to fetch information of public channel data in the form of text files. Hive storage machinist is utilized with Apache Spark for efficient data processing. The effectiveness of the proposed architecture is also highlighted.


Author(s):  
Igor Araujo ◽  
Paulo Henrique Lopes Rettore ◽  
João Guilherme Maia de Menezes

Nowadays, understanding urban mobility, transit, people viewpoint, and social behaviors has been the focus of many research and investments. However, data access is restricted to private companies and governments. In addition, the costs to create a sensor infrastructure on a given area is prohibitive. Then, using Location-Based Social Media (LBSM) may provide a new way to better comprehend the social behaviors, by the use of a users viewpoint. In this work, we propose the use of LBSM as participatory sensing, designing the Participatory Social Sensor (PSS), a friendly framework to social media data acquisition and analysis. We develop the Twitter data acquisition and analysis process, aiming to achieve the user application goals through a file setup,where the user specifies the spatial area, temporal interval, tags, and other parameters. As a result, the PSS shows a set of visual analysis which provides a context overview, allowing an easy way to researchers make-decision. A case study, Detection and Enrichment Service for Road Events Based on Heterogeneous Data Merger for VANETs, based on PSS framework was published in the current conference.


2020 ◽  
Vol 34 (01) ◽  
pp. 346-353 ◽  
Author(s):  
Mansi Agarwal ◽  
Maitree Leekha ◽  
Ramit Sawhney ◽  
Rajiv Ratn Shah

In times of a disaster, the information available on social media can be useful for several humanitarian tasks as disseminating messages on social media is quick and easily accessible. Disaster damage assessment is inherently multi-modal, yet most existing work on damage identification has focused solely on building generic classification models that rely exclusively on text or image analysis of online social media sessions (e.g., posts). Despite their empirical success, these efforts ignore the multi-modal information manifested in social media data. Conventionally, when information from various modalities is presented together, it often exhibits complementary insights about the application domain and facilitates better learning performance. In this work, we present Crisis-DIAS, a multi-modal sequential damage identification, and severity detection system. We aim to support disaster management and aid in planning by analyzing and exploiting the impact of linguistic cues on a unimodal visual system. Through extensive qualitative, quantitative and theoretical analysis on a real-world multi-modal social media dataset, we show that the Crisis-DIAS framework is superior to the state-of-the-art damage assessment models in terms of bias, responsiveness, computational efficiency, and assessment performance.


2018 ◽  
Author(s):  
Bernard J. Jansen ◽  
Soon-gyo Jung ◽  
Joni Salminen ◽  
Jisun An ◽  
Haewoon Kwak

2016 ◽  
Vol 28 (3) ◽  
pp. 268-274 ◽  
Author(s):  
Feng Yu ◽  
Theodore Peng ◽  
Kaiping Peng ◽  
Sam Xianjun Zheng ◽  
Zhiyuan Liu

2017 ◽  
Vol 10 (3) ◽  
pp. 644-652
Author(s):  
Asha Asha ◽  
Dr. Balkishan

Escalating crimes on digital facet alarms the law enforcement bodies to keep a gaze on online activities which involve massive amount of data. This will raise a need to detect suspicious activities on online available social media data by optimizing investigations using data mining tools. This paper intends to throw some light on the data mining techniques which are designed and developed for closely examining social media data for suspicious activities and profiles in different domains. Additionally, this study will categorize the techniques under various groups highlighting their important features, challenges and application realm.


Sign in / Sign up

Export Citation Format

Share Document