Performance of System for Analyzing Diffusion of Social Media Messages in Real Time

Author(s):  
Miki Enoki ◽  
Issei Yoshida ◽  
Masato Oguchi
Author(s):  
Erica Briscoe ◽  
Scott Appling ◽  
Edward Clarkson ◽  
Nikolay Lipskiy ◽  
James Tyson ◽  
...  

ObjectiveThe objective of this analysis is to leverage recent advances innatural language processing (NLP) to develop new methods andsystem capabilities for processing social media (Twitter messages)for situational awareness (SA), syndromic surveillance (SS), andevent-based surveillance (EBS). Specifically, we evaluated the useof human-in-the-loop semantic analysis to assist public health (PH)SA stakeholders in SS and EBS using massive amounts of publiclyavailable social media data.IntroductionSocial media messages are often short, informal, and ungrammatical.They frequently involve text, images, audio, or video, which makesthe identification of useful information difficult. This complexityreduces the efficacy of standard information extraction techniques1.However, recent advances in NLP, especially methods tailoredto social media2, have shown promise in improving real-time PHsurveillance and emergency response3. Surveillance data derived fromsemantic analysis combined with traditional surveillance processeshas potential to improve event detection and characterization. TheCDC Office of Public Health Preparedness and Response (OPHPR),Division of Emergency Operations (DEO) and the Georgia TechResearch Institute have collaborated on the advancement of PH SAthrough development of new approaches in using semantic analysisfor social media.MethodsTo understand how computational methods may benefit SS andEBS, we studied an iterative refinement process, in which the datauser actively cultivated text-based topics (“semantic culling”) in asemi-automated SS process. This ‘human-in-the-loop’ process wascritical for creating accurate and efficient extraction functions in large,dynamic volumes of data. The general process involved identifyinga set of expert-supplied keywords, which were used to collect aninitial set of social media messages. For purposes of this analysisresearchers applied topic modeling to categorize related messages intoclusters. Topic modeling uses statistical techniques to semanticallycluster and automatically determine salient aggregations. A user thensemantically culled messages according to their PH relevance.In June 2016, researchers collected 7,489 worldwide English-language Twitter messages (tweets) and compared three samplingmethods: a baseline random sample (C1, n=2700), a keyword-basedsample (C2, n=2689), and one gathered after semantically cullingC2 topics of irrelevant messages (C3, n=2100). Researchers utilizeda software tool, Luminoso Compass4, to sample and perform topicmodeling using its real-time modeling and Twitter integrationfeatures. For C2 and C3, researchers sampled tweets that theLuminoso service matched to both clinical and layman definitions ofRash, Gastro-Intestinal syndromes5, and Zika-like symptoms. Laymanterms were derived from clinical definitions from plain languagemedical thesauri. ANOVA statistics were calculated using SPSSsoftware, version. Post-hoc pairwise comparisons were completedusing ANOVA Turkey’s honest significant difference (HSD) test.ResultsAn ANOVA was conducted, finding the following mean relevancevalues: 3% (+/- 0.01%), 24% (+/- 6.6%) and 27% (+/- 9.4%)respectively for C1, C2, and C3. Post-hoc pairwise comparison testsshowed the percentages of discovered messages related to the eventtweets using C2 and C3 methods were significantly higher than forthe C1 method (random sampling) (p<0.05). This indicates that thehuman-in-the-loop approach provides benefits in filtering socialmedia data for SS and ESB; notably, this increase is on the basis ofa single iteration of semantic culling; subsequent iterations could beexpected to increase the benefits.ConclusionsThis work demonstrates the benefits of incorporating non-traditional data sources into SS and EBS. It was shown that an NLP-based extraction method in combination with human-in-the-loopsemantic analysis may enhance the potential value of social media(Twitter) for SS and EBS. It also supports the claim that advancedanalytical tools for processing non-traditional SA, SS, and EBSsources, including social media, have the potential to enhance diseasedetection, risk assessment, and decision support, by reducing the timeit takes to identify public health events.


2019 ◽  
Vol 118 (6) ◽  
pp. 97-99
Author(s):  
Arockia Jeyasheela A ◽  
Dr.S. Chandramohan

This study is discussed about the viral marketing. It is a one of the key success of marketing. This paper gave the techniques of viral marketing. It can be delivered word of mouth. It can be created by both the representatives of a company and consumer (individuals or communities). The right viral message with go to right consumer to the right time. Viral marketing is easy to attract the consumer. It is most important advertising to consumer. It involves consumer perception, organization contribution, blogs, SMO (Social Media Optimize), SEO (Social Engine Optimize). Principles of viral marketing are social profile gathering, Proximity Market, Real time Key word density.


2021 ◽  
pp. 193896552199308
Author(s):  
Kathryn A. LaTour ◽  
Ana Brant

Most hospitality operators use social media in their communications as a means to communicate brand image and provide information to customers. Our focus is on a two-way exchange whereby a customer’s social posting is reacted to in real-time by the provider to enhance the customer’s current experience. Using social media in this way is new, and the provider needs to carefully balance privacy and personalization. We describe the process by which the Dorchester Collection Customer Experience (CX) Team approached its social listening program and share lessons to identify best practices for hospitality operators wanting to delight their customers through insights gained from social listening.


2021 ◽  
Vol 7 (1) ◽  
Author(s):  
Suppawong Tuarob ◽  
Poom Wettayakorn ◽  
Ponpat Phetchai ◽  
Siripong Traivijitkhun ◽  
Sunghoon Lim ◽  
...  

AbstractThe explosion of online information with the recent advent of digital technology in information processing, information storing, information sharing, natural language processing, and text mining techniques has enabled stock investors to uncover market movement and volatility from heterogeneous content. For example, a typical stock market investor reads the news, explores market sentiment, and analyzes technical details in order to make a sound decision prior to purchasing or selling a particular company’s stock. However, capturing a dynamic stock market trend is challenging owing to high fluctuation and the non-stationary nature of the stock market. Although existing studies have attempted to enhance stock prediction, few have provided a complete decision-support system for investors to retrieve real-time data from multiple sources and extract insightful information for sound decision-making. To address the above challenge, we propose a unified solution for data collection, analysis, and visualization in real-time stock market prediction to retrieve and process relevant financial data from news articles, social media, and company technical information. We aim to provide not only useful information for stock investors but also meaningful visualization that enables investors to effectively interpret storyline events affecting stock prices. Specifically, we utilize an ensemble stacking of diversified machine-learning-based estimators and innovative contextual feature engineering to predict the next day’s stock prices. Experiment results show that our proposed stock forecasting method outperforms a traditional baseline with an average mean absolute percentage error of 0.93. Our findings confirm that leveraging an ensemble scheme of machine learning methods with contextual information improves stock prediction performance. Finally, our study could be further extended to a wide variety of innovative financial applications that seek to incorporate external insight from contextual information such as large-scale online news articles and social media data.


2021 ◽  
Vol 4 (1) ◽  
Author(s):  
James T. H. Teo ◽  
Vlad Dinu ◽  
William Bernal ◽  
Phil Davidson ◽  
Vitaliy Oliynyk ◽  
...  

AbstractAnalyses of search engine and social media feeds have been attempted for infectious disease outbreaks, but have been found to be susceptible to artefactual distortions from health scares or keyword spamming in social media or the public internet. We describe an approach using real-time aggregation of keywords and phrases of freetext from real-time clinician-generated documentation in electronic health records to produce a customisable real-time viral pneumonia signal providing up to 4 days warning for secondary care capacity planning. This low-cost approach is open-source, is locally customisable, is not dependent on any specific electronic health record system and can provide an ensemble of signals if deployed at multiple organisational scales.


2021 ◽  
pp. 016555152110077
Author(s):  
Sulong Zhou ◽  
Pengyu Kan ◽  
Qunying Huang ◽  
Janet Silbernagel

Natural disasters cause significant damage, casualties and economical losses. Twitter has been used to support prompt disaster response and management because people tend to communicate and spread information on public social media platforms during disaster events. To retrieve real-time situational awareness (SA) information from tweets, the most effective way to mine text is using natural language processing (NLP). Among the advanced NLP models, the supervised approach can classify tweets into different categories to gain insight and leverage useful SA information from social media data. However, high-performing supervised models require domain knowledge to specify categories and involve costly labelling tasks. This research proposes a guided latent Dirichlet allocation (LDA) workflow to investigate temporal latent topics from tweets during a recent disaster event, the 2020 Hurricane Laura. With integration of prior knowledge, a coherence model, LDA topics visualisation and validation from official reports, our guided approach reveals that most tweets contain several latent topics during the 10-day period of Hurricane Laura. This result indicates that state-of-the-art supervised models have not fully utilised tweet information because they only assign each tweet a single label. In contrast, our model can not only identify emerging topics during different disaster events but also provides multilabel references to the classification schema. In addition, our results can help to quickly identify and extract SA information to responders, stakeholders and the general public so that they can adopt timely responsive strategies and wisely allocate resource during Hurricane events.


2020 ◽  
Vol 7 (Supplement_1) ◽  
pp. S714-S715
Author(s):  
Jean-Etienne Poirrier ◽  
Theodore Caputi ◽  
John Ayers ◽  
Mark Dredze ◽  
Sara Poston ◽  
...  

Abstract Background A small number of powerful users (“influencers”) dominates conversations on social media platforms: less than 1% of Twitter accounts have at least 3,000 followers and even fewer have hundreds of thousands or millions of followers. Beyond simple metrics (number of tweets, retweets...) little is known about these “influencers”, particularly in relation to their role in shaping online narratives about vaccines. Our goal was to describe influential Twitter accounts that are driving conversations about vaccines and present new metrics of influence. Methods Using publicly-available data from Twitter, we selected posts from 1-Jan-2016 to 31-Dec-2018 and extracted the top 5% of accounts tweeting about vaccines with the most followers. Using automated classifiers, we determined the location of these accounts, and grouped them into those that primarily tweet pro- versus anti-vaccine content. We further characterized the demographics of these influencer accounts. Results From 25,381 vaccine-related tweets available in our sample representing 10,607 users, 530 accounts represented the top 5% by number of followers. These accounts had on average 1,608,637 followers (standard deviation=5,063,421) and 340,390 median followers. Among the accounts for which sentiment was successfully estimated by the classifier, 10.4% (n=55) posted anti-vaccine content and 33.6% (n=178) posted pro-vaccine content. Of the 55 anti-vaccine accounts, 50% (n=18) of the accounts for which location was successfully determined were from the United States. Of the 178 pro-vaccine accounts, 42.5% (n=54) were from the United States. Conclusion This study showed that only a small proportion of Twitter accounts (A) post about vaccines and (B) have a high follower count and post anti-vaccine content. Further analysis of these users may help researchers and policy makers better understand how to amplify the impact of pro-vaccine social media messages. Disclosures Jean-Etienne Poirrier, PhD, MBA, The GSK group of companies (Employee, Shareholder) Theodore Caputi, PhD, Good Analytics Inc. (Consultant) John Ayers, PhD, GSK (Grant/Research Support) Mark Dredze, PhD, Bloomberg LP (Consultant)Good Analytics (Consultant) Sara Poston, PharmD, The GlaxoSmithKline group of companies (Employee, Shareholder) Cosmina Hogea, PhD, GlaxoSmithKline (Employee, Shareholder)


2021 ◽  
Author(s):  
Jingzhong Xie ◽  
Jun Lai ◽  
Dongying Zhang

BACKGROUND Social media has become an important tool to implement risk communication in COVID-19 pandemic, and made health information can gain more exposure by re-posting. OBJECTIVE This paper attempts to identify the factors associated with re-posting of social media messages about health information METHODS Content analysis was applied to scrutinize 4396 Weibo posts that were posted by national and provincial public health agencies Weibo accounts and identified features of information sources and information features, and adopted Zero-Inflated Negative Binomial (ZINB) model to analyze the association between these features and the frequency of message being re-posted. RESULTS Results showed that the followers and the governmental level of information sources are correlated with increased message reposting. The information features, such as hashtags#, picture, video, emotional(!), and the usage of severity, reassurance, efficacy and action frame were associated with increased message reposting behaviors, while hyperlink and usage of uncertainty frame correlated with reduced message reposting behaviors. CONCLUSIONS The features of health information sources, structures , style and content should be paid close attention by health organizations and medical professionals to satisfy the public’s information needs and preferences, promote the public's health engagement. Suitable information systems designing, and health communication strategies making during different stages of the pandemic may improve public awareness of the COVID-19, alleviate negative emotions, promote preventive measures to curb the spread of the virus.


2022 ◽  
Vol 140 ◽  
pp. 49-61
Author(s):  
João S. Oliveira ◽  
Kemefasu Ifie ◽  
Martin Sykora ◽  
Eleni Tsougkou ◽  
Vitor Castro ◽  
...  

Author(s):  
Kai Ma ◽  
Yongjian Tan ◽  
Miao Tian ◽  
Xuejing Xie ◽  
Qinjun Qiu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document