Visualization Insights From Jen Christiansen and Moritz Stefaner’s “175 Years of Word Usage Patterns in Scientific American”

2021 ◽  
Author(s):  
Andy Kirk
1998 ◽  
Vol 86 (2) ◽  
pp. 595-613E ◽  
Author(s):  
Cynthia Whissell

This article describes a new data base for English word-usage patterns. It improves on older efforts by including television and personal commentaries as sources for the main corpus studied. More than a third of a million words were sampled from media and nonmedia sources and analyzed to produce a parsimonious listing of 6505 words (types) and their frequencies. The reliability and validity of this list were established in a variety of ways, and a computer program based on the list was used to analyze two different sets of data (an exploratory set and one representing an a priori hypothesis about word usage). A mere 206 different words were seen to account for 57% of all the words in the corpus, and 95% of this small set had its roots in Middle English or some older form of English.


10.2196/27613 ◽  
2021 ◽  
Vol 23 (12) ◽  
pp. e27613
Author(s):  
Ahmed Shahriar Sakib ◽  
Md Saddam Hossain Mukta ◽  
Fariha Rowshan Huda ◽  
A K M Najmul Islam ◽  
Tohedul Islam ◽  
...  

Background Many people suffer from insomnia, a sleep disorder characterized by difficulty falling and staying asleep during the night. As social media have become a ubiquitous platform to share users’ thoughts, opinions, activities, and preferences with their friends and acquaintances, the shared content across these platforms can be used to diagnose different health problems, including insomnia. Only a few recent studies have examined the prediction of insomnia from Twitter data, and we found research gaps in predicting insomnia from word usage patterns and correlations between users’ insomnia and their Big 5 personality traits as derived from social media interactions. Objective The purpose of this study is to build an insomnia prediction model from users’ psycholinguistic patterns, including the elements of word usage, semantics, and their Big 5 personality traits as derived from tweets. Methods In this paper, we exploited both psycholinguistic and personality traits derived from tweets to identify insomnia patients. First, we built psycholinguistic profiles of the users from their word choices and the semantic relationships between the words of their tweets. We then determined the relationship between a users’ personality traits and insomnia. Finally, we built a double-weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits as derived from user tweets. Results Our classification model showed strong prediction potential (78.8%) to predict insomnia from tweets. As insomniacs are generally ill-tempered and feel more stress and mental exhaustion, we observed significant correlations of certain word usage patterns among them. They tend to use negative words (eg, “no,” “not,” “never”). Some people frequently use swear words (eg, “damn,” “piss,” “fuck”) with strong temperament. They also use anxious (eg, “worried,” “fearful,” “nervous”) and sad (eg, “crying,” “grief,” “sad”) words in their tweets. We also found that the users with high neuroticism and conscientiousness scores for the Big 5 personality traits likely have strong correlations with insomnia. Additionally, we observed that users with high conscientiousness scores have strong correlations with insomnia patterns, while negative correlation between extraversion and insomnia was also found. Conclusions Our model can help predict insomnia from users’ social media interactions. Thus, incorporating our model into a software system can help family members detect insomnia problems in individuals before they become worse. The software system can also help doctors to diagnose possible insomnia in patients.


2019 ◽  
Vol 39 (6) ◽  
pp. 593-617
Author(s):  
Jacqueline Laws

This corpus-based study provides a baseline of complex word usage patterns in the spontaneous speech of English preschool children to ascertain the characteristics of their derivative vocabulary before literacy development affects language skills. Frequencies of suffixed derivatives produced by ( N = 243) children aged 2–5 and their caregivers were extracted for 58 suffix variants, yielding 558 types from the former and 1,364 from the latter. Between the youngest and oldest groups, 11 suffix categories increased significantly in type frequency, compared with 22 in the caregiver dataset. All derivative types were classified for transparency of meaning and simplicity of form on a five-point analysability scale. Around 59% of both the child and caregiver derivative vocabulary sets were classified as transparent regardless of age, suggesting that the potential analysability of the preschool child’s input remains surprisingly invariant over time. The study provides baseline data for future studies on the development of morphological awareness in English-speaking schoolchildren.


2021 ◽  
Author(s):  
Ahmed Shahriar Sakib ◽  
Md Saddam Hossain Mukta ◽  
Fariha Rowshan Huda ◽  
A K M Najmul Islam ◽  
Tohedul Islam ◽  
...  

BACKGROUND Many people suffer from insomnia, a sleep disorder characterized by difficulty falling and staying asleep during the night. As social media have become a ubiquitous platform to share users’ thoughts, opinions, activities, and preferences with their friends and acquaintances, the shared content across these platforms can be used to diagnose different health problems, including insomnia. Only a few recent studies have examined the prediction of insomnia from Twitter data, and we found research gaps in predicting insomnia from word usage patterns and correlations between users’ insomnia and their Big 5 personality traits as derived from social media interactions. OBJECTIVE The purpose of this study is to build an insomnia prediction model from users’ psycholinguistic patterns, including the elements of word usage, semantics, and their Big 5 personality traits as derived from tweets. METHODS In this paper, we exploited both psycholinguistic and personality traits derived from tweets to identify insomnia patients. First, we built psycholinguistic profiles of the users from their word choices and the semantic relationships between the words of their tweets. We then determined the relationship between a users’ personality traits and insomnia. Finally, we built a double-weighted ensemble classification model to predict insomnia from both psycholinguistic and personality traits as derived from user tweets. RESULTS Our classification model showed strong prediction potential (78.8%) to predict insomnia from tweets. As insomniacs are generally ill-tempered and feel more stress and mental exhaustion, we observed significant correlations of certain word usage patterns among them. They tend to use negative words (eg, “no,” “not,” “never”). Some people frequently use swear words (eg, “damn,” “piss,” “fuck”) with strong temperament. They also use anxious (eg, “worried,” “fearful,” “nervous”) and sad (eg, “crying,” “grief,” “sad”) words in their tweets. We also found that the users with high neuroticism and conscientiousness scores for the Big 5 personality traits likely have strong correlations with insomnia. Additionally, we observed that users with high conscientiousness scores have strong correlations with insomnia patterns, while negative correlation between extraversion and insomnia was also found. CONCLUSIONS Our model can help predict insomnia from users’ social media interactions. Thus, incorporating our model into a software system can help family members detect insomnia problems in individuals before they become worse. The software system can also help doctors to diagnose possible insomnia in patients.


1987 ◽  
Vol 18 (2) ◽  
pp. 112-130
Author(s):  
Mary Ann Romski ◽  
Sharon Ellis Joyner ◽  
Rose A. Sevcik

Studies of first-word acquisition in typical language-learning children frequently take the form of diary studies. Comparable diary data from language-impaired children with developmental delays, however, are not currently available. This report describes the spontaneous vocalizations of a child with a developmental delay for 14 months, from the time he was age 6:5 to age 7:7. From a corpus of 285 utterances, 47 phonetic forms were identified and categorized. Analysis focused on semantic, communicative, and phonological usage patterns.


Author(s):  
Thomas Mößle ◽  
Florian Rehbein

Aim: The aim of this article is to work out the differential significance of risk factors of media usage, personality and social environment in order to explain problematic video game usage in childhood and adolescence. Method: Data are drawn from the Berlin Longitudinal Study Media, a four-year longitudinal control group study with 1 207 school children. Data from 739 school children who participated at 5th and 6th grade were available for analysis. Result: To explain the development of problematic video game usage, all three areas, i. e. specific media usage patterns, certain aspects of personality and certain factors pertaining to social environment, must be taken into consideration. Video game genre, video gaming in reaction to failure in the real world (media usage), the children’s/adolescents’ academic self-concept (personality), peer problems and parental care (social environment) are of particular significance. Conclusion: The results of the study emphasize that in future – and above all also longitudinal – studies different factors regarding social environment must also be taken into account with the recorded variables of media usage and personality in order to be able to explain the construct of problematic video game usage. Furthermore, this will open up possibilities for prevention.


1981 ◽  
Vol 26 (6) ◽  
pp. 484-484
Author(s):  
Michael I. Posner

Sign in / Sign up

Export Citation Format

Share Document