Scaling in words on Twitter

Eszter Bokányi; Dániel Kondor; Gábor Vattay

doi:10.1098/rsos.190027

Scaling in words on Twitter

Royal Society Open Science ◽

10.1098/rsos.190027 ◽

2019 ◽

Vol 6 (10) ◽

pp. 190027 ◽

Cited By ~ 1

Author(s):

Eszter Bokányi ◽

Dániel Kondor ◽

Gábor Vattay

Keyword(s):

The United States ◽

City Size ◽

Zipf’S Law ◽

Zipf's Law ◽

Scaling Properties ◽

Scaling Relationship ◽

City Population ◽

Urban Scaling ◽

Core Vocabulary ◽

Relationship Of

Scaling properties of language are a useful tool for understanding generative processes in texts. We investigate the scaling relations in citywise Twitter corpora coming from the metropolitan and micropolitan statistical areas of the United States. We observe a slightly superlinear urban scaling with the city population for the total volume of the tweets and words created in a city. We then find that a certain core vocabulary follows the scaling relationship of that of the bulk text, but most words are sensitive to city size, exhibiting a super- or a sublinear urban scaling. For both regimes, we can offer a plausible explanation based on the meaning of the words. We also show that the parameters for Zipf’s Law and Heaps' Law differ on Twitter from that of other texts, and that the exponent of Zipf’s Law changes with city size.

Download Full-text

A Dynamic Generalization of Zipf's Rank-Size Rule

Environment and Planning A Economy and Space ◽

10.1068/a141449 ◽

1982 ◽

Vol 14 (11) ◽

pp. 1449-1467 ◽

Cited By ~ 12

Author(s):

B Roehner ◽

K E Wiese

Keyword(s):

Stochastic Model ◽

Size Distribution ◽

Urban Growth ◽

Simple Form ◽

Deterministic Model ◽

City Size ◽

Zipf’S Law ◽

Zipf's Law ◽

City Size Distribution ◽

Zero Growth

A dynamic deterministic model of urban growth is proposed, which in its most simple form yields Zipf's law for city-size distribution, and in its general form may account for distributions that deviate strongly from Zipf's law. The qualitative consequences of the model are examined, and a corresponding stochastic model is introduced, which permits, in particular, the study of zero-growth situations.

Download Full-text

Zipf's Law, Pareto's Law, and the Evolution of Top Incomes in the United States

American Economic Journal Macroeconomics ◽

10.1257/mac.20150051 ◽

2017 ◽

Vol 9 (3) ◽

pp. 36-71 ◽

Cited By ~ 15

Author(s):

Shuhei Aoki ◽

Makoto Nirei

Keyword(s):

United States ◽

Income Distribution ◽

The United States ◽

Zipf’S Law ◽

Tax Rates ◽

Firm Level ◽

Productivity Shocks ◽

Zipf's Law ◽

Top Incomes ◽

Pareto’S Law

We construct a tractable neoclassical growth model that generates Pareto's law of income distribution and Zipf's law of the firm size distribution from idiosyncratic, firm-level productivity shocks. Executives and entrepreneurs invest in risk-free assets, as well as their own firms' risky stocks, through which their wealth and income depend on firm-level shocks. By using the model, we evaluate how changes in tax rates can account for the evolution of top incomes in the United States. The model matches the decline in the Pareto exponent of the income distribution and the trend of the top 1 percent income share in recent decades. (JEL D31, H24, L11)

Download Full-text

COVID-19 attack rate increases with city size

10.1101/2020.03.22.20041004 ◽

2020 ◽

Cited By ~ 23

Author(s):

Andrew J. Stier ◽

Marc G. Berman ◽

Luis M. A. Bettencourt

Keyword(s):

Social Network ◽

Urban Areas ◽

City Size ◽

Economic Threat ◽

Scaling Relationship ◽

The Social ◽

City Population ◽

Current Outbreak ◽

Novel Coronavirus ◽

Us Cities

The current outbreak of novel coronavirus disease 2019 (COVID-19) poses an unprecedented global health and economic threat to interconnected human societies. Until a vaccine is developed, strategies for controlling the outbreak rely on aggressive social distancing. These measures largely disconnect the social network fabric of human societies, especially in urban areas. Here, we estimate the growth rates and reproductive numbers of COVID-19 in US cities from March 14th through March 19th to reveal a power-law scaling relationship to city population size. This means that COVID-19 is spreading faster on average in larger cities with the additional implication that, in an uncontrolled outbreak, larger fractions of the population are expected to become infected in more populous urban areas. We discuss the implications of these observations for controlling the COVID-19 outbreak, emphasizing the need to implement more aggressive distancing policies in larger cities while also preserving socioeconomic activity.

Download Full-text

Scaling of urban income inequality in the USA

Journal of The Royal Society Interface ◽

10.1098/rsif.2021.0223 ◽

2021 ◽

Vol 18 (181) ◽

pp. 20210223

Author(s):

Elisa Heinrich Mora ◽

Cate Heine ◽

Jacob J. Jackson ◽

Geoffrey B. West ◽

Vicky Chuqiao Yang ◽

...

Keyword(s):

Income Inequality ◽

Income Distribution ◽

City Size ◽

Housing Cost ◽

Scaling Analysis ◽

Total Income ◽

Income Distributions ◽

City Population ◽

Leibler Divergence ◽

Urban Scaling

Urban scaling analysis, the study of how aggregated urban features vary with the population of an urban area, provides a promising framework for discovering commonalities across cities and uncovering dynamics shared by cities across time and space. Here, we use the urban scaling framework to study an important, but under-explored feature in this community—income inequality. We propose a new method to study the scaling of income distributions by analysing total income scaling in population percentiles. We show that income in the least wealthy decile (10%) scales close to linearly with city population, while income in the most wealthy decile scale with a significantly superlinear exponent. In contrast to the superlinear scaling of total income with city population, this decile scaling illustrates that the benefits of larger cities are increasingly unequally distributed. For the poorest income deciles, cities have no positive effect over the null expectation of a linear increase. We repeat our analysis after adjusting income by housing cost, and find similar results. We then further analyse the shapes of income distributions. First, we find that mean, variance, skewness and kurtosis of income distributions all increase with city size. Second, the Kullback–Leibler divergence between a city’s income distribution and that of the largest city decreases with city population, suggesting the overall shape of income distribution shifts with city population. As most urban scaling theories consider densifying interactions within cities as the fundamental process leading to the superlinear increase of many features, our results suggest this effect is only seen in the upper deciles of the cities. Our finding encourages future work to consider heterogeneous models of interactions to form a more coherent understanding of urban scaling.

Download Full-text

Demography and the emergence of universal patterns in urban systems

Nature Communications ◽

10.1038/s41467-020-18205-1 ◽

2020 ◽

Vol 11 (1) ◽

Author(s):

Luís M. A. Bettencourt ◽

Daniel Zünd

Keyword(s):

Urban Areas ◽

Small Towns ◽

City Size ◽

Zipf’S Law ◽

Urban Systems ◽

Size Distributions ◽

Vital Rates ◽

Zipf's Law ◽

Large Cities ◽

Population Sizes

Abstract Urban areas exist in a wide variety of population sizes, from small towns to huge megacities. No proposed form for the statistical distribution of city sizes has received more attention than Zipf’s law, a Pareto distribution with power law exponent equal to one. However, this distribution is typically violated by empirical evidence for small and large cities. Moreover, no theory presently exists to derive city size distributions from fundamental demographic choices while also explaining consistent variations. Here we develop a comprehensive framework based on demography to show how the structure of migration flows between cities, together with the differential magnitude of their vital rates, determine a variety of city size distributions. This approach provides a powerful mathematical methodology for deriving Zipf’s law as well as other size distributions under specific conditions, and to resolve puzzles associated with their deviations in terms of concepts of choice, symmetry, information, and selection.

Download Full-text

Investigation of urban regularities for Croatia in the period from 1857 to 2011

Ekonomski pregled ◽

10.32910/ep.71.4.1 ◽

2020 ◽

Vol 71 (4) ◽

pp. 307-330

Author(s):

Hrvoje Jošić ◽

Berislav Žmuk

Keyword(s):

Size Distribution ◽

Unit Root ◽

Urban Economics ◽

Unit Roots ◽

City Size ◽

Zipf’S Law ◽

Urban Hierarchy ◽

Zipf's Law ◽

Gibrat’S Law ◽

Gibrat's Law

Two main regularities in the field of urban economics are Zipf’s law and Gibrat’s law. Zipf’s law states that distribution of largest cities should obey the Pareto rank-size distribution while Gibrat’s law states that proportionate growth of cities is independent of its size. These two laws are interconnected and therefore are often considered together. The objective of this paper is the investigation of urban regularities for Croatia in the period from 1857 to 2011. In order to estimate and evaluate the structure of Croatian urban hierarchy, Pareto or Zipf’s coefficients are calculated. The results have shown that the coefficient values for the largest settlements in different years are close to one, indicating that the Croatian urban hierarchy system follows the rank-size distribution and therefore obeys Zipf's law. The independence of city growth regarding the city size is tested using penal unit roots. Results for Gibrat's law testing using panel unit root tests have shown that there is a presence of unit root in growth of settlements therefore leading to the acceptance of Gibrat’s law.

Download Full-text

Zipf’s law and city size distribution: A survey of the literature and future research agenda

Physica A Statistical Mechanics and its Applications ◽

10.1016/j.physa.2017.10.005 ◽

2018 ◽

Vol 492 ◽

pp. 75-92 ◽

Cited By ~ 31

Author(s):

Sidra Arshad ◽

Shougeng Hu ◽

Badar Nadeem Ashraf

Keyword(s):

Size Distribution ◽

Research Agenda ◽

City Size ◽

Zipf’S Law ◽

Future Research ◽

Zipf's Law ◽

Future Research Agenda ◽

City Size Distribution

Download Full-text

Zipf’s law for atlas models

Journal of Applied Probability ◽

10.1017/jpr.2020.64 ◽

2020 ◽

Vol 57 (4) ◽

pp. 1276-1297

Author(s):

Ricardo T. Fernholz ◽

Robert Fernholz

Keyword(s):

Pareto Distribution ◽

Household Wealth ◽

City Size ◽

Zipf’S Law ◽

Mathematical Explanation ◽

Zipf's Law ◽

First Order ◽

Pareto Distributions ◽

Straight Line ◽

Atlas Model

AbstractA set of data with positive values follows a Pareto distribution if the log–log plot of value versus rank is approximately a straight line. A Pareto distribution satisfies Zipf’s law if the log–log plot has a slope of $-1$. Since many types of ranked data follow Zipf’s law, it is considered a form of universality. We propose a mathematical explanation for this phenomenon based on Atlas models and first-order models, systems of strictly positive continuous semimartingales with parameters that depend only on rank. We show that the stationary distribution of an Atlas model will follow Zipf’s law if and only if two natural conditions, conservation and completeness, are satisfied. Since Atlas models and first-order models can be constructed to approximate systems of time-dependent rank-based data, our results can explain the universality of Zipf’s law for such systems. However, ranked data generated by other means may follow non-Zipfian Pareto distributions. Hence, our results explain why Zipf’s law holds for word frequency, firm size, household wealth, and city size, while it does not hold for earthquake magnitude, cumulative book sales, and the intensity of wars, all of which follow non-Zipfian Pareto distributions.

Download Full-text

A dynamic model for city size distribution beyond Zipf's law

Physica A Statistical Mechanics and its Applications ◽

10.1016/j.physa.2007.05.059 ◽

2007 ◽

Vol 384 (2) ◽

pp. 613-627 ◽

Cited By ~ 28

Author(s):

Lucien Benguigui ◽

Efrat Blumenfeld-Lieberthal

Keyword(s):

Size Distribution ◽

Dynamic Model ◽

City Size ◽

Zipf’S Law ◽

Zipf's Law ◽

City Size Distribution

Download Full-text

The Statistics of Urban Scaling and Their Connection to Zipf’s Law

PLoS ONE ◽

10.1371/journal.pone.0040393 ◽

2012 ◽

Vol 7 (7) ◽

pp. e40393 ◽

Cited By ~ 69

Author(s):

Andres Gomez-Lievano ◽

HyeJin Youn ◽

Luís M. A. Bettencourt

Keyword(s):

Zipf’S Law ◽

Zipf's Law ◽

Urban Scaling

Download Full-text