Web scraping based online consumer price index: The “IPC Online” case

2020 ◽  
Vol 44 (2-3) ◽  
pp. 141-159
Author(s):  
Juan I. Uriarte ◽  
Gonzalo R. Ramírez Muñoz de Toro ◽  
Juan M.C. Larrosa
2021 ◽  
Vol 43 ◽  
pp. 251-269
Author(s):  
Adam Juszczak ◽  

Aim/purpose – Web-scraping is a technique used to automatically extract data from websites. After the rise-up of online shopping, it allows the acquisition of information about prices of goods sold by retailers such as supermarkets or internet shops. This study examines the possibility of using web-scrapped data from one clothing store. It aims at comparing known price index formulas being implemented to the web-scraping case and verifying their sensitivity on the choice of data filter type. Design/methodology/approach – The author uses the price data scrapped from one of the biggest online shops in Poland. The data were obtained as part of eCPI (electronic Consumer Price Index) project conducted by the National Bank of Poland. The author decided to select three types of products for this analysis – female ballerinas, male shoes, and male oxfords to compare their prices in over one-year time period. Six price indexes were used for calculation – The Jevons and Dutot indexes with their chain and GEKS (acronym from the names of creators – Gini–Éltető–Köves–Szulc) versions. Apart from the analysis conducted on a full data set, the author introduced filters to remove outliers. Findings – Clothing and footwear are considered one of the most difficult groups of goods to measure price change indexes due to high product churn, which undermines the possibility to use the traditional Jevons and Dutot indexes. However, it is possible to use chained indexes and GEKS indexes instead. Still, these indexes are fairly sensitive to large price changes. As observed in case of both product groups, the results provided by the GEKS and chained versions of indexes were different, which could lead to conclu- sion that even though they are lending promising results, they could be better suited for other COICOP (Classification of Individual Consumption by Purpose) groups. Research implications/limitations – The findings of the paper showed that usage of filters did not significantly reduce the difference between price indexes based on GEKS and chain formulas. Originality/value/contribution – The usage of web-scrapped data is a fairly new topic in the literature. Research on the possibility of using different price indexes provides useful insights for future usage of these data by statistics offices. Keywords: inflation, CPI, web-scraping, online shopping, big data. JEL Classification: C43, C49


Data Science ◽  
2021 ◽  
pp. 1-17
Author(s):  
Muhammad Ghozy Al Haqqoni ◽  
Setia Pramana

Digital Economy in recent years, especially in Southeast Asia, including Indonesia, is growing rapidly. E-commerce is one part of the Digital Economy. BPS-Statistics Indonesia as a Non-ministerial Government Agency responsible directly to the president has conducted an E-commerce Survey in 2019. From this publication, it is concluded that the interest of Indonesian traders using the internet in selling in recent years has increased. So, the urgency of using e-commerce data in its application in official Statistics is increasingly needed. Several studies have carried out the application of e-commerce data in the calculation of The Consumer Price Index (CPI). In this research, e-commerce data is applied with a case study using the data from one of online marketplaces in Indonesia in calculating CPI at city level in Java. The purpose of this study is to compare the marketplace-based CPI data and BPS-Statistics’ survey-based CPI. The data is collected through web scraping techniques and followed by preprocessing data and analyzed descriptively. Web scraper that is built can be used in obtaining data. Commodity-level CPI with marketplace data tends to have relatively large prices which result in higher CPI being compared to BPS-Statistics CPI. Meanwhile, at the expenditure group level, the CPI between the two approaches is broadly similar in general.


2017 ◽  
Vol 1 (1) ◽  
pp. 37
Author(s):  
Hansen Rusliani

Penelitian ini bertujuan untuk mengetahui dampak perbankan syari’ah terhadap pertumbuhan ekonomi di Indonesia dan Malaysia. Data yang digunakan dalam penelitian ini merupakan data primer (interview) dan data sekunder dalam bentuk bulanan yang diperoleh dari Badan Pusat Statistik Ekonomi dan Keuangan Indonesia Bank Indonesia (SEKI-BI) dan Statistik Perbankan Syari’ah Bank Indonesia (SPS-BI) serta data dari Bank Negara Malaysia dan Departemen Statistik Malaysia dalam periode waktu kurun waktu 16 tahun, 2000 sampai dengan 2015. Observasi penelitian dilakukan di Indonesia dan Malaysia untuk memperkaya analisis. Penelitian ini menggunakan Vector Autoregression (VAR), Uji Kointegrasi serta dikombinasikan dengan Response Function (IRF) dan Decomposition (FEVD) untuk melihat interaksi antara faktor makro ekonomi dengan pembiayaan dalam jangka panjang. Adapun variabel yang digunakan adalah total pembiayan syari’ah (Total Syari’ah Financing) dan Gross Domestic Product (GDP) sebagai representasi pertumbuhan ekonomi. Untuk tambahan variabel digunakan Consumer Price Index (CPI) sebagai representasi tingkat inflasi. Hipotesis penelitian yaitu terdapat pertumbuhan ekonomi setiap tahunnya dikedua negara tersebut pasca krisis moneter.


2016 ◽  
Author(s):  
Steven Paben ◽  
William Johnson ◽  
John Schilp

1994 ◽  
Vol 9 (3) ◽  
pp. 391-401
Author(s):  
Joseph Rabianski ◽  
Julian Diaz ◽  
Neil Carn

1976 ◽  
Vol 78 ◽  
pp. 48-57 ◽  
Author(s):  
A.J.H. Dean

This article examines the accuracy of the National Institute's forecasts of incomes, inflation and employment from 1965 to 1975. It is found that in recent years the Institute has tended to underestimate inflation, although less seriously for the consumer price index than the other current price variables studied. The accuracy of the forecasts has generally increased in relative terms, although it has deteriorated in absolute terms. The forecasting performance in 1974 was particularly poor but there has been a distinct improvement in 1975 and 1976.


2012 ◽  
Vol 01 (07) ◽  
pp. 17-29
Author(s):  
Furrukh Bashir ◽  
Shahbaz Nawaz ◽  
Rahat Ullah ◽  
Muhammad Ramzan Arshad ◽  
Munwar Bagum ◽  
...  

Education is always considered as the major determinant for the development of any economy. Enrollment at various levels also shows that how much education is common within the citizens of the country. Considering the importance of enrollment, the current study examines the influence of some macroeconomic variables on various levels i.e. primary, secondary, higher, college, professional and university enrollment in Pakistan. Time series data has been gathered on consumer price index, government revenue, employed labor force, government expenditure, and health expenditure for the period from 1972 to 2010. For long run estimates, Johansen Co integration test is used and short run estimates are taken through error correction model. The results of the study exhibit positive association of employed labor force, government expenditure and health expenditure with primary, secondary, higher, college, professional and university enrollment in Pakistan. On the other side, consumer price index and government revenue have been found to be inversely influencing enrollment at various levels. Short run results are also much favorable for the economy and reveals convergence towards long run equilibrium due to any disturbances in the short run period. At the end study gives some policy implications that government should decrease consumer price index and tax rate and to increase government expenditure in terms of education and health for higher enrollment rates in Pakistan.


Sign in / Sign up

Export Citation Format

Share Document