scholarly journals Robust Inference after Random Projections via Hellinger Distance for Location-Scale Family

Entropy ◽  
2019 ◽  
Vol 21 (4) ◽  
pp. 348 ◽  
Author(s):  
Lei Li ◽  
Anand Vidyashankar ◽  
Guoqing Diao ◽  
Ejaz Ahmed

Big data and streaming data are encountered in a variety of contemporary applications in business and industry. In such cases, it is common to use random projections to reduce the dimension of the data yielding compressed data. These data however possess various anomalies such as heterogeneity, outliers, and round-off errors which are hard to detect due to volume and processing challenges. This paper describes a new robust and efficient methodology, using Hellinger distance, to analyze the compressed data. Using large sample methods and numerical experiments, it is demonstrated that a routine use of robust estimation procedure is feasible. The role of double limits in understanding the efficiency and robustness is brought out, which is of independent interest.

Algorithms ◽  
2020 ◽  
Vol 13 (7) ◽  
pp. 159 ◽  
Author(s):  
Shinichi Yamagiwa ◽  
Eisaku Hayakawa ◽  
Koichi Marumo

Toward strong demand for very high-speed I/O for processors, physical performance growth of hardware I/O speed was drastically increased in this decade. However, the recent Big Data applications still demand the larger I/O bandwidth and the lower latency for the speed. Because the current I/O performance does not improve so drastically, it is the time to consider another way to increase it. To overcome this challenge, we focus on lossless data compression technology to decrease the amount of data itself in the data communication path. The recent Big Data applications treat data stream that flows continuously and never allow stalling processing due to the high speed. Therefore, an elegant hardware-based data compression technology is demanded. This paper proposes a novel lossless data compression, called ASE coding. It encodes streaming data by applying the entropy coding approach. ASE coding instantly assigns the fewest bits to the corresponding compressed data according to the number of occupied entries in a look-up table. This paper describes the detailed mechanism of ASE coding. Furthermore, the paper demonstrates performance evaluations to promise that ASE coding adaptively shrinks streaming data and also works on a small amount of hardware resources without stalling or buffering any part of data stream.


Entropy ◽  
2021 ◽  
Vol 23 (6) ◽  
pp. 773
Author(s):  
Amichai Painsky ◽  
Meir Feder

Learning and making inference from a finite set of samples are among the fundamental problems in science. In most popular applications, the paradigmatic approach is to seek a model that best explains the data. This approach has many desirable properties when the number of samples is large. However, in many practical setups, data acquisition is costly and only a limited number of samples is available. In this work, we study an alternative approach for this challenging setup. Our framework suggests that the role of the train-set is not to provide a single estimated model, which may be inaccurate due to the limited number of samples. Instead, we define a class of “reasonable” models. Then, the worst-case performance in the class is controlled by a minimax estimator with respect to it. Further, we introduce a robust estimation scheme that provides minimax guarantees, also for the case where the true model is not a member of the model class. Our results draw important connections to universal prediction, the redundancy-capacity theorem, and channel capacity theory. We demonstrate our suggested scheme in different setups, showing a significant improvement in worst-case performance over currently known alternatives.


Urban Studies ◽  
2021 ◽  
pp. 004209802110140
Author(s):  
Sarah Barns

This commentary interrogates what it means for routine urban behaviours to now be replicating themselves computationally. The emergence of autonomous or artificial intelligence points to the powerful role of big data in the city, as increasingly powerful computational models are now capable of replicating and reproducing existing spatial patterns and activities. I discuss these emergent urban systems of learned or trained intelligence as being at once radical and routine. Just as the material and behavioural conditions that give rise to urban big data demand attention, so do the generative design principles of data-driven models of urban behaviour, as they are increasingly put to use in the production of replicable, autonomous urban futures.


2021 ◽  
Vol 13 ◽  
pp. 175628722199813
Author(s):  
B. M. Zeeshan Hameed ◽  
Aiswarya V. L. S. Dhavileswarapu ◽  
Nithesh Naik ◽  
Hadis Karimi ◽  
Padmaraj Hegde ◽  
...  

Artificial intelligence (AI) has a proven record of application in the field of medicine and is used in various urological conditions such as oncology, urolithiasis, paediatric urology, urogynaecology, infertility and reconstruction. Data is the driving force of AI and the past decades have undoubtedly witnessed an upsurge in healthcare data. Urology is a specialty that has always been at the forefront of innovation and research and has rapidly embraced technologies to improve patient outcomes and experience. Advancements made in Big Data Analytics raised the expectations about the future of urology. This review aims to investigate the role of big data and its blend with AI for trends and use in urology. We explore the different sources of big data in urology and explicate their current and future applications. A positive trend has been exhibited by the advent and implementation of AI in urology with data available from several databases. The extensive use of big data for the diagnosis and treatment of urological disorders is still in its early stage and under validation. In future however, big data will no doubt play a major role in the management of urological conditions.


2021 ◽  
pp. 097226292110225
Author(s):  
Shobhana Chandra ◽  
Sanjeev Verma

Big data (BD) is making advances in promoting sustainable consumption behaviour and has attracted the attention of researchers worldwide. Despite the increased focus, the findings of studies on this topic are fragmented, and future researchers need a systematic understanding of the existing literature for identification of the research scope. This study offers a systematic review of the role of BD in promoting sustainable-consumption behaviour with the help of a bibliometric analysis, followed by a thematic analysis. The findings suggest that businesses deploy BD to create sustainable consumer experiences, predict consumer buying patterns, design and alter business models and create nudges for sustainable consumption, while consumers are forcing businesses to develop green operations and supply chains to reduce the latter’s carbon footprint. The major research gaps for future researchers are in the following areas: the impact of big data analytics (BDA) on consumerism, the role of BD in the formation of sustainable habits and consumer knowledge creation for sustainable consumption and prediction of green consumer behaviour.


IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 54595-54614 ◽  
Author(s):  
Syed Attique Shah ◽  
Dursun Zafer Seker ◽  
Sufian Hameed ◽  
Dirk Draheim

2017 ◽  
Vol 93 (1) ◽  
pp. 79-95 ◽  
Author(s):  
Eric T. Bradlow ◽  
Manish Gangwar ◽  
Praveen Kopalle ◽  
Sudhir Voleti

Sign in / Sign up

Export Citation Format

Share Document