Estimation of the Population Mean for Incomplete Data by using Information of Simple Linear Relationship Model in Data Set

A Radial Basis Boltzmann Machine (RBBM) is a specialized Boltzmann Machine architecture that combines feed-forward mapping with probability estimation in the input space, and for which very efficient learning rules exist. The hidden representation of the network displays symmetry breaking as a function of the noise in the dynamics. Thus, generalization can be studied as a function of the noise in the neuron dynamics instead of as a function of the number of hidden units. We show that the RBBM can be seen as an elegant alternative of k-nearest neighbor, leading to comparable performance without the need to store all data. We show that the RBBM has good classification performance compared to the MLP. The main advantage of the RBBM is that simultaneously with the input-output mapping, a model of the input space is obtained which can be used for learning with missing values. We derive learning rules for the case of incomplete data, and show that they perform better on incomplete data than the traditional learning rules on a 'repaired' data set.

Download Full-text

A simple linear relationship and definition of a unit for proteinase activity

Archives of Biochemistry and Biophysics ◽

10.1016/0003-9861(51)90254-8 ◽

1951 ◽

Vol 32 (1) ◽

pp. 200-206 ◽

Cited By ~ 8

Author(s):

Byron S. Miller ◽

John A. Johnson

Keyword(s):

Linear Relationship ◽

Proteinase Activity ◽

Simple Linear Relationship ◽

Definition Of

Download Full-text

A MODIFIED RATIO TYPE ESTIMATOR OF FINITE POPULATION MEAN UNDER STRATIFIED RANDOM SAMPLING SCHEME

10.36106/4917049 ◽

2021 ◽

pp. 58-60

Author(s):

Naziru Fadisanku Haruna ◽

Ran Vijay Kumar Singh ◽

Samsudeen Dahiru

Keyword(s):

Mean Square Error ◽

Random Sampling ◽

Minimum Mean Square Error ◽

Real Life ◽

Population Data ◽

Auxiliary Variable ◽

Mean Square ◽

Data Set ◽

Population Mean ◽

The Mean

In This paper a modied ratio-type estimator for nite population mean under stratied random sampling using single auxiliary variable has been proposed. The expression for mean square error and bias of the proposed estimator are derived up to the rst order of approximation. The expression for minimum mean square error of proposed estimator is also obtained. The mean square error the proposed estimator is compared with other existing estimators theoretically and condition are obtained under which proposed estimator performed better. A real life population data set has been considered to compare the efciency of the proposed estimator numerically.

Download Full-text

Comment: Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data

Statistical Science ◽

10.1214/07-sts227c ◽

2007 ◽

Vol 22 (4) ◽

pp. 540-543 ◽

Cited By ~ 32

Author(s):

Greg Ridgeway ◽

Daniel F. McCaffrey

Keyword(s):

Incomplete Data ◽

Alternative Strategies ◽

Population Mean ◽

Double Robustness

Download Full-text

Rejoinder: Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data

Statistical Science ◽

10.1214/07-sts227rej ◽

2007 ◽

Vol 22 (4) ◽

pp. 574-580 ◽

Cited By ~ 11

Author(s):

Joseph D. Y. Kang ◽

Joseph L. Schafer

Keyword(s):

Incomplete Data ◽

Alternative Strategies ◽

Population Mean ◽

Double Robustness

Download Full-text

Do the flipped impacts of hotels matter to the popularity of Airbnb?

International Journal of Contemporary Hospitality Management ◽

10.1108/ijchm-06-2020-0582 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Bowen Yi ◽

Da Shi ◽

Fangfang Shi ◽

Liang Zhang

Keyword(s):

Spillover Effects ◽

Set Covering ◽

Negative Effects ◽

Data Set ◽

Content Type ◽

Competitive Effects ◽

Competition Theory ◽

Relationship Model ◽

Hotel Managers ◽

Multidimensional Effects

Purpose By building on cooperation–competition theory, this study aims to investigate the multidimensional flipped effects of neighborhood hotels on Airbnb listings’ popularity, examining the degree to which such impacts are influenced by hotel types and geographical areas. Design/methodology/approach This study explores the interdependent and competitive relationship between neighborhood hotels and Airbnb from the perspective of effects on Airbnb listings’ popularity by exploring a data set covering 10,492 Airbnb listings and 2,691 hotels from Ctrip. Findings Results reveal that neighborhood hotels’ number of reviews, review ratings and prices each have positive spillover effects on Airbnb listings’ popularity, while quality assurance labels and negative review topic sentiments exert competitive effects on Airbnb popularity. Moreover, the number of budget chain hotels and high-star hotels have positive and negative effects on Airbnb popularity, respectively. Geographical areas also have a moderating effect on the relationship between various hotel-related influencing factors and Airbnb. Practical implications This study can offer hotel managers and Airbnb operators a clearer understanding of these businesses’ coexisting relationship. Findings can also provide Airbnb-specific guidelines for practitioners in terms of site selection, promotional features and development strategies for Airbnb listings. Originality/value This study establishes a cooperation–competition relationship model between hotels and Airbnb and considers the flipped effects of hotels on Airbnb for the first time. It expands previous studies by considering the multidimensional effects of hotels on Airbnb listings’ popularity and by examining the influences of hotel types and geographical areas on hotels’ impacts on Airbnb.

Download Full-text

Data Mining with Incomplete Data

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch056 ◽

2011 ◽

pp. 293-296 ◽

Cited By ~ 2

Author(s):

Hai Wang ◽

Shouhong Wang

Keyword(s):

Data Mining ◽

Missing Data ◽

Survey Data ◽

Incomplete Data ◽

Knowledge Engineering ◽

Data Set ◽

Survey Question ◽

Data Source ◽

The Common ◽

Extent Of Damage

Survey is one of the common data acquisition methods for data mining (Brin, Rastogi & Shim, 2003). In data mining one can rarely find a survey data set that contains complete entries of each observation for all of the variables. Commonly, surveys and questionnaires are often only partially completed by respondents. The possible reasons for incomplete data could be numerous, including negligence, deliberate avoidance for privacy, ambiguity of the survey question, and aversion. The extent of damage of missing data is unknown when it is virtually impossible to return the survey or questionnaires to the data source for completion, but is one of the most important parts of knowledge for data mining to discover. In fact, missing data is an important debatable issue in the knowledge engineering field (Tseng, Wang, & Lee, 2003).

Download Full-text

Estimating the Entropy for Lomax Distribution Based on Generalized Progressively Hybrid Censoring

Symmetry ◽

10.3390/sym11101219 ◽

2019 ◽

Vol 11 (10) ◽

pp. 1219 ◽

Cited By ~ 1

Author(s):

Shuhan Liu ◽

Wenhao Gui

Keyword(s):

Incomplete Data ◽

Real Data ◽

Unknown Parameters ◽

Data Set ◽

Hybrid Censoring ◽

Lomax Distribution ◽

Bayesian Estimates ◽

Squared Error ◽

General Entropy Loss Function ◽

Two Parameter

As it is often unavoidable to obtain incomplete data in life testing and survival analysis, research on censoring data is becoming increasingly popular. In this paper, the problem of estimating the entropy of a two-parameter Lomax distribution based on generalized progressively hybrid censoring is considered. The maximum likelihood estimators of the unknown parameters are derived to estimate the entropy. Further, Bayesian estimates are computed under symmetric and asymmetric loss functions, including squared error, linex, and general entropy loss function. As we cannot obtain analytical Bayesian estimates directly, the Lindley method and the Tierney and Kadane method are applied. A simulation study is conducted and a real data set is analyzed for illustrative purposes.

Download Full-text

Incomplete Big Data Distributed Clustering

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.687-691.1496 ◽

2014 ◽

Vol 687-691 ◽

pp. 1496-1499

Author(s):

Yong Lin Leng

Keyword(s):

Big Data ◽

Incomplete Data ◽

Clustering Algorithm ◽

Design Method ◽

Complete Data ◽

Similarity Metrics ◽

Distributed Clustering ◽

Computing Technology ◽

Data Set ◽

Affinity Propagation Clustering

Partially missing or blurring attribute values make data become incomplete during collecting data. Generally we use inputation or discarding method to deal with incomplete data before clustering. In this paper we proposed an a new similarity metrics algorithm based on incomplete information system. First algorithm divided the data set into a complete data set and non complete data set, and then the complete data set was clustered using the affinity propagation clustering algorithm, incomplete data according to the design method of the similarity metric is divided into the corresponding cluster. In order to improve the efficiency of the algorithm, designing the distributed clustering algorithm based on cloud computing technology. Experiment demonstrates the proposed algorithm can cluster the incomplete big data directly and improve the accuracy and effectively.

Download Full-text