scholarly journals Estimation of the Population Mean for Incomplete Data by using Information of Simple Linear Relationship Model in Data Set

2021 ◽  
Vol 6 (4) ◽  
pp. 161-169
Author(s):  
Juthaphorn Sinsomboonthong ◽  
Saichon Sinsomboonthong
1997 ◽  
Vol 08 (03) ◽  
pp. 301-315 ◽  
Author(s):  
Marcel J. Nijman ◽  
Hilbert J. Kappen

A Radial Basis Boltzmann Machine (RBBM) is a specialized Boltzmann Machine architecture that combines feed-forward mapping with probability estimation in the input space, and for which very efficient learning rules exist. The hidden representation of the network displays symmetry breaking as a function of the noise in the dynamics. Thus, generalization can be studied as a function of the noise in the neuron dynamics instead of as a function of the number of hidden units. We show that the RBBM can be seen as an elegant alternative of k-nearest neighbor, leading to comparable performance without the need to store all data. We show that the RBBM has good classification performance compared to the MLP. The main advantage of the RBBM is that simultaneously with the input-output mapping, a model of the input space is obtained which can be used for learning with missing values. We derive learning rules for the case of incomplete data, and show that they perform better on incomplete data than the traditional learning rules on a 'repaired' data set.


2021 ◽  
pp. 58-60
Author(s):  
Naziru Fadisanku Haruna ◽  
Ran Vijay Kumar Singh ◽  
Samsudeen Dahiru

In This paper a modied ratio-type estimator for nite population mean under stratied random sampling using single auxiliary variable has been proposed. The expression for mean square error and bias of the proposed estimator are derived up to the rst order of approximation. The expression for minimum mean square error of proposed estimator is also obtained. The mean square error the proposed estimator is compared with other existing estimators theoretically and condition are obtained under which proposed estimator performed better. A real life population data set has been considered to compare the efciency of the proposed estimator numerically.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Bowen Yi ◽  
Da Shi ◽  
Fangfang Shi ◽  
Liang Zhang

Purpose By building on cooperation–competition theory, this study aims to investigate the multidimensional flipped effects of neighborhood hotels on Airbnb listings’ popularity, examining the degree to which such impacts are influenced by hotel types and geographical areas. Design/methodology/approach This study explores the interdependent and competitive relationship between neighborhood hotels and Airbnb from the perspective of effects on Airbnb listings’ popularity by exploring a data set covering 10,492 Airbnb listings and 2,691 hotels from Ctrip. Findings Results reveal that neighborhood hotels’ number of reviews, review ratings and prices each have positive spillover effects on Airbnb listings’ popularity, while quality assurance labels and negative review topic sentiments exert competitive effects on Airbnb popularity. Moreover, the number of budget chain hotels and high-star hotels have positive and negative effects on Airbnb popularity, respectively. Geographical areas also have a moderating effect on the relationship between various hotel-related influencing factors and Airbnb. Practical implications This study can offer hotel managers and Airbnb operators a clearer understanding of these businesses’ coexisting relationship. Findings can also provide Airbnb-specific guidelines for practitioners in terms of site selection, promotional features and development strategies for Airbnb listings. Originality/value This study establishes a cooperation–competition relationship model between hotels and Airbnb and considers the flipped effects of hotels on Airbnb for the first time. It expands previous studies by considering the multidimensional effects of hotels on Airbnb listings’ popularity and by examining the influences of hotel types and geographical areas on hotels’ impacts on Airbnb.


Author(s):  
Hai Wang ◽  
Shouhong Wang

Survey is one of the common data acquisition methods for data mining (Brin, Rastogi & Shim, 2003). In data mining one can rarely find a survey data set that contains complete entries of each observation for all of the variables. Commonly, surveys and questionnaires are often only partially completed by respondents. The possible reasons for incomplete data could be numerous, including negligence, deliberate avoidance for privacy, ambiguity of the survey question, and aversion. The extent of damage of missing data is unknown when it is virtually impossible to return the survey or questionnaires to the data source for completion, but is one of the most important parts of knowledge for data mining to discover. In fact, missing data is an important debatable issue in the knowledge engineering field (Tseng, Wang, & Lee, 2003).


Symmetry ◽  
2019 ◽  
Vol 11 (10) ◽  
pp. 1219 ◽  
Author(s):  
Shuhan Liu ◽  
Wenhao Gui

As it is often unavoidable to obtain incomplete data in life testing and survival analysis, research on censoring data is becoming increasingly popular. In this paper, the problem of estimating the entropy of a two-parameter Lomax distribution based on generalized progressively hybrid censoring is considered. The maximum likelihood estimators of the unknown parameters are derived to estimate the entropy. Further, Bayesian estimates are computed under symmetric and asymmetric loss functions, including squared error, linex, and general entropy loss function. As we cannot obtain analytical Bayesian estimates directly, the Lindley method and the Tierney and Kadane method are applied. A simulation study is conducted and a real data set is analyzed for illustrative purposes.


2014 ◽  
Vol 687-691 ◽  
pp. 1496-1499
Author(s):  
Yong Lin Leng

Partially missing or blurring attribute values make data become incomplete during collecting data. Generally we use inputation or discarding method to deal with incomplete data before clustering. In this paper we proposed an a new similarity metrics algorithm based on incomplete information system. First algorithm divided the data set into a complete data set and non complete data set, and then the complete data set was clustered using the affinity propagation clustering algorithm, incomplete data according to the design method of the similarity metric is divided into the corresponding cluster. In order to improve the efficiency of the algorithm, designing the distributed clustering algorithm based on cloud computing technology. Experiment demonstrates the proposed algorithm can cluster the incomplete big data directly and improve the accuracy and effectively.


Sign in / Sign up

Export Citation Format

Share Document