The Data Selection Criteria for HSC and SVM Algorithms

Author(s):  
Qing He ◽  
Fuzhen Zhuang ◽  
Zhongzhi Shi
2018 ◽  
Vol 25 (1) ◽  
pp. 129-143 ◽  
Author(s):  
Guo-Yuan Lien ◽  
Daisuke Hotta ◽  
Eugenia Kalnay ◽  
Takemasa Miyoshi ◽  
Tse-Chun Chen

Abstract. To successfully assimilate data from a new observing system, it is necessary to develop appropriate data selection strategies, assimilating only the generally useful data. This development work is usually done by trial and error using observing system experiments (OSEs), which are very time and resource consuming. This study proposes a new, efficient methodology to accelerate the development using ensemble forecast sensitivity to observations (EFSO). First, non-cycled assimilation of the new observation data is conducted to compute EFSO diagnostics for each observation within a large sample. Second, the average EFSO conditionally sampled in terms of various factors is computed. Third, potential data selection criteria are designed based on the non-cycled EFSO statistics, and tested in cycled OSEs to verify the actual assimilation impact. The usefulness of this method is demonstrated with the assimilation of satellite precipitation data. It is shown that the EFSO-based method can efficiently suggest data selection criteria that significantly improve the assimilation results.


2019 ◽  
Vol 28 (08) ◽  
pp. 1950066 ◽  
Author(s):  
N. G. Kelkar ◽  
H. Kamada ◽  
M. Skurzok

The possibility for the existence of the exotic [Formula: see text] states is explored with the objective of calculating the [Formula: see text] momentum distribution inside such nuclei. Even though the latter is an essential ingredient for the analysis of the experimental data on the [Formula: see text], [Formula: see text] and [Formula: see text] reactions aimed at finding an [Formula: see text]-mesic 3He, the data analysis is usually performed by approximating the [Formula: see text] momentum distribution by that of a nucleon. Here, we present calculations performed by solving the three-body Faddeev equations to obtain the momentum distribution of the [Formula: see text] inside possible ([Formula: see text])[Formula: see text], ([Formula: see text])[Formula: see text] and ([Formula: see text])[Formula: see text]-[Formula: see text] states. The [Formula: see text] momentum distributions are found to be much narrower than those of the nucleons and influence the data selection criteria.


2009 ◽  
Vol 66 (8) ◽  
pp. 1673-1680 ◽  
Author(s):  
Mark R. Payne ◽  
Lotte Worsøe Clausen ◽  
Henrik Mosegaard

Abstract Payne, M. R., Clausen, L. W., and Mosegaard, H. 2009. Finding the signal in the noise: objective data-selection criteria improve the assessment of western Baltic spring-spawning herring. – ICES Journal of Marine Science, 66: 1673–1680. In the art of fish-stock assessment, it is common practice to include all available data without properly testing their validity in terms of their signal-to-noise ratio. The western Baltic spring-spawning herring (Clupea harengus) stock has been historically difficult to assess in a reliable manner. The population is spread between the Skagerrak, Kattegat, the Danish islands, and the western Baltic, but the distribution depends on age and season. Although the distribution area is covered by five separate surveys, none covers the entire stock. Using all time-series data may cause high noise levels and could lead to a poor-quality assessment. We examine the temporal and spatial coverage of each survey in terms of current biological understanding of stock distribution and, employing the observed internal consistency between age classes within cohorts as additional criteria, select the most appropriate data subsets. Assessments based on the revised dataset show greatly improved quality in terms of both accuracy and precision. The results highlight the often-ignored principle that a judicious choice of input data, based on rational and justifiable selection criteria, can enhance the ultimate quality of a stock assessment.


2000 ◽  
Vol 2000 (10) ◽  
pp. 57-72
Author(s):  
Philip N. Brink ◽  
Andrew D. Santini ◽  
Benjamin J. Sherman ◽  
Mark J. TenBroek

F1000Research ◽  
2014 ◽  
Vol 3 ◽  
pp. 218 ◽  
Author(s):  
Ye Hu ◽  
Jürgen Bajorath

Drug promiscuity and polypharmacology are much discussed topics in pharmaceutical research. Experimentally, promiscuity can be studied by profiling of compounds on arrays of targets. Computationally, promiscuity rates can be estimated by mining of compound activity data. In this study, we have assessed drug promiscuity over time by systematically collecting activity records for approved drugs. For 518 diverse drugs, promiscuity rates were determined over different time intervals. Significant differences between the number of reported drug targets and the promiscuity rates derived from activity records were frequently observed. On the basis of high-confidence activity data, an increase in average promiscuity rates from 1.5 to 3.2 targets per drug was detected between 2000 and 2014. These promiscuity rates are lower than often assumed. When the stringency of data selection criteria was reduced in subsequent steps, non-realistic increases in promiscuity rates from ~6 targets per drug in 2000 to more than 28 targets were obtained. Hence, estimates of drug promiscuity significantly differ depending on the stringency with which target annotations and activity data are considered.


Ecology ◽  
1999 ◽  
Vol 80 (4) ◽  
pp. 1132-1141 ◽  
Author(s):  
Göran Englund ◽  
Orlando Sarnelle ◽  
Scott D. Cooper

Sign in / Sign up

Export Citation Format

Share Document