Confidence Sets for Statistical Classification

Wei Liu; Frank Bretz; Natchalee Srimaneekarn; Jianan Peng; Anthony J. Hayter

doi:10.3390/stats2030024

Confidence Sets for Statistical Classification

Stats ◽

10.3390/stats2030024 ◽

2019 ◽

Vol 2 (3) ◽

pp. 332-346 ◽

Cited By ~ 1

Author(s):

Wei Liu ◽

Frank Bretz ◽

Natchalee Srimaneekarn ◽

Jianan Peng ◽

Anthony J. Hayter

Keyword(s):

Confidence Set ◽

Unknown Parameters ◽

Confidence Sets ◽

Single Class ◽

Wide Range ◽

Desirable Feature ◽

Point Estimator ◽

Point Estimators ◽

Future Object

Classification has applications in a wide range of fields including medicine, engineering, computer science and social sciences among others. In statistical terms, classification is inference about the unknown parameters, i.e., the true classes of future objects. Hence, various standard statistical approaches can be used, such as point estimators, confidence sets and decision theoretic approaches. For example, a classifier that classifies a future object as belonging to only one of several known classes is a point estimator. The purpose of this paper is to propose a confidence-set-based classifier that classifies a future object into a single class only when there is enough evidence to warrant this, and into several classes otherwise. By allowing classification of an object into possibly more than one class, this classifier guarantees a pre-specified proportion of correct classification among all future objects. An example is provided to illustrate the method, and a simulation study is included to highlight the desirable feature of the method.

Download Full-text

Confidence Sets for Statistical Classification (II): Exact Confidence Sets

Stats ◽

10.3390/stats2040030 ◽

2019 ◽

Vol 2 (4) ◽

pp. 439-446

Author(s):

Wei Liu ◽

Frank Bretz ◽

Anthony J. Hayter

Keyword(s):

Social Sciences ◽

Statistical Classification ◽

Confidence Set ◽

Confidence Sets ◽

Single Class ◽

Wide Range ◽

Future Object ◽

Critical Constant ◽

Better Than

Classification has applications in a wide range of fields including medicine, engineering, computer science and social sciences among others. Liu et al. (2019) proposed a confidence-set-based classifier that classifies a future object into a single class only when there is enough evidence to warrant this, and into several classes otherwise. By allowing classification of an object into possibly more than one class, this classifier guarantees a pre-specified proportion of correct classification among all future objects. However, the classifier uses a conservative critical constant. In this paper, we show how to determine the exact critical constant in applications where prior knowledge about the proportions of the future objects from each class is available. As the exact critical constant is smaller than the conservative critical constant given by Liu et al. (2019), the classifier using the exact critical constant is better than the classifier by Liu et al. (2019) as expected. An example is provided to illustrate the method.

Download Full-text

Modes of Interaction in Naturally Occurring Medical Encounters With General Practitioners: The “One in a Million” Study

Qualitative Health Research ◽

10.1177/1049732321993790 ◽

2021 ◽

pp. 104973232199379

Author(s):

Olaug S. Lian ◽

Sarah Nettleton ◽

Åge Wifstad ◽

Christopher Dowrick

Keyword(s):

General Practitioners ◽

Narrative Analysis ◽

Mode Competition ◽

General Applicability ◽

Naturally Occurring ◽

Wide Range ◽

The One ◽

Narrative Mode ◽

Modes Of Interaction

In this article, we qualitatively explore the manner and style in which medical encounters between patients and general practitioners (GPs) are mutually conducted, as exhibited in situ in 10 consultations sourced from the One in a Million: Primary Care Consultations Archive in England. Our main objectives are to identify interactional modes, to develop a classification of these modes, and to uncover how modes emerge and shift both within and between consultations. Deploying an interactional perspective and a thematic and narrative analysis of consultation transcripts, we identified five distinctive interactional modes: question and answer (Q&A) mode, lecture mode, probabilistic mode, competition mode, and narrative mode. Most modes are GP-led. Mode shifts within consultations generally map on to the chronology of the medical encounter. Patient-led narrative modes are initiated by patients themselves, which demonstrates agency. Our classification of modes derives from complete naturally occurring consultations, covering a wide range of symptoms, and may have general applicability.

Download Full-text

Uncertainty-Aware Deep Learning-Based Cardiac Arrhythmias Classification Model of Electrocardiogram Signals

Computers ◽

10.3390/computers10060082 ◽

2021 ◽

Vol 10 (6) ◽

pp. 82

Author(s):

Ahmad O. Aseeri

Keyword(s):

Deep Learning ◽

Cardiac Arrhythmias ◽

Large Scale ◽

Clinical Decision Making ◽

Probabilistic Approach ◽

Classification Model ◽

Gating Mechanism ◽

Uncertainty Estimates ◽

Wide Range

Deep Learning-based methods have emerged to be one of the most effective and practical solutions in a wide range of medical problems, including the diagnosis of cardiac arrhythmias. A critical step to a precocious diagnosis in many heart dysfunctions diseases starts with the accurate detection and classification of cardiac arrhythmias, which can be achieved via electrocardiograms (ECGs). Motivated by the desire to enhance conventional clinical methods in diagnosing cardiac arrhythmias, we introduce an uncertainty-aware deep learning-based predictive model design for accurate large-scale classification of cardiac arrhythmias successfully trained and evaluated using three benchmark medical datasets. In addition, considering that the quantification of uncertainty estimates is vital for clinical decision-making, our method incorporates a probabilistic approach to capture the model’s uncertainty using a Bayesian-based approximation method without introducing additional parameters or significant changes to the network’s architecture. Although many arrhythmias classification solutions with various ECG feature engineering techniques have been reported in the literature, the introduced AI-based probabilistic-enabled method in this paper outperforms the results of existing methods in outstanding multiclass classification results that manifest F1 scores of 98.62% and 96.73% with (MIT-BIH) dataset of 20 annotations, and 99.23% and 96.94% with (INCART) dataset of eight annotations, and 97.25% and 96.73% with (BIDMC) dataset of six annotations, for the deep ensemble and probabilistic mode, respectively. We demonstrate our method’s high-performing and statistical reliability results in numerical experiments on the language modeling using the gating mechanism of Recurrent Neural Networks.

Download Full-text

Classification of unlabeled online media

Scientific Reports ◽

10.1038/s41598-021-85608-5 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Sakthi Kumar Arul Prakash ◽

Conrad Tucker

Keyword(s):

Social Media ◽

Real World ◽

Graphical Model ◽

Ground Truth ◽

Classification Problem ◽

Machine Learning Algorithms ◽

Social Media Networks ◽

Online Social Media ◽

Wide Range

AbstractThis work investigates the ability to classify misinformation in online social media networks in a manner that avoids the need for ground truth labels. Rather than approach the classification problem as a task for humans or machine learning algorithms, this work leverages user–user and user–media (i.e.,media likes) interactions to infer the type of information (fake vs. authentic) being spread, without needing to know the actual details of the information itself. To study the inception and evolution of user–user and user–media interactions over time, we create an experimental platform that mimics the functionality of real-world social media networks. We develop a graphical model that considers the evolution of this network topology to model the uncertainty (entropy) propagation when fake and authentic media disseminates across the network. The creation of a real-world social media network enables a wide range of hypotheses to be tested pertaining to users, their interactions with other users, and with media content. The discovery that the entropy of user–user and user–media interactions approximate fake and authentic media likes, enables us to classify fake media in an unsupervised learning manner.

Download Full-text

Sequence data from isolated lichen-associated melanized fungi enhance delimitation of two new lineages within Chaetothyriomycetidae

Mycological Progress ◽

10.1007/s11557-021-01706-8 ◽

2021 ◽

Vol 20 (7) ◽

pp. 911-927

Author(s):

Lucia Muggia ◽

Yu Quan ◽

Cécile Gueidan ◽

Abdullah M. S. Al-Hatmi ◽

Martin Grube ◽

...

Keyword(s):

Sequence Data ◽

Single Species ◽

Sister Group ◽

Asexual Propagation ◽

Dna Sequence Data ◽

Wide Range ◽

The Family ◽

Rock Inhabiting Fungi ◽

Stable Habitat

AbstractLichen thalli provide a long-lived and stable habitat for colonization by a wide range of microorganisms. Increased interest in these lichen-associated microbial communities has revealed an impressive diversity of fungi, including several novel lineages which still await formal taxonomic recognition. Among these, members of the Eurotiomycetes and Dothideomycetes usually occur asymptomatically in the lichen thalli, even if they share ancestry with fungi that may be parasitic on their host. Mycelia of the isolates are characterized by melanized cell walls and the fungi display exclusively asexual propagation. Their taxonomic placement requires, therefore, the use of DNA sequence data. Here, we consider recently published sequence data from lichen-associated fungi and characterize and formally describe two new, individually monophyletic lineages at family, genus, and species levels. The Pleostigmataceae fam. nov. and Melanina gen. nov. both comprise rock-inhabiting fungi that associate with epilithic, crust-forming lichens in subalpine habitats. The phylogenetic placement and the monophyly of Pleostigmataceae lack statistical support, but the family was resolved as sister to the order Verrucariales. This family comprises the species Pleostigma alpinum sp. nov., P. frigidum sp. nov., P. jungermannicola, and P. lichenophilum sp. nov. The placement of the genus Melanina is supported as a lineage within the Chaetothyriales. To date, this genus comprises the single species M. gunde-cimermaniae sp. nov. and forms a sister group to a large lineage including Herpotrichiellaceae, Chaetothyriaceae, Cyphellophoraceae, and Trichomeriaceae. The new phylogenetic analysis of the subclass Chaetothyiomycetidae provides new insight into genus and family level delimitation and classification of this ecologically diverse group of fungi.

Download Full-text

Introduction of a new AAC symbol classification system: the multidimensional quaternary symbol continuum (MQSC)

Journal of Enabling Technologies ◽

10.1108/jet-04-2021-0024 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Eliada Pampoulou ◽

Donald R. Fuller

Keyword(s):

Peer Review ◽

Classification System ◽

Augmentative And Alternative Communication ◽

Social Issues ◽

Content Type ◽

Design Attributes ◽

Wide Range ◽

Current Classification ◽

Symbol Classification

PurposeWhen the augmentative and alternative communication (ACC) model (Lloyd et al., 1990) was proposed, these components of symbols were not considered, nor were they contemplated when superordinate (Lloyd and Fuller, 1986) and subordinate levels (Fuller et al., 1992) of AAC symbol taxonomy were developed. The purpose of this paper is to revisit the ACC model and propose a new symbol classification system called multidimensional quaternary symbol continuum (MQSC)Design/methodology/approachThe field of AAC is evolving at a rapid rate in terms of its clinical, social, research and theoretical underpinnings. Advances in assessment and intervention methods, technology and social issues are all responsible to some degree for the significant changes that have occurred in the field of AAC over the last 30 years. For example, the number of aided symbol collections has increased almost exponentially over the past couple of decades. The proliferation of such a large variety of symbol collections represents a wide range of design attributes, physical attributes and linguistic characteristics for aided symbols and design attributes and linguistic characteristics for unaided symbols.FindingsTherefore, it may be time to revisit the AAC model and more specifically, one of its transmission processes referred to as the means to represent.Originality/valueThe focus of this theoretical paper then, is on the current classification of symbols, issues with respect to the current classification of symbols in terms of ambiguity of terminology and the evolution of symbols, and a proposal for a new means of classifying the means to represent.Peer reviewThe peer review history for this article is available at: https://publons.com/publon10.1108/JET-04-2021-0024

Download Full-text

Classification of Fake Product Ratings Using a Timeline Based Approach

International Journal of Business Administration and Management Research ◽

10.24178/ijbamr.2017.3.2.12 ◽

2017 ◽

Vol 3 (2) ◽

pp. 12 ◽

Cited By ~ 1

Author(s):

Neha Thomas ◽

Susan Elias

Keyword(s):

Language Processing ◽

Opinion Mining ◽

Optimal Point ◽

Linear Classifiers ◽

Wide Range ◽

Text Content ◽

Classification Tool ◽

Fake Reviews ◽

Product Ratings

Abstract— Detection of fake review and reviewers is currently a challenging problem in cyber space. It is challenging primarily due to the dynamic nature of the methodology used to fake the review. There are several aspects to be considered when analyzing reviews to classify them effective into genuine and fake. Sentiment analysis, opinion mining and intend mining are fields of research that try to accomplish the goal through Natural Language Processing of the text content of the review. In this paper, an approach that uses the review ratings evaluated along a timeline is presented. An Amazon dataset comprising of ratings indicated for a wide range of products was used for the analysis presented here. The analysis of the ratings was carried out for an electronic product over a period of six years. The computed average rating helps to identify linear classifiers that define solution boundaries within the dataspace. This enables a product specific classification of review ratings and suitable recommendations can also be generated automatically. The paper explains a methodology to evaluate the average product ratings over time and presents the research outcomes using a novel classification tool. The proposed approach helps to determine the optimal point to distinguish between fake and genuine ratings for each product. Index Terms: Fake reviews, Fake Ratings, Product Ratings, Online Shopping, Amazon Dataset.

Download Full-text

CLASSIFICATION OF ARTISTIC GLAZED CERAMICS OF DERBENT OF THE END VIII-X CENTURIES. (on the example of materials from excavations XXVII and XXXIII)

History Archeology and Ethnography of the Caucasus ◽

10.32653/ch163714-759 ◽

2020 ◽

Vol 16 (3) ◽

pp. 714-759

Author(s):

Zuhra Z. Kuzeeva

Keyword(s):

Large Set ◽

Red Clay ◽

Large Section ◽

Wide Range ◽

Opaque Glaze ◽

Clay Ceramics ◽

The City ◽

Household Ceramics ◽

Archaeological Excavations

The article is devoted to the classification analysis of the glazed ceramics of Derbent, originating from the materials of archaeological excavations (excavations XXVII and XXXIII), carried out in 2014-2015. in the city. The chronological framework of the study was presumably determined by the end of the VIII-X centuries. The relevance of the topic is characterized by the importance of studying the glazed ceramics of Derbent as a source of a large set of information (historical, cultural and socio-economic interactions of Derbent with a wide range of countries of the Near and Far Abroad).Typology of glazed ceramics in Derbent at the end of the 8th-10th centuries is considered in the article on the basis of modern methodological developments based on three main approaches to the study of any ceramics: the study of technology together with the morphology and decor of the dishes. All investigated ceramics, consisting of fragments of rims, bodies, bases and handles of vessels, are included in one large Section - Household ceramics. This section includes three sections, which are based on the analysis of the clay color of the shard (red clay, brown clay, beige clay ceramics), which determines the technology for the production of dishes. Based on the presence or absence of engobe on ceramics, two subsections are allocated in each department. The next division is the groups that are formed according to the degree of transparency of the opaque glaze. There are three of them: ceramics with transparent, translucent, opaque (dull) glaze. Within some groups, four subgroups are additionally distinguished, determined by the color of the glaze. According to the peculiarities of the additional decor, the types (overglaze, underglaze ornament) and subtypes (painting, engraving, combination of painting with engraving, relief ornament) of ceramics are distinguished. Thus, the characteristics of the glazed ceramics of Derbent from these excavations include: Section, department, sub-department, group, subgroup, type, subtype.

Download Full-text

COMPARATIVE ANALYSIS OF DIFFERENT CRITERIA OF SITE STRUCTURAL STABILITY IN THE TIME SERIES

VESTNIK OF ASTRAKHAN STATE TECHNICAL UNIVERSITY SERIES MANAGEMENT COMPUTER SCIENCE AND INFORMATICS ◽

10.24143/2072-9502-2019-1-119-128 ◽

2019 ◽

pp. 119-128

Author(s):

Liliya Andreevna Landman ◽

Andrei Vladimirovich Faddeenkov

Keyword(s):

Time Series ◽

Regression Model ◽

Structural Changes ◽

Least Squares Method ◽

Complex Structure ◽

Standard Normal Distribution ◽

Regression Equations ◽

Unknown Parameters ◽

Chow Test ◽

Wide Range

The concept of structure is used to describe a set of stable relations between the main parts of the object, which describe its integrity and identity, i.e, preserving the basic properties for a wide range of internal and external changes. This concept usually relates to the concepts of system and organization. The structure expresses a stable part of the system that is slightly changed during different reforms. Over the years structural changes take place because of active economic policy or as a result of spontaneous, uncontrollable processes. Therefore, it seems to be quite natural to find out whether there have been structural changes in the observation period, and to find them reflected in the specification of the model. The basic ideas of methods for determining structural changes in the time series dynamics have been considered, such as Chow test, Gujarati test and Poirier method. The power study was conducted for the three possible cases of change in time series trends. The random error was modeled according to the standard normal distribution. A linear multiple regression model with three independent variables was used as a time series model. Estimation of the vector of unknown parameters of the model was conducted using least squares method. For each of the three criteria the of test the null hypothesis about time series instability was carried out using the F -criterion, which involves finding the residual sum of squares of a regression model and analysis of correlation between its decline and the loss of degrees of freedom. It can be noted that Gujarati and Poirier equations have a more complex structure than equation of Chow test; however, using Chow test assumes estimation of the parameters of the three regression equations.

Download Full-text

Classification of Actions or Inheritance also for Methods

DAIMI Report Series ◽

10.7146/dpb.v16i231.7587 ◽

1987 ◽

Vol 16 (231) ◽

Author(s):

Bent Bruun Kristensen ◽

Ole Lehrmann Madsen ◽

Birger Møller-Pedersen ◽

Kristen Nygaard

Keyword(s):

General Class ◽

Object Oriented ◽

Wide Range ◽

Language Classes ◽

Virtual Classes ◽

Main Thing

The main thing with the sub-class mechanism as found in languages like C++, SIMULA and Smalltalk is its possibility to express specializations. A general class, covering a wide range of objects, may be specialized to cover more specific objects. This is obtained by three properties of sub-classing: An object of a sub-class inherits the attributes of the super-class, virtual procedure/method attributes (of the super-class) may be specialized in the sub-class, and (in SIMULA only) it inherits the actions of the super-class.In the languages mentioned above, virtual procedures/methods of a super-class are specialized in sub-classes in a very primitive manner: they are simply re-defined and need not bear any resemblance of the virtual in the super-class. In BETA, a new object-oriented language, classes and methods are unified into one concept, and by an extension of the virtual concept, virtual procedures/methods in sub-classes are defined as specializations of the virtuals in the super-class. The virtual procedures/methods of the sub-classes thus inherit the attributes (e.g. parameters) and actions from the ''super-procedure/method''.In the languages mentioned above only procedures/methods may be virtual. As classes and procedures/methods are unified in BETA this gives also virtual classes. The paper demonstrates, how this may be used to parameterize types and enforce constraints on types.

Download Full-text