Selected Papers from InTech'04

Richard Aló;  ; Vladik Kreinovich;

doi:10.20965/jaciii.2006.p0243

Selected Papers from InTech'04

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2006.p0243 ◽

2006 ◽

Vol 10 (3) ◽

pp. 243-244

Author(s):

Richard Aló ◽

◽

Vladik Kreinovich ◽

Keyword(s):

Machine Learning ◽

Language Processing ◽

Adaptive Systems ◽

Vision System ◽

Weighted Average ◽

Machine Learning Algorithms ◽

Equivalent Transformation ◽

General Description ◽

International Conference

The main objective of the annual International Conference on Intelligent Technologies (InTech) is to bring together researchers and practitioners who implement intelligent and fuzzy technologies in real-world environment. The Fifth International Conference on Intelligent Technologies InTech'04 was held in Houston, Texas, on December 2-4, 2004. Topics of InTech'04 included mathematical foundations of intelligent technologies, traditional Artificial Intelligent techniques, uncertainty processing and methods of soft computing, learning/adaptive systems/data mining, and applications of intelligent technologies. This special issue contains versions of 15 selected papers originally presented at InTech'04. These papers cover most of the topics of the conference. Several papers describe new applications of the existing intelligent techniques. R. Aló{o} et al. show how traditional statistical hypotheses testing techniques – originally designed for processing measurement results – need to be modified when applied to simulated data – e.g., when we compare the quality of two algorithms. Y. Frayman et al. use mathematical morphology and genetic algorithms in the design of a machine vision system for detecting surface defects in aluminum die casting. Y. Murai et al. propose a new faster entropy-based placement algorithm for VLSI circuit design and similar applications. A. P. Salvatore et al. show how expert system-type techniques can help in scheduling botox treatment for voice disorders. H. Tsuji et al. propose a new method, based on partial differential equations, for automatically identifying and extracting objects from a video. N. Ward uses Ordered Weighted Average (OWA) techniques to design a model that predicts admission of computer science students into different graduate schools. An important aspect of intelligence is ability to learn. In A. Mahaweerawat et al., neural-based machine learning is used to identify and predict software faults. J. Han et al. show that we can drastically improve the quality of machine learning if, in addition to discovering traditional (positive) rules, we also search for negative rules. A serious problem with many neural-based machine learning algorithms is that often, the results of their learning are un-intelligible rules and numbers. M. I. Khan et al. show, on the example of robotic arm applications, that if we allow neurons with different input-output dependencies – including linear neurons – then we can extract meaningful knowledge from the resulting network. Several papers analyze the Equivalent Transformation (ET) model, that allows the user to automatically generate code from specifications. A general description of this model is given by K. Akama et al. P. Chippimolchai et al. describe how, within this model, we can transform a user's query into an equivalent more efficient one. H. Koike et al. apply this approach to natural language processing. Y. Shigeta et al. show how the existing constraint techniques can be translated into equivalent transformation rules and thus, combined with other specifications. I. Takarajima et al. extend the ET approach to situations like parallel computations, where the order in which different computations are performed on different processors depends on other processes and is, thus, non-deterministic. Finally, a paper by J. Chandra – based on his invited talk at InTech'04 – describes a general framework for robust and resilient critical infrastructure systems, with potential applications to transportation systems, power grids, communication networks, water resources, health delivery systems, and financial networks. We want to thank all the authors for their outstanding work, the participants of InTech'04 for their helpful suggestions, the anonymous reviewers for their thorough analysis and constructive help, and – last but not the least – to Professor Kaoru Hirota for his kind suggestion to host this issue and to the entire staff of the journal for their tireless work.

Download Full-text

What's my App?

ACM SIGMETRICS Performance Evaluation Review ◽

10.1145/3466826.3466841 ◽

2021 ◽

Vol 48 (4) ◽

pp. 41-44

Author(s):

Dena Markudova ◽

Martino Trevisan ◽

Paolo Garza ◽

Michela Meo ◽

Maurizio M. Munafo ◽

...

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Real Time ◽

Language Processing ◽

Traffic Management ◽

Quality Of Experience ◽

Broadband Internet ◽

Management Policies ◽

Control Traffic

With the spread of broadband Internet, Real-Time Communication (RTC) platforms have become increasingly popular and have transformed the way people communicate. Thus, it is fundamental that the network adopts traffic management policies that ensure appropriate Quality of Experience to users of RTC applications. A key step for this is the identification of the applications behind RTC traffic, which in turn allows to allocate adequate resources and make decisions based on the specific application's requirements. In this paper, we introduce a machine learning-based system for identifying the traffic of RTC applications. It builds on the domains contacted before starting a call and leverages techniques from Natural Language Processing (NLP) to build meaningful features. Our system works in real-time and is robust to the peculiarities of the RTP implementations of different applications, since it uses only control traffic. Experimental results show that our approach classifies 5 well-known meeting applications with an F1 score of 0.89.

Download Full-text

Machine Learning for Dissimulating Reality

Proceedings ◽

10.3390/proceedings2021077017 ◽

2021 ◽

Vol 77 (1) ◽

pp. 17

Author(s):

Andrea Giussani

Keyword(s):

Machine Learning ◽

Language Processing ◽

Huge Amount ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Technological Advances ◽

Textual Data ◽

Musical Scores ◽

Mathematical Formulas

In the last decade, advances in statistical modeling and computer science have boosted the production of machine-produced contents in different fields: from language to image generation, the quality of the generated outputs is remarkably high, sometimes better than those produced by a human being. Modern technological advances such as OpenAI’s GPT-2 (and recently GPT-3) permit automated systems to dramatically alter reality with synthetic outputs so that humans are not able to distinguish the real copy from its counteracts. An example is given by an article entirely written by GPT-2, but many other examples exist. In the field of computer vision, Nvidia’s Generative Adversarial Network, commonly known as StyleGAN (Karras et al. 2018), has become the de facto reference point for the production of a huge amount of fake human face portraits; additionally, recent algorithms were developed to create both musical scores and mathematical formulas. This presentation aims to stimulate participants on the state-of-the-art results in this field: we will cover both GANs and language modeling with recent applications. The novelty here is that we apply a transformer-based machine learning technique, namely RoBerta (Liu et al. 2019), to the detection of human-produced versus machine-produced text concerning fake news detection. RoBerta is a recent algorithm that is based on the well-known Bidirectional Encoder Representations from Transformers algorithm, known as BERT (Devlin et al. 2018); this is a bi-directional transformer used for natural language processing developed by Google and pre-trained over a huge amount of unlabeled textual data to learn embeddings. We will then use these representations as an input of our classifier to detect real vs. machine-produced text. The application is demonstrated in the presentation.

Download Full-text

Assessing automated CMR contouring algorithms using systematic contour quality scoring analysis

European Heart Journal - Cardiovascular Imaging ◽

10.1093/ehjci/jeaa356.434 ◽

2021 ◽

Vol 22 (Supplement_1) ◽

Author(s):

M Omer ◽

A Amir-Khalili ◽

A Sojoudi ◽

T Thao Le ◽

S A Cook ◽

...

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Random Order ◽

Standard Operating Procedure ◽

Left Ventricular ◽

Quality Score ◽

Machine Learning Algorithms ◽

Dice Similarity Coefficient ◽

National Budget

Abstract Funding Acknowledgements Type of funding sources: Public grant(s) – National budget only. Main funding source(s): SmartHeart EPSRC programme grant (www.nihr.ac.uk), London Medical Imaging and AI Centre for Value-Based Healthcare Background Quality measures for machine learning algorithms include clinical measures such as end-diastolic (ED) and end-systolic (ES) volume, volumetric overlaps such as Dice similarity coefficient and surface distances such as Hausdorff distance. These measures capture differences between manually drawn and automated contours but fail to capture the trust of a clinician to an automatically generated contour. Purpose We propose to directly capture clinicians’ trust in a systematic way. We display manual and automated contours sequentially in random order and ask the clinicians to score the contour quality. We then perform statistical analysis for both sources of contours and stratify results based on contour type. Data The data selected for this experiment came from the National Health Center Singapore. It constitutes CMR scans from 313 patients with diverse pathologies including: healthy, dilated cardiomyopathy (DCM), hypertension (HTN), hypertrophic cardiomyopathy (HCM), ischemic heart disease (IHD), left ventricular non-compaction (LVNC), and myocarditis. Each study contains a short axis (SAX) stack, with ED and ES phases manually annotated. Automated contours are generated for each SAX image for which manual annotation is available. For this, a machine learning algorithm trained at Circle Cardiovascular Imaging Inc. is applied and the resulting predictions are saved to be displayed in the contour quality scoring (CQS) application. Methods: The CQS application displays manual and automated contours in a random order and presents the user an option to assign a contour quality score 1: Unacceptable, 2: Bad, 3: Fair, 4: Good. The UK Biobank standard operating procedure is used for assessing the quality of the contoured images. Quality scores are assigned based on how the contour affects clinical outcomes. However, as images are presented independent of spatiotemporal context, contour quality is assessed based on how well the area of the delineated structure is approximated. Consequently, small contours and small deviations are rarely assigned a quality score of less than 2, as they are not clinically relevant. Special attention is given to the RV-endo contours as often, mostly in basal images, two separate contours appear. In such cases, a score of 3 is given if the two disjoint contours sufficiently encompass the underlying anatomy; otherwise they are scored as 2 or 1. Results A total of 50991 quality scores (24208 manual and 26783 automated) are generated by five expert raters. The mean score for all manual and automated contours are 3.77 ± 0.48 and 3.77 ± 0.52, respectively. The breakdown of mean quality scores by contour type is included in Fig. 1a while the distribution of quality scores for various raters are shown in Fig. 1b. Conclusion We proposed a method of comparing the quality of manual versus automated contouring methods. Results suggest similar statistics in quality scores for both sources of contours. Abstract Figure 1

Download Full-text

DEVELOPMENT OF A MACHINE LEARNING ALGORITHM TO PREDICT AUTHOR’S AGE FROM TEXT

International Journal of Research -GRANTHAALAYAH ◽

10.29121/granthaalayah.v7.i10.2019.408 ◽

2020 ◽

Vol 7 (10) ◽

pp. 380-389

Author(s):

Asogwa D.C ◽

Anigbogu S.O ◽

Anigbogu G.N ◽

Efozia F.N

Keyword(s):

Machine Learning ◽

Language Processing ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Machine Learning Algorithm ◽

Age Group ◽

Political Views ◽

Learning Techniques ◽

Age Prediction

Author's age prediction is the task of determining the author's age by studying the texts written by them. The prediction of author’s age can be enlightening about the different trends, opinions social and political views of an age group. Marketers always use this to encourage a product or a service to an age group following their conveyed interests and opinions. Methodologies in natural language processing have made it possible to predict author’s age from text by examining the variation of linguistic characteristics. Also, many machine learning algorithms have been used in author’s age prediction. However, in social networks, computational linguists are challenged with numerous issues just as machine learning techniques are performance driven with its own challenges in realistic scenarios. This work developed a model that can predict author's age from text with a machine learning algorithm (Naïve Bayes) using three types of features namely, content based, style based and topic based. The trained model gave a prediction accuracy of 80%.

Download Full-text

A Comparative Analysis of Machine Learning Techniques for Spam Detection

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1308 ◽

2021 ◽

pp. 657-661

Author(s):

Rashida Ali ◽

Ibrahim Rampurawala ◽

Mayuri Wandhe ◽

Ruchika Shrikhande ◽

Arpita Bhatkar

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Comparative Analysis ◽

Natural Language ◽

Language Processing ◽

High Volume ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Spam Detection ◽

Learning Techniques

Internet provides a medium to connect with individuals of similar or different interests creating a hub. Since a huge hub participates on these platforms, the user can receive a high volume of messages from different individuals creating a chaos and unwanted messages. These messages sometimes contain a true information and sometimes false, which leads to a state of confusion in the minds of the users and leads to first step towards spam messaging. Spam messages means an irrelevant and unsolicited message sent by a known/unknown user which may lead to a sense of insecurity among users. In this paper, the different machine learning algorithms were trained and tested with natural language processing (NLP) to classify whether the messages are spam or ham.

Download Full-text

Prediction on water quality of a lake in Chennai, India using machine learning algorithms

10.5004/dwt.2021.26970 ◽

2021 ◽

Vol 218 ◽

pp. 44-51

Author(s):

D. Venkata Vara Prasad ◽

Lokeswari Y. Venkataramana ◽

P. Senthil Kumar ◽

G. Prasannamedha ◽

K. Soumya ◽

...

Keyword(s):

Machine Learning ◽

Water Quality ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Machine-Learning-Based External Plagiarism Detecting Methodology From Monolingual Documents

Feature Dimension Reduction for Content-Based Image Identification - Advances in Multimedia and Interactive Technologies ◽

10.4018/978-1-5225-5775-3.ch007 ◽

2018 ◽

pp. 122-139

Author(s):

Saugata Bose ◽

Ritambhra Korpal

Keyword(s):

Machine Learning ◽

Language Processing ◽

Confusion Matrix ◽

False Negative ◽

False Negative Rate ◽

Search Space ◽

Machine Learning Algorithms ◽

C4.5 Decision Tree ◽

N Gram ◽

Four Levels

In this chapter, an initiative is proposed where natural language processing (NLP) techniques and supervised machine learning algorithms have been combined to detect external plagiarism. The major emphasis is on to construct a framework to detect plagiarism from monolingual texts by implementing n-gram frequency comparison approach. The framework is based on 120 characteristics which have been extracted during pre-processing steps using simple NLP approach. Afterward, filter metrics has been applied to select most relevant features and supervised classification learning algorithm has been used later to classify the documents in four levels of plagiarism. Then, confusion matrix was built to estimate the false positives and false negatives. Finally, the authors have shown C4.5 decision tree-based classifier's suitability on calculating accuracy over naive Bayes. The framework achieved 89% accuracy with low false positive and false negative rate and it shows higher precision and recall value comparing to passage similarities method, sentence similarity method, and search space reduction method.

Download Full-text

Machine Learning

Machine Learning ◽

10.4018/978-1-60960-818-7.ch102 ◽

2012 ◽

pp. 13-22 ◽

Cited By ~ 1

Author(s):

João Gama ◽

André C.P.L.F. de Carvalho

Keyword(s):

Machine Learning ◽

Language Processing ◽

Text Processing ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Background Information ◽

Future Research ◽

Personal View ◽

Learning Techniques ◽

Future Research Directions

Machine learning techniques have been successfully applied to several real world problems in areas as diverse as image analysis, Semantic Web, bioinformatics, text processing, natural language processing,telecommunications, finance, medical diagnosis, and so forth. A particular application where machine learning plays a key role is data mining, where machine learning techniques have been extensively used for the extraction of association, clustering, prediction, diagnosis, and regression models. This text presents our personal view of the main aspects, major tasks, frequently used algorithms, current research, and future directions of machine learning research. For such, it is organized as follows: Background information concerning machine learning is presented in the second section. The third section discusses different definitions for Machine Learning. Common tasks faced by Machine Learning Systems are described in the fourth section. Popular Machine Learning algorithms and the importance of the loss function are commented on in the fifth section. The sixth and seventh sections present the current trends and future research directions, respectively.

Download Full-text

Machine Learning

Encyclopedia of Information Science and Technology, Second Edition ◽

10.4018/978-1-60566-026-4.ch392 ◽

2011 ◽

pp. 2462-2468 ◽

Cited By ~ 3

Author(s):

João Gama ◽

André C.P.L.F. de Carvalho

Keyword(s):

Machine Learning ◽

Language Processing ◽

Text Processing ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Background Information ◽

Future Research ◽

Personal View ◽

Learning Techniques ◽

Future Research Directions

Download Full-text

Combine Clustering and Machine Learning for Enhancing the Efficiency of Energy Baseline of Chiller System

Energies ◽

10.3390/en13174368 ◽

2020 ◽

Vol 13 (17) ◽

pp. 4368 ◽

Cited By ~ 1

Author(s):

Chun-Wei Chen ◽

Chun-Chang Li ◽

Chen-Yu Lin

Keyword(s):

Machine Learning ◽

Prediction Accuracy ◽

Prediction Models ◽

Machine Learning Algorithms ◽

Learning Models ◽

Important Method ◽

Gap Statistic ◽

Machine Learning Model ◽

Key Variables

Energy baseline is an important method for measuring the energy-saving benefits of chiller system, and the benefits can be calculated by comparing prediction models and actual results. Currently, machine learning is often adopted as a prediction model for energy baselines. Common models include regression, ensemble learning, and deep learning models. In this study, we first reviewed several machine learning algorithms, which were used to establish prediction models. Then, the concept of clustering to preprocess chiller data was adopted. Data mining, K-means clustering, and gap statistic were used to successfully identify the critical variables to cluster chiller modes. Applying these key variables effectively enhanced the quality of the chiller data, and combining the clustering results and the machine learning model effectively improved the prediction accuracy of the model and the reliability of the energy baselines.

Download Full-text