Supplier Clustering Based on Unstructured Manufacturing Capability Data

Volume 1B: 38th Computers and Information in Engineering Conference ◽

10.1115/detc2018-85865 ◽

2018 ◽

Cited By ~ 1

Author(s):

Ramin Sabbagh ◽

Farhad Ameri

Keyword(s):

Natural Language ◽

Language Processing ◽

Quantitative Methods ◽

Ad Hoc ◽

Semantic Analysis ◽

Reduction Process ◽

Manufacturing Companies ◽

Web Based ◽

Manufacturing Capability ◽

Processing Techniques

The descriptions of capabilities of manufacturing companies can be found in multiple locations including company websites, legacy system databases, and ad hoc documents and spreadsheets. The capability descriptions are often represented using natural language. To unlock the value of unstructured capability information and learn from it, there is a need for developing advanced quantitative methods supported by machine learning and natural language processing techniques. This research proposes a multi-step unsupervised learning methodology using K-means clustering and topic modeling techniques in order to build clusters of suppliers based on their capabilities, extract and organize the manufacturing capability terminology, and discover nontrivial patterns in manufacturing capability corpora. The capability data is extracted either directly from the website of manufacturing firms or from their profiles in e-sourcing portals and directories. Feature extraction and dimensionality reduction process in this work in supported by Ngram extraction and Latent Semantic Analysis (LSA) methods. The proposed clustering method is validated experimentally based a dataset composed of 150 capability descriptions collected from web-based sourcing directories such as the Thomas Net directory for manufacturing companies. The results of the experiment show that the proposed method creates supplier cluster with high accuracy.

Download Full-text

A Framework Based on K-Means Clustering and Topic Modeling for Analyzing Unstructured Manufacturing Capability Data

Journal of Computing and Information Science in Engineering ◽

10.1115/1.4044506 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 3

Author(s):

Ramin Sabbagh ◽

Farhad Ameri

Keyword(s):

Natural Language ◽

Language Processing ◽

Topic Modeling ◽

Quantitative Methods ◽

Ad Hoc ◽

Semantic Analysis ◽

Reduction Process ◽

Manufacturing Companies ◽

Manufacturing Capability ◽

N Gram

Abstract The natural language descriptions of the capabilities of manufacturing companies can be found in multiple locations including company websites, legacy system databases, and ad hoc documents and spreadsheets. To unlock the value of unstructured capability data and learn from it, there is a need for developing advanced quantitative methods supported by machine learning and natural language processing techniques. This research proposes a hybrid unsupervised learning methodology using K-means clustering and topic modeling techniques in order to build clusters of suppliers based on their capabilities, automatically infer topics from the created clusters, and discover nontrivial patterns in manufacturing capability corpora. The capability data is extracted either directly from the website of manufacturing firms or from their profiles in e-sourcing portals and directories. Feature extraction and dimensionality reduction process in this work are supported by N-gram extraction and latent semantic analysis (LSA) methods. The proposed clustering method is validated experimentally based on a dataset composed of 150 capability descriptions collected from web-based sourcing directories such as the Thomas Net directory for manufacturing companies. The results of the experiment show that the proposed method creates supplier cluster with high accuracy. Two example applications of the proposed framework, related to supplier similarity measurement and automated thesaurus creation, are introduced in this paper.

Download Full-text

Interpretation of iconic utterances based on contents representation: Semantic analysis in the PVI system

Natural Language Engineering ◽

10.1017/s1351324997001836 ◽

1998 ◽

Vol 4 (1) ◽

pp. 17-40

Author(s):

PASCAL VAILLANT

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Augmentative And Alternative Communication ◽

Semantic Analysis ◽

Alternative Communication ◽

Language Disabilities ◽

The Disabled ◽

Conceptual Graph ◽

Processing Techniques

This article focuses on the need for technological aid for agrammatics, and presents a system designed to meet this need. The field of Augmentative and Alternative Communication (AAC) explores ways to allow people with speech or language disabilities to communicate. The use of computers and natural language processing techniques offers a range of new possibilities in this direction. Yet AAC addresses speech deficits mainly, not linguistic disabilities. A model of aided AAC interfaces with a place for natural language processing is presented. The PVI system, described in this contribution, makes use of such advanced techniques. It has been developed at Thomson-CSF for the use of children with cerebral palsy. It presents a customizable interface helping the disabled to compose sequences of icons displayed on a computer screen. A semantic parser, using lexical semantics information, is used to determine the best case assignments for predicative icons in the sequence. It maximizes a global value, the ‘semantic harmony’ of the sequence. The resulting conceptual graph is fed to a natural language generation module which uses Tree Adjoining Grammars (TAG) to generate French sentences. Evaluation by users demonstrates the system's strengths and limitations, and shows the ways for future developments.

Download Full-text

A Review for Semantic Analysis and Text Document Annotation Using Natural Language Processing Techniques

SSRN Electronic Journal ◽

10.2139/ssrn.3418747 ◽

2019 ◽

Cited By ~ 1

Author(s):

Nikita Pande ◽

Mandar Karyakarte

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Text Document ◽

Processing Techniques ◽

Document Annotation

Download Full-text

Mining the biomedical literature using semantic analysis and natural language processing techniques

BIOSILICO ◽

10.1016/s1478-5382(03)02330-8 ◽

2003 ◽

Vol 1 (2) ◽

pp. 69-80 ◽

Cited By ~ 15

Author(s):

Ronen Feldman ◽

Yizhar Regev ◽

Eyal Hurvitz ◽

Michal Finkelstein-Landau

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Biomedical Literature ◽

Processing Techniques

Download Full-text

UGLEO: A WEB BASED INTELLIGENCE CHATBOT FOR STUDENT ADMISSION PORTAL USING MEGAHAL STYLE

Jurnal Ilmiah Informatika Komputer ◽

10.35760/ik.2018.v23i3.2373 ◽

2018 ◽

Vol 23 (3) ◽

pp. 175-191

Author(s):

Anneke Annassia Putri Siswadi ◽

Avinanta Tarigan

Keyword(s):

Markov Chain ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Information Need ◽

Web Based ◽

Markov Chain Method ◽

Information Center

To fulfill the prospective student's information need about student admission, Gunadarma University has already many kinds of services which are time limited, such as website, book, registration place, Media Information Center, and Question Answering’s website (UG-Pedia). It needs a service that can serve them anytime and anywhere. Therefore, this research is developing the UGLeo as a web based QA intelligence chatbot application for Gunadarma University's student admission portal. UGLeo is developed by MegaHal style which implements the Markov Chain method. In this research, there are some modifications in MegaHal style, those modifications are the structure of natural language processing and the structure of database. The accuracy of UGLeo reply is 65%. However, to increase the accuracy there are some improvements to be applied in UGLeo system, both improvement in natural language processing and improvement in MegaHal style.

Download Full-text

A Natural Language Processing Approach to Measuring Treatment Adherence and Consistency Using Semantic Similarity

AERA Open ◽

10.1177/23328584211028615 ◽

2021 ◽

Vol 7 ◽

pp. 233285842110286

Author(s):

Kylie L. Anglin ◽

Vivian C. Wong ◽

Arielle Boguslav

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Intervention Implementation ◽

Proof Of Concept ◽

Coaching Intervention ◽

Processing Techniques ◽

Teacher Coaching ◽

The Impact

Though there is widespread recognition of the importance of implementation research, evaluators often face intense logistical, budgetary, and methodological challenges in their efforts to assess intervention implementation in the field. This article proposes a set of natural language processing techniques called semantic similarity as an innovative and scalable method of measuring implementation constructs. Semantic similarity methods are an automated approach to quantifying the similarity between texts. By applying semantic similarity to transcripts of intervention sessions, researchers can use the method to determine whether an intervention was delivered with adherence to a structured protocol, and the extent to which an intervention was replicated with consistency across sessions, sites, and studies. This article provides an overview of semantic similarity methods, describes their application within the context of educational evaluations, and provides a proof of concept using an experimental study of the impact of a standardized teacher coaching intervention.

Download Full-text

Detecting Malicious Windows Commands Using Natural Language Processing Techniques

Innovative Security Solutions for Information Technology and Communications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-12942-2_13 ◽

2019 ◽

pp. 157-169

Author(s):

Muhammd Mudassar Yamin ◽

Basel Katt

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing Techniques

Download Full-text

Identification of spam comments using natural language processing techniques

2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP) ◽

10.1109/iccp.2014.6936976 ◽

2014 ◽

Cited By ~ 6

Author(s):

Cristina Radulescu ◽

Mihaela Dinsoreanu ◽

Rodica Potolea

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing Techniques

Download Full-text

Using Natural Language Processing Techniques for Stock Return Predictions

SSRN Electronic Journal ◽

10.2139/ssrn.2940564 ◽

2017 ◽

Cited By ~ 1

Author(s):

Ming Li Chew ◽

Sahil Puri

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Stock Return ◽

Processing Techniques

Download Full-text

Identifying and intercepting health misinformation on Reddit dermatology forums with artificially intelligent bots using natural language processing (Preprint)

10.2196/preprints.20975 ◽

2021 ◽

Author(s):

Monique B. Sager ◽

Aditya M. Kashyap ◽

Mila Tamminga ◽

Sadhana Ravoori ◽

Christopher Callison-Burch ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

The United States ◽

Test Accuracy ◽

Limited Data ◽

Test Environment ◽

Data Set ◽

Inappropriate Care ◽

Processing Techniques

BACKGROUND Reddit, the fifth most popular website in the United States, boasts a large and engaged user base on its dermatology forums where users crowdsource free medical opinions. Unfortunately, much of the advice provided is unvalidated and could lead to inappropriate care. Initial testing has shown that artificially intelligent bots can detect misinformation on Reddit forums and may be able to produce responses to posts containing misinformation. OBJECTIVE To analyze the ability of bots to find and respond to health misinformation on Reddit’s dermatology forums in a controlled test environment. METHODS Using natural language processing techniques, we trained bots to target misinformation using relevant keywords and to post pre-fabricated responses. By evaluating different model architectures across a held-out test set, we compared performances. RESULTS Our models yielded data test accuracies ranging from 95%-100%, with a BERT fine-tuned model resulting in the highest level of test accuracy. Bots were then able to post corrective pre-fabricated responses to misinformation. CONCLUSIONS Using a limited data set, bots had near-perfect ability to detect these examples of health misinformation within Reddit dermatology forums. Given that these bots can then post pre-fabricated responses, this technique may allow for interception of misinformation. Providing correct information, even instantly, however, does not mean users will be receptive or find such interventions persuasive. Further work should investigate this strategy’s effectiveness to inform future deployment of bots as a technique in combating health misinformation. CLINICALTRIAL N/A

Download Full-text