Machine-learning methods for text named entity recognition

PROBLEMS IN PROGRAMMING ◽

10.15407/pp2016.02-03.150 ◽

2016 ◽

pp. 150-157

Author(s):

O.O. Marchenko ◽

Keyword(s):

Machine Learning ◽

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Learning Methods ◽

Named Entities ◽

Named Entity ◽

Machine Learning Methods ◽

Multi Classification

The article describes machine learning methods for the named entity recognition. To build named entity classifiers two basic models of machine learning, The Naїve Bayes and Conditional Random Fields, were used. A model for multi-classification of named entities using Error Correcting Output Codes was also researched. The paper describes a method for classifiers' training and the results of test experiments. Conditional Random Fields overcome other models in precision and recall evaluations.

Download Full-text

Conditional Random Fields for Biomedical Named Entity Recognition Revisited

10.21203/rs.3.rs-36431/v1 ◽

2020 ◽

Author(s):

Xie-Yuan Xie

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Biomedical Domain ◽

Minimal Set ◽

Named Entities ◽

Named Entity ◽

Biomedical Texts ◽

Biomedical Named Entity Recognition

Abstract Named Entity Recognition (NER) is a key task which automatically extracts Named Entities (NE) from the text. Names of persons, places, date and time are examples of NEs. We are applying Conditional Random Fields (CRFs) for NER in biomedical domain. Examples of NEs in biomedical texts are gene, proteins. We used a minimal set of features to train CRF algorithm and obtained a good results for biomedical texts.

Download Full-text

Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods

JMIR Medical Informatics ◽

10.2196/medinform.9965 ◽

2018 ◽

Vol 6 (4) ◽

pp. e50 ◽

Cited By ~ 10

Author(s):

Yu Zhang ◽

Xuwen Wang ◽

Zhen Hou ◽

Jiao Li

Keyword(s):

Machine Learning ◽

Electronic Health Records ◽

Named Entity Recognition ◽

Entity Recognition ◽

Learning Methods ◽

Health Records ◽

Named Entity ◽

Machine Learning Methods ◽

Electronic Health

Download Full-text

An Annotated Corpus of Crime-Related Portuguese Documents for NLP and Machine Learning Processing

Data ◽

10.3390/data6070071 ◽

2021 ◽

Vol 6 (7) ◽

pp. 71

Author(s):

Gonçalo Carnaz ◽

Mário Antunes ◽

Vitor Beires Nogueira

Keyword(s):

Machine Learning ◽

Language Processing ◽

Named Entity Recognition ◽

Entity Recognition ◽

Automatic Identification ◽

Named Entities ◽

Related Data ◽

Named Entity ◽

Chain Of Custody ◽

Evidence Collection

Criminal investigations collect and analyze the facts related to a crime, from which the investigators can deduce evidence to be used in court. It is a multidisciplinary and applied science, which includes interviews, interrogations, evidence collection, preservation of the chain of custody, and other methods and techniques of investigation. These techniques produce both digital and paper documents that have to be carefully analyzed to identify correlations and interactions among suspects, places, license plates, and other entities that are mentioned in the investigation. The computerized processing of these documents is a helping hand to the criminal investigation, as it allows the automatic identification of entities and their relations, being some of which difficult to identify manually. There exists a wide set of dedicated tools, but they have a major limitation: they are unable to process criminal reports in the Portuguese language, as an annotated corpus for that purpose does not exist. This paper presents an annotated corpus, composed of a collection of anonymized crime-related documents, which were extracted from official and open sources. The dataset was produced as the result of an exploratory initiative to collect crime-related data from websites and conditioned-access police reports. The dataset was evaluated and a mean precision of 0.808, recall of 0.722, and F1-score of 0.733 were obtained with the classification of the annotated named-entities present in the crime-related documents. This corpus can be employed to benchmark Machine Learning (ML) and Natural Language Processing (NLP) methods and tools to detect and correlate entities in the documents. Some examples are sentence detection, named-entity recognition, and identification of terms related to the criminal domain.

Download Full-text

Bidirectional Long Short-Term Memory (BILSTM) with Conditional Random Fields (CRF) for Knowledge Named Entity Recognition in Online Judges (OJS)

International Journal on Natural Language Computing ◽

10.5121/ijnlc.2018.7401 ◽

2018 ◽

Vol 7 (4) ◽

pp. 01-08

Author(s):

Muhammad Asif Khan ◽

Tayyab Naveed ◽

Elmaam Yagoub ◽

Guojin Zhu

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Named entity recognition based on conditional random fields

Cluster Computing ◽

10.1007/s10586-017-1146-3 ◽

2017 ◽

Vol 22 (S3) ◽

pp. 5195-5206 ◽

Cited By ~ 4

Author(s):

Shengli Song ◽

Nan Zhang ◽

Haitao Huang

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity

Download Full-text

Chinese Named Entity Recognition with Conditional Random Fields in the Light of Chinese Characteristics

Language Processing and Intelligent Information Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-642-38634-3_8 ◽

2013 ◽

pp. 57-68 ◽

Cited By ~ 9

Author(s):

Aaron L. -F. Han ◽

Derek F. Wong ◽

Lidia S. Chao

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Chinese Characteristics

Download Full-text

The Optimization of Portuguese Named-Entity Recognition and Classification by Combining Local Grammars and Conditional Random Fields Trained with a Parsed Corpus

Communications in Computer and Information Science - Formalising Natural Languages: Applications to Natural Language Processing and Digital Humanities ◽

10.1007/978-3-030-70629-6_17 ◽

2021 ◽

pp. 196-205

Author(s):

Diego Alves ◽

Božo Bekavac ◽

Marko Tadić

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity

Download Full-text

Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-019-0865-1 ◽

2019 ◽

Vol 19 (1) ◽

Author(s):

Wangjin Lee ◽

Jinwook Choi

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity

Download Full-text

Cybersecurity named entity recognition using bidirectional long short-term memory with conditional random fields

Tsinghua Science & Technology ◽

10.26599/tst.2019.9010033 ◽

2021 ◽

Vol 26 (3) ◽

pp. 259-265

Author(s):

Pingchuan Ma ◽

Bo Jiang ◽

Zhigang Lu ◽

Ning Li ◽

Zhengwei Jiang

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Named Entity ◽

Long Short Term Memory

Download Full-text

Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features

Lecture Notes in Computer Science - Advances in Artificial Intelligence - IBERAMIA 2016 ◽

10.1007/978-3-319-47955-2_15 ◽

2016 ◽

pp. 175-186

Author(s):

Jenny Copara ◽

Jose Ochoa ◽

Camilo Thorne ◽

Goran Glavaš

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity

Download Full-text