CTSS: A Tool for Efficient Information Extraction with Soft Matching Rules for Text Mining

This paper proposes a system called CFP Manager specialized on IT field and designed to ease the process of searching conference suitable to one's need. At present, the handling of CFP faces two problems: for emails, the huge quantity of CFP received can be easily skimmed through. For websites, the reviewing of some of the main CFP aggregators available online points out the lack of usable criteria. This system proposes to answer to these problems via its architecture consisting of three components: firstly an Information Extraction module extracting relevant information (as date, location, etc...) from CFP using rule based text mining algorithm. The second component enriches the now extracted data with external one from ontology models. Finally the last one displays the said data and allows the end user to perform complex queries on the CFP dataset and thus allow him to only access to CFP suitable for him. In order to validate the authors' proposal, they eventually process the well-known precision / recall metric on our information extraction component with an average of 0.95 for precision and 0.91 for recall on three different 100 CFP dataset. This paper finally discusses the validity of our approach by confronting our system for different queries with two systems already available online (WikiCFP and IEEE Conference Search) and basic text searching approach standing for searching in an email box. On a 100 CFP dataset with the wide variety of usable data and the possibility to perform complex queries we surpass basic text searching method and WikiCFP by not returning the false positive usually returned by them and find a result close to the IEEE system.

Download Full-text

Constructing efficient information extraction pipelines

Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11 ◽

10.1145/2063576.2063935 ◽

2011 ◽

Cited By ~ 5

Author(s):

Henning Wachsmuth ◽

Benno Stein ◽

Gregor Engels

Keyword(s):

Information Extraction ◽

Efficient Information

Download Full-text

Text Mining of the Electronic Health Record: An Information Extraction Approach for Automated Identification and Subphenotyping of HFpEF Patients for Clinical Trials

Journal of Cardiovascular Translational Research ◽

10.1007/s12265-017-9752-2 ◽

2017 ◽

Vol 10 (3) ◽

pp. 313-321 ◽

Cited By ~ 17

Author(s):

Siddhartha R. Jonnalagadda ◽

Abhishek K. Adupa ◽

Ravi P. Garg ◽

Jessica Corona-Cox ◽

Sanjiv J. Shah

Keyword(s):

Clinical Trials ◽

Text Mining ◽

Electronic Health Record ◽

Information Extraction ◽

Health Record ◽

Automated Identification ◽

Electronic Health

Download Full-text

Attention information extraction of the foreign visitors using Text Mining

International Journal of Intelligent Systems Technologies and Applications ◽

10.1504/ijista.2013.056523 ◽

2013 ◽

Vol 12 (3/4) ◽

pp. 194

Author(s):

Koichi Tsujii ◽

Yoshikatsu Fujita ◽

Kazuhiko Tsuda

Keyword(s):

Text Mining ◽

Information Extraction

Download Full-text

Text Mining-Supported Information Extraction: An Extended Methodology for Developing Information Extraction Systems

2011 22nd International Workshop on Database and Expert Systems Applications ◽

10.1109/dexa.2011.79 ◽

2011 ◽

Cited By ~ 5

Author(s):

Christina Feilmayr

Keyword(s):

Text Mining ◽

Information Extraction

Download Full-text

Information extraction and text mining of Ancient Vattezhuthu characters in historical documents using image zoning

2016 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp.2016.7875929 ◽

2016 ◽

Cited By ~ 2

Author(s):

E.K. Vellingiriraj ◽

M. Balamurugan ◽

P. Balasubramanie

Keyword(s):

Text Mining ◽

Information Extraction ◽

Historical Documents

Download Full-text

Efficient Information Extraction over Evolving Text Data

2008 IEEE 24th International Conference on Data Engineering ◽

10.1109/icde.2008.4497503 ◽

2008 ◽

Cited By ~ 17

Author(s):

Fei Chen ◽

AnHai Doan ◽

Jun Yang ◽

Raghu Ramakrishnan

Keyword(s):

Information Extraction ◽

Text Data ◽

Efficient Information

Download Full-text

Analysis of E-mental Health Research: Mapping the Relationship between Information Technology and Mental Healthcare

10.21203/rs.3.rs-741015/v1 ◽

2021 ◽

Author(s):

tatsawan timakum ◽

Min Song ◽

Qing Xie

Keyword(s):

Mental Health ◽

Information Technology ◽

Text Mining ◽

Information Extraction ◽

Health Research ◽

Health Care Services ◽

Text Message ◽

Mental Healthcare ◽

Mental Health Research ◽

Mental Wellbeing

Abstract Background: E-mentalhealthcare is the convergence of digital technologies with mental health services. It has beendevelopedto fill a gap in healthcare for people who need mental wellbeing support and may never otherwise receive psychological treatment.This study aimed to apply text mining techniques to analyze the huge data of e-mental health researches and to report on research clusters and trends as well as the co-occurrence of biomedical and the use of information technology in this field.Methods: The e-mentalhealth research data was obtainedfrom 3,663 bibliographicrecords from Web of Science (WoS)and 3,172 full-text articlesfrom PubMed Central (PMC). The text mining techniques utilized for this study includedbibliometric analysis, information extraction, and visualization.Results: The e-mental health research topic trendsprimarily involvede-health care services and medical informatics research. The clusters of research comprise 16 clusters, which refer to mental sickness, ehealth, diseases, IT, and self-management. Based onthe information extraction analysis, in the biomedical domain, a “depression” entity was frequently detected and it pairs with other entities in the network with a betweenness centrality weighted at 0.046869 (eg. depression-online, depression-diabetes, depression-measure, and depression-mobile).The IT entity-relations of “mobile” were the most frequently found(weighted at 0.043466). The top pairs are related to depression, mobile health, and text message.Conclusions: E-mental health research trends focused on disease related-depression and using IT for treatment and prevention, primarily via online and mobile devices. Producing AI and machine learning are also being studied for e-mental healthcare. The results illustrate that physical sickness is likely to cause a mental health problem and identify the IT that was applied to help manage and mitigate mental health impacts.

Download Full-text

Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity

Frontiers in Digital Health ◽

10.3389/fdgth.2021.781227 ◽

2021 ◽

Vol 3 ◽

Author(s):

Luke T. Slater ◽

Andreas Karwath ◽

Robert Hoehndorf ◽

Georgios V. Gkoutos

Keyword(s):

Differential Diagnosis ◽

Text Mining ◽

Information Extraction ◽

Semantic Similarity ◽

Outcome Prediction ◽

Primary Diagnosis ◽

Patient Profile ◽

Profile Similarity ◽

Standard Component ◽

Mimic Iii

Semantic similarity is a useful approach for comparing patient phenotypes, and holds the potential of an effective method for exploiting text-derived phenotypes for differential diagnosis, text and document classification, and outcome prediction. While approaches for context disambiguation are commonly used in text mining applications, forming a standard component of information extraction pipelines, their effects on semantic similarity calculations have not been widely explored. In this work, we evaluate how inclusion and disclusion of negated and uncertain mentions of concepts from text-derived phenotypes affects similarity of patients, and the use of those profiles to predict diagnosis. We report on the effectiveness of these approaches and report a very small, yet significant, improvement in performance when classifying primary diagnosis over MIMIC-III patient visits.

Download Full-text