Detection of Topic from Unstructured Text With Mixed Languages

Author(s):  
Suraj Sharma ◽  
Sabitra Sankalp Panigrahi ◽  
Biswajit Paul ◽  
Narayan Panigrahi
Author(s):  
Nadhia Salsabila Azzahra ◽  
Muhammad Okky Ibrohim ◽  
Junaedi Fahmi ◽  
Bagus Fajar Apriyanto ◽  
Oskar Riandi

2012 ◽  
Vol 2 (1) ◽  
pp. 52-82 ◽  
Author(s):  
Bart Jacobs

This paper takes as a point of departure the hypothesis that Papiamentu descends from Upper Guinea Portuguese Creole (a term covering the sister varieties of the Cape Verde Islands and Guinea-Bissau and Casamance), speakers of which arrived on Curaçao in the second half of the 17th century, subsequently shifted their basic content vocabulary towards Spanish, but maintained the original morphosyntax. This scenario raises the question of whether, in addition to being a creole, Papiamentu can be analyzed as a so-called mixed (or intertwined) language. The present paper positively answers this question by drawing parallels between (the emergence of) Papiamentu and recognized mixed languages.


2018 ◽  
Vol 4 ◽  
Author(s):  
Faez Ahmed ◽  
Mark Fuge

Bisociative knowledge discovery is an approach that combines elements from two or more ‘incompatible’ domains to generate creative solutions and insight. Inspired by Koestler’s notion of bisociation, in this paper we propose a computational framework for the discovery of new connections between domains to promote creative discovery and inspiration in design. Specifically, we propose using topic models on a large collection of unstructured text ideas from multiple domains to discover creative sources of inspiration. We use these topics to generate a Bisociative Information Network – a graph that captures conceptual similarity between ideas – that helps designers find creative links within that network. Using a dataset of thousands of ideas from OpenIDEO, an online collaborative community, our results show usefulness of representing conceptual bridges through collections of words (topics) in finding cross-domain inspiration. We show that the discovered links between domains, whether presented on their own or via ideas they inspired, are perceived to be more novel and can also be used as creative stimuli for new idea generation.


2011 ◽  
Vol 15 (2) ◽  
pp. 378-396 ◽  
Author(s):  
CARMEL O'SHANNESSY ◽  
FELICITY MEAKINS

Crosslinguistic influence has been seen in bilingual adult and child learners when compared to monolingual learners. For speakers of Light Warlpiri and Gurindji Kriol there is no monolingual group for comparison, yet crosslinguistic influence can be seen in how the speakers resolve competition between case-marking and word order systems in each language. Light Warlpiri and Gurindji Kriol are two new Australian mixed languages, spoken in similar, yet slightly different, sociolinguistic contexts, and with similar, yet slightly different, argument marking systems. The different sociolinguistic situations and systems of argument marking lead to a difference in how speakers of each language interpret simple transitive sentences in a comprehension task. Light Warlpiri speakers rely on ergative case-marking as an indicator of agents more often than Gurindji Kriol speakers do. Conversely, Gurindji Kriol speakers rely on word order more often than Light Warlpiri speakers do.


Author(s):  
Ralph Grishman

Information extraction constructs a structured knowledge representation from unstructured text, so that the knowledge may be further used for search, inference, and analysis. Given a specification of select types of entities, semantic relations, and events, it builds a database from instances of this information in text. This chapter describes the stages of processing involved and considers how such systems may be built using hand-coded rules, supervised training, and semi-supervised training.


Sign in / Sign up

Export Citation Format

Share Document