A Microblogs Location Inferring Method Based on Hybrid Language Model

Speakers of any (minoritised or majority) language sometimes make language mistakes. Bilingual speakers may use a hybrid language, mixing languages within a sentence or even within a word, especially when they are formally similar, as Spanish and Galician are. For minoritised languages, language errors may contribute to a negative perception towards the minoritised language. The Galician public broadcaster Televisión de Galicia (TVG) has received criticism for not being a high-quality language model, permitting the intrusion of language mistakes in its content. From an exclusively linguistic viewpoint, these errors should be corrected in subtitling. Conversely, subtitling guides and target users favour a verbatim rendition of the audio, in which oral language mistakes should not be corrected. Dialectal features, even if they are not considered errors, are non-standard language. This paper aims at answering the question of “to correct or not to correct” oral errors and dialectal features in the case of minoritised languages. It presents the most relevant data from a literature review, and an analysis of subtitling guidelines & standards and of current practices at TVG. These results have yielded an original protocol for the correction or reproduction of oral errors according to speech control, target audience and broadcast genre, the effect of a mistake, and the type of language error (vocabulary vs. grammar).

Download Full-text

A hybrid language model based on a combination of N -grams and stochastic context-free grammars

ACM Transactions on Asian Language Information Processing ◽

10.1145/1034780.1034783 ◽

2004 ◽

Vol 3 (2) ◽

pp. 113-127 ◽

Cited By ~ 6

Author(s):

Diego Linares ◽

José-Miguel Benedí ◽

Joan-Andreu Sánchez

Keyword(s):

Language Model ◽

Model Based ◽

Hybrid Language ◽

Stochastic Context Free Grammars ◽

Context Free ◽

Context Free Grammars

Download Full-text

A HYBRID LANGUAGE MODEL BASED ON STATISTICS AND LINGUISTIC RULES

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001405003934 ◽

2005 ◽

Vol 19 (01) ◽

pp. 109-128 ◽

Cited By ~ 2

Author(s):

XIAOLONG WANG ◽

DANIEL S. YEUNG ◽

JAMES N. K. LIU ◽

ROBERT LUK ◽

XUAN WANG

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Large Scale ◽

Language Model ◽

Language Models ◽

Long Distance ◽

Language Understanding ◽

Linguistic Rules ◽

Hybrid Language ◽

N Gram

Language modeling is a current research topic in many domains including speech recognition, optical character recognition, handwriting recognition, machine translation and spelling correction. There are two main types of language models, the mathematical and the linguistic. The most widely used mathematical language model is the n-gram model inferred from statistics. This model has three problems: long distance restriction, recursive nature and partial language understanding. Language models based on linguistics present many difficulties when applied to large scale real texts. We present here a new hybrid language model that combines the advantages of the n-gram statistical language model with those of a linguistic language model which makes use of grammatical or semantic rules. Using suitable rules, this hybrid model can solve problems such as long distance restriction, recursive nature and partial language understanding. The new language model has been effective in experiments and has been incorporated in Chinese sentence input products for Windows and Macintosh OS.

Download Full-text

Documents Ranking Based on a Hybrid Language Model for Chinese Information Retrieval

2006 IEEE International Conference on Information Acquisition ◽

10.1109/icia.2006.306010 ◽

2006 ◽

Cited By ~ 3

Author(s):

Dequan Zheng ◽

Feng Yu ◽

Tiejun Zhao ◽

Sheng Li

Keyword(s):

Information Retrieval ◽

Language Model ◽

Hybrid Language

Download Full-text

Interpretation of Nonsense as a Method of Children Imagination Development (as exemplified in Carroll L. «Through the Looking-Glass and What Alice Found There»)

Current Issues in Philology and Pedagogical Linguistics ◽

10.29025/2079-6021-2020-4-200-212 ◽

2020 ◽

pp. 200-212

Author(s):

Larisa V. Kalashnikova

Keyword(s):

Creative Thinking ◽

Language Model ◽

Word Formation ◽

The Third ◽

New Meanings ◽

The World ◽

Looking Glass ◽

Rules Of The Game ◽

Powerful Incentive ◽

Algorithmic Thinking

The article enlightens the probem of nonsense and its role in the development of creative thinking and fantasy, and the way how the interpretation of nonsense affects children imagination. The function of imagination inherent to a person, and especially to a child, has a powerful potential – to create artificially new metaphorical models, absurd and most incredible situations based on self-amazement. Children are able to measure the properties of unfamiliar objects with the properties of known things. It is not difficult for small researchers to replace incomprehensible meanings with familiar ones; to think over situations, to make analogies, to transfer signs and properties of one object to another. The problem of nonsense research is interesting and relevant. The element of the game is an integral component of nonsense. In the process of playing, children cognize the world, learn to interact with the world, imitating the adults behavior. Imagination and fantasy help the child to invent his own rules of the game, to choose language elements that best suit his ideas. The child uses the learned productive models of the language system to create their own models and their own language, attracting language signs: words, morphs, sentences. Children’s dictionary stimulates word formation and language nomination processes. Nonsense-words are the result of children’s dictionary, speech errors and occazional formations, presented in the form of contamination, phonetic transformations, lexical substitution, implemented on certain models. The first two models are phonetic imitation and hybrid speech, based on the natural language model. The third model of designing nonsense is represented by words that have no meaning at all and can be attributed to words-portmonaie. Due to the flexibility of interframe relationships and the lack of algorithmic thinking, children can not only capture the implicit similarity of objects and phenomena, but also create it through their imagination. Interpretation of nonsense is an effective method of developing imagination in children, because metaphors, nonsense as a means of creating new meanings, modeling new content from fragments of one’s own experience, are a powerful incentive for creative thinking.

Download Full-text