Use of syntactic context to produce term association lists for text retrieval

AbstractThis paper explores the added value of studying intra- and inter-speaker variation in grammaticalisation based on idiolect corpora. It analyses the usage patterns of the English let alone construction in a self-compiled William Faulkner corpus against the backdrop of aggregated community data. Vast individual differences (early Faulkner vs. late Faulkner vs. peers) in frequencies of use are observed, and these frequency differences correlate with different degrees of grammaticalisation as measured in terms of host-class and syntactic context expansion. The corpus findings inform general issues in current cognitive-functional research, such as the from-corpus-to-cognition issue and the cause/consequence issue of frequency. They lend support to the usage-based view of grammaticalisation as a lifelong, frequency-sensitive process of cognitive automation. To substantiate this view, this paper proposes a self-feeding cycle of constructional generalisation that is driven by the interplay of frequency, entrenchment, partial sanction and habituation.

Download Full-text

Semantic-Preserving Metric Learning for Video-Text Retrieval

10.1109/icip42928.2021.9506697 ◽

2021 ◽

Author(s):

Sungkwon Choo ◽

Seong Jong Ha ◽

Joonsoo Lee

Keyword(s):

Metric Learning ◽

Text Retrieval

Download Full-text

A Deep Semantic Alignment Network for Cross-Modal Image-Text Retrieval in Remote Sensing

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2021.3070872 ◽

2021 ◽

pp. 1-1

Author(s):

Qimin Cheng ◽

Yuzhuo Zhou ◽

Peng Fu ◽

Yuan Xu ◽

Liang Zhang

Keyword(s):

Remote Sensing ◽

Text Retrieval ◽

Semantic Alignment

Download Full-text

Reduced forms in the nominal morphology of the Lindisfarne Gospel Gloss. A case of accusative/dative syncretism?

Folia Linguistica ◽

10.1515/flih-2020-0002 ◽

2020 ◽

Vol 54 (s41) ◽

pp. 37-65

Author(s):

Julia Fernández-Cuesta ◽

Nieves Rodríguez-Ledesma

Keyword(s):

Statistical Analysis ◽

Old English ◽

Frequency Condition ◽

Facsimile Edition ◽

Noun Class ◽

Characteristic Features ◽

Nominal Morphology ◽

Reduced Forms ◽

Syntactic Context

Abstract One of the most characteristic features of the grammar of the Lindisfarne Gospel gloss is the absence of the etymological -e inflection in the dative singular in the paradigm of the strong masculine and neuter declension (a-stems). Ross (1960: 38) already noted that endingless forms of the nominative/accusative cases were quite frequent in contexts where a dative singular in -e would be expected, to the extent that he labeled the forms in -e ‘rudimentary dative.’ The aim of this article is to assess to what extent the dative singular is still found as a separate case in the paradigms of the masculine and neuter a-stems and root nouns. To this end a quantitative/statistical analysis of nouns belonging to these classes has been carried out in contexts where the Latin lemma is either accusative or dative. We have tried to determine whether variables such as syntactic context, noun class, and frequency condition the presence or absence of the -e inflection, and whether the distribution of the inflected and uninflected forms is different in the various demarcations that have been identified in the gloss. The data have been retrieved using the Dictionary of Old English Corpus. All tokens have been checked against the facsimile edition and the digitised manuscript in order to detect possible errors.

Download Full-text

A review of new developments in text retrieval systems

Journal of Information Science ◽

10.1177/016555159402000608 ◽

1994 ◽

Vol 20 (6) ◽

pp. 438-443

Author(s):

Andy Ewers

Keyword(s):

Text Retrieval ◽

New Developments ◽

Retrieval Systems

Download Full-text

Quantitative similarity-based evaluation of text retrieval algorithms

2009 14th International CSI Computer Conference ◽

10.1109/csicc.2009.5349403 ◽

2009 ◽

Author(s):

Parastoo Didari ◽

Behrad Babai ◽

Azadeh Shakery

Keyword(s):

Text Retrieval ◽

Retrieval Algorithms

Download Full-text

Techniques of document management: a review of text retrieval and related technologies

Journal of Documentation ◽

10.1108/eum0000000007082 ◽

2001 ◽

Vol 57 (2) ◽

pp. 192-217 ◽

Cited By ~ 3

Author(s):

D.C. Veal Doverton

Keyword(s):

Text Retrieval ◽

Document Management

Download Full-text

Experiments with Language-based Aids in Information Retrieval Systems

Nordic Journal of Linguistics ◽

10.1017/s0332586500001736 ◽

1988 ◽

Vol 11 (1-2) ◽

pp. 33-46 ◽

Cited By ~ 2

Author(s):

Tove Fjeldvig ◽

Anne Golden

Keyword(s):

Information Retrieval ◽

Text Retrieval ◽

Considerable Improvement ◽

Controlled Experiments ◽

Compound Words ◽

Search Results ◽

Retrieval Systems ◽

Complete Search ◽

Information Retrieval Systems ◽

Search Quality

The fact that a lexeme can appear in various forms causes problems in information retrieval. As a solution to this problem, we have developed methods for automatic root lemmatization, automatic truncation and automatic splitting of compound words. All the methods have as their basis a set of rules which contain information regarding inflected and derived forms of words – and not a dictionary. The methods have been tested on several collections of texts, and have produced very good results. By controlled experiments in text retrieval, we have studied the effects on search results. These results show that both the method of automatic root lemmatization and the method of automatic truncation make a considerable improvement on search quality. The experiments with splitting of compound words did not give quite the same improvement, however, but all the same this experiment showed that such a method could contribute to a richer and more complete search request.

Download Full-text