scholarly journals The Finno-Ugric Languages and The Internet Project

2015 ◽  
pp. 87 ◽  
Author(s):  
Heidi Jauhiainen ◽  
Tommi Jauhiainen ◽  
Krister Lindén

 This paper describes a Kone Foundation funded project called "The Finno-Ugric Languages and The Internet" together with some of the achieved results. The main activity of the project is to crawl the internet and gather texts written in small Uralic languages. The sentences and words of the found texts will be assembled into a freely available corpus. Crawling is done using the open source crawler Heritrix, which is developed by the Internet Archive. Heritrix crawls through the pages and passes the found texts to a language identifier. We are using a state of the art language identifier, which has been further developed within the project and has been evaluated using 285 languages. We describe the language identification evaluation results concerning the 34 Uralic languages known by the language identifier. We also describe the initial observations and results from the first five large crawls which were done in the national internet domains of Finland, Sweden, Norway, Russia, and Estonia. 

Author(s):  
James Bankoski ◽  
Matthew Frost ◽  
Adrian Grange

In this paper, we present the argument in favor of an open source, a royalty-free video codec that will keep pace with the evolution of video traffic. Additionally, we argue that the availability of a state-of-the-art, royalty-free codec levels the playing field, allowing small content owners, and application developers to compete with the larger companies that operate in this space.


2016 ◽  
Vol 11 (1) ◽  
pp. 91-103
Author(s):  
Carole Cusack ◽  
David Pecotic

The occult and the internet intersect in four ways: as a static medium for information; as a space where contested information or ideological conflict may occur; as a facilitator of communication; and as a medium for esoteric practice. The last type of activity is rare, but it is intriguing, in that technology can shape and inform beliefs and practices in unanticipated ways. Online engagement with the ‘Work’, the movement produced by the Greek Armenian spiritual teacher and esotericist G. I. Gurdjieff (c. 1866-1949) and his immediate followers, is an under-researched instance of online esoteric practice. This article addresses this scholarly desideratum, bringing the theoretical approaches of online religion and digital ethnography to bear on the Gurdjieff Internet Guide (GIG) website, founded by Reijo Oksanen (b. 1942) and later maintained by Kristina Turner, who created an accompanying Facebook page. The GIG manifests a shift away from the sectarian secrecy of the ‘Foundation’ groups, founded by Jeanne de Salzmann (1889-1990) after Gurdjieff’s death to formalise and protect the content of the Work, and the limited web presence that the Foundation permits. The GIG moves towards an ecumenical ‘open source’ approach to the dissemination of Gurdjieff’s teachings rooted in independent groups founded by other first generation followers of Gurdjieff who remained outside of the Foundation. It is argued that the deregulation of the religious and spiritual marketplace of the contemporary West, coupled with the dominant role played by the Internet in disseminating information, has radically transformed the Gurdjieff tradition, collapsing hierarchies and esoteric strategies, democratizing access for seekers, and creating new ritual and teaching modes.


Author(s):  
Elly Mufida ◽  
David Wardana Agus Rahayu

The VoIP communication system at OMNI Hospital Alam Sutera uses the Elastix 2.5 server with the Centos 5.11 operating system. Elastix 2.5 by the developer has been declared End of Life. The server security system is a serious concern considering that VoIP servers can be accessed from the internet. Iptables and fail2ban applications are applications that are used to limit and counteract those who try to attack the VoIP server. One application that can be used as an open source VoIP server is the Issabel Application version 4.0. The migration process from Elastix 2.5 application to Issabel 4.0 by backing up all configurations in the Elastix 2.5 application through a web browser including the configuration of endpoints, fax, e-mail, asterisk. After the backup file is downloaded then upload the backup file to the Issabel 4.0 application then run the migration process. Adding a backup path as a failover connection is needed because the VoIP communication protocol between the OMNI Hospitals Group still uses one path so that when there is a problem in the connection path, the communication protocol will stop. The tunnel EoIP is a protocol used as a backup path between the OMNI Hospitals Group site.


Author(s):  
Anna Udelkina

This article is devoted to the study of the multimedia environment of the polemic discourse in German media with its diverse formats of impact on the audience and the actively developing internal dynamics of texts. If at the end of the XXth century the specifics of German media were the use of the Internet site as one of the possibilities to present copies of newspapers and magazines in electronic form, today we can speak of modified, hybrid Internet versions of printed publications that do not just create websites on the Internet that duplicate their main activity, but also combines the features of the traditional press and features of the functioning of texts on the Internet. The transition from linear, monomedia broadcasting platforms to discrete, multimedia ones has a significant impact on the process of creating, designing and placing modern polemics. Texts of articles and user comments are considered in the article as tmaterialization of the polemic discourse in the media. Polemic texts are formed on the basis of intertextual structures and have a hypertext nature. The use of multimedia tools (a variety of fonts, graphics, animation, photo, video and sound) in the text of the article allows the author not only to expand the amount of information provided, but also to qualitatively supplement its content through inline inclusions tn the text, to express the meaning of information by referring to verbal and non-verbal means; to provide a visual and figurative presentation of information (graphs, charts, tables), to attract attention and influence the audience, as well as to provide readers with the opportunity to participate in information exchange.


2006 ◽  
Vol 40 (3) ◽  
pp. 286-295 ◽  
Author(s):  
Andrew Buxton

PurposeTo review the variety of software solutions available for putting CDS/ISIS databases on the internet. To help anyone considering which route to take.Design/methodology/approachBriefly describes the characteristics, history, origin and availability of each package. Identifies the type of skills required to implement the package and the kind of application it is suited to. Covers CDS/ISIS Unix version, JavaISIS, IsisWWW, WWWISIS Versions 3 and 5, Genisis, IAH, WWW‐ISIS, and OpenIsis.FindingsThere is no obvious single “best” solution. Several are free but may require more investment in acquiring the skills to install and configure them. The choice will depend on the user's experience with CDS/ISIS formatting language, HTML, programming languages, operating systems, open source software, and so on.Originality/valueThere is detailed documentation available for most of these packages, but little previous guidance to help potential users to distinguish and choose between them.


Author(s):  
Emily Kalah Gade ◽  
Sarah Dreier ◽  
John Wilkerson ◽  
Anne Washington

Abstract The Internet Archive curated a 90-terabyte sub-collection of captures from the US government's public website domain (‘.gov’). Such archives provide largely untapped resources for measuring attributes, behaviors and outcomes relevant to political science research. This study leverages this archive to measure a novel dimension of federal legislators' religiosity: their proportional use of religious rhetoric on official congressional websites (2006–2012). This scalable, time-variant measure improves upon more costly, time-invariant conventional approaches to measuring legislator attributes. The authors demonstrate the validity of this method for measuring legislators' public-facing religiosity and discuss the contributions and limitations of using archived Internet data for scientific analysis. This research makes three applied methodological contributions: (1) it develops a new measure for legislator religiosity, (2) it models an improved, more comprehensive approach to analyzing congressional communications and (3) it demonstrates the unprecedented potential that archived Internet data offer to researchers seeking to develop meaningful, cost-effective approaches to analyzing political phenomena.


2020 ◽  
pp. 1-31
Author(s):  
Ilia Markov ◽  
Vivi Nastase ◽  
Carlo Strapparava

Abstract Native language identification (NLI)—the task of automatically identifying the native language (L1) of persons based on their writings in the second language (L2)—is based on the hypothesis that characteristics of L1 will surface and interfere in the production of texts in L2 to the extent that L1 is identifiable. We present an in-depth investigation of features that model a variety of linguistic phenomena potentially involved in native language interference in the context of the NLI task: the languages’ structuring of information through punctuation usage, emotion expression in language, and similarities of form with the L1 vocabulary through the use of anglicized words, cognates, and other misspellings. The results of experiments with different combinations of features in a variety of settings allow us to quantify the native language interference value of these linguistic phenomena and show how robust they are in cross-corpus experiments and with respect to proficiency in L2. These experiments provide a deeper insight into the NLI task, showing how native language interference explains the gap between baseline, corpus-independent features, and the state of the art that relies on features/representations that cover (indiscriminately) a variety of linguistic phenomena.


Author(s):  
J.M. Murray ◽  
P. Pfeffer ◽  
R. Seifert ◽  
A. Hermann ◽  
J. Handke ◽  
...  

Objective: Manual plaque segmentation in microscopy images is a time-consuming process in atherosclerosis research and potentially subject to unacceptable user-to-user variability and observer bias. We address this by releasing Vesseg a tool that includes state-of-the-art deep learning models for atherosclerotic plaque segmentation. Approach and Results: Vesseg is a containerized, extensible, open-source, and user-oriented tool. It includes 2 models, trained and tested on 1089 hematoxylin-eosin stained mouse model atherosclerotic brachiocephalic artery sections. The models were compared to 3 human raters. Vesseg can be accessed at https://vesseg .online or downloaded. The models show mean Soerensen-Dice scores of 0.91±0.15 for plaque and 0.97±0.08 for lumen pixels. The mean accuracy is 0.98±0.05. Vesseg is already in active use, generating time savings of >10 minutes per slide. Conclusions: Vesseg brings state-of-the-art deep learning methods to atherosclerosis research, providing drastic time savings, while allowing for continuous improvement of models and the underlying pipeline.


Leonardo ◽  
1999 ◽  
Vol 32 (5) ◽  
pp. 353-358 ◽  
Author(s):  
Noah Wardrip-Fruin

We look to media as memory, and a place to memorialize, when we have lost. Hypermedia pioneers such as Ted Nelson and Vannevar Bush envisioned the ultimate media within the ultimate archive—with each element in continual flux, and with constant new addition. Dynamism without loss. Instead we have the Web, where “Not Found” is a daily message. Projects such as the Internet Archive and Afterlife dream of fixing this uncomfortable impermanence. Marketeers promise that agents (indentured information servants that may be the humans of About.com or the software of “Ask Jeeves”) will make the Web comfortable through filtering—hiding the impermanence and overwhelming profluence that the Web's dynamism produces. The Impermanence Agent—a programmatic, esthetic, and critical project created by the author, Brion Moss, a.c. chapman, and Duane Whitehurst— operates differently. It begins as a storytelling agent, telling stories of impermanence, stories of preservation, memorial stories. It monitors each user's Web browsing, and starts customizing its storytelling by weaving in images and texts that the user has pulled from the Web. In time, the original stories are lost. New stories, collaboratively created, have taken their place.


Sign in / Sign up

Export Citation Format

Share Document