Using Data Crawlers and Semantic Web to Build Financial XBRL Data Generators: The SONAR Extension Approach

The Scientific World JOURNAL ◽

10.1155/2014/506740 ◽

2014 ◽

Vol 2014 ◽

pp. 1-18 ◽

Cited By ~ 2

Author(s):

Miguel Ángel Rodríguez-García ◽

Alejandro Rodríguez-González ◽

Ricardo Colomo-Palacios ◽

Rafael Valencia-García ◽

Juan Miguel Gómez-Berbís ◽

...

Keyword(s):

Language Processing ◽

Financial Services ◽

Financial Information ◽

Added Value ◽

Semantic Technologies ◽

Proof Of Concept ◽

Data Source ◽

Using Data ◽

The Web ◽

Significant Data

Precise, reliable and real-time financial information is critical for added-value financial services after the economic turmoil from which markets are still struggling to recover. Since the Web has become the most significant data source, intelligent crawlers based on Semantic Technologies have become trailblazers in the search of knowledge combining natural language processing and ontology engineering techniques. In this paper, we present the SONAR extension approach, which will leverage the potential of knowledge representation by extracting, managing, and turning scarce and disperse financial information into well-classified, structured, and widely used XBRL format-oriented knowledge, strongly supported by a proof-of-concept implementation and a thorough evaluation of the benefits of the approach.

Download Full-text

Effect of Profitability and Leverage on Disclosure of Corporate Social Responsibility in Islamic Commercial Banks

Journal of Finance and Islamic Banking ◽

10.22515/jfib.v1i2.1494 ◽

2019 ◽

Vol 1 (2) ◽

pp. 133

Author(s):

Siti Nur Alfiyah

Keyword(s):

Corporate Social Responsibility ◽

Social Responsibility ◽

Financial Services ◽

Commercial Banks ◽

Quantitative Research ◽

Secondary Data ◽

Sampling Technique ◽

Data Source ◽

Using Data ◽

Corporate Social

The purpose of this study is to determine the effect of profitability and leverage on Corporate Social Responsibility. The population in this study are all Sharia Commercial Banks registered in the Financial Services Authority from 2014-2016. The sampling technique used purposive sampling method and obtained 10 research samples. For the dependent variable (y) of this research is Corporate Social Responsibility. For independent variable (x) that is profitability and leverage. The method used is quantitative research method. For data source is secondary data by using data analysis with statistic used SPSS software program assistance Smart SPSS 20. The results of this study indicate that profitability has no significant effect on Corporate Social Responsibility. The study also concluded that Leverage significantly influence Corporate Social Responsibility.

Download Full-text

The Glossaryfication Web Service – an automated glossary creation tool to support One Health communication

ARPHA Conference Abstracts ◽

10.3897/aca.4.e68843 ◽

2021 ◽

Vol 4 ◽

Author(s):

Estibaliz Lopez de Abechuco ◽

Nazareno Scaccia ◽

Taras Günther ◽

Matthias Filter

Keyword(s):

Web Service ◽

Language Processing ◽

One Health ◽

Text Processing ◽

Added Value ◽

Web Interface ◽

Source Information ◽

Text Type ◽

Tag Cloud ◽

The Web

Efficient communication and collaboration across sectors is an important precondition for true One Health Surveillance (OHS) activities. Despite the overall willingness to embrace the One Health paradigm, it is still challenging to accomplish this in day-to-day practice due to the differences in terminology and interpretation of sector-specific terms. In this sense, simple interventions like the inclusion of integrative glossaries in OHS documents (e.g. reports, research papers and guidelines) would help to reduce misunderstandings and could significantly improve the written communication in OHS. Here, we present the Glossaryfication Web Service that generates a document-specific glossary for any text file provided by the user. The web service automatically adds the available definitions with their corresponding references for the words in the document that match with terms in the user-selected glossaries. The Glossaryfication Web Service was developed to provide added value to the OHEJP Glossary that was developed within the OHEJP project ORION. The OHEJP Glossary improves the communication and collaboration among OH sectors by providing an online resource that lists relevant OH terms and sector-specific definitions. The Glossaryfication Web Service supports the practical use of the curated OHEJP Glossary and can also source information from other glossaries relevant for OH professionals (currently supporting the online CDC, WHO and EFSA glossaries). The Glossaryfication Web Service was created using the open-source software KNIME and the KNIME Text Processing extension (https://www.knime.com/knime-text-processing). The Glossaryfication KNIME workflow is deployed on BfR’s KNIME Server infrastructure providing an easy-to-use web interface where the users can upload their documents (any text-type file e.g. PDF, Word, Excel) and select the desired glossary to compare with. The Glossaryfication KNIME workflow reads in the document provided via the web interface and applies natural language processing (e.g. text cleaning, stemming), transforming (bag-of-words generation) and information retrieval methods to identify the matching terms in the selected glossaries. The Glossaryfication Web Service generates as an output a table containing all the terms that match with the selected glossaries. It also provides the available definitions, corresponding references and additional meta-information, e.g. the term frequency, i.e., how often each term appears in the given text, and the sectoral classification (only for the OHEJP Glossary terms). Furthermore, the workflow generates a tag cloud where the terms are categorized as: (i) exact match when the term in the text matches exactly with the entry of this term in the glossary; (ii) inexact match when the term appears in the text slightly modified (e.g. plural forms or suffixes) and (iii) non-matching that corresponds to all the other words appearing in the text that do not match with any glossary term. Through the user interface, the users can then choose if they want to download the whole list of terms, select only the exact/inexact matching terms, or just choose those terms and definitions that match with the meaning intended for this term in the user-provided document. The resulting table of terms can be downloaded as an Excel file and added to the user’s document as a document-specific glossary. The Glossaryfication Web Service provides an easy-to-adopt solution to enrich documents and reports with more comprehensive and unambiguous glossaries. Furthermore, it improves the referentiality of terms and definitions from different OH sectors. An additional feature provided by the Glossaryfication Web Service is the possibility of extending its use to other glossaries from other national or international institutions allowing the user to customize this glossary creation service.

Download Full-text

Governance Disclosure on the Internet by Leading Indian Public Sector Companies

Think India ◽

10.26643/think-india.v22i2.8716 ◽

2019 ◽

Vol 22 (2) ◽

pp. 174-187

Author(s):

Harmandeep Singh ◽

Arwinder Singh

Keyword(s):

Public Sector ◽

Financial Information ◽

Three Dimensions ◽

The Internet ◽

Web Pages ◽

Non Profit ◽

Index Approach ◽

Significant Difference ◽

Governance Disclosure ◽

The Web

Nowadays, internet satisfying people with different services related to different fields. The profit, as well as non-profit organization, uses the internet for various business purposes. One of the major is communicated various financial as well as non-financial information on their respective websites. This study is conducted on the top 30 BSE listed public sector companies, to measure the extent of governance disclosure (non-financial information) on their web pages. The disclosure index approach to examine the extent of governance disclosure on the internet was used. The governance index was constructed and broadly categorized into three dimensions, i.e., organization and structure, strategy & Planning and accountability, compliance, philosophy & risk management. The empirical evidence of the study reveals that all the Indian public sector companies have a website, and on average, 67% of companies disclosed some kind of governance information directly on their websites. Further, we found extreme variations in the web disclosure between the three categories, i.e., The Maharatans, The Navratans, and Miniratans. However, the result of Kruskal-Wallis indicates that there is no such significant difference between the three categories. The study provides valuable insights into the Indian economy. It explored that Indian public sector companies use the internet for governance disclosure to some extent, but lacks symmetry in the disclosure. It is because there is no such regulation for web disclosure. Thus, the recommendation of the study highlighted that there must be such a regulated framework for the web disclosure so that stakeholders ensure the transparency and reliability of the information.

Download Full-text

An Analysis of Web Home Page Disclosure Practices of Top Public and Private Indian Banks

International Journal of Banking Risk and Insurance ◽

10.21863/ijbri/2015.3.2.009 ◽

2015 ◽

Vol 3 (2) ◽

Author(s):

Mandeep Kaur ◽

Manpreet Kaur

Keyword(s):

Positive Relationship ◽

Home Page ◽

Financial Information ◽

Public And Private ◽

Communication Device ◽

Indian Banks ◽

Public And Private Sector ◽

The Relationship ◽

Source Of Information ◽

The Web

Internet is a very powerful communication device to disclose financial and non-financial information. Almost every company today maintains its website and disseminates their information voluntarily. Internet is very exciting medium to disclose information in the form of presentation. It has become most frequently used source of information. This paper tries to examine the web home page disclosure practices of top public and private Indian banks and try to find out the relationship between the disclosure score and size of bank by using the sample of 20 banks which constitute of top public and private sector banks. The results show that there is positive relationship between the disclosure score and size of bank.

Download Full-text

Financial Services Credit Scoring System Using Data Mining

2020 IEEE 14th International Conference on Application of Information and Communication Technologies (AICT) ◽

10.1109/aict50176.2020.9368572 ◽

2020 ◽

Author(s):

Adel Hassan ◽

Rashid Jayousi

Keyword(s):

Data Mining ◽

Financial Services ◽

Scoring System ◽

Credit Scoring ◽

Using Data

Download Full-text

A Natural Language Processing Approach to Measuring Treatment Adherence and Consistency Using Semantic Similarity

AERA Open ◽

10.1177/23328584211028615 ◽

2021 ◽

Vol 7 ◽

pp. 233285842110286

Author(s):

Kylie L. Anglin ◽

Vivian C. Wong ◽

Arielle Boguslav

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Semantic Similarity ◽

Language Processing ◽

Intervention Implementation ◽

Proof Of Concept ◽

Coaching Intervention ◽

Processing Techniques ◽

Teacher Coaching ◽

The Impact

Though there is widespread recognition of the importance of implementation research, evaluators often face intense logistical, budgetary, and methodological challenges in their efforts to assess intervention implementation in the field. This article proposes a set of natural language processing techniques called semantic similarity as an innovative and scalable method of measuring implementation constructs. Semantic similarity methods are an automated approach to quantifying the similarity between texts. By applying semantic similarity to transcripts of intervention sessions, researchers can use the method to determine whether an intervention was delivered with adherence to a structured protocol, and the extent to which an intervention was replicated with consistency across sessions, sites, and studies. This article provides an overview of semantic similarity methods, describes their application within the context of educational evaluations, and provides a proof of concept using an experimental study of the impact of a standardized teacher coaching intervention.

Download Full-text

Using NLP for Fact Checking: A Survey

Designs ◽

10.3390/designs5030042 ◽

2021 ◽

Vol 5 (3) ◽

pp. 42

Author(s):

Eric Lazarski ◽

Mahmood Al-Khassaweneh ◽

Cynthia Howard

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Computer Science ◽

Language Processing ◽

The Internet ◽

Fake News ◽

Fact Checking ◽

The Many ◽

Human Powered ◽

The Web

In recent years, disinformation and “fake news” have been spreading throughout the internet at rates never seen before. This has created the need for fact-checking organizations, groups that seek out claims and comment on their veracity, to spawn worldwide to stem the tide of misinformation. However, even with the many human-powered fact-checking organizations that are currently in operation, disinformation continues to run rampant throughout the Web, and the existing organizations are unable to keep up. This paper discusses in detail recent advances in computer science to use natural language processing to automate fact checking. It follows the entire process of automated fact checking using natural language processing, from detecting claims to fact checking to outputting results. In summary, automated fact checking works well in some cases, though generalized fact checking still needs improvement prior to widespread use.

Download Full-text

A Study of the Effects of the COVID-19 Pandemic on the Experience of Back Pain Reported on Twitter® in the United States: A Natural Language Processing Approach

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18094543 ◽

2021 ◽

Vol 18 (9) ◽

pp. 4543

Author(s):

Krzysztof Fiok ◽

Waldemar Karwowski ◽

Edgar Gutierrez ◽

Maham Saeidi ◽

Awad M. Aljuaid ◽

...

Keyword(s):

United States ◽

Natural Language Processing ◽

Back Pain ◽

Natural Language ◽

Language Processing ◽

The United States ◽

Daily Routine ◽

Body Movements ◽

Data Source ◽

Twitter Users

The COVID-19 pandemic has changed our lifestyles, habits, and daily routine. Some of the impacts of COVID-19 have been widely reported already. However, many effects of the COVID-19 pandemic are still to be discovered. The main objective of this study was to assess the changes in the frequency of reported physical back pain complaints reported during the COVID-19 pandemic. In contrast to other published studies, we target the general population using Twitter as a data source. Specifically, we aim to investigate differences in the number of back pain complaints between the pre-pandemic and during the pandemic. A total of 53,234 and 78,559 tweets were analyzed for November 2019 and November 2020, respectively. Because Twitter users do not always complain explicitly when they tweet about the experience of back pain, we have designed an intelligent filter based on natural language processing (NLP) to automatically classify the examined tweets into the back pain complaining class and other tweets. Analysis of filtered tweets indicated an 84% increase in the back pain complaints reported in November 2020 compared to November 2019. These results might indicate significant changes in lifestyle during the COVID-19 pandemic, including restrictions in daily body movements and reduced exposure to routine physical exercise.

Download Full-text

The Added Value of Non-Financial Information in Internet Firms Pricing

SSRN Electronic Journal ◽

10.2139/ssrn.328781 ◽

2002 ◽

Cited By ~ 1

Author(s):

Francisco Guijarro Martínez ◽

Ismael Moya Clemente

Keyword(s):

Financial Information ◽

Added Value ◽

Internet Firms

Download Full-text

Text Simplification

The Oxford Handbook of Computational Linguistics 2nd edition ◽

10.1093/oxfordhb/9780199573691.013.52 ◽

2018 ◽

Cited By ~ 1

Author(s):

Horacio Saggion

Keyword(s):

Language Processing ◽

Language Resources ◽

The Past ◽

Text Simplification ◽

Text Readability ◽

Target User ◽

Evaluation Approaches ◽

Linguistic Impairment ◽

Automatic Text ◽

The Web

Over the past decades, information has been made available to a broad audience thanks to the availability of texts on the Web. However, understanding the wealth of information contained in texts can pose difficulties for a number of people including those with poor literacy, cognitive or linguistic impairment, or those with limited knowledge of the language of the text. Text simplification was initially conceived as a technology to simplify sentences so that they would be easier to process by natural-language processing components such as parsers. However, nowadays automatic text simplification is conceived as a technology to transform a text into an equivalent which is easier to read and to understand by a target user. Text simplification concerns both the modification of the vocabulary of the text (lexical simplification) and the modification of the structure of the sentences (syntactic simplification). In this chapter, after briefly introducing the topic of text readability, we give an overview of past and recent methods to address these two problems. We also describe simplification applications and full systems also outline language resources and evaluation approaches.

Download Full-text