Discriminative Training for Log-Linear Based SMT

Lemao Liu; Tiejun Zhao; Taro Watanabe; Hailong Cao; Conghui Zhu

doi:10.1145/2637478

Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models

Computational Linguistics ◽

10.1162/coli.2007.33.4.493 ◽

2007 ◽

Vol 33 (4) ◽

pp. 493-552 ◽

Cited By ~ 125

Author(s):

Stephen Clark ◽

James R. Curran

Keyword(s):

Linear Models ◽

Parallel Implementation ◽

Discriminative Training ◽

Training Data ◽

Highly Efficient ◽

Independent Events ◽

Cluster Dynamic ◽

Statistical Parsing ◽

Parsing Algorithm ◽

Log Linear

This article describes a number of log-linear parsing models for an automatically extracted lexicalized grammar. The models are “full” parsing models in the sense that probabilities are defined for complete parses, rather than for independent events derived by decomposing the parse tree. Discriminative training is used to estimate the models, which requires incorrect parses for each sentence in the training data as well as the correct parse. The lexicalized grammar formalism used is Combinatory Categorial Grammar (CCG), and the grammar is automatically extracted from CCGbank, a CCG version of the Penn Treebank. The combination of discriminative training and an automatically extracted grammar leads to a significant memory requirement (up to 25 GB), which is satisfied using a parallel implementation of the BFGS optimization algorithm running on a Beowulf cluster. Dynamic programming over a packed chart, in combination with the parallel implementation, allows us to solve one of the largest-scale estimation problems in the statistical parsing literature in under three hours. A key component of the parsing system, for both training and testing, is a Maximum Entropy supertagger which assigns CCG lexical categories to words in a sentence. The supertagger makes the discriminative training feasible, and also leads to a highly efficient parser. Surprisingly, given CCG's “spurious ambiguity,” the parsing speeds are significantly higher than those reported for comparable parsers in the literature. We also extend the existing parsing techniques for CCG by developing a new model and efficient parsing algorithm which exploits all derivations, including CCG's nonstandard derivations. This model and parsing algorithm, when combined with normal-form constraints, give state-of-the-art accuracy for the recovery of predicate-argument dependencies from CCGbank. The parser is also evaluated on DepBank and compared against the RASP parser, outperforming RASP overall and on the majority of relation types. The evaluation on DepBank raises a number of issues regarding parser evaluation. This article provides a comprehensive blueprint for building a wide-coverage CCG parser. We demonstrate that both accurate and highly efficient parsing is possible with CCG.

Download Full-text

Polytomous IRT models under log-linear model framework

PsycEXTRA Dataset ◽

10.1037/e713402011-001 ◽

2011 ◽

Author(s):

Zhushan Mandy Li

Keyword(s):

Linear Model ◽

Model Framework ◽

Irt Models ◽

Polytomous Irt Models ◽

Log Linear

Download Full-text

The relationship between free T4 and thyrotropin receptor antibodies is log-linear and negatively influenced by age and smoking in patients with Graves' disease

Endocrine Abstracts ◽

10.1530/endoabs.50.p392 ◽

2017 ◽

Author(s):

Earn H Gan ◽

Vasileios Tsatlidis ◽

David Kennedy ◽

Salman Razvi

Keyword(s):

Graves Disease ◽

Thyrotropin Receptor ◽

Free T4 ◽

Thyrotropin Receptor Antibodies ◽

Log Linear ◽

The Relationship ◽

Receptor Antibodies

Download Full-text

Stochastic gradient descent training for L1-regularized log-linear models with cumulative penalty

Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - ACL-IJCNLP '09 ◽

10.3115/1687878.1687946 ◽

2009 ◽

Cited By ~ 45

Author(s):

Yoshimasa Tsuruoka ◽

Jun'ichi Tsujii ◽

Sophia Ananiadou

Keyword(s):

Gradient Descent ◽

Linear Models ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Log Linear

Download Full-text

When and why are log-linear models self-normalizing?

10.3115/v1/n15-1027 ◽

2015 ◽

Author(s):

Jacob Andreas ◽

Dan Klein

Keyword(s):

Linear Models ◽

Log Linear

Download Full-text

Living Arrangements Among the Elderly in the United States: A Log linear Approach

Journal of Comparative Family Studies ◽

10.3138/jcfs.7.2.351 ◽

1976 ◽

Vol 7 (2) ◽

pp. 351-366 ◽

Cited By ~ 15

Author(s):

Beth Soldo ◽

Patience Lauriat

Keyword(s):

United States ◽

Living Arrangements ◽

The Elderly ◽

The United States ◽

Linear Approach ◽

Log Linear

Download Full-text

Improvement Comparison of Different Lattice-based Discriminative Training Methods in Chinese-monolingual and Chinese-English-bilingual Speech Recognition

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2012.01162 ◽

2012 ◽

Vol 38 (7) ◽

pp. 1162

Author(s):

Yan-Min QIAN ◽

Yu-Xiang SHAN ◽

Lin-Fang WANG ◽

Jia LIU

Keyword(s):

Speech Recognition ◽

Discriminative Training ◽

Training Methods ◽

English Bilingual

Download Full-text

Leveraging Second-Order Log-Linear Model for Improved Deep Learning Based ASR Performance

10.21437/interspeech.2018-1156 ◽

2018 ◽

Author(s):

Ankit Raj ◽

Shakti P Rath ◽

Jithendra Vepa

Keyword(s):

Deep Learning ◽

Linear Model ◽

Second Order ◽

Log Linear

Download Full-text

Impact of Training and Development Programmes on the Productivity of Employees in the Banks

Journal of Strategic Human Resource Management ◽

10.21863/jshrm/2016.5.1.023 ◽

2016 ◽

Vol 5 (1) ◽

Author(s):

Jaspreet Kaur

Keyword(s):

Regression Analysis ◽

Banking Sector ◽

Human Resources Management ◽

Training And Development ◽

Linear Forms ◽

Knowledge And Skills ◽

Resources Management ◽

Level Of Satisfaction ◽

Log Linear ◽

The Impact

Manpower training and development is an important aspect of human resources management which must be embarked upon either proactively or reactively to meet any change brought about in the course of time. Training is a continuous and perennial activity. It provides employees with the knowledge and skills to perform more effectively. The study examines the opinions of trainees regarding the impact of training and development programmes on the productivity of employees in the selected banks. To evaluate the impact of training and development programmes on productivity of banking sector, multiple regression analysis was employed in both log as well as log-linear forms. Also the impact of three sets of training i.e. objectives, methods and basics on level of satisfaction of respondents with the training was also examined through employing the regression analysis in the similar manner.

Download Full-text

Review of "Analyzing Qualitative/Categorical Data: Log-Linear Models and Latent-Structure Analysis, by Leo A. Goodman", Abt Books, 1978

ACM SIGSIM Simulation Digest ◽

10.1145/1102815.1102830 ◽

1979 ◽

Vol 10 (4) ◽

pp. 69-69

Keyword(s):

Structure Analysis ◽

Categorical Data ◽

Linear Models ◽

Latent Structure ◽

Latent Structure Analysis ◽

Log Linear ◽

Data Log

Download Full-text