Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

The Language Hoax

By John H. McWhorter

The Language Hoax "argues that that all humans process life the same way, regardless of their language."


New from Cambridge University Press!

ad

Language and Development in Africa

By H. Ekkehard Wolff

Language and Development in Africa "discusses the resourcefulness of languages, both local and global, in view of the ongoing transformation of African societies as much as for economic development.. "


The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.

Academic Paper


Title: Using Machine-Learning to Assign Function Labels to Parser Output for Spanish
Paper URL: http://acl.ldc.upenn.edu/P/P06/P06-2018.pdf
Author: Grzegorz Chrupała
Email: click here TO access email
Homepage: http://www.lsv.uni-saarland.de/personalPages/gchrupala/index.html
Institution: Saarland University
Author: Josef Van Genabith
Email: click here TO access email
Institution: Dublin City University
Linguistic Field: Computational Linguistics
Subject Language: Spanish
Spanish
Abstract: Data-driven grammatical function tag assignment has been studied for English using the Penn-II Treebank data. In this paper we address the question of whether such methods can be applied successfully to other languages and treebank resources. In addition to tag assignment accuracy and f-scores we also present results of a task-based evaluation. We use three machine-learning methods to assign Cast3LB function tags to sentences parsed with Bikel's parser trained on the Cast3LB treebank. The best performing method, SVM, achieves an f-score of 86.87% on gold-standard trees and 66.67% on parser output - a statistically significant improvement of 6.74% over the baseline. In a task-based evaluation we generate LFG functional-structures from the function-tag-enriched trees. On this task we achieve an f-score of 75.67%, a statistically significant 3.4% improvement over the baseline.
Type: Individual Paper
Status: Completed
Publication Info: Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
URL: http://acl.ldc.upenn.edu/P/P06/P06-2018.pdf


Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page