Publishing Partner: Cambridge University Press CUP Extra Wiley-Blackwell Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

The Vulgar Tongue: Green's History of Slang

By Jonathon Green

A comprehensive history of slang in the English speaking world by its leading lexicographer.


New from Cambridge University Press!

ad

The Universal Structure of Categories: Towards a Formal Typology

By Martina Wiltschko

This book presents a new theory of grammatical categories - the Universal Spine Hypothesis - and reinforces generative notions of Universal Grammar while accommodating insights from linguistic typology.


New from Brill!

ad

Brill's MyBook Program

Do you have access to Dynamics of Morphological Productivity through your library? Then you can by the paperback for only €25 or $25! Find out more about Brill's MyBook program!


Book Information

   
Sun Image

Title: Definition Extraction for Glossary Creation
Subtitle: A study on extracting definitions for semi-automatic glossary creation in Dutch
Written By: Eline Westerhout
Series Title: LOT Dissertation Series
Description:

The central topic of this thesis is the automatic extraction of definitions
from text. Definition extraction can play a role in various applications
including the semi-automatic development of glossaries in an eLearning
context, which constitutes the main focus of this dissertation. A glossary
provides definitions
for the most important terms that are discussed in a text. The
semi-automatic extraction approach presented in this study consists of two
phases. As a first step, a method entirely based on lexico-syntactic
patterns has been used to distinguish between definitions and
non-definitions. A corpus consisting of 600 definitions has been employed
to identify recurrent definition patterns. Since many of these patterns are
not unique to definitions, a second step was employed to reduce the number
of non-definitions identified. It has been investigated whether other
textual characteristics can contribute to the correct classification of
definitions, in addition to the lexico-syntactic patterns. The properties
that have been examined vary from the importance of the defined word
(phrase) within a text to the layout of the definition. Machine learning
techniques have been employed to identify which are the most relevant
(combinations of) definition properties. The results of this dissertation
are relevant for researchers in linguistics and lexicography as well as for
the development of language technology applications.

Publication Year: 2010
Publisher: Netherlands Graduate School of Linguistics / Landelijke (LOT)
Review: Not available for review. If you would like to review a book on The LINGUIST List, please login to view the AFR list.
BibTex: View BibTex record
Linguistic Field(s): Computational Linguistics
Lexicography
Subject Language(s): Dutch
Issue: All announcements sent out by The LINGUIST List are emailed to our subscribers and archived with the Library of Congress.
Click here to see the original emailed issue.

Versions:
Format: Paperback
ISBN-13: 9789460930348
Prices: U.K. £ 24.86