Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Words in Time and Place: Exploring Language Through the Historical Thesaurus of the Oxford English Dictionary

By David Crystal

Offers a unique view of the English language and its development, and includes witty commentary and anecdotes along the way.


New from Cambridge University Press!

ad

The Indo-European Controversy: Facts and Fallacies in Historical Linguistics

By Asya Pereltsvaig and Martin W. Lewis

This book "asserts that the origin and spread of languages must be examined primarily through the time-tested techniques of linguistic analysis, rather than those of evolutionary biology" and "defends traditional practices in historical linguistics while remaining open to new techniques, including computational methods" and "will appeal to readers interested in world history and world geography."


Academic Paper


Title: Modeling Word Senses with Fuzzy Clustering
Paper URL: http://wo.uio.no/as/WebObjects/theses.woa/wo/2.3.9
Author: Erik Velldal
Email: click here TO access email
Institution: University of Oslo
Linguistic Field: Computational Linguistics; Language Acquisition; Semantics; Text/Corpus Linguistics
Abstract: This thesis describes a clustering approach to automatically inferring soft semantic classes and characterizing senses of a set of Norwegian nouns. The words are represented by way of their distribution in text, identified as local contexts in the form of lexical-syntactic relations. Through a shallow processing step the context features are extracted for lemmatized word forms in syntactically tagged corpora. The corresponding frequency counts of noun-context co-occurrences are weighted with a statistical association measure, and the distributional profile of a given word is represented in the form of a feature vector in a semantic space model. A hybrid approach is taken when clustering the word vectors; a bottom-up hierarchical method is used to initialize various types of fuzzy partitional clusterings. With the purpose of capturing the notion of typicality the clusters are construed as fuzzy sets, and the words are assigned varying degrees of membership with respect to the various classes. Words are assigned graded memberships in clusters on the basis of their resemblance towards a class prototype. The goal is to automatically uncover semantic classes, where the various memberships of a given word in these fuzzy clusters can be used to characterize its various senses.
Type: Individual Paper
Status: Completed
URL: http://wo.uio.no/as/WebObjects/theses.woa/wo/2.3.9


Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page