Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Vowel Length From Latin to Romance

By Michele Loporcaro

This book "draws on extensive empirical data, including from lesser known varieties" and "puts forward a new account of a well-known diachronic phenomenon."


New from Cambridge University Press!

ad

Letter Writing and Language Change

Edited By Anita Auer, Daniel Schreier, and Richard J. Watts

This book "challenges the assumption that there is only one 'legitimate' and homogenous form of English or of any other language" and "supports the view of different/alternative histories of the English language and will appeal to readers who are skeptical of 'standard' language ideology."


Academic Paper


Title: Modeling Word Senses with Fuzzy Clustering
Paper URL: http://wo.uio.no/as/WebObjects/theses.woa/wo/2.3.9
Author: Erik Velldal
Email: click here TO access email
Institution: University of Oslo
Linguistic Field: Computational Linguistics; Language Acquisition; Semantics; Text/Corpus Linguistics
Abstract: This thesis describes a clustering approach to automatically inferring soft semantic classes and characterizing senses of a set of Norwegian nouns. The words are represented by way of their distribution in text, identified as local contexts in the form of lexical-syntactic relations. Through a shallow processing step the context features are extracted for lemmatized word forms in syntactically tagged corpora. The corresponding frequency counts of noun-context co-occurrences are weighted with a statistical association measure, and the distributional profile of a given word is represented in the form of a feature vector in a semantic space model. A hybrid approach is taken when clustering the word vectors; a bottom-up hierarchical method is used to initialize various types of fuzzy partitional clusterings. With the purpose of capturing the notion of typicality the clusters are construed as fuzzy sets, and the words are assigned varying degrees of membership with respect to the various classes. Words are assigned graded memberships in clusters on the basis of their resemblance towards a class prototype. The goal is to automatically uncover semantic classes, where the various memberships of a given word in these fuzzy clusters can be used to characterize its various senses.
Type: Individual Paper
Status: Completed
URL: http://wo.uio.no/as/WebObjects/theses.woa/wo/2.3.9


Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page