This book "supplies a vocabulary of English words and idiomatic phrases 'arranged … according to the ideas which they express'. The thesaurus, continually expanded and updated, has always remained in print, but this reissued first edition shows the impressive breadth of Roget's own knowledge and interests."
The aim of this book is try to illustrate with numerous examples how quantitative methods can most fruitfully contribute to linguistic analysis and research. In addition, it does not intend to offer an exhaustive presentation of all statistical techniques available to linguistics, but to demonstrate the contribution that statistics can and should make to linguistic studies.
This book shows how quantitative methods and statistical techniques can supplement qualitative analyses of language. It attempts to present some mathematical and statistical properties of natural languages, and introduces some of the quantitative methods which are of the most value in working empirically with texts and corpora, illustrating the various issues with numerous examples and moving from the most basic descriptive techniques to decision-taking techniques and to more sophisticated multivariate statistical language models.