Publishing Partner: Cambridge University Press CUP Extra Wiley-Blackwell Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Language Planning as a Sociolinguistic Experiment

By: Ernst Jahr

Provides richly detailed insight into the uniqueness of the Norwegian language development. Marks the 200th anniversary of the birth of the Norwegian nation following centuries of Danish rule


New from Cambridge University Press!

ad

Acquiring Phonology: A Cross-Generational Case-Study

By Neil Smith

The study also highlights the constructs of current linguistic theory, arguing for distinctive features and the notion 'onset' and against some of the claims of Optimality Theory and Usage-based accounts.


New from Brill!

ad

Language Production and Interpretation: Linguistics meets Cognition

By Henk Zeevat

The importance of Henk Zeevat's new monograph cannot be overstated. [...] I recommend it to anyone who combines interests in language, logic, and computation [...]. David Beaver, University of Texas at Austin


Summary Details


Query:   Corpus Linguistics and Frequency
Author:  Peyton Todd
Submitter Email:  click here to access email
Linguistic LingField(s):   Text/Corpus Linguistics

Summary:   Many thanks to Roger Levy, Maria Giagkou, Balint Tanos, Aida Zitouni, Holly
Jacobson, Cedric Krummes, Karen Englander, Gill Philip, Martin Volk, N.
Wiedenmann, and Josh Viau for their answers to my recent query regarding
sources of information about corpus linguistics and frequency. In further
expression of my gratitude, and as a boon to others sharing my interest, I
provide below a summary of the replies I received.

Peyton Todd

BOOKS AND ARTICLES:

1. Baker, Paul (2006). Using Corpora in Discourse Analysis. London:
Continuum, 0-8264-7725-9

2. Biber, Douglas. Dimensions of Register Variation using
Multifeature/multidimensional analysis.

3. Hunston, S. & G.Francis, Pattern Grammar (J. Benjamins)

4. Meyer, Charles F. (2002). English Corpus Linguistics: An Introduction .
Cambridge University Press. (ISBN: 052100490X)

5. Roland, Douglas, Frederic Dick, and Jeffrey L. Elman (2007). Frequency
of basic English grammatical structures: A corpus analysis. Journal of
Memory and Language 57(3):348-379.

6. Sinclair, John. Reading Concordances.

7. Sinclair, John. Trust the Text.

Also, 'the works of Joan Bybee', listed at http://www.unm.edu/~jbybee/

HANDS-ON SEARCHES:

1. Bank of English (= Collins, below)

2. British National Corpus: http://www.natcorp.ox.ac.uk/

3. Collins WordbanksOnline concordance sampler
http://www.collins.co.uk/Corpus/CorpusSearch.aspx

4. Introductory website:
http://www.essex.ac.uk/linguistics/clmt/w3c/corpus_ling/content/introduction.html

5. Linguistic Data Consortium (LDC) at the University of Pennsylvania.WebSearch

6. Phrases in English: http://pie.usna.edu/, which uses the BNC

7. http://childes.psy.cmu.edu/

8. http://www.natcorp.ox.ac.uk/

9. TIGER-Search (freely available from the University of Stuttgart)

10.The Penn Treebank (for English).

SOFTWARE

1. The TextSTAT (free):
http://www.niederlandistik.fu-berlin.de/textstat/software-en.html

2. The WordSmith Tools (not free, but inexpensive):
http://www.lexically.net/wordsmith/

3. AntConc: downloadable for free at:
http://www.antlab.sci.waseda.ac.jp/software.html

4. ConcApp: available from www.edict.com.hk/PUB/concapp/


A LIST SERVER FOR FURTHER QUERIES:


1. Corpora@uib.no


And an INDIVIDUAL:

Prof. Dr. Dietmar Zaefferer,
Ludwig-Maximilians-University at Munich, Germany
(who is very friendly) who has data on all languages of the world
(Computational Linguistics)

LL Issue: 18.3088
Date Posted: 22-Oct-2007
Original Query: Read original query


Back

Sums main page