A range of electronic corpora has become increasingly accessible via the
WWW and CD-ROM. This development coincided with improvements in the
standards governing the collecting, encoding and archiving of such data.
Less attention, however, has been paid to making other types of digital
data available. This is especially true of that which one might describe as
'unconventional', namely, dialects, child language and bilingual databases.
This book is a first step toward developing similar standards for enriching
and preserving these neglected resources.