The NPS Chat Corpus, Release 1.0 is now available. Release 1.0 consists of 10,567 posts out of approximately 500,000 posts we have gathered from various online chat services in accordance with their terms of service. Future releases will contain more posts from more domains.
The posts included in Release 1.0 have been:
1) Hand privacy masked; 2) Part-of-speech tagged; and 3) Dialogue-act tagged.
The NPS Chat Corpus will be part of the Natural Language Tool Kit (NLTK) as of ver 0.9.4.
For license information and instructions on how to obtain the corpus, please see