This interface allows University of Illinois researchers to download copyrighted corpora for which the University has a license. At present, the collection is limited mainly to text and speech transcript corpora. If you're looking for data used in specific experiments by Cognitive Computation Group members, please visit our Data page.
Entering values in the fields below and clicking Submit will display a list of the corpora we have that match the specified criteria.
If you have corpora you would like to make available to the wider University of Illinois research community, or if you find errors or omissions in the data here, please contact Mark Sammons at mssammon at illinois dot edu.