CCP logo




The TestSuite Corpora contain structured test suites for natural language processing applications, built on the principles of software testing and linguistic field work. For general background on the philosophy behind building these test suites, see:

K. Bretonnel Cohen, Lorraine Tanabe, Shuhei Kinoshita, and Lawrence Hunter (2004). A resource for constructing customized test suites for molecular biology entity identification systems. BioLINK 2004: Linking biological literature, ontologies, and databases: tools for users, pp. 1-8. Association for Computational Linguistics, (pdf)

The test suites that are available include: