BioNLP-Corpora is a repository of biologically and linguistically annotated corpora and biological datasets.
It is one of the projects of the BioNLP initiative by the 
Center for Computational Pharmacology
at the 
University of Colorado Denver Health Sciences Center 
to create and distribute code, software, and data for applying natural language 
processing techniques to biomedical texts.
There are many resources available for download at BioNLP-Corpora: