CRAFT: THE COLORADO
RICHLY
ANNOTATED FULL TEXT CORPUS
|
NOTE: This site hosts CRAFT v2.0 and below. For more recent versions of the corpus, including the most recent version, please go to
the CRAFT GitHub site
.
|
Quick
Facts
|
The Colorado Richly Annotated Full Text Corpus (CRAFT) is a manually annotated corpus consisting of 67 full-text biomedical journal articles. Each article is a member of the PubMed Central Open Access Subset. Annotation guidelines used during the construction of CRAFT: LICENSE
The CRAFT annotations are licensed under the Creative Commons Attribution 3.0 license (CC BY). NEWS
RELEVANT PUBLICATIONS
To reference the
CRAFT corpus, please cite one of:
UIMA
INTEGRATION
Accompanying the release of CRAFT is a software module
that integrates CRAFT with the Unstructured
Information Management Architecture (UIMA). The software module is
a Maven project. It includes a Collection Reader for the CRAFT corpus
as well as the annotations themselves (in the form of UIMA XMI).
LICENSE: The craft-code software module has been released under the BSD New (3-Clause) license DOCUMENTATION: API 1.0 SOURCE CODE: Download craft-code-2.0 MAVEN COORDINATES (CCP type system): <!-- the
craft collection reader using the ccp type system -->
<dependency> <groupId>edu.ucdenver.ccp</groupId> <artifactId>craft-collection-reader-ccp</artifactId> <version>2.0</version> </dependency> <repository> <id>bionlp-sourceforge</id> <url>http://svn.code.sf.net/p/bionlp/code/repo/</url> </repository> STAY UPDATED
To receive up-to-date information about the CRAFT corpus and future releases, please sign up for the BioNLP-Corpora-CRAFT mailing list. |