CRAFT: THE COLORADO
RICHLY
ANNOTATED FULL TEXT CORPUS
|
|
Quick
Facts
|
The Colorado Richly Annotated Full Text Corpus (CRAFT) is a manually annotated corpus consisting of 67 full-text biomedical journal articles. Each article is a member of the PubMed Central Open Access Subset. Annotation guidelines used during the construction of CRAFT: LICENSE
The CRAFT annotations are licensed under the Creative Commons Attribution 3.0 license (CC BY). NEWS
REFERENCING
CRAFT
To reference the
CRAFT corpus, please cite one or
both of:
UIMA
INTEGRATION
Accompanying the 1.0 release of CRAFT is a software module
that integrates CRAFT with the Unstructured
Information Management Architecture (UIMA). The software module is
a Maven project. It includes a Collection Reader for the CRAFT corpus
as well as the annotations themselves (in the form of UIMA XMI). The
CRAFT annotations are made available in two UIMA type systems: CCP and
U-Compare. LICENSE: The craft-code software module has been released under the BSD New (3-Clause) license DOCUMENTATION: API 1.0 SOURCE CODE: Download craft-code-1.0 MAVEN COORDINATES (CCP type system): <!-- the
craft collection reader using the ccp type system -->
<dependency> <groupId>edu.ucdenver.ccp</groupId> <artifactId>craft-collection-reader-ccp</artifactId> <version>1.0</version> </dependency> <repository> <id>bionlp-sourceforge</id> <url>http://bionlp.svn.sourceforge.net/viewvc/bionlp/repo/</url> </repository> MAVEN COORDINATES (U-Compare type system): <!-- the
craft collection reader using the u-compare type system -->
<dependency> <groupId>edu.ucdenver.ccp</groupId> <artifactId>craft-collection-reader-ucompare</artifactId> <version>1.0</version> </dependency> <repository> <id>bionlp-sourceforge</id> <url>http://bionlp.svn.sourceforge.net/viewvc/bionlp/repo/</url> </repository> STAY UPDATED
To receive up-to-date information about the CRAFT corpus and future releases, please sign up for the BioNLP-Corpora-CRAFT mailing list. |