January 30, 2004

Editing of NCI Thesaurus 03.12a was completed on December 5, 2003.  Version 
03.12a was December's first build in our development cycle.

This directory contains seven files:

	ReadMe.txt			This file
	NCI_THESAURUS_license.txt	A description of the licensing terms of the NCI Thesaurus
	ThesaurusTermsofUse.htm		As above, in html format for web browsing
	Thesaurus_03.12a.XML.zip	The NCI Thesaurus version 03.12a in Apelon's XML format
	Thesaurus_03.12a.FLAT.zip	The NCI Thesaurus 03.12a in flat file format
	Thesaurus_03.12a.OWL.zip	The NCI Thesaurus 03.12a in OWL
	ontylog.dtd			The Apelon XML's document type definition, from Apelon, Inc.

The NCI_THESAURUS_license.txt file contains the terms of use for the 
NCI Thesaurus.  Please refer to the NCI_THESAURUS_license.txt
file for the exact licensing terms. 


The zip files unpack the following files:
	Thesaurus_03.12a.XML.zip	Thesaurus_03.12a.xml
	Thesaurus_03.12a.FLAT.zip	Thesaurus_03.12a.txt
	Thesaurus_03.12a.OWL.zip	Thesaurus.owl
	
In all three formats below, the ontology is in a defined state, i.e. 
relations are as stated by the editors, no inferred relations are
specified.

The Thesaurus_03.12a.xml file contains the entire terminology and associated 
ontologic constructions from the NCI Thesaurus, including properties, roles, 
and kinds.  The DTD for the XML is as defined by Apelon, Inc, whose editing 
tools are being used in the construction of the Thesaurus.  Properties of 
use only to the EVS (e.g. editor notes) are absent in the released terminology. 


The Thesaurus_03.12a.txt flat file is in tab-delimited format.  Included in this 
format are all the terms associated with NCI Thesaurus concepts (names and 
synonyms), a text definition of the concept (if one is present), and stated 
parent-child relations, sufficient to reconstruct the hierarchy.  The fields 
are:

	code <tab> concept name <tab> parents <tab> synonyms <tab> definition

The "parents" field contains the concept name(s) of the superconcept(s).
If a "parents" or "synonyms" field contains multiple entries, these 
are pipe-delimited.  For root concepts without "parents", this field
contains the string "root_node".  The first entry in the "synonyms" field 
is the preferred name of the concept.  If no preferred name has been stated
for the concept, this field contains the concept name.  The 
"definition" field contains only one definition if more than one 
definition is associated with the concept; not all concepts contain 
definitions.  

The Thesaurus.owl file contains the entire terminology expressed in the OWL 
web ontology language (http://www.w3.org/TR/owl-ref/), with the exception of
the Ontylog namespace declaration, which was deemed unnecessary.  The Ontylog
Roles where converted to restrictions on OWL properties, and most of the 
concept annotations in Ontylog properties were converted to OWL 
AnnotationProperty; as in the Ontylog xml file, properties of use only to 
the EVS (e.g. editor notes) are absent in the OWL file.  Because 
Roles in Ontylog are mapped from a domain kind to a range kind, the OWL 
version of the Thesaurus has each kind as a root class to facilitate the 
conversion of Roles to OWL properties.

For additional information, please see the Release Notes of caCORE 2.0.