Editing of NCI Thesaurus 16.06d was completed on June 27, 2016.  Version 16.06d
was June's fourth build in our development cycle.

This directory contains files:

	ReadMe.txt			This file
	Thesaurus_16.06d.FLAT.zip	The NCI Thesaurus 16.06d in flat file format
	Thesaurus_16.06d.OWL.zip	The NCI Thesaurus 16.06d in OWL
	Thesaurus_16.06d.OWL-byName.zip	The NCI Thesaurus 16.06d in OWL byName format
	ThesaurusInf_16.06d.OWL.zip	The NCI Thesaurus 16.06d in Inferred OWL


The zip files unpack the following files:

	Thesaurus_16.06d.FLAT.zip	Thesaurus.txt
	Thesaurus_16.06d.OWL.zip	Thesaurus.owl
	Thesaurus_16.06d.OWL-byName.zip	Thesaurus-byName.owl
	ThesaurusInf_16.06d.OWL.zip	ThesaurusInferred.owl

In the first two formats below, the relations are as stated by the editors, no inferred 
relations are specified.

The Thesaurus_16.06d.txt flat file is in tab-delimited format.  Included in this format
are all the terms associated with NCI Thesaurus concepts (names and synonyms), a text 
definition of the concept (if one is present), and stated parent-child relations, sufficient 
to reconstruct the hierarchy.  The fields
are:

	code <tab> concept name <tab> parents <tab> synonyms <tab> definition

The "parents" field contains the concept name(s) of the superconcept(s).
If a "parents" or "synonyms" field contains multiple entries, these are pipe-delimited.  
For root concepts without "parents", this field contains the string "root_node".  The 
first entry in the "synonyms" field is the preferred name of the concept.  If no preferred 
name has been stated for the concept, this field contains the concept name.  The "definition" 
field contains only one definition if more than one definition is associated with the concept; 
not all concepts contain definitions.

The Thesaurus.owl file packaged in Thesaurus_16.06d.OWL.zip
(asserted OWL byCode) contains the entire terminology expressed in the
Web Ontology Language (OWL), and the rdf:about (rdf:ID) utilizes semantically
meaningless identifiers, e.g. "C123456", which is the value of the "code"
property found in the asserted "byName" file.  Relations are as stated by the
editors; no inferred relations are specified.  Annotations of use only to the
EVS (for example, editor's notes) are absent in the released terminology.
The rdf:about of annotation and object properties are also replaced with
their corresponding codes.

The Thesaurus-byName.owl file packaged in Thesaurus_16.06d.OWL-byName.zip (asserted OWL byName) contains the
entire terminology expressed in the OWL web ontology language (http://www.w3.org/TR/owl-ref/),
and the rdf:about (rdf:ID) utilizes semantically meaningful identifiers, e.g. "Gene".
Relations are as stated by the editors; no inferred relations are specified.
Annotations of use only to the EVS (for example, editor's notes) are absent in the released terminology.

The unzipped Thesaurus.owl is available directly at http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl

The ThesaurusInferred.owl file packaged in ThesaurusInf_16.06d.OWL.zip
contains the terminology from the NCI Thesaurus but excludes retired concepts and 
includes inferred relationships. This file is created for import into the UMLS 
and NCI Metathesaurus. Properties of use only to the EVS (e.g. editor notes) are
absent in the released terminology.  It also uses semantically meaningless 
rdf:about identifiers as described above for "asserted OWL byCode".

The file nci_code_cui_map_201604.dat contains the map of NCI Thesaurus (NCIt) codes
to NCI Metathesaurus (NCIMeta) CUIs (concept unique identifier).  The format of the 
file is code <pipe> CUI <pipe>.  The CUIs are either derived from the UMLS Metathesaurus 
(CUIs that begin with "C-digit") or specific to the NCI Metathesaurus (CUIs that begin
with "CL").  This mapping file is created for each release of the NCIMeta, which is 
published on a quarterly schedule hence is not current with every release of NCIt which 
is published monthly.  Newly created Thesaurus concepts will not appear in this file
until the next release of the Metathesaurus, and newly retired Thesaurus concepts will
continue to appear until the next release of the Metathesaurus.  Archived mapping files 
may be found in the archives directory beneath the last release of Thesaurus it was
generated from.  The current mapping file is derived from NCIt version 16.04d and
NCIMeta version 201604.

For previous releases of the NCI Thesaurus please see the archives at
ftp://ftp1.nci.nih.gov/pub/cacore/EVS/NCI_Thesaurus/archive/

For additional information, please see the LexEVS summary page.
(https://wiki.nci.nih.gov/display/EVS/LexEVS+Servers+and+APIs+Summary)