Editing of NCI Thesaurus 08.10e was completed on October 31, 2008. Version 08.10e was October's fifth build in our development cycle. This directory contains files: ReadMe.txt This file Thesaurus_08.10e.XML.zip The NCI Thesaurus version 08.10e in Apelon's XML format Thesaurus_08.10e.Inferred.zip The NCI Thesaurus version 08.10e in Apelon's Inferred XML format Thesaurus_08.10e.FLAT.zip The NCI Thesaurus 08.10e in flat file format Thesaurus_08.10e.OWL.zip The NCI Thesaurus 08.10e in OWL The zip files unpack the following files: Thesaurus_08.10e.XML.zip Thesaurus_08.10e.xml Thesaurus_08.10e.Inferred.zip Thesaurus_08.10e.Inferred.xml Thesaurus_08.10e.FLAT.zip Thesaurus_08.10e.txt Thesaurus_08.10e.OWL.zip Thesaurus.owl In all three formats below, the ontology is in a defined state, i.e. relations are as stated by the editors, no inferred relations are specified. The Thesaurus_08.10e.xml file contains the entire terminology and associated ontologic constructions from the NCI Thesaurus, including properties, roles, and kinds. The DTD for the XML is as defined by Apelon, Inc, whose editing tools are being used in the construction of the Thesaurus. Properties of use only to the EVS (e.g. editor notes) are absent in the released terminology. The Thesaurus_08.10e.InferredXML.zip file contains the terminology from the NCI Thesaurus but excludes retired concepts and includes inferred relationships. This file is created for import into the UMLS and NCI Metathesaurus. The DTD for the XML is as defined by Apelon, Inc, whose editing tools are being used in the construction of the Thesaurus. Properties of use only to the EVS (e.g. editor notes) are absent in the released terminology. The Thesaurus_08.10e.txt flat file is in tab-delimited format. Included in this format are all the terms associated with NCI Thesaurus concepts (names and synonyms), a text definition of the concept (if one is present), and stated parent-child relations, sufficient to reconstruct the hierarchy. The fields are: code concept name parents synonyms definition The "parents" field contains the concept name(s) of the superconcept(s). If a "parents" or "synonyms" field contains multiple entries, these are pipe-delimited. For root concepts without "parents", this field contains the string "root_node". The first entry in the "synonyms" field is the preferred name of the concept. If no preferred name has been stated for the concept, this field contains the concept name. The "definition" field contains only one definition if more than one definition is associated with the concept; not all concepts contain definitions. The Thesaurus.owl file contains the entire terminology expressed in the OWL web ontology language (http://www.w3.org/TR/owl-ref/), with the exception of the Ontylog namespace declaration, which was deemed unnecessary. The Ontylog Roles where converted to restrictions on OWL properties, and most of the concept annotations in Ontylog properties were converted to OWL AnnotationProperty; as in the Ontylog xml file, properties of use only to the EVS (e.g. editor notes) are absent in the OWL file. Because Roles in Ontylog are mapped from a domain kind to a range kind, the OWL version of the Thesaurus has each kind as a root class to facilitate the conversion of Roles to OWL properties. The kind root classes are declared disjoint in the OWL file. The unzipped Thesaurus.owl is available directly at http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl For additional information, please see the Release Notes of caCORE 4.1.