Geopolitical Entities, Names, and Codes (GENC) Terminology Files

The NCIt-GENC terminology files provided here are to support dissemination of the GENC system of information and support the Food and Drug Administration Structured Product Labeling (FDA SPL). The efforts of GENC are described more fully on the GENC web page, which also includes extensive data on the administrative subdivisions and mappings. The efforts of the FDA SPL are described more fully on the FDA SPL web page. The data supplied by GENC includes the following files:

The current GENC_Standard_Index.xlsx
NCIt-GENC_Terminology.xlsx
NCIt-GENC_Terminology.txt (Tab-delimited text)
The NCIt-GENC spreadsheet contains the GENC list, comprised of approximately 280 entries. These concepts are also assigned NCIt codes as they are part of the NCI Thesaurus and belong to the subset Geopolitical Entities, Names, and Codes (GENC) Terminology. Updates are made as they are provided by the GENC Working Group.


Each file has the following column headers on the first row:
Spreadsheet Column Content Description
NCIt Concept Code The NCIt concept code attached to the concept. NCIt Codes are unique strings that begin with a "C" and are followed by a series of digits.
NCIt Preferred Term The preferred term for the concept chosen by NCI; these mirror the GENC Name.
GENC Name The preferred term of the geopolitical entity chosen by GENC.
GENC 2 Letter Code The 2 letter code for the geopolitical entity chosen by GENC.
GENC Standard 3 Letter Code The 3 letter code for the geopolitical entity chosen by GENC.
GENC Number The 3 digit code for the geopolitical entity chosen by GENC.
NCIt Subset Code The NCIt concept code attached to the subset concept. NCIt Codes are unique strings that begin with a "C" and are followed by a series of digits.
NCIt Subset Name The preferred term for the subset concept.

Also included on the NCI EVS ftp site (https://evs.nci.nih.gov/ftp1/GENC/) are the following additional files:
About (This file)
Changes.txt (A text file of changes between the most recent and the current version of the GENC terminology. For each change record, the Changes.txt contains a complete row of tab delimited data with the same data elements as described above. An "A" will precede any new concept additions, a "C" will precede any modification to existing concepts, and a "D" will precede any concepts that have been deleted.)
Version.txt (A text file that contains the version of NCI Thesaurus that corresponds to the current spreadsheet data. The database is reconciled the last Monday of every month. The files will be posted during the following two weeks. The version appears as YR.MOweek. An example is 17.02d which corresponds to the year 2017, the month of February, and the "d" refers to the fourth Monday of the month.)
Archived files are available at: Help requests on these files should go to NCIThesaurus@mail.nih.gov