PreProcess - JDI, phase III
This pages describes the automatical pre-process tasks of generating input files for JDI (Journal Descriptor Indexing). There are three phases of this pre-process for JDI:
- Phase I:
generate all files to Java input format from Lisp files. This set of data is tested by comparing to all Lisp files and result of file.9801 and used in tc2006.
- Phase II:
use Java programs to generate files from original data (MEDLINE) and Lisp files. This set of data is tested by comparing to all files in phase I and results of file.9801 and used in tc2007.
- Phase III:
use Java program to generate files from scratch (MEDLINE, Meta-thesaurus, etc.). This set of data is tested by comparing final files in phase II by similarity (test suite) and used since tc2008.
The details procedures of phase III approach is described as below: