ASCII LEXICON
The Specialist LEXICON is distributed in UTF-8 format annually with UMLS. There are some NLP projects uses the Specialist LEXICON and still only dealing with ASCII characters. Due to the requests from user groups, the pure ASCII version of LEXICON is generated since 2009.
- The 1st version of ASCII LEXICON generation:
For years of 09 and 10 - The 2nd (enhanced) version of ASCII LEXICON generation
This enhanced version is developed to support MetaMap projects on the migration from 'C-code' to Java Lexical Tools/LexAccess interface since 2011. A set of reports is generated from the ASCII conversion log files. These reports should be reviewed by following procedures before the finalization as described in the design documents of ASCII LEXICON reports and review