Stri Java

Introduction

Stri tool uses the Jdi methodology as its basis. It uses ST (semantic type) documents; an ST document is a set of one-word UMLS Metathesaurus strings belonging to an ST. Stri takes the inputs, which may be text phrases or MeSH terms. Filters are applied to text input, such as word extraction algorithms, stopwords, minimum word length, etc. Then, Stri ranks the STs for an input according to similarity of JDI of the input (result of running Jdi tool on the input in real time) compared to pre-calculated JDI of each ST document, and sends the ranked STs with their scores to the output.

SetUp

Follow the installation instructions to install text categorization tools and run the sti program. Check on the following items only if you don't use the provided script to install Text Categorization tools.

TestRun

Input

Sti take text as input:

Output

Stri calculates the average ST scores of the input text for both word counts and document counts and sent the top rank ST to output. If detail flag, -d, is used, the results include rank, ST scores in following format:

RankST ScoresST abbreviationST name

Stri Options

Please refer to design document