Generate Inflectional Variants - Specifying Output Categories and Inflections
- Short Description: Generate inflectional variants, specifying the Bit OR'ed output categories, and output inflections.
- Full Description:
- |FACT|uninflected term|category|uninflected inflection|inflected term|category|inflected inflection|EUI|
- |RULE|uninflected term|matched pattern|category|uninflected inflection|replaced pattern|category|inflected inflection|
The inflection operation can be qualified to restrict output category and inflection by specifying the category bit vector and the inflection bit vector.
The category values can be OR'ed from the following values:
The inflection values can be OR'ed from the following values:
In some domains, nouns are the vast majority of terms in the vocabulary. Furthermore, for terms which can be interpreted as either nouns or as some other categories, the noun sense is much more likely. Under these circumstances, one might want to restrict one's output to nouns and ignore the other senses of words. Further, many indexing vocabularies have nouns in their plural form rather than in their singular form, so one might want to restrict one's output to plural nouns.
Users may input "all" instead of "2047" or "16777215" to represent all categories or all inflections, respectively. For instance, users may construct command such as "-f:ici~128+all" to get all inflectional variants for all nouns (including all inflections).
The results are sorted as in inflection flow component. It is sorted by the frequency of category, length, and case insensitive alphabetical order.
If the -m flag is specified, two types of possible information may be appended to the outputs. The formats of possible information are:
- Please refer to inflections.
- In the Java version, both options of category and inflection must be specified. They cannot be omitted. In other words, option flag, "all", is needed if users don't care about the option. For example, if you would like to get the noun of a input and don't care about the inflections, you may use -f:ici~128+16777215.
- EUI information is added into -m option (fact) in 2012
- Fact: Find all inflectional variants from inflection table.
- Rules: Find all inflectional variants from morphology rules.
- Assign category and inflection for all outputs
- Filter output according to the restricted categories and inflections.
- Filter output according to the restriction flag (-ki)
- Display output by the frequency of categories.
shell> lvg -f:ici~128+8 -m elderly elderly|elderly|128|8|ici|1|FACT|elderly|noun|base|elderly|noun|plural|E0024667| elderly|elderlies|128|8|ici|1|FACT|elderly|noun|base|elderlies|noun|plural|E0024667| leaf leaf|leafs|128|8|ici|1|FACT|leaf|noun|base|leafs|noun|plural|E0037070| leaf|leaves|128|8|ici|1|FACT|leaf|noun|base|leaves|noun|plural|E0037070| neoplasm neoplasm|neoplasms|128|8|ici|1|FACT|neoplasm|noun|base|neoplasms|noun|plural|E0042193| shell> lvg -f:ici~128+all -m neoplasm neoplasm|neoplasm|128|1|ici|1|FACT|neoplasm|noun|base|neoplasm|noun|base|E0042193| neoplasm|neoplasm|128|512|ici|1|FACT|neoplasm|noun|base|neoplasm|noun|singular|E0042193| neoplasm|neoplasms|128|8|ici|1|FACT|neoplasm|noun|base|neoplasms|noun|plural|E0042193| shell> lvg -f:ici~all+8 -m neoplasm neoplasm|neoplasms|128|8|ici|1|FACT|neoplasm|noun|base|neoplasms|noun|plural|E0042193| left left|left|128|8|ici|1|FACT|left|noun|base|left|noun|plural|E0037124| left|lefts|128|8|ici|1|FACT|left|noun|base|lefts|noun|plural|E0037124|More examples
- Call ToInflection.InflectWords (sorted)
- Filter out results by output categories and inflections.