Term Synonyms

Description

Produces a list of term synonyms, constructed from the given term list. It assumes synonyms to be modeled as anonymous sets where terms in each set are synonyms with each other:

  • (synset1, synonym, mb)
  • (synset1, synonym, megabyte)
  • (synset2, synonym, gb)
  • (synset2, synonym, gigabyte)

Input

  • DS [OBJ]: the dataset where synonym relations are defined.
  • SOURCE [STRING]: a list of strings (single-token strings are expected)

Output

  • PAIRS [STRING,STRING]: pairs of original terms and their synonyms. The original terms are not included here unless they have a synonym defined.
  • EXPANDED [STRING]: the original terms (always present) + the synonyms found.

Parameters

  • Property: the string property to identify synonyms. Default is synonym.
  • Use sub-properties: when set to true, the values of all sub properties are also included. Sub-properties can be defined in the data with the rdfs:subPropertyOf relation.
  • Language: when a language is selected, only the strings in this language are extracted. This uses the language tags that are defined in the data.
  • Stemming: tokens can be stemmed for a specific language or left as they are
  • Case-sensitive: if set to false, upper/lower case is ignored