Term List

Description

Provides a list of terms (string tokens) as a data source.

Output

  • TERM [STRING]: a list of strings

Parameters

  • Value: a string to provide in output to this block, possibly after tokenization
  • Tokenization: the method to tokenize the input strings.
    • Spaces: all valid Unicode space characters
    • Spaces/Punctuation: Spaces + all valid Unicode punctuation characters
    • Spaces/Punctuation/Digits: Spaces/Punctuation + all valid Unicode digit characters
    • Spaces/Punctuation/Digits/Symbols: Spaces/Punctuation/Digits + all valid Unicode symbol characters
    • Custom Regular Expression: any regular expression
  • Min token length: tokens whose character length is shorter than this value are discarded