MediaWiki:Cirrussearch-morelikethis-settings

#

# This message lets you configure the settings of the "more like this" feature.
# Changes to this take effect immediately.
# The syntax is as follows:
#   * Everything from a "#" character to the end of the line is a comment.
#   * Every non-blank line is the setting name followed by a ":" character followed by the setting value
# The settings are:
#   * min_doc_freq (integer): Minimum number of documents (per shard) that need a term for it to be considered.
#   * max_doc_freq (integer): Maximum number of documents (per shard) that have a term for it to be considered.
#                   High frequency terms are generally "stop words".
#   * max_query_terms (integer): Maximum number of terms to be considered. This value is limited to $wgCirrusSearchMoreLikeThisMaxQueryTermsLimit (100).
#   * min_term_freq (integer): Minimum number of times the term appears in the input to doc to be considered. For small fields (title) this value should be 1.
#   * minimum_should_match (percentage -100% to 100%, or integer number of terms): The percentage of terms to match on. Defaults to 30%.
#   * min_word_len (integer): Minimal length of a term to be considered. Defaults to 0.
#   * max_word_len (integer): The maximum word length above which words will be ignored. Defaults to unbounded (0).
#   * fields (comma separated list of values): These are the fields to use. Allowed fields are title, text, auxiliary_text, opening_text, headings.
# Examples of good lines:
# min_doc_freq:2
# max_doc_freq:20000
# max_query_terms:25
# min_term_freq:2
# minimum_should_match:30%
# min_word_len:2
# max_word_len:40
# fields:text,opening_text
#