MediaWiki:Cirrussearch-morelikethis-settings
#
# This message lets you configure the settings of the "more like this" feature. # Changes to this take effect immediately. # The syntax is as follows: # * Everything from a "#" character to the end of the line is a comment. # * Every non-blank line is the setting name followed by a ":" character followed by the setting value # The settings are: # * min_doc_freq (integer): Minimum number of documents (per shard) that need a term for it to be considered. # * max_doc_freq (integer): Maximum number of documents (per shard) that have a term for it to be considered. # High frequency terms are generally "stop words". # * max_query_terms (integer): Maximum number of terms to be considered. This value is limited to $wgCirrusSearchMoreLikeThisMaxQueryTermsLimit (100). # * min_term_freq (integer): Minimum number of times the term appears in the input to doc to be considered. For small fields (title) this value should be 1. # * minimum_should_match (percentage -100% to 100%, or integer number of terms): The percentage of terms to match on. Defaults to 30%. # * min_word_len (integer): Minimal length of a term to be considered. Defaults to 0. # * max_word_len (integer): The maximum word length above which words will be ignored. Defaults to unbounded (0). # * fields (comma separated list of values): These are the fields to use. Allowed fields are title, text, auxiliary_text, opening_text, headings. # Examples of good lines: # min_doc_freq:2 # max_doc_freq:20000 # max_query_terms:25 # min_term_freq:2 # minimum_should_match:30% # min_word_len:2 # max_word_len:40 # fields:text,opening_text #