search - Solr: One word query does not match three word indexed value -
one of documents has title
attribute value poésie pour pouvoir
. when query q=title:poesie
, no results found. q=title:poesie pour
finds document, though.
title
of type text
. excerpt schema.xml:
<fieldtype name="text" class="solr.textfield" positionincrementgap="100"> <analyzer type="index"> <tokenizer class="solr.whitespacetokenizerfactory"/> <filter class="solr.asciifoldingfilterfactory" /> <filter class="solr.stopfilterfactory" ignorecase="true" words="stopwords.txt"/> <filter class="solr.worddelimiterfilterfactory" generatewordparts="1" generatenumberparts="1" catenatewords="1" catenatenumbers="1" catenateall="0" splitoncasechange="1"/> <filter class="solr.lowercasefilterfactory"/> <filter class="solr.keywordmarkerfilterfactory" protected="protwords.txt"/> <filter class="solr.snowballporterfilterfactory" language="german2"/> <filter class="solr.removeduplicatestokenfilterfactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.whitespacetokenizerfactory"/> <filter class="solr.synonymfilterfactory" synonyms="synonyms.txt" ignorecase="true" expand="true"/> <filter class="solr.stopfilterfactory" ignorecase="true" words="stopwords.txt"/> <filter class="solr.worddelimiterfilterfactory" generatewordparts="1" generatenumberparts="1" catenatewords="0" catenatenumbers="0" catenateall="0" splitoncasechange="1"/> <filter class="solr.lowercasefilterfactory"/> <filter class="solr.keywordmarkerfilterfactory" protected="protwords.txt"/> <filter class="solr.snowballporterfilterfactory" language="german2"/> <filter class="solr.removeduplicatestokenfilterfactory"/> </analyzer> </fieldtype>
the second query isn't searching title field - it's searching default search field. query parsed "title:poesie default_field:pour". second part what's generating hit.
you can use debugquery
parameter see how query being parsed. use analysis page under solr admin page see why title value doesn't match (input "poésie pour pouvoir" under "indexed" value , "poesie" under query value).
Comments
Post a Comment