Do all-stopword queries matter?

Many search engines don’t index “stopwords”, words that are very common and have little meaning by themselves. The stopword list is often just the most frequent words in the language: “the”, “be” (and its inflections), “a”, “of”, and so on.

Search engines that index all words like to show off searches for “to be or not to be”, because stopword elimination can remove every word in the phrase. Of course, no one really searches for “to be or not to be” because we all know where it came from.

Are there any real titles that are all stopwords? Does this matter? I’ve been indexing movie titles, and found a more than a few that are 100% stopwords.

The last one isn’t a traditional stopward, but think about the number of “click here” links on the web. It is a web stopword, for sure.

5 thoughts on “Do all-stopword queries matter?

  1. There was a time when the WordPress search could not handle “The Who” though there were hundreds of mentions on various sites. Let’s not even go into !!! (an indie band)

    Like

  2. Pingback: Using different language stop words in Solr – vfbga

Leave a comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.