[annotator-dev] default analyzer for tags field

Gergely, Ujvari ujvari at hypothes.is
Tue Jul 30 23:33:28 UTC 2013


I've a theoretical question about how should the tag index work.

The |tags| field is defined as this in the annotation.py:

|'tags': {'type': 'string', 'index_name': 'tag'}|

But no analyzer was set up for the search, so ES uses it's own analyzer
which by default ignores searches to common stopwords for example:

|"a", "an", "and", "are", "as", "at", "be", "but", "by",
  "for", "if", "in", "into", "is", "it",
  "no", "not", "of", "on", "or", "such",
  "that", "the", "their", "then", "there", "these",
  "they", "this", "to", "was", "will", "with"

This means that searching to these stopwords do not give back search

My question: is this an intentional decision to avoid using trivial
tags? If yes, wouldn't it make sense to not let create this tags if
they're not that searchable?


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/annotator-dev/attachments/20130731/46ffaa8a/attachment-0001.html>

More information about the annotator-dev mailing list