text analyzer stemmer filter language support #41998
Replies: 3 comments 6 replies
-
|
Wait @aoiasd to comment. |
Beta Was this translation helpful? Give feedback.
-
|
I just discussed with @aoiasd. So far, we don't have the stemming filter for Czech. The milvus analyzer is mainly powered by Tantivy, and the snowball project provides stemmers for tantivy. Czech is not in the list of the stemmers of snowball: https://github.com/snowballstem/snowball/blob/master/libstemmer/modules.txt |
Beta Was this translation helpful? Give feedback.
-
|
hi @yhmo, the czech stemmer got merged in snowball recently snowballstem/snowball#151 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
are there any plans to include more languages for the stemming filter step in text analyzer? It would greatly improve keyword/hybrid search in my local language (czech). Will it eventually be possible to add custom stemmers?
The roadmap has the following section for CY25:
Analyzer Enhancement
Enhance Analyzer with expanded tokenizer support and improved observability
Perhaps it will be included with this feature?
Beta Was this translation helpful? Give feedback.
All reactions