Package org.apache.lucene.analysis.hunspell
package org.apache.lucene.analysis.hunspell
A Java implementation of Hunspell stemming and
spell-checking algorithms (
Hunspell
), and a stemming
TokenFilter (HunspellStemFilter
) based on it.
For dictionaries, see e.g. LibreOffice repository or Titus Wormer's collection (UTF)
-
ClassDescriptionAn object representing the analysis result of a simple (non-compound) wordAn object representing a prefix or a suffix applied to a word stemAn object representing homonym dictionary entries.An object representing *.dic file entry with its word, flags and morphological data.In-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary.Suggestion to add/edit dictionary entries to generate a given list of words created by
WordFormGenerator.compress(java.util.List<java.lang.String>, java.util.Set<java.lang.String>, java.lang.Runnable)
.An oracle for quickly checking that a specific part of a word can never be a valid word.A spell checker based on Hunspell dictionaries.TokenFilter that uses hunspell affix rules and words to stem tokens.TokenFilterFactory that creates instances ofHunspellStemFilter
.AFragmentChecker
based on all character n-grams possible in a certain language, keeping them in a relatively memory-efficient, but probabilistic data structure.A callback for n-gram ranges in wordsThe strategy defining how a Hunspell dictionary should be loaded, with different tradeoffs.A generator for misspelled word corrections based on Hunspell flags.An exception thrown whenHunspell.suggest(java.lang.String)
call takes too long, ifTimeoutPolicy.THROW_EXCEPTION
is used.A strategy determining what to do when Hunspell API calls take too much timeA utility class used for generating possible word forms by adding affixes to stems (WordFormGenerator.getAllWordForms(String, String, Runnable)
), and suggesting stems and flags to generate the given set of words (WordFormGenerator.compress(List, Set, Runnable)
).