Class Token


  • public class Token
    extends Object
    Analyzed token with morphological data from its dictionary.
    • Constructor Detail

    • Method Detail

      • getSurfaceForm

        public char[] getSurfaceForm()
        Returns:
        surfaceForm
      • getOffset

        public int getOffset()
        Returns:
        offset into surfaceForm
      • getLength

        public int getLength()
        Returns:
        length of surfaceForm
      • getSurfaceFormString

        public String getSurfaceFormString()
        Returns:
        surfaceForm as a String
      • getReading

        public String getReading()
        Returns:
        reading. null if token doesn't have reading.
      • getPronunciation

        public String getPronunciation()
        Returns:
        pronunciation. null if token doesn't have pronunciation.
      • getPartOfSpeech

        public String getPartOfSpeech()
        Returns:
        part of speech.
      • getInflectionType

        public String getInflectionType()
        Returns:
        inflection type or null
      • getInflectionForm

        public String getInflectionForm()
        Returns:
        inflection form or null
      • getBaseForm

        public String getBaseForm()
        Returns:
        base form or null if token is not inflected
      • getType

        public JapaneseTokenizer.Type getType()
        Returns the type of this token
        Returns:
        token type, not null
      • isKnown

        public boolean isKnown()
        Returns true if this token is known word
        Returns:
        true if this token is in standard dictionary. false if not.
      • isUnknown

        public boolean isUnknown()
        Returns true if this token is unknown word
        Returns:
        true if this token is unknown word. false if not.
      • isUser

        public boolean isUser()
        Returns true if this token is defined in user dictionary
        Returns:
        true if this token is in user dictionary. false if not.
      • getPosition

        public int getPosition()
        Get index of this token in input text
        Returns:
        position of token
      • setPositionLength

        public void setPositionLength​(int positionLength)
        Set the position length (in tokens) of this token. For normal tokens this is 1; for compound tokens it's > 1.
      • getPositionLength

        public int getPositionLength()
        Get the length (in tokens) of this token. For normal tokens this is 1; for compound tokens it's > 1.
        Returns:
        position length of token