Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Expand
titleSample sentence : "My Friends Tigger & Pooh, 2007-2010. 내 친구 티거와 곰돌이 푸우"

my friends tigger & pooh
2007-2010. 내 친구 티거와 곰돌이 푸우

id

  • IdAnalyzer

    • CodeTokenizer

    • Code Block
      @Override
      protected boolean isTokenChar(int c) {
          return ! (Character.isWhitespace(c) || c == ',') ;
      }
Expand
titleSample sentence : "My Friends Tigger & Pooh, 2007-2010. 내 친구 티거와 곰돌이 푸우"

My
Friends
Tigger
&
Pooh
2007-2010.

친구
티거와
곰돌이
푸우

whitespace

  • WhitespaceAnalyzer : lucene-analyzer-common에 존재하는 analyzer로 공백이나 탭등을 기준으로 tokenize를 수행

    • WhitespaceTokenizer : 공백 문자로 텍스트를 분할

    • Code Block
      @Override
      protected boolean isTokenChar(int c) {
        return !Character.isWhitespace(c);
      }

...