|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.udo.cs.wvtool.generic.tokenizer.NGramTokenizer
public class NGramTokenizer
Creates tokens by creating ngrams of the tokens received from an inner tokenizer.
Field Summary | |
---|---|
private java.util.List |
currentTokens
The token, which is currently provided. |
private TokenEnumeration |
input
|
private int |
n
|
private WVTTokenizer |
tokenizer
|
Constructor Summary | |
---|---|
NGramTokenizer(int n,
WVTTokenizer tokenizer)
|
Method Summary | |
---|---|
boolean |
hasMoreTokens()
Determine whether there are tokens left in the Enumeration. |
java.lang.String |
nextToken()
Return the next token from the stream. |
private void |
readNextToken()
Read a token from the character stream and store it into currentToken. |
TokenEnumeration |
tokenize(java.io.Reader source,
WVTDocumentInfo d)
Tokenize a character stream. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
private java.util.List currentTokens
private int n
private TokenEnumeration input
private WVTTokenizer tokenizer
Constructor Detail |
---|
public NGramTokenizer(int n, WVTTokenizer tokenizer)
Method Detail |
---|
public TokenEnumeration tokenize(java.io.Reader source, WVTDocumentInfo d) throws WVToolException
WVTTokenizer
tokenize
in interface WVTTokenizer
source
- the Reader
from which to get the character
streamd
- the WVTDocumentInfo
value, describing the
document being processed
TokenEnumeration
WVToolException
WVTTokenizer.tokenize(Reader,
WVTDocumentInfo)
private void readNextToken() throws WVToolException
WVToolException
public boolean hasMoreTokens()
TokenEnumeration
hasMoreTokens
in interface TokenEnumeration
boolean
valueTokenEnumeration.hasMoreTokens()
public java.lang.String nextToken() throws WVToolException
TokenEnumeration
nextToken
in interface TokenEnumeration
WVToolException
TokenEnumeration.nextToken()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |