|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
ObjectStdTermFilter
public class StdTermFilter
Performs standard tokenization activities for terms, such as mapping to lowercase, removing apostrophes, etc.
| Nested Class Summary | |
|---|---|
private class |
StdTermFilter.DribbleStream
|
| Field Summary | |
|---|---|
private StdTermFilter.DribbleStream |
dribble
|
private TokenStream |
filter
|
private static String |
SAVE_WILD_QMARK
During tokenization, the '?' |
private static String |
SAVE_WILD_STAR
During tokenization, the '*' wildcard has to be changed to a word to keep it from being removed. |
| Constructor Summary | |
|---|---|
StdTermFilter()
Construct the rewriter |
|
| Method Summary | |
|---|---|
String |
filter(String term)
Apply the standard mapping to the given term. |
protected static String |
restoreWildcards(String s)
Restores wildcards saved by saveWildcards(String). |
protected static String |
saveWildcards(String s)
Converts wildcard characters into word-looking bits that would never occur in real text, so the standard tokenizer will keep them part of words. |
| Methods inherited from class Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
private StdTermFilter.DribbleStream dribble
private TokenStream filter
private static final String SAVE_WILD_STAR
private static final String SAVE_WILD_QMARK
| Constructor Detail |
|---|
public StdTermFilter()
| Method Detail |
|---|
public String filter(String term)
protected static String saveWildcards(String s)
restoreWildcards(String).
protected static String restoreWildcards(String s)
saveWildcards(String).
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||