|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object pt.tumba.spell.DefaultWordFinder pt.tumba.spell.TeXWordFinder
public class TeXWordFinder
A word finder for TeX and LaTeX documents, which searches text for sequences of letters, but ignores any commands and environments as well as Math environments.
DefaultWordFinder
Field Summary | |
---|---|
private boolean |
IGNORE_COMMENTS
Boolean flag indicating if TeX comments should be ignored. |
static int |
REG_EXPR
Constant value specifying regular expressions on user defined ignores. |
private int |
regexUserDefinedIgnores
An integer specifying the type of expression to use. |
static int |
STRING_EXPR
Constant value specifying strings on user defined ignores. |
private java.util.Set |
userDefinedIgnores
A Set of user defined ignores. |
Fields inherited from class pt.tumba.spell.DefaultWordFinder |
---|
currentSegmentPos, currentWord, currentWordPos, nextSegmentPos, nextWord, nextWordPos, sentenceIterator, solveHardCases, startsSentence, text |
Constructor Summary | |
---|---|
TeXWordFinder()
Constructor for TexWordFinder. |
|
TeXWordFinder(java.lang.String inText)
Constructor for TeXWordFinder. |
Method Summary | |
---|---|
void |
addUserDefinedIgnores(java.util.Collection expressions,
int regex)
This method is used to import a user defined set of either strings or regular expressions to ignore. |
java.lang.String |
currentSegment()
Returns the current text segment from the input. |
private int |
ignoreUserDefined(int i)
User defined ignore. |
java.lang.String |
next()
This method scans the text from the end of the last word, and returns a String corresponding to the next word. |
void |
setIgnoreComments(boolean ignore)
Allows one to indicate if TeX comments should be ignored. |
Methods inherited from class pt.tumba.spell.DefaultWordFinder |
---|
current, getText, hasNext, ignore, ignore, ignore, ignore, isWordChar, isWordChar, lookAhead, nextSegment, replace, replaceBigram, replaceSegment, setText, splitSegments, splitWords, startsSentence, toString |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
private boolean IGNORE_COMMENTS
private java.util.Set userDefinedIgnores
private int regexUserDefinedIgnores
public static final int STRING_EXPR
public static final int REG_EXPR
Constructor Detail |
---|
public TeXWordFinder(java.lang.String inText)
inText
- A String with the input text to tokenize.public TeXWordFinder()
Method Detail |
---|
public java.lang.String currentSegment()
currentSegment
in class DefaultWordFinder
public java.lang.String next()
next
in class DefaultWordFinder
public void addUserDefinedIgnores(java.util.Collection expressions, int regex)
expressions
- a collection of of Objects whose toString() value
should be the expression. Typically String objects.regex
- is an integer specifying the type of expression to
use. e.g. REG_EXPR, STRING_EXPR.private int ignoreUserDefined(int i)
i
-
public void setIgnoreComments(boolean ignore)
ignore
- true if TeX comments should be ignored and false otherwise.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |