| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectpt.tumba.spell.DefaultWordFinder
pt.tumba.spell.TeXWordFinder
public class TeXWordFinder
A word finder for TeX and LaTeX documents, which searches text for sequences of letters, but ignores any commands and environments as well as Math environments.
DefaultWordFinder| Field Summary | |
|---|---|
| private  boolean | IGNORE_COMMENTSBoolean flag indicating if TeX comments should be ignored. | 
| static int | REG_EXPRConstant value specifying regular expressions on user defined ignores. | 
| private  int | regexUserDefinedIgnoresAn integer specifying the type of expression to use. | 
| static int | STRING_EXPRConstant value specifying strings on user defined ignores. | 
| private  java.util.Set | userDefinedIgnoresA Set of user defined ignores. | 
| Fields inherited from class pt.tumba.spell.DefaultWordFinder | 
|---|
| currentSegmentPos, currentWord, currentWordPos, nextSegmentPos, nextWord, nextWordPos, sentenceIterator, solveHardCases, startsSentence, text | 
| Constructor Summary | |
|---|---|
| TeXWordFinder()Constructor for TexWordFinder. | |
| TeXWordFinder(java.lang.String inText)Constructor for TeXWordFinder. | |
| Method Summary | |
|---|---|
|  void | addUserDefinedIgnores(java.util.Collection expressions,
                      int regex)This method is used to import a user defined set of either strings or regular expressions to ignore. | 
|  java.lang.String | currentSegment()Returns the current text segment from the input. | 
| private  int | ignoreUserDefined(int i)User defined ignore. | 
|  java.lang.String | next()This method scans the text from the end of the last word, and returns a String corresponding to the next word. | 
|  void | setIgnoreComments(boolean ignore)Allows one to indicate if TeX comments should be ignored. | 
| Methods inherited from class pt.tumba.spell.DefaultWordFinder | 
|---|
| current, getText, hasNext, ignore, ignore, ignore, ignore, isWordChar, isWordChar, lookAhead, nextSegment, replace, replaceBigram, replaceSegment, setText, splitSegments, splitWords, startsSentence, toString | 
| Methods inherited from class java.lang.Object | 
|---|
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait | 
| Field Detail | 
|---|
private boolean IGNORE_COMMENTS
private java.util.Set userDefinedIgnores
private int regexUserDefinedIgnores
public static final int STRING_EXPR
public static final int REG_EXPR
| Constructor Detail | 
|---|
public TeXWordFinder(java.lang.String inText)
inText - A String with the input text to tokenize.public TeXWordFinder()
| Method Detail | 
|---|
public java.lang.String currentSegment()
currentSegment in class DefaultWordFinderpublic java.lang.String next()
next in class DefaultWordFinder
public void addUserDefinedIgnores(java.util.Collection expressions,
                                  int regex)
expressions - a collection of of Objects whose toString() value
               should be the expression. Typically String objects.regex - is an integer specifying the type of expression to 
               use. e.g. REG_EXPR, STRING_EXPR.private int ignoreUserDefined(int i)
i - 
public void setIgnoreComments(boolean ignore)
ignore - true if TeX comments should be ignored and false otherwise.| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||