org.htmlparser.visitors
Class TextExtractingVisitor
java.lang.Object
org.htmlparser.visitors.NodeVisitor
org.htmlparser.visitors.TextExtractingVisitor
public class TextExtractingVisitor
- extends NodeVisitor
Extracts text from a web page.
Usage:
Parser parser = new Parser(...);
TextExtractingVisitor visitor = new TextExtractingVisitor();
parser.visitAllNodesWith(visitor);
String textInPage = visitor.getExtractedText();
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TextExtractingVisitor
public TextExtractingVisitor()
getExtractedText
public java.lang.String getExtractedText()
visitStringNode
public void visitStringNode(Text stringNode)
- Overrides:
visitStringNode in class NodeVisitor
visitTag
public void visitTag(Tag tag)
- Overrides:
visitTag in class NodeVisitor
visitEndTag
public void visitEndTag(Tag tag)
- Overrides:
visitEndTag in class NodeVisitor