GrammarScope  4.0.0
Public Member Functions | Static Public Member Functions | Private Member Functions | Private Attributes | Static Private Attributes | List of all members
grammarscope.io.read.TreeReader Class Reference
Collaboration diagram for grammarscope.io.read.TreeReader:
Collaboration graph
[legend]

Public Member Functions

 TreeReader (final StringReader stringReader)
 
AParseTree readTree () throws IOException
 

Static Public Member Functions

static AParseTree make (final String inputString)
 
static void main (final String[] args)
 

Private Member Functions

AParseTree getTreeFromInputStream ()
 

Private Attributes

final Tokenizer< String > tokenizer
 
final TreeNormalizer treeNormalizer
 
MutableAParseTree currentTree
 
ArrayList< MutableAParseTreestack
 

Static Private Attributes

static final String leftParen = "("
 
static final String rightParen = ")"
 

Detailed Description

This class implements the TreeReader interface to read Penn Treebank-style files. The reader is implemented as a pushdown automaton (PDA) that parses the Lisp-style format in which the trees are stored. This reader is compatible with both PTB and PATB trees.

Author
Christopher Manning
Roger Levy
Spence Green
Bernard Bou for adaptation

Constructor & Destructor Documentation

◆ TreeReader()

grammarscope.io.read.TreeReader.TreeReader ( final StringReader  stringReader)

Constructor

Parameters
stringReaderstring reader

Member Function Documentation

◆ getTreeFromInputStream()

AParseTree grammarscope.io.read.TreeReader.getTreeFromInputStream ( )
private

◆ main()

static void grammarscope.io.read.TreeReader.main ( final String[]  args)
static

◆ make()

static AParseTree grammarscope.io.read.TreeReader.make ( final String  inputString)
static

This gives you a tree from a String representation (as a bracketed Tree, of the kind produced by toString(), pennPrint(), or as in the Penn Treebank. It's not the most efficient thing to do for heavy duty usage.

Parameters
inputStringinput string
Returns
parse tree artifact

◆ readTree()

AParseTree grammarscope.io.read.TreeReader.readTree ( ) throws IOException

Reads a single tree in standard Penn Treebank format from the input stream. The method supports additional parentheses around the tree (an unnamed ROOT node) so long as they are balanced. If the token stream ends before the current tree is complete, then the method will throw an IOException.

Note that the method will skip malformed trees and attempt to read additional trees from the input stream. It is possible, however, that a malformed tree will corrupt the token stream. In this case, an IOException will eventually be thrown.

Returns
A single tree, or null at end of token stream.
Exceptions
IOExceptionexception

Member Data Documentation

◆ currentTree

MutableAParseTree grammarscope.io.read.TreeReader.currentTree
private

◆ leftParen

final String grammarscope.io.read.TreeReader.leftParen = "("
staticprivate

◆ rightParen

final String grammarscope.io.read.TreeReader.rightParen = ")"
staticprivate

◆ stack

ArrayList<MutableAParseTree> grammarscope.io.read.TreeReader.stack
private

◆ tokenizer

final Tokenizer<String> grammarscope.io.read.TreeReader.tokenizer
private

◆ treeNormalizer

final TreeNormalizer grammarscope.io.read.TreeReader.treeNormalizer
private

The documentation for this class was generated from the following file: