edu.udo.cs.yale.example
Class SparseFormatDataRowReader

java.lang.Object
  extended by edu.udo.cs.yale.example.AbstractDataRowReader
      extended by edu.udo.cs.yale.example.SparseFormatDataRowReader
All Implemented Interfaces:
DataRowReader, java.util.Iterator<DataRow>

public class SparseFormatDataRowReader
extends AbstractDataRowReader

Reads the data rows in sparse format. The format is specified in the class comment of SparseFormatExampleSource. Attributes may be passed to the reader in its constructor. If they are ommitted, they are generated on the fly. In either case, indices are assigned to the attributes. If an AbstractExampleTable is generated using instances of this class, the constructor of AbstractExampleTable will reassign these indexes.

Version:
$Id: SparseFormatDataRowReader.java,v 2.20 2006/03/27 13:21:58 ingomierswa Exp $
Author:
Simon Fischer, Ingo Mierswa

Field Summary
private  AttributeSet attributeSet
          The attribute set with regular and special attributes.
private  DataRow currentDataRow
          The DataRow that will be returned in the next call to next()
private  int dimension
          The dimension of the examples, i.e. the total number of regular and special attributes.
private  boolean eof
          Remember if an end of file has occured.
private  int format
          One out of FORMAT_XY, FORMAT_YX, FORMAT_PREFIX, FORMAT_SEPARATE_FILE, and FORMAT_NO_LABEL.
static java.lang.String[] FORMAT_NAMES
          Names of the formats.
static int FORMAT_NO_LABEL
          Label is missing.
static int FORMAT_PREFIX
          Label has a prefix specified in the prefix map.
static int FORMAT_SEPARATE_FILE
          Label is in separate file.
static int FORMAT_XY
          Label succeeds attributes.
static int FORMAT_YX
          Label preceeds attributes.
private  java.io.BufferedReader inAttributes
          Reader for the labels.
private  java.io.BufferedReader inLabels
          Reader for the labels.
private  boolean lineRead
          Remember if a line has already been read.
private  int linesRead
          Number of lines already read.
private  int maxNumber
          The maximum number of attributes to read.
private  java.util.Map<java.lang.String,java.lang.String> prefixMap
          Maps prefixes to special attribute names, e.g.
 
Constructor Summary
SparseFormatDataRowReader(DataRowFactory factory, int format, java.util.Map<java.lang.String,java.lang.String> prefixMap, AttributeSet attributeSet, java.io.Reader attributeReader, java.io.Reader labelReader, int sampleSize)
          Creates a new data row reader for sparse format.
 
Method Summary
 boolean hasNext()
          Checks if futher examples exist.
 DataRow next()
          Returns the next Example.
private  boolean readLine()
           
 
Methods inherited from class edu.udo.cs.yale.example.AbstractDataRowReader
getFactory, remove
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

FORMAT_NAMES

public static final java.lang.String[] FORMAT_NAMES
Names of the formats.


FORMAT_XY

public static final int FORMAT_XY
Label succeeds attributes.

See Also:
Constant Field Values


FORMAT_YX

public static final int FORMAT_YX
Label preceeds attributes.

See Also:
Constant Field Values


FORMAT_PREFIX

public static final int FORMAT_PREFIX
Label has a prefix specified in the prefix map.

See Also:
Constant Field Values


FORMAT_SEPARATE_FILE

public static final int FORMAT_SEPARATE_FILE
Label is in separate file.

See Also:
Constant Field Values


FORMAT_NO_LABEL

public static final int FORMAT_NO_LABEL
Label is missing.

See Also:
Constant Field Values


inAttributes

private java.io.BufferedReader inAttributes
Reader for the labels.


inLabels

private java.io.BufferedReader inLabels
Reader for the labels.


attributeSet

private AttributeSet attributeSet
The attribute set with regular and special attributes.


eof

private boolean eof
Remember if an end of file has occured.


lineRead

private boolean lineRead
Remember if a line has already been read.


maxNumber

private int maxNumber
The maximum number of attributes to read.


linesRead

private int linesRead
Number of lines already read.


currentDataRow

private DataRow currentDataRow
The DataRow that will be returned in the next call to next()


format

private int format
One out of FORMAT_XY, FORMAT_YX, FORMAT_PREFIX, FORMAT_SEPARATE_FILE, and FORMAT_NO_LABEL.


dimension

private int dimension
The dimension of the examples, i.e. the total number of regular and special attributes.


prefixMap

private java.util.Map<java.lang.String,java.lang.String> prefixMap
Maps prefixes to special attribute names, e.g. "l:" to "label".

Constructor Detail

SparseFormatDataRowReader

public SparseFormatDataRowReader(DataRowFactory factory,
                                 int format,
                                 java.util.Map<java.lang.String,java.lang.String> prefixMap,
                                 AttributeSet attributeSet,
                                 java.io.Reader attributeReader,
                                 java.io.Reader labelReader,
                                 int sampleSize)
Creates a new data row reader for sparse format. The attributes indices must not be set. If they are, they are reassigned new values when this constructor is called!

Parameters:
factory - Factory used to create DataRow instances.
format - One Out of FORMAT_XY, FORMAT_YX, FORMAT_PREFIX, and FORMAT_SEPARATE_FILE.
prefixMap - Maps prefixes to special attribute names (e.g. "l" to "label").
attributeSet - Set of regular and special attributes.
attributeReader - Reader for the data
labelReader - Reader for the labels. Only necessary if format is FORMAT_SEPARATE_FILE.
sampleSize - sample size, may be -1 for no limit.

Method Detail

hasNext

public boolean hasNext()
Checks if futher examples exist. Returns false if one of the files end.


readLine

private boolean readLine()
                  throws java.io.IOException
Throws:
java.io.IOException

next

public DataRow next()
Returns the next Example.



Copyright © 2001-2006