edu.udo.cs.yale.example
Class YaleLineReader

java.lang.Object
  extended by edu.udo.cs.yale.example.YaleLineReader

public class YaleLineReader
extends java.lang.Object

A simple line converter for reading data from BufferedReaders. Each line is separated into columns using a pattern matcher based on regular expressions. In addition, comments might be also defined. Everything after a comment character is completely ignored. Quotes might also be used. If a columns starts with a quote (") the end of the quoted region is searched and the corresponding columns build a new column which replaces the old ones. Quoting is added for compatibility reasons only. Since parsing is slower if quoting is used, quotes should not be used at all. If possible please use and define a column separator which is not part of your data.

Version:
$Id: YaleLineReader.java,v 2.12 2006/08/03 14:39:27 ingomierswa Exp $
Author:
Ingo Mierswa

Field Summary
private  java.lang.String[] commentChars
          The possible character for comment lines.
private  char decimalPointCharacter
          Decimal point character
private  int lineNumber
          The current line number.
private  java.util.regex.Pattern separatorPattern
          A regular expression pattern which is used for splitting the columns.
private  boolean useQuotes
          Indicates if quotes should be regarded (slower!).
 
Constructor Summary
YaleLineReader(java.lang.String separatorsRegExpr, char[] commentChars, boolean useQuotes)
          Indicates if quoting (") can be used to form.
YaleLineReader(java.lang.String separatorsRegExpr, char[] commentChars, boolean useQuotes, char decimalPointCharacter)
          Indicates if quoting (") can be used to form.
 
Method Summary
private  java.lang.String[] quote(java.lang.String[] columns)
           
 java.lang.String[] readLine(java.io.BufferedReader in, int expectedNumberOfColumns)
          Ignores comment and empty lines and returns the first line not starting with a comment.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

separatorPattern

private java.util.regex.Pattern separatorPattern
A regular expression pattern which is used for splitting the columns.


commentChars

private java.lang.String[] commentChars
The possible character for comment lines.


useQuotes

private boolean useQuotes
Indicates if quotes should be regarded (slower!).


decimalPointCharacter

private char decimalPointCharacter
Decimal point character


lineNumber

private int lineNumber
The current line number.

Constructor Detail

YaleLineReader

public YaleLineReader(java.lang.String separatorsRegExpr,
                      char[] commentChars,
                      boolean useQuotes,
                      char decimalPointCharacter)
Indicates if quoting (") can be used to form.


YaleLineReader

public YaleLineReader(java.lang.String separatorsRegExpr,
                      char[] commentChars,
                      boolean useQuotes)
Indicates if quoting (") can be used to form.

Method Detail

readLine

public java.lang.String[] readLine(java.io.BufferedReader in,
                                   int expectedNumberOfColumns)
                            throws java.io.IOException
Ignores comment and empty lines and returns the first line not starting with a comment. Returns null if no such line exists. Throws an IOException if the line does not provide the given expected number of columns.

Throws:
java.io.IOException


quote

private java.lang.String[] quote(java.lang.String[] columns)
                          throws java.io.IOException
Throws:
java.io.IOException


Copyright © 2001-2006