|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.udo.cs.yale.operator.Operator
edu.udo.cs.yale.operator.io.SimpleExampleSource
public class SimpleExampleSource
This operator reads an example set from (a) file(s). Probably you can use the
default parameter values for the most file formats (including the format
produced by the ExampleSetWriter, CSV, ...). In fact, in many cases this operator
is more appropriate for CSV based file formats than the CSVExampleSource
operator
itself.
In contrast to the usual ExampleSource operator this operator is able to read the attribute names from the first line of the data file. However, there are some restrictions: the data can only be read from one file instead of multiple files, all attributes must have the same value type, and currently the only special attribute which is supported is a single label attribute. If you need other functionality you should use the more powerful ExampleSource operator.
The split points can be defined with regular expressions (please refer to the Java API). The default split parameter ",\s*|;\s*|\s+" should work for most file formats. This regular expression describes the following column separators
Quoting is also possible with ". However, using quotes slows down parsing and is therefore not recommended. The user should ensure that the split characters are not included in the data columns and that quotes are not needed. Additionally you can specify comment characters which can be used at arbitrary locations of the data lines. Unknown attribute values can be marked with empty strings or a question mark.
Field Summary | |
---|---|
private static java.lang.Class[] |
INPUT_CLASSES
|
private static java.lang.Class[] |
OUTPUT_CLASSES
|
Constructor Summary | |
---|---|
SimpleExampleSource(OperatorDescription description)
|
Method Summary | |
---|---|
IOObject[] |
apply()
Implement this method in subclasses. |
java.lang.Class[] |
getInputClasses()
Returns the classes that are needed as input. |
java.lang.Class[] |
getOutputClasses()
Returns the classes that are guaranteed to be returned by apply() as additional output. |
java.util.List<ParameterType> |
getParameterTypes()
Returns a list of ParameterTypes describing the parameters of this operator. |
private void |
resetAttributeType(java.util.List<AttributeDataSource> attributeDataSources,
java.lang.String columnName,
java.lang.String typeName)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
private static final java.lang.Class[] INPUT_CLASSES
private static final java.lang.Class[] OUTPUT_CLASSES
Constructor Detail |
---|
public SimpleExampleSource(OperatorDescription description)
Method Detail |
---|
public IOObject[] apply() throws OperatorException
Operator
apply
in class Operator
OperatorException
private void resetAttributeType(java.util.List<AttributeDataSource> attributeDataSources, java.lang.String columnName, java.lang.String typeName) throws OperatorException
OperatorException
public java.lang.Class[] getInputClasses()
Operator
Operator.getOutputClasses()
and
Operator.apply()
if this is necessary. This default behavior can be
changed by overriding Operator.getInputDescription(Class)
. Subclasses
which implement this method should not make use of parameters since this
method is invoked by getParameterTypes(). Therefore, parameters are not
fully available at this point of time and this might lead to exceptions.
Please use InputDescriptions instead.
getInputClasses
in class Operator
public java.lang.Class[] getOutputClasses()
Operator
Operator.getInputDescription(Class)
and can be changed by
overwriting this method. Objects which are not consumed must not be
defined as additional output in this method. May be null or an empy array
(no additional output is produced).
getOutputClasses
in class Operator
public java.util.List<ParameterType> getParameterTypes()
Operator
getParameterTypes
in class Operator
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |