|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.udo.cs.yale.operator.Operator
edu.udo.cs.yale.operator.io.ExampleSource
public class ExampleSource
This operator reads an example set from (a) file(s). Probably you can use the default parameter values for the most file formats (including the format produced by the ExampleSetWriter, CSV, ...). Please refer to section First steps/File formats for details on the attribute description file set by the parameter attributes used to specify attribute types.
This operator supports the reading of data from multiple source files. Each attribute (including special attributes like labels, weights, ...) might be read from another file. Please note that only the minimum number of lines of all files will be read, i.e. if one of the data source files has less lines than the others, only this number of examples will be read.
The split points can be defined with regular expressions (please refer to the Java API). The default split parameter ",\s*|;\s*|\s+" should work for most file formats. This regular expression describes the following column separators
Quoting is also possible with ". However, using quotes slows down parsing and is therefore not recommended. The user should ensure that the split characters are not included in the data columns and that quotes are not needed. Additionally you can specify comment characters which can be used at arbitrary locations of the data lines. Unknown attribute values can be marked with empty strings or a question mark.
Field Summary | |
---|---|
private static java.lang.Class[] |
INPUT_CLASSES
|
private static java.lang.Class[] |
OUTPUT_CLASSES
|
Constructor Summary | |
---|---|
ExampleSource(OperatorDescription description)
|
Method Summary | |
---|---|
IOObject[] |
apply()
Implement this method in subclasses. |
java.lang.Class[] |
getInputClasses()
Returns the classes that are needed as input. |
java.lang.Class[] |
getOutputClasses()
Returns the classes that are guaranteed to be returned by apply() as additional output. |
java.util.List<ParameterType> |
getParameterTypes()
Returns a list of ParameterTypes describing the parameters of this operator. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
private static final java.lang.Class[] INPUT_CLASSES
private static final java.lang.Class[] OUTPUT_CLASSES
Constructor Detail |
---|
public ExampleSource(OperatorDescription description)
Method Detail |
---|
public IOObject[] apply() throws OperatorException
Operator
apply
in class Operator
OperatorException
public java.lang.Class[] getInputClasses()
Operator
Operator.getOutputClasses()
and
Operator.apply()
if this is necessary. This default behavior can be
changed by overriding Operator.getInputDescription(Class)
. Subclasses
which implement this method should not make use of parameters since this
method is invoked by getParameterTypes(). Therefore, parameters are not
fully available at this point of time and this might lead to exceptions.
Please use InputDescriptions instead.
getInputClasses
in class Operator
public java.lang.Class[] getOutputClasses()
Operator
Operator.getInputDescription(Class)
and can be changed by
overwriting this method. Objects which are not consumed must not be
defined as additional output in this method. May be null or an empy array
(no additional output is produced).
getOutputClasses
in class Operator
public java.util.List<ParameterType> getParameterTypes()
Operator
getParameterTypes
in class Operator
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |