edu.udo.cs.yale.operator.preprocessing.discretization
Class MinimalEntropyPartitioning

java.lang.Object
  extended by edu.udo.cs.yale.operator.Operator
      extended by edu.udo.cs.yale.operator.preprocessing.discretization.Discretization
          extended by edu.udo.cs.yale.operator.preprocessing.discretization.MinimalEntropyPartitioning
All Implemented Interfaces:
ConfigurationListener

public class MinimalEntropyPartitioning
extends Discretization

A filter that discretizes all numeric attributes in the dataset into nominal attributes. The discretization is performed by selecting a bin boundary minimizing the entropy in the induced partitions. The method is then applied recursively for both new partitions until the stopping criterion is reached. For Detail see a)Multi-interval discretization of continued-values attributes for classification learning(Fayyad,Irani) b)Supervised and Unsupervized Discretization(Dougherty,Kohavi,Sahami) Skips all special attributes including the label.

Version:
$Id: MinimalEntropyPartitioning.java,v 1.5 2006/04/14 11:42:27 ingomierswa Exp $
Author:
Dirk Dach

Constructor Summary
MinimalEntropyPartitioning(OperatorDescription description)
           
 
Method Summary
 IOObject[] apply()
          Implement this method in subclasses.
 java.lang.Class[] getInputClasses()
          Returns the classes that are needed as input.
private  java.lang.Double getMinEntropySplitpoint(java.util.LinkedList<double[]> truncatedExamples, Attribute label)
           
 java.lang.Class[] getOutputClasses()
          Returns the classes that are guaranteed to be returned by apply() as additional output.
 double[][] getRanges(ExampleSet exampleSet)
          Delivers the maximum range thresholds for all attributes, i.e. the value getRanges()[a][b] is the b-th threshold for the a-th attribute.
private  java.util.ArrayList getSplitpoints(java.util.LinkedList<double[]> startPartition, Attribute label)
           
 double log2(double arg)
           
 
Methods inherited from class edu.udo.cs.yale.operator.Operator
addError, addValue, addWarning, apply, checkDeprecations, checkIO, checkProperties, clearErrorList, cloneOperator, createExperimentTree, createExperimentTree, createFromXML, createMarkedExperimentTree, delete, experimentFinished, experimentStarts, getAddOnlyAdditionalOutput, getApplyCount, getDeliveredOutputClasses, getDeprecationInfo, getDesiredInputClasses, getErrorList, getExperiment, getInnerOperatorsXML, getInput, getInput, getInput, getInputDescription, getIOContainerForInApplyLoopBreakpoint, getName, getNumberOfSteps, getOperatorClassName, getOperatorDescription, getParameter, getParameterAsBoolean, getParameterAsColor, getParameterAsDouble, getParameterAsFile, getParameterAsInt, getParameterAsString, getParameterList, getParameters, getParameterType, getParameterTypes, getParent, getStartTime, getStatus, getUserDescription, getValue, getValues, getXML, hasBreakpoint, hasBreakpoint, hasInput, inApplyLoop, isEnabled, isParameterSet, logMessage, performAdditionalChecks, register, remove, rename, resume, setBreakpoint, setEnabled, setExperiment, setInput, setListParameter, setOperatorParameters, setParameter, setParameters, setParent, setUserDescription, toString, writeXML
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

MinimalEntropyPartitioning

public MinimalEntropyPartitioning(OperatorDescription description)
Method Detail

getMinEntropySplitpoint

private java.lang.Double getMinEntropySplitpoint(java.util.LinkedList<double[]> truncatedExamples,
                                                 Attribute label)

getSplitpoints

private java.util.ArrayList getSplitpoints(java.util.LinkedList<double[]> startPartition,
                                           Attribute label)

getRanges

public double[][] getRanges(ExampleSet exampleSet)
Delivers the maximum range thresholds for all attributes, i.e. the value getRanges()[a][b] is the b-th threshold for the a-th attribute.

Specified by:
getRanges in class Discretization


apply

public IOObject[] apply()
                 throws OperatorException
Description copied from class: Operator
Implement this method in subclasses.

Overrides:
apply in class Discretization
Throws:
OperatorException


log2

public double log2(double arg)

getOutputClasses

public java.lang.Class[] getOutputClasses()
Description copied from class: Operator
Returns the classes that are guaranteed to be returned by apply() as additional output. Please note that input object which should not be consumed must also be defined by this method (e.g. for preprocessing operators). The default behavior for input consumation is defined by Operator.getInputDescription(Class) and can be changed by overwriting this method. Objects which are not consumed must not be defined as additional output in this method. May be null or an empy array (no additional output is produced).

Overrides:
getOutputClasses in class Discretization


getInputClasses

public java.lang.Class[] getInputClasses()
Description copied from class: Operator
Returns the classes that are needed as input. May be null or an empty (no desired input). As default, all delivered input objects are consumed and must be also delivered as output in both Operator.getOutputClasses() and Operator.apply() if this is necessary. This default behavior can be changed by overriding Operator.getInputDescription(Class). Subclasses which implement this method should not make use of parameters since this method is invoked by getParameterTypes(). Therefore, parameters are not fully available at this point of time and this might lead to exceptions. Please use InputDescriptions instead.

Overrides:
getInputClasses in class Discretization



Copyright © 2001-2006