For an experimental testbed, we used the ``Arm2D'' problem described
by Cohn [1994]. The task is to learn the kinematics of
a toy 2-degree-of-freedom robot arm (see Figure 4). The
inputs are joint angles , and the outputs are
the Cartesian coordinates of the tip
. One of the implicit
assumptions of both models described here is that the noise is
Gaussian in the output dimensions. To test the robustness of the
algorithm to this assumption, we ran experiments using no noise, using
additive Gaussian noise in the outputs, and using additive Gaussian
noise in the inputs. The results of each were comparable; we report
here the results using additive Gaussian noise in the inputs. Gaussian
input noise corresponds to the case where the arm effectors or joint
angle sensors are noisy, and results in non-Gaussian errors in the
learner's outputs. The input distribution
is assumed to be
uniform.
Figure 4: The arm kinematics problem. The learner
attempts to predict tip position given a set of joint angles
.
We compared the performance of the variance-minimizing criterion by comparing the learning curves of a learner using the criterion with that of one learning from random samples. The learning curves plot the mean squared error and variance of the learner as its training set size increases. The curves are created by starting with an initial sample, measuring the learner's mean squared error or estimated variance on a set of ``reference'' points (independent of the training set), selecting and adding a new example to the training set, retraining the learner on the augmented set, and repeating.
On each step, the variance-minimizing learner chose a set of 64
unlabeled reference points drawn from input distribution . It
then selected a query
that it
estimated would minimize
over
the reference set. In the experiments reported here, the best
was selected from another set of 64 ``candidate'' points
drawn at random on each iteration.