Neuronale Architekturen

The TACOMA (TAsk decomposition, COrrelation Measures and local Attention neurons) Algorithm for Growing Neural Networks

To reduce the engineering efforts for the design of neural network architectures a data driven algorithm is desirable which constructs a network during the training process automatically. For structure adaptation different approaches with evolutionary algorithms, growth algorithms, pruning and others are used. To solve complex classification or function approximation problems successfully it is often necessary to divide the problem into subproblems and to solve them separately. This is a fundamental principle of nature called Task Decomposition. The resulting key questions are: How to divide a problem automatically in structured sub-problems and how to build a neural network consisting of sub-networks reflecting this structure? The answer is quite difficuilt because what "a good structuring of the problem" is depends on the set of possible network structures. The idea TACOMA is based on is to build a feed-forward neural network bottom-up by cyclically inserting cascaded hidden layers. The activation function of a hidden layer unit combines the local characteristics of radial basis function units (or other window functions) with sigmoid units:



sigmoid function image * window function image = activation function image


With each grow step a hidden layer consisting of such local attention neurons will be inserted. The number of units to be inserted and their attention regions, means and ratios are given by an approximation of the mapping of the residual error from output to input space. The attention region of a unit becomes restricted to a region in the input space where the residual error is still high. The attention regions of the hidden units of the same hidden layer do not overlap (no lateral influence) and can be considered as units of different subnetworks. Contrary to the Cascade-Correlation Learning Architecture different correlation measures are used to train the units. The hidden units are trained to maximize the correlation between the units output and the residual error at the network output. At the same time the hidden units are trained to minimize the correlation to the other hidden units of the layer. After training a new hidden layer will be connected to the network using a connection routing algorithm which connects only cooperative units of different layers. Units of different hidden layers are called cooperative if their attention regions in the input space overlap. The TACOMA algorithm genereates over a number of cycles a neural network well adapted to problem to be solved. The training stops if the overall error at the network output is small enough. If we now consider the distribution of the local hidden units in the input space we will find clusters of cooperative units. Those groups of units can be considered as subnetworks, each specialized to a different input region. The number of units per subnetwork expresses the complexity of the subtask in the relating input region and can be quite different depending on the machine-learning problems.

 

 

 

 

Benchmarks

Two Spirals Classification Benchmark: The task is to learn to discriminate between two sets of training points which lie on two distinct spirals in the x-y plane. These spirals coil three times around the origin and around one another. This appears to be a very difficult task for back-propagation networks and their relatives. Problems like this one, whose inputs are points on the 2-D plane, are interesting because we can display the 2-D "receptive field" of any unit in the network. The problem was first published by Scott E. Fahlman, CMU.

 

Two Spirals Data Two Spirals TACOMA results

data set

TACOMA solution

 

Two Twin Spirals Classification Benchmark: The task is to learn to discriminate between two sets of training points which lie on two twin spirals in the x-y plane. This appears to be a very difficult (I've never seen a solution other than by TACOMA) task for back-propagation networks and their relatives. Also the solution by the Cascade Correlation Architecture algorithm gives no well generalization. Problems like this one, whose inputs are points on the 2-D plane, are interesting because we can display the 2-D "receptive field" of any unit in the network. The problem was first published by Jan Matti Lange, GFaI.

 

 

data set

CASCOR solution

TACOMA solution

 

Publications

Related Work

The TACOMA Architecture is a special neural architecture integrated into the Java Neural Networks Simulator at the University of Tübingen. The implementation for the SNNS system has been done by J. Gatter (Thesis). JavaNNS is freely available.


© GFaI 1994-2005    Kontakt: Dr. H.-M. Voigt
 
Gesellschaft zur Förderung angewandter Informatik e.V.
Forschungsbereich
Adaptive Modellierung und Mustererkennung