next up previous contents index
Next: Initialization Functions Up: RBF Implementation in Previous: RBF Implementation in

Activation Functions

 

For the use of radial basis functions, three different activation functions h have been implemented. For computational efficiency the square of the distance is uniformly used as argument for h. Also, an additional argument p has been defined which represents the bias of the hidden units. The vectors and result from the activation and weights of links leading to the corresponding unit. The following radial basis functions have been implemented:

  1. Act_RBF_Gaussian --- the Gaussian function  

  2. Act_RBF_MultiQuadratic --- the multiquadratic function  

  3. Act_RBF_ThinPlateSpline -- the thin plate splines function  

During the construction of three layered neural networks based on radial basis functions, it is important to use the three activation functions mentioned above only for neurons inside the hidden layer. There is also only one hidden layer allowed.

For the output layer two other activation functions are to be used:

  1. Act_IdentityPlusBias

  2. Act_Logistic

  Act_IdentityPlusBias activates the corresponding unit with the weighted sum of all incoming activations and adds the bias of the unit. Act_Logistic applies the sigmoid logistic   function to the weighted sum which is computed like in Act_IdentityPlusBias. In general, it is necessary to use an activation function which pays attention to the bias of the unit.

The last two activation functions converge towards infinity, the first converges towards zero. However, all three functions are useful as base functions. The mathematical preconditions for their use are fulfilled by all three functions and their use is backed by practical experience. All three functions have been implemented as base functions into SNNS.

The most frequently used base function is the Gaussian function. For large distances r, the Gaussian function becomes almost 0. Therefore, the behavior of the net is easy to predict if the input patterns differ strongly from all teaching patterns. Another advantage of the Gaussian function is, that the network is able to produce useful results without the use of shortcut connections between input and output layer.



next up previous contents index
Next: Initialization Functions Up: RBF Implementation in Previous: RBF Implementation in



Niels Mache
Wed May 17 11:23:58 MET DST 1995