next up previous
Next: ch4 Up: ch4 Previous: ch4

Alternative Error Functions

Penalize large weights:


\begin{displaymath}E(\vec{w}) \equiv \frac{1}{2}\sum_{d \in D} \sum_{k \in outputs} (t_{kd} -
o_{kd})^2 + \gamma \sum_{i,j}w_{ji}^{2}\end{displaymath}


Train on target slopes as well as values:


\begin{displaymath}E(\vec{w}) \equiv \frac{1}{2} \sum_{d \in D} \sum_{k \in outp...
...j_d} - \frac{\partial
o_{kd}}{\partial x^j_d}\right)^2 \right] \end{displaymath}


Tie together weights: e.g., in phoneme recognition network



Don Patterson 2001-12-13