Penalize large weights:
Train on target slopes as well as values:
Tie together weights: e.g., in phoneme recognition network