- initialize better, constant 0.1 is suboptimal

- portable save/load (only works on sbcl and allegro)

- maybe do a few batch fine tuning passing when it has converged

- the learning rate magnification effect is very ad-hoc, maybe the
   user could supply a function

- tests
