Convergence for bptt grads:
1.5078654e-06
-2.4998343
-2.4998274
-2.499832
-2.4998307
-2.4998364
-2.4998295
-2.4998288
-2.4998322
-2.4998336
-2.4998312
-2.4998286
-2.4998329
-2.4998314
-2.4998264
-2.4998264
-2.4998288
-2.499835
-2.4998317
-2.4998326
-2.4998314
-2.4998293
-2.4998271
-2.49983
-2.4998288
-2.4998286
-2.4998322
-2.4998312
-2.499834
-2.4998312
-2.4998338
-2.4998298
-2.4998312
-2.4998333
-2.4998276
-2.4998295
-2.499834
-2.4998317
-2.4998314
-2.4998329
-2.4998276
Final cosine with grad:	 0.9998312592506409
Final dist with grad:	 0.03210129588842392
