Parallelizing neural networks on one GPU with JAX

from blog Will Whitney, | ↗ original
How you can get a 100x speedup for training small neural networks by making the most of your accelerator.