Basically you're sharding the neural network by "something" that is itself tuned during the learning process.
Basically you're sharding the neural network by "something" that is itself tuned during the learning process.