Which network is faster to train for 100 epochs, given that they have the same total number of neurons n, where one network is wider and the other is deeper?
1) The wider network
2) The deeper network
3) Both networks will take the same time to train
4) Cannot be determined