-
Notifications
You must be signed in to change notification settings - Fork 634
How to replace AllReduce with Reduce in parameter server mode? #467
Comments
Note the reason parameter server mode takes a mean of the gradients instead of a sum is that |
@reedwm Thanks for getting back to me about |
On a single device, However, with parameter server mode, |
@reedwm
I know |
benchmarks/scripts/tf_cnn_benchmarks/variable_mgr_util.py
Line 575 in 5d03cf8
@reedwm
In parameter server mode, I managed to replace
grad = tf.add_n(grads)
bynccl all reduce
:I tried to figure out a way to accomplish the
sum
without usingall-reduce
since I only need one copy of the sum, notnumber-of-gpus
copies of sum. Intf
, is there areduce
API I can use? Thanks.The text was updated successfully, but these errors were encountered: