Abstract: With the increase in both the model size and dataset size of distributed training (DT) tasks, communication between the workers and parameter servers (PSs) in a cluster has become a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results