python - Is it possible to split a network across multiple GPUs in tensorflow?

Question

Welcome To Ask or Share your Answers For Others

python - Is it possible to split a network across multiple GPUs in tensorflow?

1 Reply

深蓝 · Answer 1 · 2021-10-23T17:59:00+0000

Splitting a large model across multiple GPUs is certainly possible in TensorFlow, but doing it optimally is a hard research problem. In general, you will need to do the following:

Wrap large contiguous regions of your code in a with tf.device(...): block, naming the different GPUs:

with tf.device("/gpu:0"):
  # Define first layer.

with tf.device("/gpu:1"):
  # Define second layer.

# Define other layers, etc.

When building your optimizer, pass the optional argument colocate_gradients_with_ops=True to the optimizer.minimize() method:

loss = ...
optimizer = tf.train.AdaGradOptimizer(0.01)
train_op = optimizer.minimize(loss, colocate_gradients_with_ops=True)

(Optionally.) You may need to enable "soft placement" in the tf.ConfigProto when you create your tf.Session, if any of the operations in your model cannot run on GPU:
```
config = tf.ConfigProto(allow_soft_placement=True)
sess = tf.Session(config=config)
```

Categories

python - Is it possible to split a network across multiple GPUs in tensorflow?

python - Is it possible to split a network across multiple GPUs in tensorflow?

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags