[Rllib] `TorchPolicy` and `TFPolicy` cannot find any GPUs

### What is the problem?

`ray.get_gpu_ids()` gets an empty list on my machine when I'm using `TorchPolicy` with `config['num_gpu']` set. It will get an `IndexError` at `self.devices[0]` when using `TorchPolicy` on GPUs:

https://github.com/ray-project/ray/blob/1f35470560c90de69e7555097a4d2dd85065d6f8/rllib/policy/torch_policy.py#L154-L159

This issue can be reproduced on multiple machines. *Ray version and other system information (Python version, TensorFlow version, OS):*

#### My Runtime Environment

##### Machine 1:
- OS version: Ubuntu 20.04 LTS
- Python version: 3.8.10
- Ray version: 1.5.0 from PyPI (tested with nightly build as well)
- PyTorch version: 1.9.0
- NVIDIA driver version: 470.57.02
- CUDA version: 11.1.1

##### Machine 2:
- OS version: Ubuntu 16.04 LTS
- Python version: 3.7.10
- Ray version: 1.5.0 from PyPI (tested with nightly build as well)
- PyTorch version: 1.4.0
- NVIDIA driver version: 430.64
- CUDA version: 10.0.0

Same issue on Windows: https://discuss.ray.io/t/error-with-torch-policy-and-ray-get-gpu-ids-on-windows/2711

### Reproduction (REQUIRED)
Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have **no external library dependencies** (i.e., use fake or mock data / environments):

```bash
conda create --name test python=3.8 --yes
conda activate test
pip3 install https://s3-us-west-2.amazonaws.com/ray-wheels/latest/ray-2.0.0.dev0-cp38-cp38-manylinux2014_x86_64.whl
python3 -c 'import ray; print(ray.get_gpu_ids())'
nvidia-smi --list-gpus
```

If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".

- [X] I have verified my script runs in a clean environment and reproduces the issue.
- [X] I have verified the issue also occurs with the [latest wheels](https://docs.ray.io/en/master/installation.html).

	gpu_ids = ray.get_gpu_ids()
	self.devices = [
	torch.device("cuda:{}".format(i))
	for i, id_ in enumerate(gpu_ids) if i < config["num_gpus"]
	]
	self.device = self.devices[0]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Rllib] `TorchPolicy` and `TFPolicy` cannot find any GPUs #17397

What is the problem?

My Runtime Environment

Machine 1:

Machine 2:

Reproduction (REQUIRED)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Rllib] TorchPolicy and TFPolicy cannot find any GPUs #17397

Description

What is the problem?

My Runtime Environment

Machine 1:

Machine 2:

Reproduction (REQUIRED)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Rllib] `TorchPolicy` and `TFPolicy` cannot find any GPUs #17397