[Data] `DefaultAutoscalerV2` doesn't scale nodes from zero

### What happened + What you expected to happen

The new cluster autoscaler works by checking if the global logical utilization is high, and if so, adding one node of each type.

To determine the different types of nodes, the autoscaler calls `ray.nodes()`: https://github.com/ray-project/ray/blob/6dc66d4b7da62e93093b98190c064d50d5c1905f/python/ray/data/_internal/cluster_autoscaler/default_cluster_autoscaler_v2.py#L124-L140.

The problem with this approach is that it can only detect alive nodes. If you start from zero worker nodes, the autoscaler won't be aware of that node type. To address this issue, we can use the Ray Core API introduced in https://github.com/ray-project/ray/pull/49568.

**Outcome**:

`DefaultAutoscalerV2` should try to scale up node types even if there are currently zero of them.

**Constraints**:

* Use `ray._private.state.state.get_cluster_config()` (you might also need to still use `ray.nodes()` since `get_cluster_config()` doesn't give you counts)


### Versions / Dependencies

6dc66d4b7da62e93093b98190c064d50d5c1905f

### Reproduction script

TODO

### Issue Severity

None

	"""Get the unique node resource specs and their count in the cluster."""
	# Filter out the head node.
	node_resources = [
	node["Resources"]
	for node in ray.nodes()
	if node["Alive"] and "node:__internal_head__" not in node["Resources"]
	]

	nodes_resource_spec_count = defaultdict(int)
	for r in node_resources:
	node_resource_spec = _NodeResourceSpec.of(
	cpu=r["CPU"], gpu=r.get("GPU", 0), mem=r["memory"]
	)
	nodes_resource_spec_count[node_resource_spec] += 1

	return nodes_resource_spec_count

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] `DefaultAutoscalerV2` doesn't scale nodes from zero #59682

What happened + What you expected to happen

Versions / Dependencies

Reproduction script

Issue Severity

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Data] DefaultAutoscalerV2 doesn't scale nodes from zero #59682

Description

What happened + What you expected to happen

Versions / Dependencies

Reproduction script

Issue Severity

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Data] `DefaultAutoscalerV2` doesn't scale nodes from zero #59682