[raystrategy] multi-stragy in the worker is not consistent

In the current ray strategy, since the `strategy` shows in three places: two are obvious and one is hidden. 


one is in the ray launcher: 
```python
class RayLauncher(_SpawnLauncher):
    def __init__(self, strategy: "RayPlugin") -> None:
        self._strategy = strategy
        self._start_method = "ray"
        self._workers = []
        self._futures = []
        self._master_addr = None
```
https://github.com/JiahaoYao/ray_lightning/blob/2727fd441a62e0e6763fd1f25ed97575dc5a6733/ray_lightning/ray_ddp.py#L38-L48


And later we use these in `_wrapped_function_`

https://github.com/JiahaoYao/ray_lightning/blob/main/ray_lightning/ray_ddp.py#L241-L242

```python
        self._strategy.set_remote(True)
        self._strategy.set_global_to_local(global_to_local)
```

The second is an attributed in the `trainer.strategy`. 

The last hidden one is in the 

https://github.com/JiahaoYao/ray_lightning/blob/2727fd441a62e0e6763fd1f25ed97575dc5a6733/ray_lightning/ray_ddp.py#L222-L226


```python
        self._futures = [
            w.execute.remote(self._wrapping_function, i, self._global_to_local,
                             trainer, function, args, kwargs, self.tune_queue)
            for i, w in enumerate(self._workers)
        ]
```

ray remote functions create the copy of trainer. 


Thus, the actual call of the `strategy.teardown` is the one from the copies of trainer. 


support of the assumption is 

<img width="899" alt="image" src="https://user-images.githubusercontent.com/20907377/174874851-573f52f2-13b3-47c0-b809-b33e3e7d6ec6.png">

<img width="811" alt="image" src="https://user-images.githubusercontent.com/20907377/174874893-1374bfa6-3fb1-45e5-908d-0179e9799954.png">

printing out the pid of strategy, and it turns out they are different. 


Proposal: might removing the redundant use of strategy


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[raystrategy] multi-stragy in the worker is not consistent #161

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[raystrategy] multi-stragy in the worker is not consistent #161

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions