Skip to content

Add error handling for forward method in patch_gradient_accumulation#32

Merged
danielhanchen merged 1 commit into
unslothai:nightlyfrom
Erland366:fix/no_forward_vl
Jan 7, 2025
Merged

Add error handling for forward method in patch_gradient_accumulation#32
danielhanchen merged 1 commit into
unslothai:nightlyfrom
Erland366:fix/no_forward_vl

Conversation

@Erland366

Copy link
Copy Markdown
Collaborator

User reported an error of :

  File "/mnt/c/code/vllm/pyenv/lib/python3.10/site-packages/unsloth_zoo/compiler.py", line 791, in patch_gradient_accumulation
    forward = module.forward
AttributeError: type object 'Qwen2VLCausalLMOutputWithPast' has no attribute 'forward'

other_classes in patch_gradient_accumulation (for Qwen) somehow includes Qwen2VLCausalLMOutputWithPast which is not supposed to be there since it doesn't have forward method

other_classes = ['Qwen2VLCausalLMOutputWithPast', 'Qwen2VLPreTrainedModel', 'Qwen2VisionTransformerPretrainedModel', 'Qwen2VLModel', 'Qwen2VLForConditionalGeneration']

by skipping class that has no forward method. This run smoothly again on Qwen2VL

@danielhanchen danielhanchen changed the base branch from main to nightly January 7, 2025 11:41
@danielhanchen danielhanchen merged commit 57724c3 into unslothai:nightly Jan 7, 2025
danielhanchen added a commit that referenced this pull request Jan 7, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Jan 10, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Jan 20, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Feb 6, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Feb 6, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Feb 13, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Feb 15, 2025
* Update saving_utils.py

* Update compiler_replacements.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Compiler replacements

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Feb 20, 2025
* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Feb 20, 2025
* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Mar 4, 2025
* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Mar 5, 2025
* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* compiling issues

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* SFT dataset prepare

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Mar 6, 2025
* Update compiler.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* compiling issues

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* SFT dataset prepare

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
danielhanchen added a commit that referenced this pull request Mar 6, 2025
* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update compiler.py

* Update compiler.py

* Fix requires grad

* Update peft_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* Update peft_utils.py

* _get_dtype

* Update utils.py

* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* compiling issues

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* SFT dataset prepare

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Support `image_url` with the `url` field (#57)

* Support `image_url` with the `url` field

* Update vision_utils.py

---------

Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update utils.py

* Update compiler.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
Co-authored-by: Mehmet Oguz Derin <mehmetoguzderin@mehmetoguzderin.com>
danielhanchen added a commit that referenced this pull request Mar 8, 2025
* better attribution

* Update compiler.py

* Last layer GC

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Saving, llama.cpp

* Update llama_cpp.py

* Update llama_cpp.py

* Add error handling for forward method in patch_gradient_accumulation (#32)

* Update peft_utils.py

* Update peft_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update llama_cpp.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update saving_utils.py

* Update __init__.py

* Create vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Licensing, bug fixes

* Update patching_utils.py

* Update vllm_utils.py

* Update __init__.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* rotary

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* load lora from tensors

* 0.7.1 lora request

* Update vllm_lora_request.py

* Update vllm_lora_request.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update compiler.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update logging_utils.py

* Update vllm_utils.py

* fix_zero_training_loss

* Update dataset_utils.py

* Update training_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_lora_worker_manager.py

* Update vllm_lora_worker_manager.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update vllm_utils.py

* Update __init__.py

* Create rl_replacements.py

* Update __init__.py

* Fixes

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update saving_utils.py

* Update saving_utils.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update __init__.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update tokenizer_utils.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update gradient_checkpointing.py

* Update __init__.py

* compiling issues

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* SFT dataset prepare

* Update pyproject.toml

* Update rl_replacements.py

* Update rl_replacements.py

* Update rl_replacements.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update __init__.py

* Update compiler.py

* Update compiler.py

* Support `image_url` with the `url` field (#57)

* Support `image_url` with the `url` field

* Update vision_utils.py

---------

Co-authored-by: Daniel Han <danielhanchen@gmail.com>

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update utils.py

* Update compiler.py

* Update compiler.py

* Fix compiling

* Update patching_utils.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* distributed_function

* Update compiler.py

* Update compiler.py

* distributed

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

* Update compiler.py

---------

Co-authored-by: Edd <68678137+Erland366@users.noreply.github.com>
Co-authored-by: Mehmet Oguz Derin <mehmetoguzderin@mehmetoguzderin.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants