Automatic safetensors conversion when lacking these files by LysandreJik · Pull Request #29390 · huggingface/transformers

LysandreJik · 2024-03-01T10:19:36Z

When a user calls the PyTorch from_pretrained on a repository that only contains PyTorch/Flax/TF files, start an auto conversion in the background so that it has a PR opened with safetensors files.

LysandreJik · 2024-03-01T10:20:33Z

src/transformers/modeling_utils.py

+                            cls._auto_conversion = Thread(
+                                target=auto_conversion,
+                                args=(pretrained_model_name_or_path,),
+                                kwargs=cached_file_kwargs,
+                            )
+                            cls._auto_conversion.start()


@Wauplin curious if you have a better idea in mind to have access to the thread started here; I don't need to join it during runtime, I'm only attributing it to the class here so that I can access it within the test files (but not super keen on modifying internals just for the tests to be simpler ...)

@LysandreJik I'm not shocked by having a cls._auto_conversion attribute TBH. Though a solution to get rid of it is to give a name to the thread. Something like that:

Thread( target=auto_conversion, args=(pretrained_model_name_or_path,), kwargs=cached_file_kwargs, name="Thread-autoconversion-{<unique id here>}", ).start()

and then in the tests:

for thread in threading.enumerate(): print(thread.name) # ... # Thread-autoconversion-0

Thread names don't have to be unique BTW (they have a thread id anyway). But I think it's best to at least assign a unique number to the name.

But it's quite hacky IMO. In a simple case it should work fine but if you start to have several threads / parallel tests, it might get harder to be 100% sure the thread you've started is indeed the one you retrieve in the test logic.

yeah here it's really only for testing and I don't want to depend on a flaky time.sleep or something so ensuring that the thread joins first is optimal. The thread name is actually much better IMO, I'll implement that! Thanks a lot!

HuggingFaceDocBuilderDev · 2024-03-01T10:38:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

julien-c

neat stuff.

UI-wise, Let's also think about whether we add some kind of "official HF Staff" tag to the bot's PRs, or something

ArthurZucker

Clean and very nice!

ArthurZucker · 2024-03-05T07:36:08Z

src/transformers/modeling_utils.py

-                        # message.
+
+                    if resolved_archive_file is not None:
+                        if filename in [WEIGHTS_NAME, WEIGHTS_INDEX_NAME]:


one thing I would be wary is just that if we convert a big checkpoint from torch to safetensors and we want to load it in Flax, sharded safetensors are not supported yet

Flax defaults to loading flax checkpoints, not safetensors, so it won't be affected by a repo where there is sharded safetensors

ArthurZucker · 2024-03-05T07:37:13Z

tests/test_modeling_utils.py


            for discussion in discussions:
-                if discussion.author == "SFconvertBot":
+                if discussion.author == "SFconvertbot":


➕ on @julien-c's comment, have had feedback that this is not explicit enough.

Suggested change

if discussion.author == "SFconvertbot":

if discussion.author == "HuggingFaceOfficialSafetensorConverter":

bot is scary for some 😅

we can't change the account name now

but we will think of a way to make it clearer in the UI that it's a "official bot"

Sounds good 👍🏻

LysandreJik · 2024-03-05T12:37:23Z

Thanks both for the review!

LysandreJik · 2024-03-05T12:37:51Z

I'll merge it now and will keep monitoring issues to ensure it doesn't break things in the wild.

…9390)" This reverts commit a69cbf4.

#29507) Revert "Automatic safetensors conversion when lacking these files (#29390)" This reverts commit a69cbf4.

* Automatic safetensors conversion when lacking these files * Remove debug * Thread name * Typo * Ensure that raises do not affect the main thread

…s" (#29846) * Automatic safetensors conversion when lacking these files (#29390) * Automatic safetensors conversion when lacking these files * Remove debug * Thread name * Typo * Ensure that raises do not affect the main thread * Catch all errors

LysandreJik added 2 commits March 1, 2024 11:16

Automatic safetensors conversion when lacking these files

04017c2

Remove debug

38c31c3

LysandreJik requested review from Narsil and julien-c March 1, 2024 10:19

LysandreJik commented Mar 1, 2024

View reviewed changes

julien-c reviewed Mar 1, 2024

View reviewed changes

LysandreJik added 2 commits March 4, 2024 14:04

Thread name

b6cabae

Typo

c7baeae

LysandreJik requested a review from ArthurZucker March 4, 2024 13:39

Ensure that raises do not affect the main thread

a3d54cd

ArthurZucker reviewed Mar 5, 2024

View reviewed changes

ArthurZucker approved these changes Mar 5, 2024

View reviewed changes

LysandreJik merged commit a69cbf4 into main Mar 5, 2024

LysandreJik deleted the safetensors-step-2 branch March 5, 2024 12:37

Qubitium mentioned this pull request Mar 7, 2024

Model load Regression due to Auto Safetensor conversion merged in #29390 #29500

Closed

LysandreJik added a commit that referenced this pull request Mar 7, 2024

Revert "Automatic safetensors conversion when lacking these files (#2…

bc1e104

…9390)" This reverts commit a69cbf4.

LysandreJik added a commit that referenced this pull request Mar 7, 2024

Revert "Automatic safetensors conversion when lacking these files (#2… (

f6133d7

#29507) Revert "Automatic safetensors conversion when lacking these files (#29390)" This reverts commit a69cbf4.

LysandreJik restored the safetensors-step-2 branch March 25, 2024 09:10

LysandreJik mentioned this pull request Mar 25, 2024

Reimplement "Automatic safetensors conversion when lacking these files" #29846

Merged

	if discussion.author == "SFconvertbot":
	if discussion.author == "HuggingFaceOfficialSafetensorConverter":

Conversation

LysandreJik commented Mar 1, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 1, 2024

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LysandreJik commented Mar 5, 2024

Uh oh!

LysandreJik commented Mar 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants