Skip to content

Positional embedding xformers fix#5

Merged
sagadre merged 4 commits intomainfrom
pos_fix
Sep 3, 2023
Merged

Positional embedding xformers fix#5
sagadre merged 4 commits intomainfrom
pos_fix

Conversation

@sagadre
Copy link
Copy Markdown
Collaborator

@sagadre sagadre commented Sep 2, 2023

Green is after fix
Screen Shot 2023-09-01 at 9 09 18 PM

@sagadre sagadre linked an issue Sep 2, 2023 that may be closed by this pull request
'Allows int8 training/inference, etc.'
)
parser.add_argument(
"--xformers-rotary",
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we call this flag something like "--old-xformers-rotary" so that it still makes sense when they fix the bug?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm so when this gets fixed upstream this flag will use the fix so it won’t be “old” i guess we want this to always use the old implementation and will make changes accordingly

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm that's a good point..

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to check: can we set up things so that when the fix happens upstream, we won't need a flag in OpenLM? I think in general the fewer flags a user needs to pass in for regular use, the better.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

that’s how things are now but i think we do need to support an “old” flag here considering that we have already trained models with the head rotary

@sagadre
Copy link
Copy Markdown
Collaborator Author

sagadre commented Sep 3, 2023

confirmed that this looks good still with most up to date changes
Screen Shot 2023-09-03 at 12 53 24 AM

@sagadre sagadre merged commit 37dd034 into main Sep 3, 2023
@sagadre sagadre deleted the pos_fix branch September 3, 2023 04:55
@mitchellnw
Copy link
Copy Markdown
Contributor

Amazing!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Problem in position embedding

3 participants