Skip to content

Remove structural tokens from Ume tokenizers#101

Merged
karinazad merged 2 commits intomainfrom
k/ume-remove-structure-tokens
Jun 11, 2025
Merged

Remove structural tokens from Ume tokenizers#101
karinazad merged 2 commits intomainfrom
k/ume-remove-structure-tokens

Conversation

@karinazad
Copy link
Collaborator

Since we'll be training with the contrastive objective and using LG embeddings directly, we can remove extra code for processing LG tokens from Ume tokenizers

LG embeddings + tokenization will be added separately once available

@karinazad karinazad requested a review from ncfrey June 11, 2025 00:08
@karinazad karinazad merged commit d21f653 into main Jun 11, 2025
5 checks passed
@karinazad karinazad deleted the k/ume-remove-structure-tokens branch June 11, 2025 02:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants