Tokenizer backend equivalence report

Generated 2026-02-27 20:46 UTC

9631
Models checked
2333 / 9631
Mismatched models
78
Unique diff patterns
22
Converters affected

Mismatches by converter

BertConverter64 models
BigBirdConverter33 models
BlenderbotConverter40 models
CLIPConverter99 models
DebertaConverter28 models
DebertaV2Converter1 model
GPT2Converter293 models
GemmaConverter179 models
LlamaConverter175 models
MBart50Converter1 model
MPNetConverter1 model
MarkupLMConverter1 model
NllbConverter18 models
OpenAIGPTConverter29 models
PegasusConverter167 models
Qwen2Converter456 models
ReformerConverter15 models
RobertaConverter17 models
SeamlessM4TConverter25 models
T5Converter379 models
TikTokenConverter274 models
XLMRobertaConverter38 models

Diff patterns

Pattern #1 (456 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]+|\s+..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]+|\s+..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)])
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+decoder:		ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)
Affected models
ConverterModels
Qwen2ConverterACE-Step/acestep-captioner ACE-Step/acestep-transcriber AGI-Eval/Auto-ATT AI71ai/agrillm-Qwen3-30B-A3B AITRADER/Huihui-Qwen3-Coder-Next-abliterated-mlx-4Bit AITRADER/Huihui-Qwen3-Coder-Next-abliterated-mlx-8Bit AVoCaDO-Captioner/AVoCaDO AlexHung29629/Qwen2.5-Omni-7B-Reasoning Alexjiuqiaoyu/Mirror_Max Alibaba-DAMO-Academy/RynnBrain-30B-A3B Alibaba-DAMO-Academy/RynnBrain-Plan-30B-A3B AudioVisual-Caption/ASID-Captioner-3B AudioVisual-Caption/ASID-Captioner-7B AutisticAF/Qwen3-Coder-Next-mlx-3Bit Bearrr310/train_grpo_1.5B-1230-ckpt100 BellLabs/das-1 BellLabs/das-2 BellLabs/das-3 Benasd/Qwen3-VL-30B-A3B-Instruct-NVFP4 Benasd/Qwen3-VL-30B-A3B-Thinking-NVFP4 Bhuvan77777/quantized-qwen2-audio-4bit BloomBerry/colqwen2-1.0-hf-vllm Cirrascale/Qwen3-Coder-Next-NVFP4 Clevyby/lynn-A2.7B-rp-v1-14.3b-32k-Q5_K_S-GGUF DarthGrampus/Qwen3-Coder-Next-Base-mlx-6Bit DiaDem-Captioner/DiaDem Dongchao/Omni-AutoThink Eldadalbajob/Huihui-Qwen3-Coder-Next-abliterated-mlx-4Bit Enriqueag26/OmniCare-Qwen3-VL-30B-A3B EurekaTian/ROMA FINGU-AI/qwen2.5-omni-3b-merge FunAGI/Qwen2.5-Omni-7B-GPTQ-4bit GadflyII/Qwen3-Coder-Next-NVFP4 GadflyII/Qwen3-VL-235B-A22B-Instruct-NVFP4 GadflyII/Qwen3-VL-235B-A22B-Thinking-NVFP4 Gaie/TA2T_DPO_Base GaleneAI/Qwen3-VL-235B-A22B-Thinking-NVFP4 Guerte/llspch Hcompany/Holo2-235B-A22B Hcompany/Holo2-30B-A3B HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-2-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-1epoch HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2 HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2-run1 HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-4-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-6-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-3e-3-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-5e-3-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-5e-5-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-remov-aux-only HectorHe/Qwen1.5-MOE-sft-coommonsense15k HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5 HectorHe/Qwen1.5-MOE-sft-math14k HectorHe/Qwen1.5-MOE-sft-math7k HectorHe/Qwen1.5-MOE-sft-math7k-sft-epoch1 HectorHe/Qwen1.5-MOE-sft-math7k-sfttest HectorHe/Qwen1.5-MOE-sft-nemotron-code HectorHe/Qwen1.5-MOE-sft-s1K Hui519/speechllm-as-judge-qwen25omni Intel/Qwen3-Coder-Next-int4-AutoRound Intel/Qwen3-Next-80B-A3B-Instruct-int4-mixed-AutoRound Intel/Qwen3-VL-30B-A3B-Instruct-int4-AutoRound KE-Team/Ke-Omni-R KasuleTrevor/QWen-sample KasuleTrevor/Qwen-nyn-intent KasuleTrevor/Qwen2-Luganda-ASR KasuleTrevor/Qwen2-RUNY-Intent Kendamarron/Qwen2.5-1.75B-A1.1B-Instruct-ja LGB666/SageLM LeroyDyer/Mistral-AudioLm Lingzhi-AI/Lingzhi-57B-chat LoukaLacasse/Qualia-7B MaziyarPanahi/Qwen1.5-MoE-A2.7B-Wikihow Memories-ai/UGC-VideoCaptioner Moamen-dcp/Full-Franko-CS-v0 Na0s/Qwen1.5-MoE-A2.7B-Chat-20_experts-L2Norm-Pruning Na0s/Qwen1.5-MoE-A2.7B-Chat-20_experts_Maths_FT_1k_cosine Na0s/Qwen1.5-MoE-A2.7B-LoRA-Exhaustive-FT Nagata99999/Affine-2 OpenGVLab/InternVL3-14B OpenGVLab/InternVL3-14B-AWQ OpenGVLab/InternVL3-14B-Instruct OpenGVLab/InternVL3-14B-hf OpenGVLab/InternVL3-1B OpenGVLab/InternVL3-1B-Instruct OpenGVLab/InternVL3-1B-hf OpenGVLab/InternVL3-2B OpenGVLab/InternVL3-2B-Instruct OpenGVLab/InternVL3-2B-hf OpenGVLab/InternVL3-38B OpenGVLab/InternVL3-38B-AWQ OpenGVLab/InternVL3-38B-Instruct OpenGVLab/InternVL3-38B-hf OpenGVLab/InternVL3-78B OpenGVLab/InternVL3-78B-AWQ OpenGVLab/InternVL3-78B-Instruct OpenGVLab/InternVL3-8B OpenGVLab/InternVL3-8B-Instruct OpenGVLab/InternVL3-8B-hf OpenGVLab/InternVL3_5-30B-A3B-Instruct OptimizeLLM/Qwen3-VL-30B-A3B-Thinking-NVFP4 QuantTrio/Qwen3-Coder-Next-E400 QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8 QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ QuantTrio/Qwen3-VL-30B-A3B-Thinking-AWQ Qwen/Qwen1.5-MoE-A2.7B Qwen/Qwen1.5-MoE-A2.7B-Chat Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4 Qwen/Qwen2-57B-A14B Qwen/Qwen2-57B-A14B-Instruct Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4 Qwen/Qwen2-Audio-7B Qwen/Qwen2-Audio-7B-Instruct Qwen/Qwen2.5-Omni-3B Qwen/Qwen2.5-Omni-7B Qwen/Qwen2.5-Omni-7B-AWQ Qwen/Qwen2.5-Omni-7B-GPTQ-Int4 Qwen/Qwen3-Coder-Next Qwen/Qwen3-Coder-Next-Base Qwen/Qwen3-Coder-Next-FP8 Qwen/Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct-FP8 Qwen/Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking-FP8 Qwen/Qwen3-VL-235B-A22B-Instruct Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 Qwen/Qwen3-VL-235B-A22B-Thinking Qwen/Qwen3-VL-235B-A22B-Thinking-FP8 Qwen/Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct-FP8 Qwen/Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking-FP8 R0mAI/51cea115-9b0c-4fca-8ddb-dc69e56a10dc RESMP-DEV/Qwen3-Next-80B-A3B-Instruct-NVFP4 RESMP-DEV/Qwen3-Next-80B-A3B-Thinking-NVFP4 RMSnow/SpeechJudge-GRM RUC-NLPIR/OmniAtlas-Qwen2.5-3B RUC-NLPIR/OmniAtlas-Qwen2.5-7B RedHatAI/Qwen2-57B-A14B-Instruct-FP8 RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16 RedHatAI/Qwen3-Next-80B-A3B-Thinking-FP8-dynamic RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4 RichardErkhov/Qwen_-_Qwen1.5-MoE-A2.7B-Chat-4bits RoxanneWsyw/gsm_qwen1.5_full_lr1e-6_frozen RoxanneWsyw/qwen1.5_gsm_esft_gate_lr5e-6 RoxanneWsyw/qwen1.5_gsm_esft_token_lr5e-6 SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite-woA SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_1 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_2 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_3 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_4 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_5 SAA-Lab/Qwen2.5-Omni-3B-UltraSuite SAA-Lab/Qwen2.5-Omni-3B-UltraSuite-woA SAA-Lab/Qwen2.5-Omni-7B-UltraSuite-woA Sahil-Kabir/colqwen2.5-v0.2-hf Sakalti/SakaMoe-3x1.6B-Instruct Sakalti/SakaMoe-3x14B-Instruct SeaLLMs/SeaLLMs-Audio-7B SejmofDejected/Qwen2.5-Omni-7B Sergei6000/Qwen2-Audio-7B-Instruct-Int4 Shifusen/Qwen3-VL-30B-A3B-Instruct-abliterated-NVFP4 SoarAILabs/breeze-3b Sophia-AI/RegTech-14B-Instruct Sophia-AI/RegTech-32B-Instruct Sophia-AI/RegTech-4B-Instruct Sophia-AI/RegTech-7B-Instruct SoundMind-RL/SoundMindModel TeamPV/0.5B-qwen-x16_v2_cp_3000 TheClusterDev/Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic TomLucidor/Qwen3-Coder-Next-REAM-mlx-3Bit Trelis/song-birds Urabewe/Ace-Step-Captioner-fp8 Yuuta208/Qwen2.5-Coder-1.5B-Qwen2.5-Math-1.5B-Merged-moe Yuuta208/Qwen2.5-Coder-7B-Qwen2.5-Math-7B-Merged-moe-16 aeon37/Qwen3-VL-30B-A3B-Instruct-heretic alicekyting/Qwen2-Audio-7B-Instruct-4bit allenai/Molmo-72B-0924 allenai/Molmo-7B-D-0924 allenai/Molmo2-4B allenai/Molmo2-8B allenai/Molmo2-VideoPoint-4B allenai/MolmoAct-7B-D-0812 allenai/MolmoAct-7B-D-LIBERO-Goal-0812 allenai/MolmoAct-7B-D-LIBERO-Long-0812 allenai/MolmoAct-7B-D-LIBERO-Object-0812 allenai/MolmoAct-7B-D-LIBERO-Spatial-0812 allenai/MolmoAct-7B-D-Pretrain-0812 allenai/MolmoAct-7B-D-Pretrain-RT-1-0812 amd/Qwen3-Coder-Next-MXFP4 anonymousICML/OmniGuard-3B anonymousICML/OmniGuard-7B antgroup/HumanSense_Omni_Reasoning arkiven4/Qwen2.5-7B-SFT-NT aryashah00/survey-finetuned-Qwen1.5-MoE-A2.7B bbytxt/727c22bb-a499-4951-978f-841369ce2042 boyuzhuGPT/checkpoint-3921 boyuzhuGPT/omniguard-video boyuzhuGPT/qwen2_5_omni_all_1015_reverse boyuzhuGPT/qwen2_5_omni_all_1016 boyuzhuGPT/qwen2_5_omni_audio_only boyuzhuGPT/qwen2_5_omni_audio_only_1014 boyuzhuGPT/qwen2_5_omni_guardrail boyuzhuGPT/qwen2_5_omni_image_only boyuzhuGPT/qwen2_5_omni_image_only_3 boyuzhuGPT/qwen2_5_omni_text_image_half boyuzhuGPT/qwen2_5_omni_text_only boyuzhuGPT/qwen2_5_omni_text_only_cleaned boyuzhuGPT/qwen2_5_omni_video_only boyuzhuGPT/qwen2_5_omni_video_only_backup boyuzhuGPT/qwen2_5_omni_wo_image_1016 browser-use/bu-30b-a3b-preview bullpoint/Qwen3-Coder-Next-AWQ-4bit catplusplus/Qwen3-VL-30B-A3B-Instruct-Heretic-NVFP4 catplusplus/Qwen3-VL-30B-A3B-Thinking-Heretic catplusplus/Qwen3-VL-30B-A3B-Thinking-Heretic-NVFP4 chaitnya26/Qwen2.5-Omni-3B-Fork chaitnya26/Qwen2.5-Omni-7B-fork chauhoang/5a9f2ec1-88d6-6b14-efe5-27299e1af90e chenhaodev/qwen2-audio-7b-aishell1 chunping-m/transcriber coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-W4A16 coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-nvfp4 cyankiwi/Qwen3-Coder-Next-AWQ-4bit cyankiwi/Qwen3-Coder-Next-AWQ-8bit cyankiwi/Qwen3-Coder-Next-REAM-AWQ-4bit cyankiwi/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit cyankiwi/Qwen3-VL-30B-A3B-Instruct-AWQ-4bit cyankiwi/Qwen3-VL-30B-A3B-Instruct-AWQ-8bit cyankiwi/Qwen3-VL-30B-A3B-Thinking-AWQ-4bit cyankiwi/Qwen3-VL-30B-A3B-Thinking-AWQ-8bit cyankiwi/bu-30b-a3b-preview-AWQ-4bit dazipe/Qwen3-Coder-Next-GPTQ-Int4A16 dazipe/Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16 ddvd233/OmniSapiens-7B-RL-Full ddvd233/Qwen2.5-Omni-7B ddwang2000/EmotionThinker dimasik2987/c499b3ec-738a-4496-a788-f85f963cb5b2 ehristoforu/Testrumoe-2x1.5b-instruct ehristoforu/tmoe ehristoforu/tmoe-v2 eve1f/cp2 eve1f/cp3 fauzanazz/qwen2-audio-indo-fraud-7b-merged faychu/test femiari/Qwen2-1.5Moe filipesantoscv11/f5866ce8-292c-450f-9d85-6807fe1ba0b1 flozi00/gerqwen-audio fractaldactal/Qwen2.5-Omni-7B gguichard/qwen15_moe_finetuning_json_cvfull_model gguichard/qwen15_moe_finetuning_json_cvfull_model_fp giangndm/qwen2.5-omni-3b-mlx-4bit giangndm/qwen2.5-omni-3b-mlx-8bit giangndm/qwen2.5-omni-7b-mlx-4bit giangndm/qwen2.5-omni-7b-mlx-8bit havinash-ai/2fe5536e-17d3-4296-b792-1dab8bad3b6b hf-internal-testing/tiny-random-Qwen2MoeForCausalLM hf-internal-testing/tiny-random-Qwen2_5OmniForConditionalGeneration horiguchidotconf/qwen57b_gptq_20240923 huihui-ai/Huihui-Qwen3-Coder-Next-abliterated huihui-ai/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated huihui-ai/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-mlx-4bit huihui-ai/Huihui-Qwen3-Next-80B-A3B-Thinking-abliterated huihui-ai/Huihui-Qwen3-VL-30B-A3B-Instruct-abliterated huihui-ai/Huihui-Qwen3-VL-30B-A3B-Thinking-abliterated ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4 imkebe/Qwen2.5-Omni-7B-rk3588-1.2.0 inclusionAI/UI-Venus-1.5-30B-A3B introvoyz041/Huihui-Qwen3-VL-30B-A3B-Thinking-abliterated-qx86-hi-mlx-mlx-4Bit iteshxt/dia-convo-v1.2c jackboot/quill-57b-tokenizer janhq/Jan-v2-VL-max-FP8 jart25/Qwen3-Next-80B-A3B-Instruct-Int4-GPTQ jart25/Qwen3-VL-30B-A3B-Instruct-AWQ-8bit jayzou3773/Qwen1.5-MOE-sft-ESFT-intent jayzou3773/Qwen1.5-MOE-sft-ESFT-translation jayzou3773/Qwen1.5-MOE-sft-coommonsense15k jayzou3773/Qwen1.5-MOE-sft-gsm8k jayzou3773/commonsense-15k-ss20-step30-int5-plantrue-tau1.0-beta2-keep10-budget30-ep2 jayzou3773/commonsense-15k-ss20-step30-int5-plantrue-tau2.0-beta1-keep10-budget30-ep2 jayzou3773/commonsense-15k-ss20-step30-int5-plantrue-tau2.0-beta2-keep10-budget30-ep2 jayzou3773/merging-step15-mweight-tau0.1-ep2 jayzou3773/merging-step30-mweight-tau0.05-ep2 jayzou3773/qwen1.5-step30-withoutplan-commonsense15k-ep2-in5 jcPatrick/Qwen2.5-omni-3B-Open-R1-GRPO jinaai/jina-vlm jinaai/jina-vlm-mlx jli56/cp_320_nothinker jli56/cp_471_nothinker jongwooko/Flex-Omni-7B katuni4ka/tiny-random-qwen1.5-moe kd1729/Qwen2-Audio-7B-Instruct kokovova/56a16866-d85e-41fa-8f42-e2192ee7ac3e ldhldh/merged-qwen-omni-dare ldhldh/merged-qwen-omni-dare-3 ldhldh/merged-qwen-omni-dare-3B liangjh2001/Qwen2-Audio-7B-Instruct-train-all-full-new liangjh2001/fuseties-and-train-audio_deepfake liangjh2001/fuseties-and-train-audio_emotion liangjh2001/fuseties-and-train-audio_speaker liangjh2001/qwen_audio_ties-full-audio_deepfake_val_new_2w-full liangjh2001/qwen_audio_ties-full-audio_emotion_train_1w5_wo_happy-full liangjh2001/qwen_audio_ties-full-audio_speaker_recognition_random_order_train-full liangjh2001/qwen_audio_ties_new lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-4bit lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-5bit lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-6bit lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-8bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-4bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-5bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-6bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-8bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-4bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-5bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-6bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-8bit mclemcrew/Qwen-Audio-Mix-Instruct michaelfeil/Qwen2-57B-A14B-Instructfp8_tllm michaelfeil/Qwen2-57B-A14B-Instructint4_awq_tllm michaellin/Qwen3-Coder-Next-mlx-4Bit mii-llm/nesso-4B mispeech/r1-aqa mlfoundations/Gelato-30B-A3B mlinmg/Qwen-2-Audio-Instruct-dynamic-fp8 mlx-community/Molmo-7B-D-0924-4bit mlx-community/Qwen1.5-MoE-A2.7B-4bit mlx-community/Qwen1.5-MoE-A2.7B-Chat-4bit mlx-community/Qwen2-57B-A14B-4bit mlx-community/Qwen2-57B-A14B-8bit mlx-community/Qwen2-57B-A14B-Instruct-4bit mlx-community/Qwen2-57B-A14B-Instruct-8bit mlx-community/Qwen2.5-2X32B-CoderInstruct-OlympicCoder-87B-v1.1-4bit mlx-community/Qwen3-Next-80B-A3B-Instruct-4bit mlx-community/Qwen3-Next-80B-A3B-Instruct-8bit mlx-community/Qwen3-Next-80B-A3B-Thinking-4bit mlx-community/Qwen3-Next-80B-A3B-Thinking-8bit mlx-community/Qwen3-VL-235B-A22B-Instruct-3bit mlx-community/Qwen3-VL-235B-A22B-Thinking-3bit mlx-community/Qwen3-VL-30B-A3B-Instruct-3bit mlx-community/Qwen3-VL-30B-A3B-Instruct-4bit mlx-community/Qwen3-VL-30B-A3B-Instruct-6bit mlx-community/Qwen3-VL-30B-A3B-Instruct-8bit mlx-community/Qwen3-VL-30B-A3B-Instruct-bf16 mlx-community/Qwen3-VL-30B-A3B-Thinking-4bit mlx-community/Qwen3-VL-30B-A3B-Thinking-8bit mlx-community/Qwen3-VL-30B-A3B-Thinking-bf16 mncai/hunmin_vlm_235b_v0.11_merged_cua mrtoots/unsloth-Qwen3-Coder-Next-mlx-8bit naveennagar009/qwen2_5_3B_omni_7k_v2 naveennagar009/qwen2_5_7B_omni_13k naveennagar009/qwen2_5_7B_omni_9k nguyenvulebinh/af3 ngxson/qwen3_next_tiny_test nightmedia/Huihui-Qwen3-VL-30B-A3B-Thinking-abliterated-qx86-hi-mlx nightmedia/Qwen2.5-2X7B-Coder-Soar-qwen-Coder-Instruct-OlympicCoder-19B-dq68-128k-mlx nkwbtb/OmniEmbed-v0.1 nm-testing/Qwen1.5-MoE-A2.7B-Chat-quantized.w4a16 nm-testing/Qwen3-Next-80B-A3B-Instruct-NVFP4 nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4 nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4 nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4-MLPerf-Inference-Closed-V6.0 nvidia/music-flamingo-hf openinterx/UGC-VideoCaptioner optimum-intel-internal-testing/tiny-random-qwen1.5-moe peft-internal-testing/tiny-random-qwen-1.5-MoE pluto6272/Qwen3-VL-30B-Medical-V3-Precision prithivMLmods/Qwen3-VL-30B-A3B-Instruct-abliterated-v1 puwaer/Qwen3-Next-80B-A3B-Thinking-GRPO-Uncensored qzkiyoshi/finetune_jun_qwen rajthakkar123/qwen2.5-omni-7b-q8_0 recursal/QRWKV6-7B-Base reinforce20001/Qwen3-VL-30B-A3B-Thinking-NVFP4 rrbale/pruned-qwen-moe samaritan-ai/LightOnOCR-2-1B-sam-44-mss-alb scrunter/Qwen3-VL-235B-A22B-Thinking-heretic shivak/Qwen3-VL-235B-A22B-Thinking-W4A16 shuyuej/Qwen2-57B-A14B-GPTQ shuyuej/Qwen2-57B-A14B-Instruct-GPTQ sirus/Qwen3-VL-30B-A3B-Instruct-sovereign-beta sugiv/octopus-omni-embed the-qa-company-official/Qwen3-VL-30B-A3B-Thinking-NVFP4 thisisiron/Ovis2-1B-hf thisisiron/Ovis2-2B-hf thoddnn/colqwen2-v1.0-hf thoddnn/colqwen2-v1.0-mlx thoddnn/colqwen2-v1.0-mlx-4bit thoddnn/colqwen2-v1.0-mlx-8bit thucdangvan020999/qwen2-audio-instruct-ep10-ckpt550-1000samples thucdangvan020999/qwen2-audio-instruct-ep15-ckpt900-1000samples thucdangvan020999/qwen2-audio-instruct-ep20-ckp1220-1000samples thucdangvan020999/qwen2-audio-instruct-ep5-ckpt305-1000samples thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt200 thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt400 thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt600 thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt800 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt100 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt200 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt300 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt400 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt500 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt600 tiny-random/qwen2.5-omni tiny-random/qwen3-next-moe tiny-random/qwen3-vl-moe tranquangchung/qwen2-audio-dialogue tuanna08go/4157a659-35c7-43b8-ac2f-40ae4da17214 tuanna08go/743d2a68-4596-4ef4-b0b2-d57af29bb021 unsloth/Qwen2.5-Omni-3B unsloth/Qwen2.5-Omni-7B unsloth/Qwen3-Coder-Next unsloth/Qwen3-Coder-Next-Base unsloth/Qwen3-Coder-Next-FP8 unsloth/Qwen3-Coder-Next-FP8-Dynamic unsloth/Qwen3-Next-80B-A3B-Instruct unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit unsloth/Qwen3-Next-80B-A3B-Thinking unsloth/Qwen3-VL-235B-A22B-Instruct unsloth/Qwen3-VL-235B-A22B-Instruct-FP8 unsloth/Qwen3-VL-30B-A3B-Instruct unsloth/Qwen3-VL-30B-A3B-Instruct-FP8 unsloth/Qwen3-VL-30B-A3B-Thinking unsloth/Qwen3-VL-30B-A3B-Thinking-FP8 vidore/colqwen2-v1.0-hf vincentzed-hf/Qwen3-Coder-Next-NVFP4 wolfofbackstreet/Qwen2-Audio-7B-Instruct-Onnx wolfofbackstreet/Qwen2-Audio-7B-Instruct-Openvino-4Bit wolfofbackstreet/Qwen2.5-Omni-3B-4Bit wolfofbackstreet/Qwen2.5-Omni-3B-4Bit-Openvino yaolily/TimeChat-Captioner-GRPO-7B yasinarafatbd/Qwen2_Audio yasinarafatbd/Qwen2_Audio_Engine_Sound ybkim95-ai/VocalAgents2_snuh_cv_fold1 ybkim95-ai/VocalAgents2_snuh_cv_fold2 ybkim95-ai/VocalAgents2_snuh_cv_fold3 ybkim95-ai/VocalAgents2_snuh_cv_fold4 ybkim95-ai/VocalAgents2_snuh_cv_fold5 yhcao/sft_base_3x_with_pt_extra yhcao/sft_base_3x_with_pt_extra_continue_silence yhcao/sft_final yogkul2000/AVATAR yuhui1038/SpeechRole-Agent yujiepan/qwen1.5-moe-tiny-random yujiepan/qwen2-audio-tiny-random yujiepan/qwen2.5-omni-tiny-random yujiepan/qwen3-next-moe-tiny-random yujiepan/qwen3-vl-moe-tiny-random zenlm/zen-omni zh-liu799/56565 zh-liu799/789564 zhifeixie/Audio-Reasoner

Pattern #2 (285 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁"), Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
Affected models
ConverterModels
T5Converter12313Max/musicgen-small-custom AleBurzio/long-t5-base-govreport Archana99/musicgen-lora-testing Archeane/first_tldr ArmandoRockYourCloud/MusicaMDM Blaise-g/longt5_tglobal_large_sumpubmed DONG19/code-search-net-codemoe-base DevPanda004/musicgen-melody-indian DevPanda004/musicgen-small-ft1 Guyfromvillage/musicgen-stereo-lora-test Joemgu/long-t5-base-sumstew Karim-Gamal/switch-base-8-finetuned-SemEval-2018-emojis-IID-Fed Karim-Gamal/switch-base-8-finetuned-SemEval-2018-emojis-cen-1 Karim-Gamal/switch-base-8-finetuned-SemEval-2018-emojis-cen-2 LegolasTheElf/Long-T5-Booksum LoopersClub/musicgen-small Mursel/switch-base-8-xsum MusicBuddy/MusicGen_Large NitzanBar/t5_long QuangHuy54/long-t5-tglobal-base-multinews RMWeerasinghe/long-t5-tglobal-base-finetuned-govReport-4096 Razavipour/musicgen-persian-finetuned Razavipour/musicgen-persian-finetuned_setar Razavipour/musicgen-persian-finetuned_setar_test Razavipour/musicgen-persian-finetuned_setar_with_meta Razavipour/musicgen-persian-traditional-Santoor-40-samples Razavipour/musicgen-persian-traditional-Tonbak-40-samples Razavipour/musicgen-persian-traditional-instruments Razavipour/musicgen-persian-traditional-instruments-mini Razavipour/musicgen-persian-traditional-instruments-tiny Razavipour/musicgen-persian-traditional-instruments-tiny_3 Razavipour/musicgen-persian-traditional-kamancheh-40-samples RikiLin/musicgen-melody-lora-piano RuaYiii/musicgen Shobhank-iiitdwd/long-t5-tglobal-base-16384-book-summary Splend1dchan/long-t5lephone-5000 Stancld/longt5-tglobal-large-16384-pubmed-3k_steps TrulySenya/musicgen-melody-lora-punk TrulySenya/musicgen-synths-test VasilyMorzhakov/output WelfCrozzo/belarussian-switch-translator-L512 WelfCrozzo/belarussian-switch-ul2 Xenova/long-t5-encodec-tglobal-base Xenova/long-t5-local-base Xenova/long-t5-tglobal-base Xenova/long-t5-tglobal-base-16384-book-summary Xenova/musicgen-small Yoga26/longT5 Z3K3/musicgen-large acmc/summarizer_google_long-t5-local-base_keybert_unfaceted acmc/summarizer_google_long-t5-tglobal-base_keywords_unfaceted agkochuev19/music alex2awesome/city_council_gpt3_silver_standard_summaries__long_t5_local_base annamoerman/music-gen-test artovv/musicgen-melody-my-adapters-v3 artovv/musicgen-melody-mystyle-adapters-v2 artovv/musicgen-mystyle_v3 artovv/musicgen-mystyle_v5 asach/lognt5-xsum-icsi-5 avasaz/avasaz-large avasaz/avasaz-webgl awkyu/musicgen-small ayrmoney/musicgen-large birgermoell/musicgen-melody-lora-punk bloominho/my_awesome_opus_books_model cjinghong/musicgen-melody-lora-punk contemmcm/19b9a99d360126bde69d42d263b160bc crumb/switch-base-8-arxiv-abstraction csc-unipd/tasty-musicgen-small daniel-was-taken/long-t5-scisumm-accelerate-v2 danieladeeko/t5_research derrickdso/samplegen-small diegopdlv5/musicgen-melody-lora-punk dthomas84/musicgen-large emre/switch-base-8-finetuned-samsum facebook/musicgen-large facebook/musicgen-medium facebook/musicgen-melody facebook/musicgen-melody facebook/musicgen-melody-large facebook/musicgen-melody-large facebook/musicgen-small facebook/musicgen-stereo-large facebook/musicgen-stereo-medium facebook/musicgen-stereo-melody facebook/musicgen-stereo-melody facebook/musicgen-stereo-melody-large facebook/musicgen-stereo-melody-large facebook/musicgen-stereo-small freddy913/FRDYV2 freddy913/FRDYV2_32 freddy913/FRDYV2_33 freddy913/FRDYV2_34 freddy913/FRDYV2_35 freddy913/FRDYV2_36 freddy913/FRDYV2_37 freddy913/FRDYV2_40 freddy913/FRDYV3_31 freddy913/FRDYV4 fxmarty/tiny-random-working-LongT5Model glamprou/switch-base-8-sst2 google/long-t5-local-base google/long-t5-local-large google/long-t5-tglobal-base google/long-t5-tglobal-large google/long-t5-tglobal-xl google/switch-base-128 google/switch-base-16 google/switch-base-256 google/switch-base-32 google/switch-base-64 google/switch-base-8 google/switch-c-2048 google/switch-large-128 hangzeli/musicgen-melody-lora-punk harisnaeem/musicgen-small-ONNX heboya8/facebook-musicgen-small-not-lora-280 heboya8/facebook-musicgen-small-not-lora-40 heboya8/facebook-musicgen-small-not-lora-400 heboya8/facebook-musicgen-small-not-lora-420 heboya8/facebook-musicgen-small-not-lora-440 heboya8/facebook-musicgen-small-not-lora-450 heboya8/facebook-musicgen-small-not-lora-470 heboya8/facebook-musicgen-small-not-lora-50 heboya8/facebook-musicgen-small-not-lora-500 heboya8/facebook-musicgen-small-not-lora-510 heboya8/facebook-musicgen-small-not-lora-530 heboya8/facebook-musicgen-small-not-lora-570 heboya8/facebook-musicgen-small-not-lora-60 heboya8/facebook-musicgen-small-not-lora-610 heboya8/facebook-musicgen-small-not-lora-620 heboya8/facebook-musicgen-small-not-lora-640 heboya8/facebook-musicgen-small-not-lora-660 heboya8/facebook-musicgen-small-not-lora-680 heboya8/facebook-musicgen-small-not-lora-700 heboya8/facebook-musicgen-small-not-lora-90 heslil/msmall hf-internal-testing/tiny-random-LongT5ForConditionalGeneration hf-internal-testing/tiny-random-LongT5Model hf-internal-testing/tiny-random-MusicgenForConditionalGeneration hf-internal-testing/tiny-random-MusicgenMelodyForConditionalGeneration hf-internal-testing/tiny-random-SwitchTransformersForConditionalGeneration hf-internal-testing/tiny-random-SwitchTransformersModel hf-tiny-model-private/tiny-random-SwitchTransformersForConditionalGeneration hf-tiny-model-private/tiny-random-SwitchTransformersModel hmueller25/LT5-finetuned hmueller25/long-t5-tglobal-base-german-law huyquoctrinh/musicgen-melody-lora-punk hyeongii/musicgen-melody-lora-punk jackmedda/google-long-t5-tglobal-base_finetuned_augmented_augmented_llama3.3_70b jamesdon/audiogen-medium-endpoint jane102350/musicgen-melody-lora-punk jauntybrain/musicgen-small jihong008/musicgen-melody-lora-punk jihong008/musicgen-melody-lora-punk2 jihong008/musicgen-melody-lora-punk3 jpe9596/musicgen-large junhaoxjtu/musicgen-melody-lora-punk kresnandika/long-t5-tglobal-base-samsum kylielee505/mymgm kylielee505/mymgm laurasi/aimusic learn3r/longt5_xl_gov_5 learn3r/longt5_xl_gov_memsum_bp_5 learn3r/longt5_xl_govreport_4096_e10 learn3r/longt5_xl_govreport_4096_memsum_e10 learn3r/longt5_xl_sfd_20 learn3r/longt5_xl_sfd_4096_e10 learn3r/longt5_xl_sfd_bp_15 learn3r/longt5_xl_sfd_bp_20 learn3r/longt5_xl_sfd_memsum_30 lusciniaweldmou/musicgen-melody-lora-punk marv1nnnnn/musicgen-songstarter mclemcrew/musicgen-melody-ravi memepottaboah/musicgen-80snewwave-tiny memepottaboah/musicgen-POPMUSIC1981-melody merve/musicgen-small mrm8488/switch-base-16-finetuned-samsum mrm8488/switch-base-16-finetuned-xsum mrm8488/switch-base-16-finetuned-xsum-2 mrm8488/switch-base-8-finetuned-samsum nbroad/longt5-base-global-mediasum ogbanugot/musicgen-melody-lora-afrobeats ogbanugot/musicgen-melody-lora-afrobeats-with-vocals ogbanugot/musicgen-melody-lora-afrobeats-with-vocals-long ogbanugot/musicgen-small-lora-afrobeats omarimc/musicgen-large omarimc/musicgen-medium omarimc/musicgen-melody omarimc/musicgen-melody omarimc/musicgen-small onurio/musicgen-large originstory/holisleigh originstory/holisleigh2 pbotsaris/musicgen-small pharoAIsanders420/micro-musicgen-jungle pharoAIsanders420/musicgen-tiny-jungle-onnx pingzhili/switch-base-32-finetuned-copa pingzhili/switch-base-32-finetuned-hotpotqa pingzhili/switch-base-32-finetuned-mrpc pingzhili/switch-base-32-finetuned-multirc pingzhili/switch-base-32-finetuned-squad pingzhili/switch-base-32-finetuned-sst2 pingzhili/switch-base-32-finetuned-wikiqa pingzhili/switch-base-32-finetuned-winogrande pszemraj/long-t5-tglobal-base-16384-book-summary pszemraj/long-t5-tglobal-base-16384-booksum-V11-big_patent-V2 pszemraj/long-t5-tglobal-base-sci-simplify pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP17 pszemraj/long-t5-tglobal-xl-16384-book-summary pszemraj/long-t5-tglobal-xl-16384-book-summary-8bit reach-vb/musicgen-large-endpoint reach-vb/musicgen-large-fp16-endpoint reach-vb/musicgen-small reach-vb/musicgen-small-endpoint reach-vb/musicgen-small-test ronaldseoh/long-t5-tglobal-large rooftopcoder/long-t5-tglobal-base-16384-book-summary-finetuned-dialogsum rooftopcoder/longt5-dialogsum-2048 rrodrigu3z/long-t5-tglobal-base-joint-dg rubentito/longt5-tglobal-base-mpdocvqa sanchit-gandhi/musicgen-small satyanshu404/long-t5-local-base-finetuned-justification-v10 seckmaster/musicgen-large shahzebnaveed/moe_switch_transformer_summarization shrg7/musicgen-melody-lora-punk shrg7/musicgen-melody-lora-punk-base skroed/musicgen-medium slavocado/musicgen-large smarters/musicgen-large-csi smarters/musicgen-small-csi sweet-dreambooths/black-eyed-peas-v1-autotuned sweet-dreambooths/black-eyed-peas-v1-crafted-prompt sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-1-epoch sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-text-only sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-text-only-no-instance sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-text-only-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-16-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-16-epochs-text-only-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-3-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-8-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-8-epochs-text-only-piano-prompts sweet-dreambooths/black-eyed-peas-v1-lower-lr sweet-dreambooths/black-eyed-peas-v1-unprompted sweet-dreambooths/black-eyed-peas-v1-unprompted-lower-lr sweet-dreambooths/combined-artists-text-only-1-epochs sweet-dreambooths/combined-artists-text-only-3-epochs talargv/musicgen-finetune-aav talargv/musicgen-finetune-phonk taufiqsyed/musicgen-melody-lora-punk taufiqsyed/salami-data-clean-model taufiqsyed/salami_neural_demo_model taufiqsyed/salami_truncsplit_dora_model taufiqsyed/salami_truncsplit_finetune_model taufiqsyed/salami_truncsplit_legit1_model taufiqsyed/salami_truncsplit_model taufiqsyed/salami_truncsplit_model_mid taufiqsyed/salami_truncsplit_model_smol taufiqsyed/salami_truncsplit_model_trial2 tboucher/piano-mono-melody tryolabs/long-t5-tglobal-base-blogpost-cqa-onnx tsuyuan/long-t5-encodec-tglobal-base tvergho/t5-cards vinnie329/musicgen-lora-emotive-ambient vinnie329/musicgen-lora-emotive-ambient-2 vinnie329/musicgen-melody-lora-alt-hip-hop whaleloops/longt5-tglobal-large-16384-pubmed-10k_steps xkristian/long5-LegalDocumentSummarization ybelkada/switch-base-8-xsum ylacombe/musicgen-melody ylacombe/musicgen-melody-large ylacombe/musicgen-melody-large-punk-lora ylacombe/musicgen-melody-lora-punk ylacombe/musicgen-melody-punk-lora ylacombe/musicgen-stereo-melody ylacombe/musicgen-stereo-melody-large yuthrb/musicgen-custom zcode/seq2seq-parseg zera09/Word-selector zera09/long_t5 zera09/long_t5_4 zubes01/switch-base-8-imdb-text-classification

Pattern #3 (263 models)

-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+decoder:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
Affected models
ConverterModels
TikTokenConverter1024m/OLMoE-1B-7B-0924-Base 1024m/OLMoE-1B-7B-0924-Instruct-Base 4bit/mpt-7b-storywriter-4bit-128g Alchan/mpt-7b-chat AntonV/mamba2-1.3b-av AntonV/mamba2-1.3b-hf AntonV/mamba2-130m-av AntonV/mamba2-130m-hf AntonV/mamba2-2.7b-av AntonV/mamba2-2.7b-hf AntonV/mamba2-370m-av AntonV/mamba2-370m-hf AntonV/mamba2-780m-av AntonV/mamba2-780m-hf ArthurZ/mamba-1.4b ArthurZ/mamba-130m Codemaster67/OLMo-7B-USPTO-1k-ZINC DanielAWrightGabrielAI/mpt-7b-storywriter-4bit-128g-65kTokens-CPU EleutherAI/Hermes-RWKV-v4-3B Fizzarolli/OLMoE-1B-7B-0924-extended-pos-emb Intel/neural-chat-7b-v1-1 KnutJaegersberg/RWKV-4-PilePlus-169M-20230520-done-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-1B5-20230520-2942-486Gtokens-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-3B-20230520-3147-520Gtokens-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-430M-20230520-6162-1018Gtokens-ctx4098 KnutJaegersberg/RWKV-pileplus-1B5-evol_instruct_v2 Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192 Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch5-8192 Lansechen/OLMoE-1B-7B-0125-Distill-or-math220k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch3-8192 Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32 Litzy619/OLMoE-1B-7B-0924-step490000-tokens2055B-qlora Nethermind/Mpt-Instruct-DotNet-S NickNickGo/pocket_olmoe OccamRazor/mpt-7b-storywriter-4bit-128g OuteAI/OuteTTS-0.3-1B P1ayer-1/mpt-7b-instruct-base Pratye/mpt-7b-chat Q-bert/Mamba-130M Q-bert/Mamba-1B Q-bert/Mamba-3B RWKV/RWKV7-Goose-Pile-168M-HF RWKV/rwkv-4-14b-pile RWKV/rwkv-4-169m-pile RWKV/rwkv-4-1b5-pile RWKV/rwkv-4-3b-pile RWKV/rwkv-4-430m-pile RWKV/rwkv-4-7b-pile RWKV/rwkv-raven-14b RWKV/rwkv-raven-1b5 RWKV/rwkv-raven-3b RWKV/rwkv-raven-7b RichardErkhov/DeepMount00_-_mamba_790_hf_qa-4bits RichardErkhov/allenai_-_OLMoE-1B-7B-0924-4bits RichardErkhov/allenai_-_OLMoE-1B-7B-0924-8bits RichardErkhov/state-spaces_-_mamba-130m-hf-8bits RichardErkhov/tsavage68_-_mpt_1000_STEPS_1e5_SFT_SFT-8bits RtaForge/mamba2-2.7b-gurukul-instruct RtaForge/mamba2-2.7b-gurukul-instruct StarRing2022/RWKV-4-Raven-3B-v11-zh StarRing2022/RWKV-430M-Pile-Alpaca TRI-ML/mamba-7b-rw TehVenom/MPT-7b-Chat-Instruct-LongCTX-Merge TehVenom/MPT-7b-InstructAndStorywriting-50_50-Merge TehVenom/MPT-7b-WizardLM_Uncensored-Storywriter-Merge TehVenom/MPT-7b-storywriter-Apache-2.0 TehVenom/mpt-7b-InstructAndStorywriting-75_25-Merge Tomasal/OLMoE-1B-7B-0125-1-epoch-enron Xenova/tiny-mamba-onnx ZySec-AI/Mamba-2.8B-CyberSec agr505/fine_tuned_mamba_causal_1_19_run2 agr505/fine_tuned_mamba_causal_1_19_run3 allenai/MolmoE-1B-0924 allenai/MolmoE-1B-0924 allenai/OLMo-1B-0724-hf allenai/OLMo-1B-hf allenai/OLMo-7B-0424-Instruct-hf allenai/OLMo-7B-0424-hf allenai/OLMo-7B-0724-Instruct-hf allenai/OLMo-7B-0724-SFT-hf allenai/OLMo-7B-0724-hf allenai/OLMo-7B-Instruct-hf allenai/OLMo-7B-Twin-2T-hf allenai/OLMo-7B-hf allenai/OLMoE-1B-7B-0125 allenai/OLMoE-1B-7B-0125 allenai/OLMoE-1B-7B-0125-DPO allenai/OLMoE-1B-7B-0125-Instruct allenai/OLMoE-1B-7B-0125-SFT allenai/OLMoE-1B-7B-0924 allenai/OLMoE-1B-7B-0924 allenai/OLMoE-1B-7B-0924-Instruct allenai/OLMoE-1B-7B-0924-Instruct allenai/OLMoE-1B-7B-0924-SFT allenai/OLMoE-1B-7B-0924-SFT allura-org/MoE-Girl-1BA-7BT amd/AMD-OLMo-1B amd/AMD-OLMo-1B-SFT amd/AMD-OLMo-1B-SFT-DPO anas-awadalla/mpt-7b autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_epoch4_freeze autoprogrammer/gsm_OLMoE-1B-7B-0125_lr2e-05_epoch4_epoch_3 autoprogrammer/olmoe_densebackward0125 autoprogrammer/olmoe_densebackward0125_v1 avidoavid/RWKV-1b5-finetuned-overfit ayoubkirouane/Mamba-Chat-2.8B binh230/mamba2-370m breadlicker45/MuseBan breadlicker45/MuseRWKV breadlicker45/MuseRift breadlicker45/MuseRizz breadlicker45/muse-test-36 breadlicker45/muse-test-37 breadlicker45/muse-test-38 breadlicker45/muse-test35 breadlicker45/museRWKV-test breadlicker45/music-rwkv-v4 breadlicker45/music-rwkv2-v4 breadlicker45/rwkv-4-169m-pile-5120 breadlicker45/rwkv-4-169m-pile-6144 breadlicker45/rwkv-4-430m-2048 breadlicker45/rwkv-4-430m-3072 breadlicker45/rwkv-4-430m-4096 breadlicker45/rwkv-4-430m-5120 breadlicker45/rwkv-4-430m-6144 breadlicker45/rwkv-music3 breadlicker45/token-music cahya/rwkv-1B5-instruction cg666/OLMoE-1B-7B-0125-Instruct-grpo-E6-D100 cg666/OLMoE-1B-7B-0125-Instruct-grpo-E6-D8000-L4096 echarlaix/tiny-mpt-random-remote-code efederici/ipt-125m estrogen/olmoe-upscale estrogen/olmoe-upscale-attempt1 estrogen/olmoe-upscale-inkmixv3-ep1 estrogen/olmoe-upscale-inkmixv3-ep2 ethzanalytics/mpt-7b-storywriter-sharded feliperodriguezborquez/OLMoE-0125-attn-rnd-V1 feliperodriguezborquez/OLMoE-0125-base-V1 feliperodriguezborquez/OLMoE-0125-base-rnd-V1 feliperodriguezborquez/OLMoE-0924-attn-rnd-V1 feliperodriguezborquez/OLMoE-0924-base-V1 feliperodriguezborquez/OLMoE-0924-base-rnd-V1 feliperodriguezborquez/OLMoE-0924-my-V1 feliperodriguezborquez/OLMoE-0924-my-rnd-V1 finnstrom3693/rwkv-raven-1.5b fla-hub/mamba-7B fla-hub/rwkv7-168M-pile foilfoilfoil/RWKV-pileplus-HF-169M fxmarty/tiny-mpt-random-remote-code gl198976/mpt-7b gl198976/mpt-7b-instruct gretelai/mpt-7b han1997/mamba-2.8b-slimpj-hf harindhar10/OLMo-7B-USPTO-1k-ZINC harindhar10/OLMo-7B-ZINC20-10k harindhar10/OLMo-7B-ZINC20-50k harindhar10/OLMo-7B-ZINC20-50k-USPTO-50k hf-internal-testing/tiny-random-MambaForCausalLM hf-internal-testing/tiny-random-MambaModel hf-internal-testing/tiny-random-MptForCausalLM hf-internal-testing/tiny-random-MptForQuestionAnswering hf-internal-testing/tiny-random-MptForSequenceClassification hf-internal-testing/tiny-random-MptForTokenClassification hf-internal-testing/tiny-random-MptModel hf-internal-testing/tiny-random-OlmoForCausalLM hf-internal-testing/tiny-random-OlmoeForCausalLM hf-internal-testing/tiny-random-RwkvForCausalLM hf-internal-testing/tiny-random-RwkvModel huxiang088/OLMoE-1B-7B-0924-Instruct-NVFP4 hyungtae/mpt-30b interview-eval/olmoe-depthqa-test-4 interview-eval/olmoe-depthqa-test-instruct-6 interview-eval/olmoe-depthqa-test-train-5 interview-eval/olmoe-depthqa-train-1 interview-eval/olmoe-gsm8k-3 interview-eval/olmoe-math-test-4 interview-eval/olmoe-math-test-gsm8k-5 interview-eval/olmoe-math-test-instruct-6 interview-eval/olmoe-math-test-train-5 interview-eval/olmoe-math-train-1 interview-eval/olmoe-math-train-gsm8k-2 iwalton3/rwkv-14b-wizardlm jmichaelov/parc-rwkv-seed2 jploski/mpt-mini-shakespeare jprafael/mpt-7b-instruct-sharded katuni4ka/tiny-random-olmo-hf lennyhans/OLMoE-1B-7B-0125-Instruct-bnb-4bit lennyhans/OLMoE-1B-7B-0125-bnb-4bit lennyhans/OLMoE-1B-7B-0125-bnb-4bit lentan/mpt-125m lightblue/japanese-mpt-7b manojpreveen/mpt-30b-v5 mesh-ops/OLMoE-1B-7B-0924-step1140000-tokens4781B mesh-ops/OLMoE-1B-7B-0924-step1200000-tokens5033B mesh-ops/OLMoE-1B-7B-0924-step1215000-tokens5096B mesh-ops/OLMoE-1B-7B-0924-step1220000-tokens5117B mesh-ops/OLMoE-1B-7B-0924-step900000-tokens3774B mlx-community/OLMoE-1B-7B-0125 mlx-community/OLMoE-1B-7B-0125-4bit mlx-community/OLMoE-1B-7B-0125-6bit mlx-community/OLMoE-1B-7B-0125-6bit mlx-community/OLMoE-1B-7B-0125-8bit mlx-community/OLMoE-1B-7B-0125-Instruct mlx-community/OLMoE-1B-7B-0125-Instruct-4bit mlx-community/OLMoE-1B-7B-0125-Instruct-6bit mlx-community/OLMoE-1B-7B-0125-Instruct-8bit mlx-community/mamba-130m-hf-f32 modularai/replit-code-1.5 motionlabs/OLMoE-1B-5B nm-testing/OLMoE-1B-7B-0924-Instruct-FP8 nomic-ai/gpt4all-mpt nomic-ai/gpt4all-mpt-2 onnx-community/tiny-random-olmo-hf openaccess-ai-collective/mpt-7b-wizardlm optimum-intel-internal-testing/tiny-mamba optimum-intel-internal-testing/tiny-random-MptForCausalLM optimum-intel-internal-testing/tiny-random-olmo-hf petkopetkov/mamba2-1.3b-hf petkopetkov/mamba2-130m-hf petkopetkov/mamba2-2.7b-hf petkopetkov/mamba2-370m-hf petkopetkov/mamba2-780m-hf porcu-pine/mamba-detoxer ragunath-ravi/mamba-akkadian-translator rdabin/OLMoE-1B-7B-0924-Instruct-all_components rdabin/OLMoE-1B-7B-0924-Instruct-attention_and_experts rdabin/OLMoE-1B-7B-0924-Instruct-attention_only rdabin/OLMoE-1B-7B-0924-Instruct-experts_only rdabin/OLMoE-1B-7B-0924-Instruct-router_and_attention rdabin/OLMoE-1B-7B-0924-Instruct-router_and_experts rdabin/OLMoE-1B-7B-0924-Instruct-router_only replit/replit-code-v1_5-3b rwl4/mpt-7b-chat-extended scottsus/mamba-1.4b-instruct-hf scottsus/mamba-2.8b-papers-trained scottsus/mamba-2.8b-wdc-trained-v2 sgugger/rwkv-430M-pile sgugger/rwkv-7b-pile state-spaces/mamba-1.4b-hf state-spaces/mamba-130m-hf state-spaces/mamba-2.8b-hf state-spaces/mamba-370m-hf state-spaces/mamba-790m-hf taylodl1/possum1_8k_hf tbmod/OLMo-7B-Instruct-hf team-lucid/mptk-1b telecomadm1145/mamba2_exp5 ucmp137538/rwkv-4-169m-pile-finetuned-sst2 umuthopeyildirim/fin-rwkv-169M umuthopeyildirim/fin-rwkv-1b5 umuthopeyildirim/fin-rwkv-430m whaleloops/clinicalmamba-130m-hf whaleloops/clinicalmamba-2.8b-hf wtang06/mpt-125m-c4 yujiepan/mamba-tiny-random yujiepan/mpt-tiny-random yusx-swapp/ofm-mamba-1.4b-lambda-hf zary0/mamba-2.7b-ja-sft zhangtaolab/plant-dnamamba-BPE zhangtaolab/plant-dnamamba-BPE-promoter

Pattern #4 (178 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}{1,3}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]..."), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
Affected models
ConverterModels
GPT2Converter0x88844451/5abc9f1a-cf97-4fe2-93c2-22ad01b8e0ea Aivesa/03c4f17f-c58f-4d99-9fa6-723e83ce2289 Apolo81/granite-4-350m-map-commands-gguf Boojum/blue-moe Boojum/blue-moe-6b-it ClarenceDan/0dc7f291-d101-46d9-a564-231611885d6e ClarenceDan/e93909e1-3ea7-49ac-89b0-3a6358376513 Etherll/Tashkeel-350M-v2 ExaltedSlayer/ibm-granite-4.0-h-small-mlx-mxfp4 Goekdeniz-Guelmez/Josiefied-granite-4.0-micro-abliterated-v1 ModelCloud/dbrx-base-converted-v2 ModelCloud/dbrx-instruct-converted-v2 Open4bits/granite-4.0-h-tiny-mlx-fp16 Open4bits/granite-4.0-micro-mlx-3Bit OpenMOSE/RWKV-Reka-3.1-Flash R0mAI/331883f4-b627-4999-a74e-d33fc5fafdd2 R0mAI/3d737383-c3fa-4c8f-ad38-b5af93247aca RedHatAI/granite-4.0-h-small-FP8-block RedHatAI/granite-4.0-h-small-FP8-dynamic RedHatAI/granite-4.0-h-tiny-FP8-dynamic RichardErkhov/katuni4ka_-_tiny-random-dbrx-4bits RichardErkhov/katuni4ka_-_tiny-random-dbrx-8bits Rocketknight1/dbrx-tiny-random SystemAdmin123/tiny-random-dbrx adammandic87/05f441bc-4409-4188-a62a-44e7d3d95c8a allenai/Flex-code-2x7B-1T allenai/Flex-code-2x7B-1T allenai/Flex-creative-2x7B-1T allenai/Flex-creative-2x7B-1T allenai/Flex-math-2x7B-1T allenai/Flex-math-2x7B-1T allenai/Flex-news-2x7B-1T allenai/Flex-news-2x7B-1T allenai/Flex-pes2o-2x7B-1T allenai/Flex-pes2o-2x7B-1T allenai/Flex-reddit-2x7B-1T allenai/Flex-reddit-2x7B-1T allenai/Molmo-7B-O-0924 allenai/Molmo2-O-7B allenai/MolmoAct-7B-O-0812 amd/dbrx-instruct-FP8-KV badmadrad/alm-granite-4.0-tiny-h-finetuned bbytxt/7c8b28fa-995f-45c1-8c76-ee14295c6be7 bdambrosio/dbrx-instruct-7.0bpw-h8-exl2 bowilleatyou/6932964f-b4ba-4d87-9bde-aee96d2216a5 cyankiwi/granite-4.0-h-micro-AWQ-4bit cyankiwi/granite-4.0-h-micro-AWQ-8bit cyankiwi/granite-4.0-h-small-AWQ-4bit cyankiwi/granite-4.0-h-small-AWQ-8bit cyankiwi/granite-4.0-h-tiny-AWQ-4bit daniel40/086ec58e-d093-49c7-9efb-0b6d27efa875 diaenra/be30be8d-506f-4ccb-923b-3fbfcef79427 dimasik1987/ff134de0-010b-48b0-8471-a1047e70f02f dimasik2987/f427b45e-0117-4127-9cce-1a55987b38c4 drdreaddd/8398b0d6-4a31-499c-94e3-4279c5e9fa74 drewbenson/granite-4.0-h-micro-Q4-mxfp4-MLX fedovtt/e63674ec-1155-4808-aeb9-c00d2f68f6ed filipesantoscv11/160bad19-cf3c-40d3-83ee-c8349ef2e991 filipesantoscv11/710c5c72-b571-4714-830c-59baed18da52 filipesantoscv11/f8a073c0-d3ed-4c99-8a44-d111d6fc700b hf-internal-testing/tiny-random-FlexOlmoForCausalLM huihui-ai/Huihui-granite-4.0-micro-abliterated ibm-granite/granite-4.0-1b ibm-granite/granite-4.0-1b-base ibm-granite/granite-4.0-350m ibm-granite/granite-4.0-350m-base ibm-granite/granite-4.0-h-1b ibm-granite/granite-4.0-h-1b-base ibm-granite/granite-4.0-h-350m ibm-granite/granite-4.0-h-350m-base ibm-granite/granite-4.0-h-micro ibm-granite/granite-4.0-h-micro-base ibm-granite/granite-4.0-h-small ibm-granite/granite-4.0-h-small-FP8 ibm-granite/granite-4.0-h-small-base ibm-granite/granite-4.0-h-tiny ibm-granite/granite-4.0-h-tiny-base ibm-granite/granite-4.0-micro ibm-granite/granite-4.0-micro-base inference-optimization/granite-4.0-h-tiny-FP8-block introvoyz041/ibm-granite-4.0-h-small-mlx-mxfp4-mlx-4Bit irishprancer/caa2d38c-dc86-4198-afd3-1a599519e6bd katuni4ka/tiny-random-dbrx lmstudio-community/granite-4.0-h-small-MLX-4bit lmstudio-community/granite-4.0-h-small-MLX-5bit lmstudio-community/granite-4.0-h-small-MLX-6bit lmstudio-community/granite-4.0-h-small-MLX-8bit lmstudio-community/granite-4.0-h-tiny-MLX-4bit lmstudio-community/granite-4.0-h-tiny-MLX-5bit lmstudio-community/granite-4.0-h-tiny-MLX-6bit lmstudio-community/granite-4.0-h-tiny-MLX-8bit magiccodingman/Granite-4.0-H-1B-Unsloth-MXFP4-Hybrid-GGUF magiccodingman/Granite-4.0-H-350M-Unsloth-MXFP4-Hybrid-GGUF magiccodingman/Granite-4.0-H-350M-Unsloth-MagicQuant-Hybrid-GGUF marialvsantiago/e83503c0-748b-4fb7-9d6c-5faa6535a694 mlx-community/Granite-4.0-H-Tiny-4bit-DWQ mlx-community/granite-4.0-1b-4bit mlx-community/granite-4.0-h-1b-3bit mlx-community/granite-4.0-h-1b-4bit mlx-community/granite-4.0-h-1b-6bit mlx-community/granite-4.0-h-1b-8bit mlx-community/granite-4.0-h-350m-4bit mlx-community/granite-4.0-h-350m-8bit mlx-community/granite-4.0-h-micro-4bit mlx-community/granite-4.0-h-micro-8bit mlx-community/granite-4.0-h-tiny-3bit-MLX mlx-community/granite-4.0-h-tiny-3bit-MLX mlx-community/granite-4.0-h-tiny-4bit mlx-community/granite-4.0-h-tiny-5bit-MLX mlx-community/granite-4.0-h-tiny-6bit-MLX mlx-community/granite-4.0-h-tiny-6bit-MLX mlx-community/granite-4.0-micro-8bit mrferr3t/098d7e0d-f18a-4f53-a5d8-5ce2370b3e54 mrferr3t/0a4b6146-bc64-48be-afac-5923d9f127d6 mrferr3t/2dfb4f3e-a033-4718-8d94-7e6e02e17ab9 mrferr3t/5c21cdd1-d5ac-48c8-b859-55f27d47371b mrferr3t/94697a59-2f6e-4920-9a51-48640bb5e678 mrferr3t/9a05be10-930f-49e2-8668-dc3d1c7facbc mrferr3t/aa07cd73-f9f1-4665-9565-e5e2752236b0 mrferr3t/ddfccb92-64d3-4fb2-a8db-243e0595faca nttx/26d1fec9-df17-4915-a9e6-6bb9488f30fa nttx/30a115b0-dd24-4bdc-b54a-3f823984babb nttx/32d5a43b-9ec3-4ff1-9561-49166dc39e00 nttx/540e706b-b84b-44b1-a4ff-cd09597559fb numerouno01/85b9c394-bbd3-40d6-99c6-a40e8a403146 numerouno01/d6e3e57d-cb21-45df-ae20-208268435bca onnx-community/granite-4.0-1b-ONNX-web onnx-community/granite-4.0-350m-ONNX-web onnx-community/granite-4.0-micro-ONNX-web optimum-intel-internal-testing/tiny-random-dbrx optimum-intel-internal-testing/tiny-random-granitemoehybrid prxy5605/40933b89-85aa-42fa-a544-f4b3f4da613b prxy5605/c7a0a965-e520-490c-b575-2cb0e842d0fb prxy5606/263d1fdb-e97d-4666-9aad-257eba4d228c qgallouedec/tiny-DbrxForCausalLM ramendik/miki-pebble-20260131-safetensors seblaku/2329ff78-bb12-41e9-b193-cf014c9dfcba sergioalves/0c12ba50-1288-403c-b3f8-d9a452c7f0ce shanearora/Flex-reddit-2x7B-1T taopanda/test-tiny-random-dbrx tiny-random/granite-moe-hybrid trl-internal-testing/tiny-DbrxForCausalLM trl-internal-testing/tmp-tiny-DbrxForCausalLM unsloth/granite-4.0-1b unsloth/granite-4.0-1b-unsloth-bnb-4bit unsloth/granite-4.0-350m unsloth/granite-4.0-350m-base unsloth/granite-4.0-350m-unsloth-bnb-4bit unsloth/granite-4.0-h-1b unsloth/granite-4.0-h-1b-unsloth-bnb-4bit unsloth/granite-4.0-h-350m unsloth/granite-4.0-h-350m-unsloth-bnb-4bit unsloth/granite-4.0-h-micro unsloth/granite-4.0-h-micro-base-unsloth-bnb-4bit unsloth/granite-4.0-h-micro-unsloth-bnb-4bit unsloth/granite-4.0-h-small unsloth/granite-4.0-h-small-FP8-Dynamic unsloth/granite-4.0-h-small-bnb-4bit unsloth/granite-4.0-h-small-unsloth-bnb-4bit unsloth/granite-4.0-h-tiny unsloth/granite-4.0-h-tiny-FP8-Dynamic unsloth/granite-4.0-h-tiny-base unsloth/granite-4.0-h-tiny-base-unsloth-bnb-4bit unsloth/granite-4.0-micro unsloth/granite-4.0-micro-base unsloth/granite-4.0-micro-base-unsloth-bnb-4bit unsloth/granite-4.0-micro-unsloth-bnb-4bit vermoney/cdc1a8da-3025-4663-a070-219f81140542 vertings6/a434d1b9-2850-4b62-a2b9-3efc8fac06ec vertings6/bd5521cc-e201-47c5-bbf9-64fe3343cdf8 vmpsergio/dc6170db-fe75-4268-b721-53f18159aa2c whiteapple8222/26792c56-d5ae-4cbd-b70f-e16bae2c539c whiteapple8222/8a526cc6-dac2-4461-b95b-6aa9599ef5a8_private whiteapple8222/caf66720-6973-4d4f-8328-2befc929adba yujiepan/dbrx-tiny-random yujiepan/dbrx-tiny256-random yujiepan/granite-4.0-h-tiny-random yujiepan/granite-moe-hybrid-tiny-random

Pattern #5 (157 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
Affected models
ConverterModels
GemmaConverter0xroyce/silent-voice-multimodal DeadlyHug/gemma3nE4b-it_expv7_8k_r EpistemeAI/Audiogemma-3N-finetune EsotericsEnjoyer/BROKEN-t5gemma-2b-2b-Steiner-Esoterics-Merged FrancescoCaracciolo/chisrtina-e4b GaborMadarasz/gemma-3n-E2B-it_hun_ASR-finetuned Hayloo9838/siglip2-vision-only MLGResearch/cleaver_t5g_ss Mfusenig/large-t5gemma-finetuned-checkpoint-2400 Mfusenig/large-t5gemma-finetuned-checkpoint-2800 Mfusenig/large-t5gemma-finetuned-checkpoint-3200 Mfusenig/large-t5gemma-finetuned-checkpoint-3600 Mfusenig/large-t5gemma-finetuned-final-best-model Mfusenig/small-t5gemma-finetuned-checkpoint-14000 Mfusenig/small-t5gemma-finetuned-checkpoint-21000 Mfusenig/small-t5gemma-finetuned-checkpoint-7000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-13000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-14000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-19500 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-21000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-26000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-6500 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-7000 Minthy/t5gemma-2b-2b-ul2-encoder-only MuXodious/gemma-3n-E4B-it-absolute-heresy-MPOA Nadhari/gemma-3n-swahili-E4B-it Nayana-cognitivelab/NayanaSectionOCR Nozim6690/hugging-face_shieldgemma-2-4b-it Qwe1325/Huihui-gemma-3n-E4B-it-abliterated-bnb-4bit RedHatAI/gemma-3n-E2B-it-FP8-dynamic RedHatAI/gemma-3n-E4B-it-FP8-dynamic RedHatAI/gemma-3n-E4B-it-quantized.w4a16 RuteNL/ViT-SO400M-16-SigLIP2-384-ONNX RuteNL/ViT-gopt-16-SigLIP2-384-ONNX attilasir/AttilaAI blind-assist/gemma-3n-2b-finetune-e1-8500 blind-assist/gemma-3n-4b-finetune-8500 brunopio/recurrentgemma-2b-it-nbits4-GS64-Axis1-HQQ-T brunopio/recurrentgemma-2b-it-nbits4-GSNone-Axis0-HQQ-T chimbiwide/Gemma3NPC-it-float16 cp500/mece dogma-black/transformers_t5gemma_2b-prefixlm_v1 google/gemma-3n-E2B google/gemma-3n-E2B-it google/gemma-3n-E4B google/gemma-3n-E4B-it google/recurrentgemma-2b google/recurrentgemma-2b-it google/recurrentgemma-9b google/recurrentgemma-9b-it google/shieldgemma-2-4b-it google/t5gemma-2b-2b-prefixlm google/t5gemma-2b-2b-prefixlm-it google/t5gemma-2b-2b-ul2 google/t5gemma-2b-2b-ul2-it google/t5gemma-9b-2b-prefixlm google/t5gemma-9b-2b-prefixlm-it google/t5gemma-9b-2b-ul2 google/t5gemma-9b-2b-ul2-it google/t5gemma-9b-9b-prefixlm google/t5gemma-9b-9b-prefixlm-it google/t5gemma-9b-9b-ul2 google/t5gemma-9b-9b-ul2-it google/t5gemma-b-b-prefixlm google/t5gemma-b-b-prefixlm-it google/t5gemma-b-b-ul2 google/t5gemma-b-b-ul2-it google/t5gemma-l-l-prefixlm google/t5gemma-l-l-prefixlm-it google/t5gemma-l-l-ul2 google/t5gemma-l-l-ul2-it google/t5gemma-ml-ml-prefixlm google/t5gemma-ml-ml-prefixlm-it google/t5gemma-ml-ml-ul2 google/t5gemma-ml-ml-ul2-it google/t5gemma-s-s-prefixlm google/t5gemma-s-s-prefixlm-it google/t5gemma-s-s-ul2 google/t5gemma-s-s-ul2-it google/t5gemma-xl-xl-prefixlm google/t5gemma-xl-xl-prefixlm-it google/t5gemma-xl-xl-ul2 google/t5gemma-xl-xl-ul2-it govnejri/Estimin3n harisarang/t5gemma-2b-2b-prefixlm-lora-pretrained-full harisarang/t5gemma-2b-2b-prefixlm-lora-sft-full harshaljanjani/tiny-t5gemma-test hf-internal-testing/namespace-google-repo_name-gemma-3n-E4B-it huihui-ai/Huihui-gemma-3n-E4B-it-abliterated igorktech/gemma-3n-E2B-it-language igorktech/gemma-3n-e2b-it-language-pruned jordimas/t5gemma-s-s-ul2 lmstudio-community/gemma-3n-E2B-it-MLX-4bit lmstudio-community/gemma-3n-E2B-it-MLX-6bit lmstudio-community/gemma-3n-E2B-it-MLX-8bit lmstudio-community/gemma-3n-E2B-it-MLX-bf16 lmstudio-community/gemma-3n-E4B-it-MLX-4bit lmstudio-community/gemma-3n-E4B-it-MLX-6bit lmstudio-community/gemma-3n-E4B-it-MLX-8bit lmstudio-community/gemma-3n-E4B-it-MLX-bf16 lyimo/gemma-3n-swahili mlx-community/Huihui-gemma-3n-E4B-it-abliterated-lm-4bit mlx-community/Huihui-gemma-3n-E4B-it-abliterated-lm-6bit mlx-community/Huihui-gemma-3n-E4B-it-abliterated-lm-8bit mlx-community/MedraN-E4B-Uncensored-Q4 mlx-community/gemma-3-12b-it-qat-4bit mlx-community/gemma-3-27b-it-qat-4bit mlx-community/gemma-3-4b-it-qat-4bit mlx-community/gemma-3n-E2B-4bit mlx-community/gemma-3n-E2B-it-4bit mlx-community/gemma-3n-E2B-it-lm-4bit mlx-community/gemma-3n-E2B-it-lm-bf16 mlx-community/gemma-3n-E2B-it-text-4bit-dwq mlx-community/gemma-3n-E4B-bf16 mlx-community/gemma-3n-E4B-it-4bit mlx-community/gemma-3n-E4B-it-8bit mlx-community/gemma-3n-E4B-it-bf16 mlx-community/gemma-3n-E4B-it-lm-4bit mlx-community/gemma-3n-E4B-it-lm-bf16 mshojaei77/gemma-3n-E4B-persian nehmeailabs-org/nehme-flashcheck-270m nightknocker/recurrent-t5gemma-l-l-ul2-encoder oddadmix/MasriSwitch-Gemma3n-Transcriber-v1 onnx-community/gemma-3n-E2B-it-ONNX rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-HealthCare sil-ai/t5gemma-swh-nih silma-ai/SILMA-Kashif-2B-Instruct-v1.0 sugarquark/sd15-text-encoder-t5g-2b-ul2-it thivy/embeddinggemma-300m-norwegian-health timm/ViT-B-16-SigLIP2 timm/ViT-B-16-SigLIP2-256 timm/ViT-B-16-SigLIP2-384 timm/ViT-B-16-SigLIP2-512 timm/ViT-B-32-SigLIP2-256 timm/ViT-L-16-SigLIP2-256 timm/ViT-L-16-SigLIP2-384 timm/ViT-L-16-SigLIP2-512 timm/ViT-SO400M-14-SigLIP2 timm/ViT-SO400M-14-SigLIP2-378 timm/ViT-SO400M-16-SigLIP2-256 timm/ViT-SO400M-16-SigLIP2-384 timm/ViT-SO400M-16-SigLIP2-512 timm/ViT-gopt-16-SigLIP2-256 timm/ViT-gopt-16-SigLIP2-384 tiny-random/gemma-3n unsloth/gemma-3n-E2B unsloth/gemma-3n-E2B-it unsloth/gemma-3n-E2B-it-unsloth-bnb-4bit unsloth/gemma-3n-E2B-unsloth-bnb-4bit unsloth/gemma-3n-E4B unsloth/gemma-3n-E4B-it unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit unsloth/gemma-3n-E4B-unsloth-bnb-4bit varshu23/gemma3-e1b-sliced-4bit yasserrmd/GemmaECG-Vision yujiepan/gemma-3n-tiny-random yujiepan/gemma-3n-tiny-random-dim4

Pattern #6 (148 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\n"), content=" "), Replace(pattern=Regex(" {2,}"), content=" ")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
Affected models
ConverterModels
PegasusConverterAexeos/gbp-large-pubmed-ft BEE-spoke-data/pegasus-x-base-synthsumm_open-16k BritnyB/summarizer Deena123/pegasus-x-base Feluda/pegasus-samsum Haribaskar2594/bigbird-pegasus-med-v2 Haribaskar2594/google-pegasus-large Haribaskar2594/google-pegasus-med-full Joemgu/pegasus-x-sumstew Kevincp560/bigbird-pegasus-large-arxiv-finetuned-pubmed Kevincp560/bigbird-pegasus-large-bigpatent-finetuned-pubMed ManqingLiu/pegasus-samsum MarketingHHM/autotrain-sumituptestv4-60050134312 MikaSie/PegasusX_no_extraction_V1 MikaSie/RoBERTa_PegasusX_dependent_V1 NazzX1/pegasus-Finetuned-sum-full-note-v1 Nourrr/pegasus-x-base Nourrr/pegasus-x-base-OldVersion PergaZuZ/cdc_influenza_pagasus-x-large PoseyATX/Bronze_Buffalo_89 QuickRead/fine-tune-Pegasus RichardErkhov/pszemraj_-_bigbird-pegasus-large-K-booksum-4bits RichardErkhov/pszemraj_-_bigbird-pegasus-large-K-booksum-8bits Shaelois/MeetingScript UNIST-Eunchan/pegasus-x-booksum-chapter Venkatesh4342/pegasus-samsum acmc/summarizer_google_bigbird-pegasus-large-pubmed_base_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_keybert_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_mesh_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_mesh_unfaceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_most_frequent_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_most_frequent_unfaceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_tf_idf_unfaceted alex2awesome/pegasus-x-large alk/pegasus-scitldr alphahg/pegasus-x-base-finetuned-paper alvinwatner/pegasus-large-qg-squad alvinwatner/pegasus-large-qg-squad-alpha-interro alwaysaditi/pegasus_X_pacsum alwaysaditi/pegasus_hiporank_final aplnestrella/pegasus-x-arxiv-cord19 aplnestrella/pegasus-x-cord19 aplnestrella/pegasus-x-cord19-ENC_16-DEC_24-b_4-e_8-g_1 aplnestrella/pegasus-x-cord19-ENC_16-DEC_8-b_8-e_8-g_1 aplnestrella/pegasus-x-cord19-extended aruca/pegasus_x-meeting-summarizer aruca/pegasus_x-meeting-summarizer-gpt3.5 aruca/pegasusx-AMI-text-summarizer bruehle/BigBirdPegasus_Chemtagger bruehle/BigBirdPegasus_Llama budhwant/big-bird-hindi-sumarization ccdv/lsg-pegasus-large-4096 chinhon/pegasus-multi_news-commentaries_hdwriter chinhon/pegasus-multi_news-headline csquin/pegasus-x-cord19-ENC_16-DEC_24-b_4-e_8-g_1_v2 deltayrn/pegasusBase farleyknight/patent-summarization-google-bigbird-pegasus-large-arxiv-2022-09-20 farleyknight/patent-summarization-pegasus-2022-09-16 gcesare/pegasus-x-base-finetuned-pubmed gigant/pegasusx_tib google/bigbird-pegasus-large-arxiv google/bigbird-pegasus-large-bigpatent google/bigbird-pegasus-large-pubmed google/pegasus-x-base google/pegasus-x-large grenmon/pegasus-x-large-finetuned-summarization haesun/pegasus-samsum hafez1412/pegasus-x-merged hf-internal-testing/tiny-random-BigBirdPegasusForCausalLM hf-internal-testing/tiny-random-BigBirdPegasusForConditionalGeneration hf-internal-testing/tiny-random-BigBirdPegasusForQuestionAnswering hf-internal-testing/tiny-random-BigBirdPegasusForSequenceClassification hf-internal-testing/tiny-random-BigBirdPegasusModel hf-internal-testing/tiny-random-PegasusForCausalLM hf-internal-testing/tiny-random-PegasusForConditionalGeneration hf-internal-testing/tiny-random-PegasusModel hf-internal-testing/tiny-random-PegasusXForConditionalGeneration hf-internal-testing/tiny-random-PegasusXModel hf-internal-testing/tiny-random-bigbird_pegasus hf-internal-testing/tiny-random-pegasus hf-tiny-model-private/tiny-random-BigBirdPegasusForCausalLM hf-tiny-model-private/tiny-random-BigBirdPegasusForConditionalGeneration hf-tiny-model-private/tiny-random-BigBirdPegasusForQuestionAnswering hf-tiny-model-private/tiny-random-BigBirdPegasusForSequenceClassification hf-tiny-model-private/tiny-random-BigBirdPegasusModel hf-tiny-model-private/tiny-random-PegasusForCausalLM hf-tiny-model-private/tiny-random-PegasusModel hf-tiny-model-private/tiny-random-PegasusXForConditionalGeneration hf-tiny-model-private/tiny-random-PegasusXModel himanimaheshwari3/my_h_billsum_model ireneli1024/bigbird-pegasus-large-pubmed-elife-finetuned ireneli1024/bigbird-pegasus-large-pubmed-plos-finetuned junyinc/LING-575-WI-SUM katarinajoanne/bigbird_fine_tuned kmfoda/output_dir li-lab/ascle-bigbird-pegasus-large-pubmed-elife-finetuned li-lab/ascle-bigbird-pegasus-large-pubmed-plos-finetuned luojingbao/pegasus_output mayu0007/pegasus_large_covid minjingzhu/bigbird-pegasus-large-pubmed-finetuned-legal minjingzhu/bigbird-pegasus-large-pubmed-finetuned-legal-2 natanmb/pegasus-x-base-finetuned-multi-news onnx-internal-testing/tiny-random-BigBirdPegasusForCausalLM-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusForConditionalGeneration-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusForQuestionAnswering-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusForSequenceClassification-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusModel-ONNX optimum-intel-internal-testing/tiny-random-bigbird_pegasus optimum-intel-internal-testing/tiny-random-pegasus priyankrathore/Pegasus-Lay-Final pszemraj/bigbird-pegasus-large-K-booksum pszemraj/bigbird-pegasus-large-K-booksum pszemraj/pegasus-large-book-summary pszemraj/pegasus-large-summary-explain pszemraj/pegasus-x-large-book-summary pszemraj/pegasus-x-large-book_synthsumm-bf16 quanganh22/pegasus-x-cui quanganh22/pegasus-x-epoch_1 quanganh22/pegasus-x-finetune-subtitleonly quanganh22/pegasus-x-finetuned-final quanganh22/pegasus-x-finetuned-final-v2 seonglae/resrer-pegasus-x sharmadhruv/my_awesome_qa_model sohamchougule/pegasus-large-finetuned-samsum sohamchougule/pegasus-x-base-finetuned-samsum starcatmeow/autotrain-cybersecurity-summarization-pegasus-x-book-43369110299 tanvi-junankar/summify-pegasus-x theojolliffe/bigbird-pegasus-large-arxiv-finetuned-roundup-280922 theojolliffe/distill-pegasus-cnn-16-4-finetuned-arxiv twigs/bigbird-pegasus-large twigs/bigbird-pegasus-large-4096-arxiv twigs/bigbird-pegasus-large-4096-govreport twigs/bigbird-pegasus-large-4096-pubmed twigs/pegasus-x-large-4096-arxiv twigs/pegasus-x-large-4096-govreport twigs/pegasus-x-large-4096-pubmed twigs/pegasus-x-large-8192-arxiv twigs/pegasus-x-large-8192-govreport twigs/pegasus-x-large-8192-pubmed ubaada/pegasus-x-large-booksum-16k vatsalinfodesk/pegasus-samsum vickt/LLM_Teached_PEGASUS_CNNDM wanyu/IteraTeR-PEGASUS-Revision-Generator zakerous/pegasus-x-large-finetuned-samsum1000 zakerous/pegasus-x-large-finetuned-samsum1000-1 zakerous/pegasus-x-large-finetuned-samsum5000 zedfum/arman-longformer-8k-finetuned-ensani zphang/pegasus-x-large

Pattern #7 (119 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Sequence(pretokenizers=[Digits(individual_digits=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
Affected models
ConverterModels
GPT2ConverterDJLougen/granite4-tax Danielbrdz/CodeBarcenas-1b HuggingFaceH4/starchat-alpha HuggingFaceH4/starchat-beta InfosysEnterprise/NT-Java-1.1B JoydeepC/trueGL LoupGarou/WizardCoder-Guanaco-15B-V1.0 LoupGarou/WizardCoder-Guanaco-15B-V1.1 Multi-Domain-Expert-Learning/osiris_12b RaymondLi/sc2-3b-test RichardErkhov/allura-org_-_MoE-Girl-800MA-3BT-8bits RichardErkhov/ibm-granite_-_granite-3.0-1b-a400m-base-4bits RichardErkhov/ibm-granite_-_granite-3.0-1b-a400m-base-8bits RichardErkhov/ibm-granite_-_granite-3.0-3b-a800m-base-4bits RichardErkhov/ibm-granite_-_granite-3.0-3b-a800m-base-8bits RichardErkhov/ibm-research_-_PowerMoE-3b-8bits RichardErkhov/ibm_-_PowerMoE-3b-4bits RichardErkhov/ibm_-_PowerMoE-3b-8bits RichardErkhov/nuprl_-_MultiPL-T-StarCoderBase_1b-4bits TevunahAi/Granite-34B-Code-Instruct-8k-2048-Calibration-FP8 TevunahAi/granite-34b-code-instruct-8k-FP8 TheBloke/Octocoder-GPTQ TheBloke/WizardCoder-15B-1.0-GPTQ TheBloke/sqlcoder-GPTQ TheBloke/sqlcoder2-GPTQ TheBloke/starchat-beta-GPTQ TheBloke/starcoder-GPTQ V-YangXu/StarCoder-Alpaca WizardLMTeam/WizardCoder-15B-V1.0 Xenova/WizardCoder-1B-V1.0 Xenova/starcoderbase-1b Xenova/tiny_starcoder_py allura-org/MoE-Girl-800MA-3BT allura-org/MoE-Girl_400MA_1BT arjunguha/notstarcoder-1b aurora-m/aurora-m-biden-harris-redteamed bigcode/gpt_bigcode-santacoder bigcode/santacoderpack bigcode/starcoder bigcode/starcoder-co-format bigcode/starcoder-cxo bigcode/starcoder-cxso bigcode/starcoder-o bigcode/tiny_starcoder_py bugdaryan/WizardCoderSQL-15B-V1.0 cobrakenji/granite-20b-code-base-GGUF codeparrot/starcoder-self-instruct defog/sqlcoder defog/sqlcoder2 fals3/bigcode-starcoderbase-unit-test-fine-tuning hf-internal-testing/tiny-random-GraniteForCausalLM hyper-accel/tiny-random-gpt_bigcode ibm-granite/granite-20b-code-base-8k ibm-granite/granite-20b-code-base-8k ibm-granite/granite-20b-code-base-r1.1 ibm-granite/granite-20b-code-instruct-8k ibm-granite/granite-20b-code-instruct-8k ibm-granite/granite-20b-functioncalling ibm-granite/granite-3.0-1b-a400m-base ibm-granite/granite-3.0-2b-base ibm-granite/granite-3.0-3b-a800m-base ibm-granite/granite-3.0-8b-base ibm-granite/granite-3.1-1b-a400m-base ibm-granite/granite-3.1-2b-base ibm-granite/granite-3.1-3b-a800m-base ibm-granite/granite-3.1-8b-base ibm-granite/granite-3.3-2b-base ibm-granite/granite-3.3-8b-base ibm-granite/granite-34b-code-base-8k ibm-granite/granite-34b-code-instruct-8k ibm-granite/granite-3b-code-base-128k ibm-granite/granite-3b-code-base-2k ibm-granite/granite-3b-code-instruct-128k ibm-granite/granite-3b-code-instruct-2k ibm-granite/granite-4.0-tiny-preview ibm-granite/granite-8b-code-base-4k ibm-granite/granite-8b-code-instruct-128k ibm-granite/granite-8b-code-instruct-4k ibm-research/PowerLM-3b ibm-research/PowerMoE-3b ibm-research/moe-7b-1b-active-shared-experts iterateai/Interplay-AppCoder jinaai/starcoder-1b-textbook kollecter/granite-3.1-1b-a400m-base kollecter/granite-3.1-3b-a800m-base mdouglas/granite-3.1-3b-a800m-base-bnb-4bit michaelfeil/ct2fast-starcoder mkdir700/v2-starcoderbase1b-personal-copilot-A100-40GB-colab mlx-community/granite-20b-code-instruct-4bit mlx-community/granite-20b-code-instruct-8bit mlx-community/granite-34b-code-base-4bit mlx-community/granite-34b-code-base-8bit mlx-community/granite-34b-code-instruct-8bit mrm8488/santacoder-finetuned-the-stack-bash-shell mrm8488/santacoder-finetuned-the-stack-clojure muhtasham/santacoder-finetuned-the-stack-assembly muhtasham/santacoder-finetuned-the-stack-cobol nlpguy/granite-3.0-1b-a400m-base nlpguy/granite-3.0-3b-a800m-base nuprl/MultiPL-T-StarCoderBase_15b nuprl/MultiPL-T-StarCoderBase_1b openaccess-ai-collective/minotaur-15b patrickbdevaney/WizardLM-1b-GGUF rahuldshetty/tiny-starcoder-instruct refactai/starcoderbase-1b richardr1126/spider-natsql-wizard-coder-merged richardr1126/spider-skeleton-wizard-coder-merged seeklhy/codes-15b seeklhy/codes-15b-bird seeklhy/codes-1b seeklhy/codes-3b seeklhy/codes-3b-bird-with-evidence seeklhy/codes-7b seeklhy/codes-7b-merged sky-2002/tiny-starcoder-ft tdoehmen/starcoder-schemapile-fk umm-maybe/StarCoder-1B-R2 umm-maybe/StarCoder-1B-StackStar yujiepan/starcoder-tiny-random

Pattern #8 (104 models)

-normalizer:		None
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+normalizer:		Sequence(normalizers=[Prepend(prepend="▁"), Replace(pattern=String(" "), content="▁")])
+pre_tokenizer:		None
Affected models
ConverterModels
LlamaConverterBEE-spoke-data/beecoder-220M-python Bharanidharan07/idefics_2_finetuned_copy GeorgeBredis/ruIdefics2-ruLLaVA-merged Guilherme34/Samantha-multimodal-v2-model HuggingFaceM4/Sightseer HuggingFaceM4/idefics2 HuggingFaceM4/idefics2-8b HuggingFaceM4/idefics2-8b-AWQ HuggingFaceM4/idefics2-8b-base HuggingFaceM4/idefics2-8b-base-AWQ HuggingFaceM4/idefics2-8b-chatty HuggingFaceM4/idefics2-8b-chatty-AWQ HuggingFaceM4/idefics2-tfrm-compatible HuggingFaceM4/idefics2_raven_finetuned HuggingFaceM4/tr_272_bis_opt_step_15000_merge Mantis-VL/idefics2-8b-video-eval-refined-40k_4096_generation Mantis-VL/idefics2-8b-video-eval-refined-40k_4096_regression Mantis-VL/mantis-8b-idefics2-classification-example_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-20k-mantis-2epoch_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-20k_2048 Mantis-VL/mantis-8b-idefics2-video-eval-40k-2epoch_4096_generation Mantis-VL/mantis-8b-idefics2-video-eval-40k-mantis-2epoch_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-50k-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-50k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-50k-mantis_4096 Mantis-VL/mantis-8b-idefics2-video-eval-50k_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-batch32_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-mantis_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k_4096 Mantis-VL/mantis-8b-idefics2-video-eval-anno-real_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-debug_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-high-res-20k-mantis-3epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-high-res-35k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-high-res-40k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k-ablation-anno_4096_generation Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k-ablation-anno_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k-sora_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k_4096_generation Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval_5184_regression Mantis-VL/mantis-8b-idefics2-video-eval_6144_regression Mantis-VL/mantis-8b-idefics2_8192 Nanbeige/ToolMind-Web-3B OpenGVLab/Mini-InternVL2-4B-DA-DriveLM OpenWebVoyager/OpenWebVoyager-opt-1 Pavithra2910/09thmay Pavithra2910/finetuningidefics Reverb/Idefics2-8b-docVQA-finetuned SalmanFaroz/idefics2-8b-DocVQA-SP Shashank91097/Idefic Shashank91097/Idefic_medical_VQA_merged11 StevenHH2000/Finedefics Syed-Hasan-8503/Idefics2-8B-SFT TD788432/IDEFICS-n.2-FT-DocVQA TIGER-Lab/Mantis-8B-Idefics2 TIGER-Lab/VISTA-Mantis TIGER-Lab/VideoScore TIGER-Lab/VideoScore-v1.1 Trelis/idefics2-8b-chatty-bf16 andrew-together/idefics2-8b-finetune-combined-50k_8192 edbeeching/vsft-idefics2 enghamdiali/idefics-9b-merge enghamdiali/idefics-9b-qt_f enghamdiali/idfc-m1 francepfl/DriveLM-mantis-8b-idefics2_8192-cot francepfl/mantis-8b-idefics2_exp10_italian_8192 francepfl/mantis-8b-idefics2_exp_italian_8192 giobin/idefics2_random_connector_v2 huz-relay/idefics2-8b-ocr instructlab/granite-7b-lab jancuhel/idefics2-8b-img-text-relevancy jihadzakki/idefics2-8b-medvqa jihadzakki/idefics2-8b-roco-slake jihadzakki/idefics2-8b-vqarad-delta lamm-mit/Cephalo-Idefics-2-vision-10b-alpha lamm-mit/Cephalo-Idefics-2-vision-10b-beta lamm-mit/Cephalo-Idefics-2-vision-12b-alpha lamm-mit/Cephalo-Idefics-2-vision-8b-alpha lamm-mit/Cephalo-Idefics-2-vision-8b-beta matbee/idefics2-weblinx-20500 mlx-community/idefics2-8b-4bit mlx-community/idefics2-8b-8bit mlx-community/idefics2-8b-chatty-4bit mlx-community/idefics2-8b-chatty-8bit mqliu/mantis-8b-idefics2_1024 pallavibiswas/idefics2-finetuned-re-id perceptorLLM/idefics2-8b-4bit-bf16 perceptorLLM/idefics2-8b-4bit-fp16 qgallouedec/tiny-Idefics2ForConditionalGeneration smishr-18/Idefics-PokeCards smishr-18/Idefics2-PokemonCards tctrautman/20240709-kibbe-training-gen-1x-merged tiiuae/viscon-contextual-captioner trl-internal-testing/tiny-Idefics2ForConditionalGeneration vctmk/mantis-8b-idefics2-classification-example_2048_regression vctmk/mantis-8b-idefics2-classification-tedED_4096_regression vctmk/mantis-8b-idefics2-classification-tedEDself_8g_4096_regression vctmk/mantis-8b-idefics2-classification-tedEDself_v2_3_8g_4096_regression worldboss/idefics-9b-doodles-v1 wyu1/Leopard-Idefics2 zesquirrelnator/idefics2-8b-docvqa-finetuned-tutorial zixianma/mma_idefics2_293k-toolp-seq_length_8192-lr_1e-5

Pattern #9 (98 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("<\|startoftext\|>|<\|endoftext\|>|'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+"), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+"), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
Affected models
ConverterModels
CLIPConverterBAAI/BGE-VL-large Marqo/marqo-fashionCLIP RaviKush/clipseg_finetuned_dice_bce RaviKush/clipseg_focal_loss_v0 RaviKush/clipseg_focal_loss_v1 Xenova/clip-vit-base-patch16 Xenova/clipseg-rd16 Xenova/clipseg-rd64 Xenova/clipseg-rd64-refined Xenova/owlv2-base-patch16 Xenova/owlv2-base-patch16-ensemble Xenova/owlv2-base-patch16-finetuned Xenova/owlvit-base-patch16 Xenova/owlvit-base-patch32 Xenova/owlvit-large-patch14 apple/DFN2B-CLIP-ViT-L-14 apple/DFN5B-CLIP-ViT-H-14 apple/DFN5B-CLIP-ViT-H-14-378 apple/MobileCLIP-S1-OpenCLIP apple/MobileCLIP-S2-OpenCLIP apple/aimv2-large-patch14-224-lit beyazitkelceoglu/owlv2-large-patch14-ONNX codyliu20032003/oneformer-parkseg12k-test dokutoshi/owlvit-base-patch32_FT_cppe5 gj5520/KoalaSeg hf-internal-testing/tiny-random-CLIPSegModel hf-internal-testing/tiny-random-OneFormerForUniversalSegmentation hf-internal-testing/tiny-random-OneFormerModel hf-internal-testing/tiny-random-OwlViTForObjectDetection hf-internal-testing/tiny-random-OwlViTModel hf-internal-testing/tiny-random-Owlv2ForObjectDetection hf-internal-testing/tiny-random-Owlv2Model hf-internal-testing/tiny-random-owlvit hf-internal-testing/tiny-random-owlvit-object-detection hf-tiny-model-private/tiny-random-CLIPSegModel hf-tiny-model-private/tiny-random-OneFormerForUniversalSegmentation hf-tiny-model-private/tiny-random-OneFormerModel hf-tiny-model-private/tiny-random-OwlViTForObjectDetection hf-tiny-model-private/tiny-random-OwlViTModel hiendang7613/oneformer_190725_swinT imageomics/bioclip imageomics/bioclip-2 laion/CLIP-ViT-L-14-CommonPool.XL-s13B-b90K laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K laion/CLIP-ViT-g-14-laion2B-s34B-b88K laion/CLIP-convnext_base_w-laion2B-s13B-b82K laion/CLIP-convnext_base_w-laion2B-s13B-b82K-augreg laion/CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K-augreg laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft-soup laion/CLIP-convnext_xxlarge-laion2B-s34B-b82K-augreg-soup mayank0621/owlvit-base-patch32_FT_cppe5 mieszkok/oneformer_ade20k_swin_large_geopose3k_original_900_E5 mieszkok/shi-labs_oneformer_ade20k_swin_large_geopose3k_original_images900_epochs5 onnx-community/owlv2-base-patch16-ONNX onnx-community/owlv2-base-patch16-ensemble-ONNX onnx-community/owlv2-base-patch16-finetuned-ONNX onnx-community/owlv2-large-patch14-ensemble-ONNX onnx-community/owlv2-large-patch14-finetuned-ONNX onnx-community/owlvit-base-patch32-ONNX onnx-internal-testing/tiny-random-OwlViTForObjectDetection-ONNX onnx-internal-testing/tiny-random-OwlViTModel-ONNX onnx-internal-testing/tiny-random-Owlv2ForObjectDetection-ONNX onnx-internal-testing/tiny-random-Owlv2Model-ONNX openai/clip-vit-base-patch16 openai/clip-vit-large-patch14 pooya-mohammadi/oneformer_ade20k_swin_tiny_clothes rathi2023/owlvit-base-patch32 rathi2023/owlvit-base-patch32_FT_cppe5 redlessone/DermLIP_ViT-B-16 suinleelab/monet timm/MobileCLIP2-S0-OpenCLIP timm/MobileCLIP2-S3-OpenCLIP timm/PE-Core-B-16 timm/PE-Core-L-14-336 timm/PE-Core-bigG-14-448 timm/eva02_base_patch16_clip_224.merged2b_s8b_b131k timm/eva02_enormous_patch14_plus_clip_224.laion2b_s9b_b144k timm/eva02_large_patch14_clip_224.merged2b_s4b_b131k timm/eva02_large_patch14_clip_336.merged2b_s6b_b61k timm/eva_giant_patch14_plus_clip_224.merged2b_s11b_b114k timm/resnet101_clip.openai timm/resnet50_clip.openai timm/vit_base_patch16_clip_224.laion400m_e32 timm/vit_base_patch16_plus_clip_240.laion400m_e31 timm/vit_base_patch32_clip_224.laion2b_e16 timm/vit_base_patch32_clip_224.laion400m_e31 timm/vit_base_patch32_clip_224.laion400m_e32 timm/vit_huge_patch14_clip_224.metaclip_2pt5b timm/vit_large_patch14_clip_224.laion400m_e32 timm/vit_large_patch14_clip_224.metaclip_2pt5b timm/vit_large_patch14_clip_336.openai wisdomik/QuiltNet-B-16 wisdomik/QuiltNet-B-32 woweenie/open-clip-vit-h-nsfw-finetune zer0int/CLIP-GmP-ViT-L-14 zer0int/LongCLIP-GmP-ViT-L-14 zer0int/LongCLIP-L-Diffusers zerosandones/owlv2-large-patch14-ensemble-ONNX

Pattern #10 (82 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
T5ConverterDmjdxb/deplot Joemgu/mlong-t5-base-sumstew Joemgu/mlong-t5-large-sumstew KennethTM/pix2struct-base-table2html Shubham-Awasthi/pix2struct_infovqa TeeA/DEPLOT-ViChart TeeA/ViMATCHA TomasFAV/Pix2StructCzechInvoice TomasFAV/Pix2StructCzechInvoiceLarge Xenova/deplot Xenova/pix2struct-ai2d-base Xenova/pix2struct-chartqa-base Xenova/pix2struct-docvqa-base Xenova/pix2struct-infographics-vqa-base Xenova/pix2struct-screen2words-base Xenova/pix2struct-tiny-random Xenova/pix2struct-widget-captioning-base am-infoweb/pix2struct-7.3K-model_12_08-new aravind-selvam/deplot_v0 aravind-selvam/pix2struct_chart bollscoasts/pix2act-onnx brainventures/deplot_kr darksensei/pix2struct-cord fxmarty/pix2struct-tiny-random gitlost-murali/pix2struct-refexp-base gitlost-murali/pix2struct-refexp-large giulioderasmo/Pix2struct-sroie-10k google/deplot google/matcha-base google/matcha-chart2text-pew google/matcha-chart2text-statista google/matcha-chartqa google/matcha-plotqa-v1 google/pix2struct-ai2d-base google/pix2struct-ai2d-large google/pix2struct-base google/pix2struct-chartqa-base google/pix2struct-docvqa-base google/pix2struct-docvqa-large google/pix2struct-infographics-vqa-base google/pix2struct-infographics-vqa-large google/pix2struct-large google/pix2struct-ocrvqa-base google/pix2struct-ocrvqa-large google/pix2struct-screen2words-base google/pix2struct-screen2words-large google/pix2struct-textcaps-base google/pix2struct-textcaps-large google/pix2struct-widget-captioning-base google/pix2struct-widget-captioning-large habibi26/ocr_struk hoangphu7122002ai/pix2struct_v0 juanivazquez/id_card-pix2struct-model-v3 optimum-intel-internal-testing/pix2struct-tiny-random oroikon/ft_pix2struct_chart_captioning paturi1710/pix2Struct-base-table-parsing-json-v2.0 paturi1710/pix2Struct-base-table-parsing-v1.0 pierretokns/pix2act-weblinx-base-onnx pierretokns/pix2act-weblinx-large-onnx prajwalJumde/pix2struct-test-model_08_08-new santiagoperezs/comunicacion-aviso-pix2struct-cord ssh1419/deplot-batch-1-new-loss-only-token ssh1419/deplot-batch-3-token-freeze-curri ssh1419/indi-deplot ssh1419/indi-deplot-1-freeze ssh1419/indi-deplot-200 ssh1419/indi-deplot-3-final ssh1419/indi-deplot-batch-16 ssh1419/indi-deplot-freeze-norm ssh1419/indi-deplot-lr-half-half ssh1419/test-deplot-1 sujr/pix2struct-base teamapocalypseml/regben2ipa-umt5base to-be/Pix2StructGhega turgutguvercin/pix2struct-turkish-receipts warshakhan/pix2struct-base-docvqa-change warshakhan/pix2struct-base-docvqa-public xcodemind/UICoder xcodemind/uicopilot_structure xcodemind/webcoder ybelkada/pix2struct-base-football zirui3/pix2struct-cord-v2

Pattern #11 (57 models)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=False)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
Affected models
ConverterModels
BertConverterAlIshaq/DPR-question_encoder-faq-pesantren DataHammer/scidpr-question-encoder Mjollnir1996/dpr-question_encoder-bert-base-multilingual_mod NAACL2022/spider NAACL2022/spider-nq-question-encoder NAACL2022/spider-trivia-ctx-encoder NAACL2022/spider-trivia-question-encoder PrimeQA/XOR-TyDi_monolingual_DPR_qry_encoder aubmindlab/araelectra-base-discriminator castorini/ance-dpr-context-multi castorini/ance-dpr-question-multi castorini/bpr-nq-ctx-encoder castorini/bpr-nq-question-encoder datasetsANDmodels/image2text deepset/bert-small-mm_retrieval-passage_encoder deepset/bert-small-mm_retrieval-question_encoder deepset/bert-small-mm_retrieval-table_encoder dsksd/dpr-ctx_encoder-single-qrecc-model-base facebook/dpr-ctx_encoder-multiset-base facebook/dpr-ctx_encoder-single-nq-base facebook/dpr-question_encoder-multiset-base facebook/dpr-question_encoder-single-nq-base facebook/dpr-reader-multiset-base facebook/dpr-reader-single-nq-base firqaaa/indo-dpr-question_encoder-single-squad-base google/mobilebert-uncased hf-internal-testing/tiny-random-DPRQuestionEncoder hf-internal-testing/tiny-random-dpr hf-tiny-model-private/tiny-random-DPRQuestionEncoder hfl/chinese-electra-180g-base-discriminator hfl/chinese-electra-180g-large-discriminator hfl/chinese-electra-180g-small-discriminator hfl/chinese-electra-180g-small-ex-discriminator lmz/candle-blip norwoodsystems/image-caption seduerr/paiintent soheeyang/dpr-ctx_encoder-single-trivia-base soheeyang/dpr-question_encoder-single-trivia-base soheeyang/rdr-ctx_encoder-single-nq-base soheeyang/rdr-ctx_encoder-single-trivia-base soheeyang/rdr-question_encoder-single-nq-base soheeyang/rdr-question_encoder-single-trivia-base squeezebert/squeezebert-mnli squeezebert/squeezebert-mnli-headless squeezebert/squeezebert-uncased tau/spider tau/spider-nq-ctx-encoder tau/spider-nq-question-encoder tau/spider-trivia-ctx-encoder tau/spider-trivia-question-encoder typeform/squeezebert-mnli vblagoje/dpr-ctx_encoder-single-lfqa-base vblagoje/dpr-ctx_encoder-single-lfqa-wiki vblagoje/dpr-question_encoder-single-lfqa-base vblagoje/dpr-question_encoder-single-lfqa-wiki zhiweitong/dpr-answer_encoder-single-nq-base zhiweitong/dpr-ctx_encoder-single-nq-base

Pattern #12 (53 models)

-normalizer:		None
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=False)
+normalizer:		Sequence(normalizers=[Prepend(prepend="▁"), Replace(pattern=String(" "), content="▁")])
+pre_tokenizer:		None
Affected models
ConverterModels
LlamaConverter1093212290a/idefics-9b-doodles A2Amir/SF_A68_IDEFICS_9B_IDL_SFT Abhaykoul/idefics-9b-doodles Alvi12/idefics-9b-doodles Aman8252/idefics-9b-doodles ArthurFischel/custom-tiny-random-idefics ArthurFischel/tiny-random-idefics-smw_10k-300steps HuggingFaceM4/idefics-80b HuggingFaceM4/idefics-80b-instruct HuggingFaceM4/idefics-9b HuggingFaceM4/idefics-9b-instruct HuggingFaceM4/tiny-random-idefics KadirErturk/image_info OpenGVLab/InternVL2-40B-AWQ Salmamoori/idefics-9b-doodles a8nova/tiny-random-idefics areegtarek/idefics-9b-all areegtarek/idefics-9b-doodles areegtarek/idefics-9b-instruct-3batchesoneepoch areegtarek/idefics-9b-instruct-3batchesoneepoch-1-2 areegtarek/idefics-9b-instruct-3batchesoneepoch-1-2-3 areegtarek/idefics-9b-instruct-3batchesoneepoch-1-2-3-abnormal2epochsfreeze areegtarek/idefics-9b-instruct-abnormal3epochs areegtarek/idefics-9b-instruct-all areegtarek/idefics-9b-instruct-all-v2 areegtarek/idefics-9b-instruct-all-v3 areegtarek/idefics-9b-instruct-stage-1 areegtarek/idefics-9b-instruct-stage-1-stage-2 areegtarek/idefics-9b-instruct-stage-1-stage-2-stage-3 areegtarek/idefics-9b-instruct-threesplitsthreeepochs-1 areegtarek/idefics-9b-instruct-threesplitsthreeepochs-1-2 areegtarek/idefics-9b-instruct-threesplitsthreeepochs-1-2-3 areegtarek/idefics-9b-randomsampleNIH areegtarek/idefics-9b-split1-v1 areegtarek/idefics-9b-split1-v1-split1.2-v1 areegtarek/idefics-9b-stage1-v1 areegtarek/idefics-9b-stage1-v1-stage2-v1 areegtarek/idefics-9b-stage1-v1-stage2-v1-stage3-v1 areegtarek/idefics-9b-threebatchestenepochs dawoz/IDEFICS-frozenlake enghamdiali/idefics-9b-fn gauthamk28/idefics-9b-doodles jacky892/idefics-9b-doodles justinkarlin/idefics-9b-faces machinev/idefics-9b-LPU_model mattzhang/idefics-9b-doodles mervinpraison/idefics-9b-doodles mervinpraison/idefics-9b-pokemon-blip mychen76/idefics-9b-doodles ntust0/idefics-9b-bayc samim2024/Image-Text-To-Text turing-motors/Heron-Idefics2-8B-v0.1 worldboss/idefics-9b-doodles

Pattern #13 (40 models)

-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 1), trim_offsets=True, add_prefix_space=True)
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"])})
Affected models
ConverterModels
BlenderbotConverterAdapting/dialogue_agent_nlplab2022 BOUDABOUS/ai-univ-chatbot BOUDABOUS/fine-tuned-chatbot Bbrown44/aas_nlp_v1 Danieldor/Baldor-Assist DriveMyScream/Blenderbot_ChatBot Grendar/blenderbot-400M-distill-Shiro Megareyka/blenderbot-400M-FineTuned Ruthu1/skincare Saima0/mental-health-chatbot Xenova/blenderbot-400M-distill abhijitgayen/cogo-blenderbot-slow azkamannan2004/MindEase-CD-CPU azkamannan2004/MindEase-CD-fp16 breadlicker45/autotrain-blender-50601120822 facebook/blenderbot-1B-distill facebook/blenderbot-3B facebook/blenderbot-400M-distill hf-internal-testing/tiny-random-BlenderbotForCausalLM hf-internal-testing/tiny-random-BlenderbotForConditionalGeneration hf-internal-testing/tiny-random-BlenderbotModel hf-tiny-model-private/tiny-random-BlenderbotForCausalLM hf-tiny-model-private/tiny-random-BlenderbotForConditionalGeneration hf-tiny-model-private/tiny-random-BlenderbotModel jonggul2/finetuning lyubomirr/GAIA onnx-internal-testing/tiny-random-BlenderbotForConditionalGeneration-ONNX onnx-internal-testing/tiny-random-BlenderbotModel-ONNX optimum-intel-internal-testing/tiny-random-BlenderbotModel scriptkidd196883/ytp-engage-model-advanced scriptkidd196883/ytp-engage-model-beginner scriptkidd196883/ytp-engage-model-intermediate sir-evil/my-chat-model stanleychu2/system_400M stanleychu2/user_400M tgoktug/my_awesome_blendersum_model tgoktug/my_awesome_meeting_blendersum_model theastro/starkbot venkatavivekanandareddy/my-blenderbot-transformer-model vpadaraju/newest

Pattern #14 (31 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=False), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
BigBirdConverterGBaker/clinical-bigbird-medqa-usmle-nocontext LucasS/bigbirdABSA Mahmoud8/bigbird-roberta-base Shaier/bigbird-roberta-base ShengdingHu/sst2 alex2awesome/quote-attribution__qa-model-v2 alex2awesome/quote-attribution__qa-model-v3 hf-internal-testing/tiny-random-BigBirdForCausalLM hf-internal-testing/tiny-random-BigBirdForMaskedLM hf-internal-testing/tiny-random-BigBirdForQuestionAnswering hf-internal-testing/tiny-random-BigBirdForSequenceClassification hf-internal-testing/tiny-random-BigBirdForTokenClassification hf-internal-testing/tiny-random-BigBirdModel hf-internal-testing/tiny-random-big_bird hf-tiny-model-private/tiny-random-BigBirdForCausalLM hf-tiny-model-private/tiny-random-BigBirdForMultipleChoice hf-tiny-model-private/tiny-random-BigBirdForPreTraining hf-tiny-model-private/tiny-random-BigBirdForQuestionAnswering hf-tiny-model-private/tiny-random-BigBirdForSequenceClassification hf-tiny-model-private/tiny-random-BigBirdForTokenClassification hf-tiny-model-private/tiny-random-BigBirdModel ilos-vigil/bigbird-small-indonesian ilos-vigil/bigbird-small-indonesian-nli nsi319/bigbird-roberta-base-finetuned-app pepa/bigbird-roberta-base-fever pepa/bigbird-roberta-base-snli pepa/bigbird-roberta-large-fever pepa/bigbird-roberta-large-snli rubentito/bigbird-base-itc-mpdocvqa tgood/bigbird-roberta-base yikuan8/Clinical-BigBird

Pattern #15 (29 models)

-normalizer:		Sequence(normalizers=[NFD(), Lowercase(), StripAccents()])
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
Affected models
ConverterModels
OpenAIGPTConverter4stack/gpt-finetuned CaoTrungHieu/GPT_Entailment Dave12121/chatFsentiment LRSR/gpt-finetuned MichaelHu03/CS6220-GPT akiraaqira/career-full anezatra/gpt1-openassistant-117M-instruct dqwdqweqw/gpt-finetuned folklore1000/lyric001s512 goktug14/gpt1_sst2_left goktug14/gpt1_sst2_right hf-internal-testing/tiny-random-OpenAIGPTForSequenceClassification hf-internal-testing/tiny-random-OpenAIGPTLMHeadModel hf-internal-testing/tiny-random-OpenAIGPTModel hf-tiny-model-private/tiny-random-OpenAIGPTForSequenceClassification hf-tiny-model-private/tiny-random-OpenAIGPTLMHeadModel hf-tiny-model-private/tiny-random-OpenAIGPTModel hojjatkarami/ehr_gpt2 jeonghyeon97/gpt-finetuned karanzrk/essayl0 lgaalves/gpt1 model-attribution-challenge/openai-gpt obov/gpt-finetuned openai-community/openai-gpt soonbob/gpt-finetuned tmnam20/test_pretrain_pipeline vietnhatthai/test_pretrain_gpt_pipeline vietnhatthai/test_pretrain_pipeline vietnhatthai/viet_news_pretrain_pipeline

Pattern #16 (23 models)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
Affected models
ConverterModels
DebertaConverterAnkitAI/deberta-xlarge-base-emotions-classifier Denyol/FakeNews-deberta-large KoalaAI/OffensiveSpeechDetector KoalaAI/Text-Moderation PORTULAN/albertina-100m-portuguese-ptbr-encoder PORTULAN/albertina-100m-portuguese-ptpt-encoder PleIAs/KaribuAI djagatiya/ner-deberta-base-ontonotesv5-englishv4 garrettbaber/twitter-roberta-base-joy-intensity h2oai/deberta_finetuned_pii hf-internal-testing/tiny-random-DebertaForMaskedLM hf-internal-testing/tiny-random-DebertaForQuestionAnswering hf-internal-testing/tiny-random-DebertaForSequenceClassification hf-internal-testing/tiny-random-DebertaForTokenClassification hf-internal-testing/tiny-random-DebertaModel jammmmmm/pii lakshyakh93/deberta_finetuned_pii matejmicek/autotrain-crender2.0-39012102367 protectai/lakshyakh93-deberta_finetuned_pii-onnx raj-tomar001/LLM-DetectAIve_deberta-base s-nlp/deberta-large-formality-ranker sagawa/PubChem-10m-deberta sagawa/ZINC-deberta

Pattern #17 (17 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=String(" {2,}"), content="▁")])
-pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
Affected models
ConverterModels
XLMRobertaConverterVerboVision/MetaCLIP2-Distil-60-PCV facebook/metaclip-2-worldwide-b16 facebook/metaclip-2-worldwide-b16-384 facebook/metaclip-2-worldwide-b32 facebook/metaclip-2-worldwide-b32-384 facebook/metaclip-2-worldwide-giant facebook/metaclip-2-worldwide-giant-378 facebook/metaclip-2-worldwide-huge-378 facebook/metaclip-2-worldwide-huge-quickgelu facebook/metaclip-2-worldwide-huge-quickgelu facebook/metaclip-2-worldwide-l14 facebook/metaclip-2-worldwide-l14 facebook/metaclip-2-worldwide-m16 facebook/metaclip-2-worldwide-m16-384 facebook/metaclip-2-worldwide-s16 facebook/metaclip-2-worldwide-s16-384 onnx-community/metaclip-2-worldwide-huge-378-ONNX

Pattern #18 (15 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\s{2,}|[\n\r\t]"), content=" "), NFC(), Strip(strip_left=False, strip_right=True)])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
ReformerConverterNick1899/reformer-biological-papers-finetuned Nick1899/reformer-biological-papers-finetuned1 google/reformer-crime-and-punishment hf-internal-testing/tiny-random-ReformerForMaskedLM hf-internal-testing/tiny-random-ReformerForQuestionAnswering hf-internal-testing/tiny-random-ReformerForSequenceClassification hf-internal-testing/tiny-random-ReformerModel hf-internal-testing/tiny-random-reformer hf-tiny-model-private/tiny-random-ReformerForMaskedLM hf-tiny-model-private/tiny-random-ReformerForQuestionAnswering hf-tiny-model-private/tiny-random-ReformerForSequenceClassification hf-tiny-model-private/tiny-random-ReformerModel mwesner/reformer-clm nadellaroshni/reformer_model robingeibel/reformer-finetuned-big_patent-16384

Pattern #19 (14 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "eng_Latn":SpecialToken(id="eng_Latn", ids=[256047], tokens=["eng_Latn"])})
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="<unk>", type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="<unk>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<unk>":SpecialToken(id="<unk>", ids=[3], tokens=["<unk>"])})
Affected models
ConverterModels
NllbConverterAfriNLP/AfriNLLB-12enc-12dec-full-ft-kd AfriNLP/AfriNLLB-8enc-8dec-iterative-498m-ft JustFrederik/nllb-200-3.3B-ct2-float16 JustFrederik/nllb-200-distilled-1.3B-ct2-float16 JustFrederik/nllb-200-distilled-1.3B-ct2-int8 JustFrederik/nllb-200-distilled-600M-ct2 JustFrederik/nllb-200-distilled-600M-ct2-float16 JustFrederik/nllb-200-distilled-600M-ct2-int8 KomorebiAI/nllb-200-3.3B-float16-ct2 KomorebiAI/nllb-200-3.3B-int8-ct2 entai2965/nllb-200-3.3B-ctranslate2 entai2965/nllb-200-3.3B-ctranslate2-float16 entai2965/nllb-200-distilled-1.3B-ctranslate2 entai2965/nllb-200-distilled-600M-ctranslate2

Pattern #20 (12 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=String(" {2,}"), content="▁")])
-pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
Affected models
ConverterModels
XLMRobertaConverterCoder-Dragon/kosmos-finetuned-DocLayNet DhananjayNahata24/Kosmos-2-DJ1 Mit1208/Kosmos-2-PokemonCards-trl-merged MoonstoneF/kosm-checkpoint MoonstoneF/kosmos-finetuned-DocLayNet ShivamExto/Kosmos-2-Furnas-trl-2 ShivamExto/Kosmos-2-Furnas-trl-2-1 hf-internal-testing/tiny-random-Kosmos2ForConditionalGeneration hf-internal-testing/tiny-random-Kosmos2Model ishaangupta293/kosmos-2-patch14-24-dup-ms microsoft/kosmos-2-patch14-224 sutantowilliam/kosmos-finetuned-DocLayNet

Pattern #21 (12 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\n"), content=" "), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
PegasusConverterCLARA-MeD/pegasus-xsum ChaniM/tst-summarization Einmalumdiewelt/PegasusXSUM_GNAD RajSang/pegasus-sports-titles allenai/pegasus-multi_lexsum-long-short allenai/pegasus-multi_lexsum-long-tiny allenai/pegasus-multi_lexsum-short-tiny eilamc14/pegasus-xsum-text-simplification google/pegasus-xsum summarizationnnnn/Pegasus the-hir0/pegasus-detoxify wgcv/tidy-tab-model-pegasus-xsum

Pattern #22 (10 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=String(" {2,}"), content="▁")])
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
Affected models
ConverterModels
XLMRobertaConverterChow05/fine-tune-embedding-v1 Chow05/fine-tune-embedding-v2 Chow05/fine-tune-embedding-v3 Chow05/fine-tune-embedding-v4 Chow05/fine-tune-embedding-v5 Chow05/fine-tune-embedding-v6 dangvantuan/vietnamese-document-embedding jinaai/jina-clip-v2 longsteel/embedding visheratin/mexma-siglip2

Pattern #23 (8 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\n"), content=" "), Replace(pattern=Regex(" {2,}"), content=" ")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
Affected models
ConverterModels
PegasusConverterNicovis/ConvSum hyperchancellor07/pegasus-samsum-dialogue-summarizer mariam16elgohary/pegasus_arxiv_mit_lectures6 mohitskaushal/legal-pegasus-layman-legal-summarizer seanduffy/arxiv_summarizer seanduffy/govreport_summarizer seanduffy/pubmed_summarizer takao3548/pegasus-samsum

Pattern #24 (8 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
T5ConverterKETI-AIR-Downstream/long-ke-t5-base-translation-aihub-bidirection KETI-AIR-Downstream/long-ke-t5-base-translation-aihub-en2ko KETI-AIR-Downstream/long-ke-t5-base-translation-aihub-ko2en KETI-AIR/long-ke-t5-base KETI-AIR/long-ke-t5-small ding-diri-ding-dong/long-ke-t5-base-translation-aihub-ko2en pellucid/my_awesome_opus100_model pigeon01/sungju-finetuned-ko-to-en_ver3

Pattern #25 (8 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=String("[\(\)\[\]\{\}]|([!\"#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=Regex("[\(\)\[\]\{\}]|([!"\#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
Affected models
ConverterModels
TikTokenConverterHongxuanLi/nougat-base-deploy Xenova/nougat-base facebook/nougat-base jjreif/nougat-base-fork kevin-pek/nougat-api mzbac/nougat-base-8bit-mlx pszemraj/nougat-base-onnx pszemraj/nougat-base-onnx-quant_avx2

Pattern #26 (8 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=String("[\(\)\[\]\{\}]|([!\"#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=Regex("[\(\)\[\]\{\}]|([!"\#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
-truncation:		{'max_length': 4096, 'stride': 0, 'strategy': 'longest_first', 'direction': 'right'}
+truncation:		{'max_length': 3584, 'stride': 0, 'strategy': 'longest_first', 'direction': 'right'}
Affected models
ConverterModels
TikTokenConverterCuiSiwei/nougat-for-formula Xenova/nougat-small facebook/nougat-small mzbac/nougat-small-8bit-mlx onnx-community/nougat-small-ONNX pszemraj/nougat-small-onnx pszemraj/nougat-small-onnx-quant_avx2 pszemraj/nougat-small-onnx-quant_avx512_vnni

Pattern #27 (7 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+pre_tokenizer:		None
-decoder:		Sequence(decoders=[Replace(pattern=String("▁"), content=" "), ByteFallback(), Fuse(), Strip(content=" ", start=1, stop=0)])
+decoder:		None
Affected models
ConverterModels
LlamaConverterBEE-spoke-data/smol_llama-101M-GQA-python HuggingFaceM4/tiny-random-idefics-m4 OpenGVLab/InternVL2-4B Wanfq/FuseLLM-7B nahidalam/llava-aimv2 philschmid/tiny-random-idefics-m4 screenmate/idefics_50_25_25_merged

Pattern #28 (7 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
T5Convertertimm/ViT-B-16-SigLIP timm/ViT-B-16-SigLIP-256 timm/ViT-B-16-SigLIP-512 timm/ViT-B-16-SigLIP-i18n-256 timm/ViT-L-16-SigLIP-256 timm/ViT-SO400M-14-SigLIP timm/ViT-SO400M-14-SigLIP-384

Pattern #29 (7 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None
Affected models
ConverterModels
GemmaConverterRichardErkhov/google_-_recurrentgemma-2b-4bits RichardErkhov/google_-_recurrentgemma-2b-8bits RichardErkhov/google_-_recurrentgemma-2b-it-4bits RichardErkhov/google_-_recurrentgemma-2b-it-8bits monology/recurrentgemma-9b-it-8bit qihoo360/fg-clip2-so400m theo77186/recurrentgemma-9b-it-bnb-4bit

Pattern #30 (6 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+pre_tokenizer:		Sequence(pretokenizers=[ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True), Metaspace(replacement="▁", prepend_scheme=first, split=False)])
Affected models
ConverterModels
LlamaConverterCanisAI/teach-generalist-ministral-3b-r2 CanisAI/teach-humanities-ministral-3b-r2 CanisAI/teach-language-ministral-3b-r2 CanisAI/teach-math-ministral-3b-r2 CanisAI/teach-science-ministral-3b-r2 thestarfarer/Ministral-3-14B-1x1

Pattern #31 (6 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None
Affected models
ConverterModels
GemmaConverteralpindale/recurrentgemma-9b alpindale/recurrentgemma-9b-it gg-hf/recurrentgemma-9b gg-hf/recurrentgemma-9b-it gg-hf/recurrentgemma-9b-it-pytorch gg-hf/recurrentgemma-9b-pytorch

Pattern #32 (6 models)

-model:			AddedToken("<pad>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("<unk>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+model:			AddedToken("<unk>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<pad>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=True)
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256026], tokens=["__fra__"])})
-decoder:		Metaspace(replacement="▁", prepend_scheme=first, split=True)
+normalizer:		None
+pre_tokenizer:		None
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={})
+decoder:		None
Affected models
ConverterModels
SeamlessM4TConverteraudo/seamless-m4t-v2-large facebook/seamless-m4t-v2-large jaman21/seamless-m4t-v2-t2st jaman21/seamless-m4t-v2-t2tt jaman21/seamless-m4t-v2-t2tt-t2st osanseviero/seamless-copy

Pattern #33 (5 models)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="[SEP]", type_id=0)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
Affected models
ConverterModels
DebertaConverterKISTI-AI/scideberta-cs cross-encoder/nli-deberta-base geckos/deberta-base-fine-tuned-ner hf-internal-testing/tiny-random-deberta optimum-intel-internal-testing/tiny-random-deberta

Pattern #34 (5 models)

-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None
Affected models
ConverterModels
GemmaConverterFlatFootInternational/gemma-3n-E4B-it-bf16 MuXodious/gemma-3n-E4B-it-absolute-heresy-MPOA-mlx-8Bit blind-assist/google-gemma-3n-2b-e3 tomaarsen/t5gemma-s-gooaq-cmnrl wdaniel00763n/gemma-3N-news-finetune

Pattern #35 (5 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
Affected models
ConverterModels
SeamlessM4TConverterThivyanRR/english_seamlessm4t_medium ThivyanRR/gujarathi_seamlessm4t_medium elego/ss-hmong-v3 lukmanaj/hf-seamless-m4t-medium-en-tw-10-ep lukmanaj/hf-seamless-m4t-medium-en-tw-3-ep

Pattern #36 (5 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
Affected models
ConverterModels
SeamlessM4TConverterAnasAber/seamless-darija-eng RafatK/SMT-AZR ThivyanRR/indic_seamlessm4t_v2_large UBC-NLP/Simba-S xun/seamless-m4t-v2-large-8bit-bnb

Pattern #37 (4 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=True)
+pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
Affected models
ConverterModels
RobertaConverterDmitrySpartak/layoutlm-invoices faisalraza/layoutlm-invoices impira/layoutlm-document-qa impira/layoutlm-invoices

Pattern #38 (3 models)

+AddedToken("<|fim_prefix|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<|fim_middle|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<|fim_suffix|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<|endofprompt|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}{1,3}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]..."), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
Affected models
ConverterModels
GPT2ConverterChuckMcSneed/dolphin-2.9.1-dbrx-llamacppfixed imi2/dbrx-base-2.5bpw-h6-exl2 nicoboss/dbrx-base

Pattern #39 (3 models)

-post_processor:		RobertaProcessing(sep=("<sep>", 50265), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
Affected models
ConverterModels
RobertaConverterdtorber/BioNLP-2024-dtorber-baseline-eLife dtorber/BioNLP-intro-disc-eLife dtorber/BioNLP-tech-intro-disc-eLife

Pattern #40 (3 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
NllbConverterKnutJaegersberg/nllb-moe-54b-4bit Maxime-Bakunzi/twigane-en-kin-translation madatnlp/nllb-moe-54b-8bit

Pattern #41 (3 models)

-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+normalizer:		NFKC()
+pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
Affected models
ConverterModels
GPT2ConverterUBC-NLP/Jasmine-350M VietAI/gpt-j-6B-vietnamese-news VietAI/gpt-neo-1.3B-vietnamese-news

Pattern #42 (3 models)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256026], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__eng__":SpecialToken(id="__eng__", ids=[256022], tokens=["__eng__"])})
Affected models
ConverterModels
SeamlessM4TConverterGeneline-X/seamless-m4t-v2-sunbird-multilingual-v1 KDiallo/seamless_sunbird_finetune KDiallo/seamless_sunbird_finetune_v2

Pattern #43 (3 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256026], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__eng__":SpecialToken(id="__eng__", ids=[256022], tokens=["__eng__"])})
Affected models
ConverterModels
SeamlessM4TConverterEricTydd/SpeechtoText-Burmese Marialab/finetuned-seamless-m4T-large-1000-step blakenp/english_to_spanish_model

Pattern #44 (2 models)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=False)
-pre_tokenizer:		BertPreTokenizer()
-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[2], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[3], tokens=["[SEP]"])})
-decoder:		WordPiece(prefix="##", cleanup=True)
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Sequence(pretokenizers=[BertPreTokenizer(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
+post_processor:		BertProcessing(sep=("[SEP]", 3), cls=("[CLS]", 2))
+decoder:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
Affected models
ConverterModels
BertConverterIcelandic-lt/convbert-small-igc-is jonfd/convbert-small-igc-is

Pattern #45 (2 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=never, split=False)
Affected models
ConverterModels
LlamaConverterOpenGVLab/InternVL2_5-2B-MPO-hf OpenGVLab/InternVL2_5-8B-MPO-hf

Pattern #46 (2 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=False)
+pre_tokenizer:		None
-decoder:		Sequence(decoders=[Replace(pattern=String("▁"), content=" "), ByteFallback(), Fuse(), Strip(content=" ", start=1, stop=0)])
+decoder:		None
Affected models
ConverterModels
LlamaConverterOpenGVLab/InternVL-Chat-V1-2 OpenGVLab/InternVL2-40B

Pattern #47 (2 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		None
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
Affected models
ConverterModels
T5Converterfreddy913/FRDYV2_38 freddy913/FRDYV2_39

Pattern #48 (2 models)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
-pre_tokenizer:		BertPreTokenizer()
-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[0], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
-decoder:		WordPiece(prefix="##", cleanup=True)
+normalizer:		Sequence(normalizers=[NFKC(), Lowercase()])
+pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+post_processor:		BertProcessing(sep=("[SEP]", 2), cls=("[CLS]", 0))
+decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
Affected models
ConverterModels
BertConverterBingsu/mobilebert_ko_mlm_1 Bingsu/my_mobilebert_untrained

Pattern #49 (2 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("</s>", 36745), cls=("<s>", 36744), trim_offsets=True, add_prefix_space=True)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Whitespace()
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={})
+decoder:		None
Affected models
ConverterModels
RobertaConverterrpii2023/Je_baat rpii2023/naya_token

Pattern #50 (2 models)

-normalizer:		None
+normalizer:		Lowercase()
Affected models
ConverterModels
RobertaConvertercambridge-climb/baseline-roberta_pre_layer_norm-model climb-mao/climb-roberta_pre_layer_norm-model

Pattern #51 (2 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None
Affected models
ConverterModels
GemmaConverterRichardErkhov/voidful_-_recurrentgemma-2b-base-4bits RichardErkhov/voidful_-_recurrentgemma-2b-base-8bits

Pattern #52 (2 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
-normalizer:		Replace(pattern=String(" "), content="▁")
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+normalizer:		None
+pre_tokenizer:		None
-decoder:		Sequence(decoders=[Replace(pattern=String("▁"), content=" "), ByteFallback(), Fuse()])
+decoder:		None
Affected models
ConverterModels
GemmaConverteraimagelab/LLaVA_MORE-gemma_2_2b-finetuning aimagelab/LLaVA_MORE-gemma_2_9b-finetuning

Pattern #53 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
Affected models
ConverterModels
T5Converterteamapocalypseml/regben2ipa-mt5-base

Pattern #54 (1 model)

-AddedToken("<SEP>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("<CLS>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		NFC()
-pre_tokenizer:		Sequence(pretokenizers=[Digits(individual_digits=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
+normalizer:		None
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("\d{1,3}(?=(?:\d{3})*\b)"), behavior=Isolated, invert=False), Split(pattern=Regex("[^\r\n\p{L}\p{N}]?[\p{Lu}\p{Lt}\p{Lm}\p{Lo}\p{M}]*[\p{Ll}\p{Lm}\p{Lo}\p{M}]+(?i:'s|'t|'re|'ve|'m|'ll..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
Affected models
ConverterModels
TikTokenConverteroptimum-intel-internal-testing/tiny-random-aya-base

Pattern #55 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=False), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
Affected models
ConverterModels
BigBirdConverterpszemraj/bigbird-roberta-base-edu-classifier

Pattern #56 (1 model)

-normalizer:		None
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=never, split=False)
+normalizer:		Sequence(normalizers=[Replace(pattern=String(" "), content="▁")])
+pre_tokenizer:		None
Affected models
ConverterModels
LlamaConverterlogsyc/failure-aware-ernie-4.5

Pattern #57 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=False), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
BigBirdConverterkimsan0622/bigbird-base

Pattern #58 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\s{2,}|[\n\r\t]"), content=" "), NFC(), Strip(strip_left=False, strip_right=True)])
+normalizer:		Sequence(normalizers=[Strip(strip_left=True, strip_right=True), Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
Affected models
ConverterModels
DebertaV2ConverterDataFog/pii-small-en

Pattern #59 (1 model)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
Affected models
ConverterModels
MarkupLMConverterSaulLu/markuplm-base

Pattern #60 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
Affected models
ConverterModels
XLMRobertaConvertermicrosoft/layoutxlm-base

Pattern #61 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
Affected models
ConverterModels
MBart50Converterdlucidone/kumaoni-mbart-lora

Pattern #62 (1 model)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+post_processor:		BertProcessing(sep=("[SEP]", 2), cls=("[CLS]", 1))
Affected models
ConverterModels
BertConverterKBLab/bert-base-swedish-cased-reallysimple-ner

Pattern #63 (1 model)

-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+decoder:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
Affected models
ConverterModels
RobertaConvertervgaraujov/led-base-16384-spanish

Pattern #64 (1 model)

-post_processor:		RobertaProcessing(sep=("</s>", 25905), cls=("<s>", 25904), trim_offsets=True, add_prefix_space=False)
+post_processor:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
Affected models
ConverterModels
RobertaConverterMwnthai/bodo-legal-led-summ

Pattern #65 (1 model)

-normalizer:		NFC()
+normalizer:		None
Affected models
ConverterModels
TikTokenConverterNYTK/PULI-HuBA-mamba-130M

Pattern #66 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "eng_Latn":SpecialToken(id="eng_Latn", ids=[256047], tokens=["eng_Latn"])})
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="eng_Latn", type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="eng_Latn", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "eng_Latn":SpecialToken(id="eng_Latn", ids=[256047], tokens=["eng_Latn"])})
Affected models
ConverterModels
NllbConverterfacebook/nllb-moe-54b

Pattern #67 (1 model)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
+normalizer:		Sequence(normalizers=[NFKD(), StripAccents(), Lowercase()])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[2], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[3], tokens=["[SEP]"])})
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={})
Affected models
ConverterModels
BertConverternovelcore/gem-electra

Pattern #68 (1 model)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=False)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=True)
Affected models
ConverterModels
BertConverterSeznam/small-e-czech

Pattern #69 (1 model)

-AddedToken("<s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("</s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("[SEP]", 102), cls=("[CLS]", 101), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
+pre_tokenizer:		BertPreTokenizer()
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[101], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[102], tokens=["[SEP]"])})
+decoder:		WordPiece(prefix="##", cleanup=True)
Affected models
ConverterModels
RobertaConverterthunlp/Lawformer

Pattern #70 (1 model)

-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
+post_processor:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
Affected models
ConverterModels
RobertaConvertermrm8488/longformer-base-4096-spanish

Pattern #71 (1 model)

-AddedToken("<s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("</s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("[SEP]", 3), cls=("[CLS]", 2), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=True)
+pre_tokenizer:		BertPreTokenizer()
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[2], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[3], tokens=["[SEP]"])})
+decoder:		WordPiece(prefix="##", cleanup=True)
Affected models
ConverterModels
RobertaConverterUWB-AIR/MQDD-pretrained

Pattern #72 (1 model)

-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("[SEP]", 2), cls=("[CLS]", 0), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=False)
+pre_tokenizer:		BertPreTokenizer()
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="[SEP]", type_id=0)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[0], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+decoder:		WordPiece(prefix="##", cleanup=True)
Affected models
ConverterModels
RobertaConvertertheSOL1/kolongformer-base-4096

Pattern #73 (1 model)

-normalizer:		Sequence(normalizers=[NFC(), Replace(pattern=Regex("\s+"), content=" "), Lowercase()])
-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("<\|startoftext\|>|<\|endoftext\|>|'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+"), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
-post_processor:		RobertaProcessing(sep=("<|endoftext|>", 1), cls=("<|startoftext|>", 0), trim_offsets=False, add_prefix_space=False)
+normalizer:		None
+pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+post_processor:		ByteLevel(add_prefix_space=True, trim_offsets=False, use_regex=True)
Affected models
ConverterModels
CLIPConverterhf-internal-testing/tiny-random-clip

Pattern #74 (1 model)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?[\p{L}\p{M}]+|\p{N}| ?[^\s\p{L}\p{M}\p{N}]+[\r\n]*|\s..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?[\p{L}\p{M}]+|\p{N}| ?[^\s\p{L}\p{M}\p{N}]+[\r\n]*|\s..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)])
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+decoder:		ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)
Affected models
ConverterModels
TikTokenConverterinferencerlabs/Qwen3.5-397B-A17B-MLX-4.1bit

Pattern #75 (1 model)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=1), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
Affected models
ConverterModels
MPNetConvertermukaj/fin-mpnet-base

Pattern #76 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256057], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__dan__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__dan__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__dan__":SpecialToken(id="__dan__", ids=[256041], tokens=["__dan__"])})
Affected models
ConverterModels
SeamlessM4TConvertermosesdaudu/Dyula_French

Pattern #77 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256057], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__eng__":SpecialToken(id="__eng__", ids=[256047], tokens=["__eng__"])})
Affected models
ConverterModels
SeamlessM4TConverterMarialab/finetuned-seamless-m4T-medium-1000-step

Pattern #78 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-decoder:		Metaspace(replacement="▁", prepend_scheme=first, split=True)
+decoder:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
Affected models
ConverterModels
SeamlessM4TConverterpanoyo9829/seamless-m4t-v2-large-fp16