Tokenizer backend equivalence report

Pattern #1 (456 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]+|\s+..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]+|\s+..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)])
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+decoder:		ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)

Affected models

Converter	Models
Qwen2Converter	ACE-Step/acestep-captioner ACE-Step/acestep-transcriber AGI-Eval/Auto-ATT AI71ai/agrillm-Qwen3-30B-A3B AITRADER/Huihui-Qwen3-Coder-Next-abliterated-mlx-4Bit AITRADER/Huihui-Qwen3-Coder-Next-abliterated-mlx-8Bit AVoCaDO-Captioner/AVoCaDO AlexHung29629/Qwen2.5-Omni-7B-Reasoning Alexjiuqiaoyu/Mirror_Max Alibaba-DAMO-Academy/RynnBrain-30B-A3B Alibaba-DAMO-Academy/RynnBrain-Plan-30B-A3B AudioVisual-Caption/ASID-Captioner-3B AudioVisual-Caption/ASID-Captioner-7B AutisticAF/Qwen3-Coder-Next-mlx-3Bit Bearrr310/train_grpo_1.5B-1230-ckpt100 BellLabs/das-1 BellLabs/das-2 BellLabs/das-3 Benasd/Qwen3-VL-30B-A3B-Instruct-NVFP4 Benasd/Qwen3-VL-30B-A3B-Thinking-NVFP4 Bhuvan77777/quantized-qwen2-audio-4bit BloomBerry/colqwen2-1.0-hf-vllm Cirrascale/Qwen3-Coder-Next-NVFP4 Clevyby/lynn-A2.7B-rp-v1-14.3b-32k-Q5_K_S-GGUF DarthGrampus/Qwen3-Coder-Next-Base-mlx-6Bit DiaDem-Captioner/DiaDem Dongchao/Omni-AutoThink Eldadalbajob/Huihui-Qwen3-Coder-Next-abliterated-mlx-4Bit Enriqueag26/OmniCare-Qwen3-VL-30B-A3B EurekaTian/ROMA FINGU-AI/qwen2.5-omni-3b-merge FunAGI/Qwen2.5-Omni-7B-GPTQ-4bit GadflyII/Qwen3-Coder-Next-NVFP4 GadflyII/Qwen3-VL-235B-A22B-Instruct-NVFP4 GadflyII/Qwen3-VL-235B-A22B-Thinking-NVFP4 Gaie/TA2T_DPO_Base GaleneAI/Qwen3-VL-235B-A22B-Thinking-NVFP4 Guerte/llspch Hcompany/Holo2-235B-A22B Hcompany/Holo2-30B-A3B HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-2-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-1epoch HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2 HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2-run1 HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-4-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-6-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-3e-3-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-5e-3-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-5e-5-gamma HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-remov-aux-only HectorHe/Qwen1.5-MOE-sft-coommonsense15k HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5 HectorHe/Qwen1.5-MOE-sft-math14k HectorHe/Qwen1.5-MOE-sft-math7k HectorHe/Qwen1.5-MOE-sft-math7k-sft-epoch1 HectorHe/Qwen1.5-MOE-sft-math7k-sfttest HectorHe/Qwen1.5-MOE-sft-nemotron-code HectorHe/Qwen1.5-MOE-sft-s1K Hui519/speechllm-as-judge-qwen25omni Intel/Qwen3-Coder-Next-int4-AutoRound Intel/Qwen3-Next-80B-A3B-Instruct-int4-mixed-AutoRound Intel/Qwen3-VL-30B-A3B-Instruct-int4-AutoRound KE-Team/Ke-Omni-R KasuleTrevor/QWen-sample KasuleTrevor/Qwen-nyn-intent KasuleTrevor/Qwen2-Luganda-ASR KasuleTrevor/Qwen2-RUNY-Intent Kendamarron/Qwen2.5-1.75B-A1.1B-Instruct-ja LGB666/SageLM LeroyDyer/Mistral-AudioLm Lingzhi-AI/Lingzhi-57B-chat LoukaLacasse/Qualia-7B MaziyarPanahi/Qwen1.5-MoE-A2.7B-Wikihow Memories-ai/UGC-VideoCaptioner Moamen-dcp/Full-Franko-CS-v0 Na0s/Qwen1.5-MoE-A2.7B-Chat-20_experts-L2Norm-Pruning Na0s/Qwen1.5-MoE-A2.7B-Chat-20_experts_Maths_FT_1k_cosine Na0s/Qwen1.5-MoE-A2.7B-LoRA-Exhaustive-FT Nagata99999/Affine-2 OpenGVLab/InternVL3-14B OpenGVLab/InternVL3-14B-AWQ OpenGVLab/InternVL3-14B-Instruct OpenGVLab/InternVL3-14B-hf OpenGVLab/InternVL3-1B OpenGVLab/InternVL3-1B-Instruct OpenGVLab/InternVL3-1B-hf OpenGVLab/InternVL3-2B OpenGVLab/InternVL3-2B-Instruct OpenGVLab/InternVL3-2B-hf OpenGVLab/InternVL3-38B OpenGVLab/InternVL3-38B-AWQ OpenGVLab/InternVL3-38B-Instruct OpenGVLab/InternVL3-38B-hf OpenGVLab/InternVL3-78B OpenGVLab/InternVL3-78B-AWQ OpenGVLab/InternVL3-78B-Instruct OpenGVLab/InternVL3-8B OpenGVLab/InternVL3-8B-Instruct OpenGVLab/InternVL3-8B-hf OpenGVLab/InternVL3_5-30B-A3B-Instruct OptimizeLLM/Qwen3-VL-30B-A3B-Thinking-NVFP4 QuantTrio/Qwen3-Coder-Next-E400 QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8 QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ QuantTrio/Qwen3-VL-30B-A3B-Thinking-AWQ Qwen/Qwen1.5-MoE-A2.7B Qwen/Qwen1.5-MoE-A2.7B-Chat Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4 Qwen/Qwen2-57B-A14B Qwen/Qwen2-57B-A14B-Instruct Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4 Qwen/Qwen2-Audio-7B Qwen/Qwen2-Audio-7B-Instruct Qwen/Qwen2.5-Omni-3B Qwen/Qwen2.5-Omni-7B Qwen/Qwen2.5-Omni-7B-AWQ Qwen/Qwen2.5-Omni-7B-GPTQ-Int4 Qwen/Qwen3-Coder-Next Qwen/Qwen3-Coder-Next-Base Qwen/Qwen3-Coder-Next-FP8 Qwen/Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct-FP8 Qwen/Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking-FP8 Qwen/Qwen3-VL-235B-A22B-Instruct Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 Qwen/Qwen3-VL-235B-A22B-Thinking Qwen/Qwen3-VL-235B-A22B-Thinking-FP8 Qwen/Qwen3-VL-30B-A3B-Instruct Qwen/Qwen3-VL-30B-A3B-Instruct-FP8 Qwen/Qwen3-VL-30B-A3B-Thinking Qwen/Qwen3-VL-30B-A3B-Thinking-FP8 R0mAI/51cea115-9b0c-4fca-8ddb-dc69e56a10dc RESMP-DEV/Qwen3-Next-80B-A3B-Instruct-NVFP4 RESMP-DEV/Qwen3-Next-80B-A3B-Thinking-NVFP4 RMSnow/SpeechJudge-GRM RUC-NLPIR/OmniAtlas-Qwen2.5-3B RUC-NLPIR/OmniAtlas-Qwen2.5-7B RedHatAI/Qwen2-57B-A14B-Instruct-FP8 RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16 RedHatAI/Qwen3-Next-80B-A3B-Thinking-FP8-dynamic RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4 RichardErkhov/Qwen_-_Qwen1.5-MoE-A2.7B-Chat-4bits RoxanneWsyw/gsm_qwen1.5_full_lr1e-6_frozen RoxanneWsyw/qwen1.5_gsm_esft_gate_lr5e-6 RoxanneWsyw/qwen1.5_gsm_esft_token_lr5e-6 SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite-woA SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_1 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_2 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_3 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_4 SAA-Lab/Qwen2.5-Omni-3B-SelfEvolve-iter_5 SAA-Lab/Qwen2.5-Omni-3B-UltraSuite SAA-Lab/Qwen2.5-Omni-3B-UltraSuite-woA SAA-Lab/Qwen2.5-Omni-7B-UltraSuite-woA Sahil-Kabir/colqwen2.5-v0.2-hf Sakalti/SakaMoe-3x1.6B-Instruct Sakalti/SakaMoe-3x14B-Instruct SeaLLMs/SeaLLMs-Audio-7B SejmofDejected/Qwen2.5-Omni-7B Sergei6000/Qwen2-Audio-7B-Instruct-Int4 Shifusen/Qwen3-VL-30B-A3B-Instruct-abliterated-NVFP4 SoarAILabs/breeze-3b Sophia-AI/RegTech-14B-Instruct Sophia-AI/RegTech-32B-Instruct Sophia-AI/RegTech-4B-Instruct Sophia-AI/RegTech-7B-Instruct SoundMind-RL/SoundMindModel TeamPV/0.5B-qwen-x16_v2_cp_3000 TheClusterDev/Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic TomLucidor/Qwen3-Coder-Next-REAM-mlx-3Bit Trelis/song-birds Urabewe/Ace-Step-Captioner-fp8 Yuuta208/Qwen2.5-Coder-1.5B-Qwen2.5-Math-1.5B-Merged-moe Yuuta208/Qwen2.5-Coder-7B-Qwen2.5-Math-7B-Merged-moe-16 aeon37/Qwen3-VL-30B-A3B-Instruct-heretic alicekyting/Qwen2-Audio-7B-Instruct-4bit allenai/Molmo-72B-0924 allenai/Molmo-7B-D-0924 allenai/Molmo2-4B allenai/Molmo2-8B allenai/Molmo2-VideoPoint-4B allenai/MolmoAct-7B-D-0812 allenai/MolmoAct-7B-D-LIBERO-Goal-0812 allenai/MolmoAct-7B-D-LIBERO-Long-0812 allenai/MolmoAct-7B-D-LIBERO-Object-0812 allenai/MolmoAct-7B-D-LIBERO-Spatial-0812 allenai/MolmoAct-7B-D-Pretrain-0812 allenai/MolmoAct-7B-D-Pretrain-RT-1-0812 amd/Qwen3-Coder-Next-MXFP4 anonymousICML/OmniGuard-3B anonymousICML/OmniGuard-7B antgroup/HumanSense_Omni_Reasoning arkiven4/Qwen2.5-7B-SFT-NT aryashah00/survey-finetuned-Qwen1.5-MoE-A2.7B bbytxt/727c22bb-a499-4951-978f-841369ce2042 boyuzhuGPT/checkpoint-3921 boyuzhuGPT/omniguard-video boyuzhuGPT/qwen2_5_omni_all_1015_reverse boyuzhuGPT/qwen2_5_omni_all_1016 boyuzhuGPT/qwen2_5_omni_audio_only boyuzhuGPT/qwen2_5_omni_audio_only_1014 boyuzhuGPT/qwen2_5_omni_guardrail boyuzhuGPT/qwen2_5_omni_image_only boyuzhuGPT/qwen2_5_omni_image_only_3 boyuzhuGPT/qwen2_5_omni_text_image_half boyuzhuGPT/qwen2_5_omni_text_only boyuzhuGPT/qwen2_5_omni_text_only_cleaned boyuzhuGPT/qwen2_5_omni_video_only boyuzhuGPT/qwen2_5_omni_video_only_backup boyuzhuGPT/qwen2_5_omni_wo_image_1016 browser-use/bu-30b-a3b-preview bullpoint/Qwen3-Coder-Next-AWQ-4bit catplusplus/Qwen3-VL-30B-A3B-Instruct-Heretic-NVFP4 catplusplus/Qwen3-VL-30B-A3B-Thinking-Heretic catplusplus/Qwen3-VL-30B-A3B-Thinking-Heretic-NVFP4 chaitnya26/Qwen2.5-Omni-3B-Fork chaitnya26/Qwen2.5-Omni-7B-fork chauhoang/5a9f2ec1-88d6-6b14-efe5-27299e1af90e chenhaodev/qwen2-audio-7b-aishell1 chunping-m/transcriber coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-W4A16 coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-nvfp4 cyankiwi/Qwen3-Coder-Next-AWQ-4bit cyankiwi/Qwen3-Coder-Next-AWQ-8bit cyankiwi/Qwen3-Coder-Next-REAM-AWQ-4bit cyankiwi/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit cyankiwi/Qwen3-VL-30B-A3B-Instruct-AWQ-4bit cyankiwi/Qwen3-VL-30B-A3B-Instruct-AWQ-8bit cyankiwi/Qwen3-VL-30B-A3B-Thinking-AWQ-4bit cyankiwi/Qwen3-VL-30B-A3B-Thinking-AWQ-8bit cyankiwi/bu-30b-a3b-preview-AWQ-4bit dazipe/Qwen3-Coder-Next-GPTQ-Int4A16 dazipe/Qwen3-Next-80B-A3B-Instruct-GPTQ-Int4A16 ddvd233/OmniSapiens-7B-RL-Full ddvd233/Qwen2.5-Omni-7B ddwang2000/EmotionThinker dimasik2987/c499b3ec-738a-4496-a788-f85f963cb5b2 ehristoforu/Testrumoe-2x1.5b-instruct ehristoforu/tmoe ehristoforu/tmoe-v2 eve1f/cp2 eve1f/cp3 fauzanazz/qwen2-audio-indo-fraud-7b-merged faychu/test femiari/Qwen2-1.5Moe filipesantoscv11/f5866ce8-292c-450f-9d85-6807fe1ba0b1 flozi00/gerqwen-audio fractaldactal/Qwen2.5-Omni-7B gguichard/qwen15_moe_finetuning_json_cvfull_model gguichard/qwen15_moe_finetuning_json_cvfull_model_fp giangndm/qwen2.5-omni-3b-mlx-4bit giangndm/qwen2.5-omni-3b-mlx-8bit giangndm/qwen2.5-omni-7b-mlx-4bit giangndm/qwen2.5-omni-7b-mlx-8bit havinash-ai/2fe5536e-17d3-4296-b792-1dab8bad3b6b hf-internal-testing/tiny-random-Qwen2MoeForCausalLM hf-internal-testing/tiny-random-Qwen2_5OmniForConditionalGeneration horiguchidotconf/qwen57b_gptq_20240923 huihui-ai/Huihui-Qwen3-Coder-Next-abliterated huihui-ai/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated huihui-ai/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-mlx-4bit huihui-ai/Huihui-Qwen3-Next-80B-A3B-Thinking-abliterated huihui-ai/Huihui-Qwen3-VL-30B-A3B-Instruct-abliterated huihui-ai/Huihui-Qwen3-VL-30B-A3B-Thinking-abliterated ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4 imkebe/Qwen2.5-Omni-7B-rk3588-1.2.0 inclusionAI/UI-Venus-1.5-30B-A3B introvoyz041/Huihui-Qwen3-VL-30B-A3B-Thinking-abliterated-qx86-hi-mlx-mlx-4Bit iteshxt/dia-convo-v1.2c jackboot/quill-57b-tokenizer janhq/Jan-v2-VL-max-FP8 jart25/Qwen3-Next-80B-A3B-Instruct-Int4-GPTQ jart25/Qwen3-VL-30B-A3B-Instruct-AWQ-8bit jayzou3773/Qwen1.5-MOE-sft-ESFT-intent jayzou3773/Qwen1.5-MOE-sft-ESFT-translation jayzou3773/Qwen1.5-MOE-sft-coommonsense15k jayzou3773/Qwen1.5-MOE-sft-gsm8k jayzou3773/commonsense-15k-ss20-step30-int5-plantrue-tau1.0-beta2-keep10-budget30-ep2 jayzou3773/commonsense-15k-ss20-step30-int5-plantrue-tau2.0-beta1-keep10-budget30-ep2 jayzou3773/commonsense-15k-ss20-step30-int5-plantrue-tau2.0-beta2-keep10-budget30-ep2 jayzou3773/merging-step15-mweight-tau0.1-ep2 jayzou3773/merging-step30-mweight-tau0.05-ep2 jayzou3773/qwen1.5-step30-withoutplan-commonsense15k-ep2-in5 jcPatrick/Qwen2.5-omni-3B-Open-R1-GRPO jinaai/jina-vlm jinaai/jina-vlm-mlx jli56/cp_320_nothinker jli56/cp_471_nothinker jongwooko/Flex-Omni-7B katuni4ka/tiny-random-qwen1.5-moe kd1729/Qwen2-Audio-7B-Instruct kokovova/56a16866-d85e-41fa-8f42-e2192ee7ac3e ldhldh/merged-qwen-omni-dare ldhldh/merged-qwen-omni-dare-3 ldhldh/merged-qwen-omni-dare-3B liangjh2001/Qwen2-Audio-7B-Instruct-train-all-full-new liangjh2001/fuseties-and-train-audio_deepfake liangjh2001/fuseties-and-train-audio_emotion liangjh2001/fuseties-and-train-audio_speaker liangjh2001/qwen_audio_ties-full-audio_deepfake_val_new_2w-full liangjh2001/qwen_audio_ties-full-audio_emotion_train_1w5_wo_happy-full liangjh2001/qwen_audio_ties-full-audio_speaker_recognition_random_order_train-full liangjh2001/qwen_audio_ties_new lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-4bit lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-5bit lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-6bit lmstudio-community/Qwen3-Next-80B-A3B-Instruct-MLX-8bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-4bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-5bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-6bit lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-8bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-4bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-5bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-6bit lmstudio-community/Qwen3-VL-30B-A3B-Thinking-MLX-8bit mclemcrew/Qwen-Audio-Mix-Instruct michaelfeil/Qwen2-57B-A14B-Instructfp8_tllm michaelfeil/Qwen2-57B-A14B-Instructint4_awq_tllm michaellin/Qwen3-Coder-Next-mlx-4Bit mii-llm/nesso-4B mispeech/r1-aqa mlfoundations/Gelato-30B-A3B mlinmg/Qwen-2-Audio-Instruct-dynamic-fp8 mlx-community/Molmo-7B-D-0924-4bit mlx-community/Qwen1.5-MoE-A2.7B-4bit mlx-community/Qwen1.5-MoE-A2.7B-Chat-4bit mlx-community/Qwen2-57B-A14B-4bit mlx-community/Qwen2-57B-A14B-8bit mlx-community/Qwen2-57B-A14B-Instruct-4bit mlx-community/Qwen2-57B-A14B-Instruct-8bit mlx-community/Qwen2.5-2X32B-CoderInstruct-OlympicCoder-87B-v1.1-4bit mlx-community/Qwen3-Next-80B-A3B-Instruct-4bit mlx-community/Qwen3-Next-80B-A3B-Instruct-8bit mlx-community/Qwen3-Next-80B-A3B-Thinking-4bit mlx-community/Qwen3-Next-80B-A3B-Thinking-8bit mlx-community/Qwen3-VL-235B-A22B-Instruct-3bit mlx-community/Qwen3-VL-235B-A22B-Thinking-3bit mlx-community/Qwen3-VL-30B-A3B-Instruct-3bit mlx-community/Qwen3-VL-30B-A3B-Instruct-4bit mlx-community/Qwen3-VL-30B-A3B-Instruct-6bit mlx-community/Qwen3-VL-30B-A3B-Instruct-8bit mlx-community/Qwen3-VL-30B-A3B-Instruct-bf16 mlx-community/Qwen3-VL-30B-A3B-Thinking-4bit mlx-community/Qwen3-VL-30B-A3B-Thinking-8bit mlx-community/Qwen3-VL-30B-A3B-Thinking-bf16 mncai/hunmin_vlm_235b_v0.11_merged_cua mrtoots/unsloth-Qwen3-Coder-Next-mlx-8bit naveennagar009/qwen2_5_3B_omni_7k_v2 naveennagar009/qwen2_5_7B_omni_13k naveennagar009/qwen2_5_7B_omni_9k nguyenvulebinh/af3 ngxson/qwen3_next_tiny_test nightmedia/Huihui-Qwen3-VL-30B-A3B-Thinking-abliterated-qx86-hi-mlx nightmedia/Qwen2.5-2X7B-Coder-Soar-qwen-Coder-Instruct-OlympicCoder-19B-dq68-128k-mlx nkwbtb/OmniEmbed-v0.1 nm-testing/Qwen1.5-MoE-A2.7B-Chat-quantized.w4a16 nm-testing/Qwen3-Next-80B-A3B-Instruct-NVFP4 nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4 nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4 nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4-MLPerf-Inference-Closed-V6.0 nvidia/music-flamingo-hf openinterx/UGC-VideoCaptioner optimum-intel-internal-testing/tiny-random-qwen1.5-moe peft-internal-testing/tiny-random-qwen-1.5-MoE pluto6272/Qwen3-VL-30B-Medical-V3-Precision prithivMLmods/Qwen3-VL-30B-A3B-Instruct-abliterated-v1 puwaer/Qwen3-Next-80B-A3B-Thinking-GRPO-Uncensored qzkiyoshi/finetune_jun_qwen rajthakkar123/qwen2.5-omni-7b-q8_0 recursal/QRWKV6-7B-Base reinforce20001/Qwen3-VL-30B-A3B-Thinking-NVFP4 rrbale/pruned-qwen-moe samaritan-ai/LightOnOCR-2-1B-sam-44-mss-alb scrunter/Qwen3-VL-235B-A22B-Thinking-heretic shivak/Qwen3-VL-235B-A22B-Thinking-W4A16 shuyuej/Qwen2-57B-A14B-GPTQ shuyuej/Qwen2-57B-A14B-Instruct-GPTQ sirus/Qwen3-VL-30B-A3B-Instruct-sovereign-beta sugiv/octopus-omni-embed the-qa-company-official/Qwen3-VL-30B-A3B-Thinking-NVFP4 thisisiron/Ovis2-1B-hf thisisiron/Ovis2-2B-hf thoddnn/colqwen2-v1.0-hf thoddnn/colqwen2-v1.0-mlx thoddnn/colqwen2-v1.0-mlx-4bit thoddnn/colqwen2-v1.0-mlx-8bit thucdangvan020999/qwen2-audio-instruct-ep10-ckpt550-1000samples thucdangvan020999/qwen2-audio-instruct-ep15-ckpt900-1000samples thucdangvan020999/qwen2-audio-instruct-ep20-ckp1220-1000samples thucdangvan020999/qwen2-audio-instruct-ep5-ckpt305-1000samples thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt200 thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt400 thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt600 thucdangvan020999/qwen2-audio-instruct-iemocap-ckpt800 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt100 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt200 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt300 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt400 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt500 thucdangvan020999/qwen2-audio-instruct-iemocap-v2-ckpt600 tiny-random/qwen2.5-omni tiny-random/qwen3-next-moe tiny-random/qwen3-vl-moe tranquangchung/qwen2-audio-dialogue tuanna08go/4157a659-35c7-43b8-ac2f-40ae4da17214 tuanna08go/743d2a68-4596-4ef4-b0b2-d57af29bb021 unsloth/Qwen2.5-Omni-3B unsloth/Qwen2.5-Omni-7B unsloth/Qwen3-Coder-Next unsloth/Qwen3-Coder-Next-Base unsloth/Qwen3-Coder-Next-FP8 unsloth/Qwen3-Coder-Next-FP8-Dynamic unsloth/Qwen3-Next-80B-A3B-Instruct unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit unsloth/Qwen3-Next-80B-A3B-Thinking unsloth/Qwen3-VL-235B-A22B-Instruct unsloth/Qwen3-VL-235B-A22B-Instruct-FP8 unsloth/Qwen3-VL-30B-A3B-Instruct unsloth/Qwen3-VL-30B-A3B-Instruct-FP8 unsloth/Qwen3-VL-30B-A3B-Thinking unsloth/Qwen3-VL-30B-A3B-Thinking-FP8 vidore/colqwen2-v1.0-hf vincentzed-hf/Qwen3-Coder-Next-NVFP4 wolfofbackstreet/Qwen2-Audio-7B-Instruct-Onnx wolfofbackstreet/Qwen2-Audio-7B-Instruct-Openvino-4Bit wolfofbackstreet/Qwen2.5-Omni-3B-4Bit wolfofbackstreet/Qwen2.5-Omni-3B-4Bit-Openvino yaolily/TimeChat-Captioner-GRPO-7B yasinarafatbd/Qwen2_Audio yasinarafatbd/Qwen2_Audio_Engine_Sound ybkim95-ai/VocalAgents2_snuh_cv_fold1 ybkim95-ai/VocalAgents2_snuh_cv_fold2 ybkim95-ai/VocalAgents2_snuh_cv_fold3 ybkim95-ai/VocalAgents2_snuh_cv_fold4 ybkim95-ai/VocalAgents2_snuh_cv_fold5 yhcao/sft_base_3x_with_pt_extra yhcao/sft_base_3x_with_pt_extra_continue_silence yhcao/sft_final yogkul2000/AVATAR yuhui1038/SpeechRole-Agent yujiepan/qwen1.5-moe-tiny-random yujiepan/qwen2-audio-tiny-random yujiepan/qwen2.5-omni-tiny-random yujiepan/qwen3-next-moe-tiny-random yujiepan/qwen3-vl-moe-tiny-random zenlm/zen-omni zh-liu799/56565 zh-liu799/789564 zhifeixie/Audio-Reasoner

Converter

Models

Qwen2Converter

Pattern #2 (285 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁"), Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])

Affected models

Converter	Models
T5Converter	12313Max/musicgen-small-custom AleBurzio/long-t5-base-govreport Archana99/musicgen-lora-testing Archeane/first_tldr ArmandoRockYourCloud/MusicaMDM Blaise-g/longt5_tglobal_large_sumpubmed DONG19/code-search-net-codemoe-base DevPanda004/musicgen-melody-indian DevPanda004/musicgen-small-ft1 Guyfromvillage/musicgen-stereo-lora-test Joemgu/long-t5-base-sumstew Karim-Gamal/switch-base-8-finetuned-SemEval-2018-emojis-IID-Fed Karim-Gamal/switch-base-8-finetuned-SemEval-2018-emojis-cen-1 Karim-Gamal/switch-base-8-finetuned-SemEval-2018-emojis-cen-2 LegolasTheElf/Long-T5-Booksum LoopersClub/musicgen-small Mursel/switch-base-8-xsum MusicBuddy/MusicGen_Large NitzanBar/t5_long QuangHuy54/long-t5-tglobal-base-multinews RMWeerasinghe/long-t5-tglobal-base-finetuned-govReport-4096 Razavipour/musicgen-persian-finetuned Razavipour/musicgen-persian-finetuned_setar Razavipour/musicgen-persian-finetuned_setar_test Razavipour/musicgen-persian-finetuned_setar_with_meta Razavipour/musicgen-persian-traditional-Santoor-40-samples Razavipour/musicgen-persian-traditional-Tonbak-40-samples Razavipour/musicgen-persian-traditional-instruments Razavipour/musicgen-persian-traditional-instruments-mini Razavipour/musicgen-persian-traditional-instruments-tiny Razavipour/musicgen-persian-traditional-instruments-tiny_3 Razavipour/musicgen-persian-traditional-kamancheh-40-samples RikiLin/musicgen-melody-lora-piano RuaYiii/musicgen Shobhank-iiitdwd/long-t5-tglobal-base-16384-book-summary Splend1dchan/long-t5lephone-5000 Stancld/longt5-tglobal-large-16384-pubmed-3k_steps TrulySenya/musicgen-melody-lora-punk TrulySenya/musicgen-synths-test VasilyMorzhakov/output WelfCrozzo/belarussian-switch-translator-L512 WelfCrozzo/belarussian-switch-ul2 Xenova/long-t5-encodec-tglobal-base Xenova/long-t5-local-base Xenova/long-t5-tglobal-base Xenova/long-t5-tglobal-base-16384-book-summary Xenova/musicgen-small Yoga26/longT5 Z3K3/musicgen-large acmc/summarizer_google_long-t5-local-base_keybert_unfaceted acmc/summarizer_google_long-t5-tglobal-base_keywords_unfaceted agkochuev19/music alex2awesome/city_council_gpt3_silver_standard_summaries__long_t5_local_base annamoerman/music-gen-test artovv/musicgen-melody-my-adapters-v3 artovv/musicgen-melody-mystyle-adapters-v2 artovv/musicgen-mystyle_v3 artovv/musicgen-mystyle_v5 asach/lognt5-xsum-icsi-5 avasaz/avasaz-large avasaz/avasaz-webgl awkyu/musicgen-small ayrmoney/musicgen-large birgermoell/musicgen-melody-lora-punk bloominho/my_awesome_opus_books_model cjinghong/musicgen-melody-lora-punk contemmcm/19b9a99d360126bde69d42d263b160bc crumb/switch-base-8-arxiv-abstraction csc-unipd/tasty-musicgen-small daniel-was-taken/long-t5-scisumm-accelerate-v2 danieladeeko/t5_research derrickdso/samplegen-small diegopdlv5/musicgen-melody-lora-punk dthomas84/musicgen-large emre/switch-base-8-finetuned-samsum facebook/musicgen-large facebook/musicgen-medium facebook/musicgen-melody facebook/musicgen-melody facebook/musicgen-melody-large facebook/musicgen-melody-large facebook/musicgen-small facebook/musicgen-stereo-large facebook/musicgen-stereo-medium facebook/musicgen-stereo-melody facebook/musicgen-stereo-melody facebook/musicgen-stereo-melody-large facebook/musicgen-stereo-melody-large facebook/musicgen-stereo-small freddy913/FRDYV2 freddy913/FRDYV2_32 freddy913/FRDYV2_33 freddy913/FRDYV2_34 freddy913/FRDYV2_35 freddy913/FRDYV2_36 freddy913/FRDYV2_37 freddy913/FRDYV2_40 freddy913/FRDYV3_31 freddy913/FRDYV4 fxmarty/tiny-random-working-LongT5Model glamprou/switch-base-8-sst2 google/long-t5-local-base google/long-t5-local-large google/long-t5-tglobal-base google/long-t5-tglobal-large google/long-t5-tglobal-xl google/switch-base-128 google/switch-base-16 google/switch-base-256 google/switch-base-32 google/switch-base-64 google/switch-base-8 google/switch-c-2048 google/switch-large-128 hangzeli/musicgen-melody-lora-punk harisnaeem/musicgen-small-ONNX heboya8/facebook-musicgen-small-not-lora-280 heboya8/facebook-musicgen-small-not-lora-40 heboya8/facebook-musicgen-small-not-lora-400 heboya8/facebook-musicgen-small-not-lora-420 heboya8/facebook-musicgen-small-not-lora-440 heboya8/facebook-musicgen-small-not-lora-450 heboya8/facebook-musicgen-small-not-lora-470 heboya8/facebook-musicgen-small-not-lora-50 heboya8/facebook-musicgen-small-not-lora-500 heboya8/facebook-musicgen-small-not-lora-510 heboya8/facebook-musicgen-small-not-lora-530 heboya8/facebook-musicgen-small-not-lora-570 heboya8/facebook-musicgen-small-not-lora-60 heboya8/facebook-musicgen-small-not-lora-610 heboya8/facebook-musicgen-small-not-lora-620 heboya8/facebook-musicgen-small-not-lora-640 heboya8/facebook-musicgen-small-not-lora-660 heboya8/facebook-musicgen-small-not-lora-680 heboya8/facebook-musicgen-small-not-lora-700 heboya8/facebook-musicgen-small-not-lora-90 heslil/msmall hf-internal-testing/tiny-random-LongT5ForConditionalGeneration hf-internal-testing/tiny-random-LongT5Model hf-internal-testing/tiny-random-MusicgenForConditionalGeneration hf-internal-testing/tiny-random-MusicgenMelodyForConditionalGeneration hf-internal-testing/tiny-random-SwitchTransformersForConditionalGeneration hf-internal-testing/tiny-random-SwitchTransformersModel hf-tiny-model-private/tiny-random-SwitchTransformersForConditionalGeneration hf-tiny-model-private/tiny-random-SwitchTransformersModel hmueller25/LT5-finetuned hmueller25/long-t5-tglobal-base-german-law huyquoctrinh/musicgen-melody-lora-punk hyeongii/musicgen-melody-lora-punk jackmedda/google-long-t5-tglobal-base_finetuned_augmented_augmented_llama3.3_70b jamesdon/audiogen-medium-endpoint jane102350/musicgen-melody-lora-punk jauntybrain/musicgen-small jihong008/musicgen-melody-lora-punk jihong008/musicgen-melody-lora-punk2 jihong008/musicgen-melody-lora-punk3 jpe9596/musicgen-large junhaoxjtu/musicgen-melody-lora-punk kresnandika/long-t5-tglobal-base-samsum kylielee505/mymgm kylielee505/mymgm laurasi/aimusic learn3r/longt5_xl_gov_5 learn3r/longt5_xl_gov_memsum_bp_5 learn3r/longt5_xl_govreport_4096_e10 learn3r/longt5_xl_govreport_4096_memsum_e10 learn3r/longt5_xl_sfd_20 learn3r/longt5_xl_sfd_4096_e10 learn3r/longt5_xl_sfd_bp_15 learn3r/longt5_xl_sfd_bp_20 learn3r/longt5_xl_sfd_memsum_30 lusciniaweldmou/musicgen-melody-lora-punk marv1nnnnn/musicgen-songstarter mclemcrew/musicgen-melody-ravi memepottaboah/musicgen-80snewwave-tiny memepottaboah/musicgen-POPMUSIC1981-melody merve/musicgen-small mrm8488/switch-base-16-finetuned-samsum mrm8488/switch-base-16-finetuned-xsum mrm8488/switch-base-16-finetuned-xsum-2 mrm8488/switch-base-8-finetuned-samsum nbroad/longt5-base-global-mediasum ogbanugot/musicgen-melody-lora-afrobeats ogbanugot/musicgen-melody-lora-afrobeats-with-vocals ogbanugot/musicgen-melody-lora-afrobeats-with-vocals-long ogbanugot/musicgen-small-lora-afrobeats omarimc/musicgen-large omarimc/musicgen-medium omarimc/musicgen-melody omarimc/musicgen-melody omarimc/musicgen-small onurio/musicgen-large originstory/holisleigh originstory/holisleigh2 pbotsaris/musicgen-small pharoAIsanders420/micro-musicgen-jungle pharoAIsanders420/musicgen-tiny-jungle-onnx pingzhili/switch-base-32-finetuned-copa pingzhili/switch-base-32-finetuned-hotpotqa pingzhili/switch-base-32-finetuned-mrpc pingzhili/switch-base-32-finetuned-multirc pingzhili/switch-base-32-finetuned-squad pingzhili/switch-base-32-finetuned-sst2 pingzhili/switch-base-32-finetuned-wikiqa pingzhili/switch-base-32-finetuned-winogrande pszemraj/long-t5-tglobal-base-16384-book-summary pszemraj/long-t5-tglobal-base-16384-booksum-V11-big_patent-V2 pszemraj/long-t5-tglobal-base-sci-simplify pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP pszemraj/long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP17 pszemraj/long-t5-tglobal-xl-16384-book-summary pszemraj/long-t5-tglobal-xl-16384-book-summary-8bit reach-vb/musicgen-large-endpoint reach-vb/musicgen-large-fp16-endpoint reach-vb/musicgen-small reach-vb/musicgen-small-endpoint reach-vb/musicgen-small-test ronaldseoh/long-t5-tglobal-large rooftopcoder/long-t5-tglobal-base-16384-book-summary-finetuned-dialogsum rooftopcoder/longt5-dialogsum-2048 rrodrigu3z/long-t5-tglobal-base-joint-dg rubentito/longt5-tglobal-base-mpdocvqa sanchit-gandhi/musicgen-small satyanshu404/long-t5-local-base-finetuned-justification-v10 seckmaster/musicgen-large shahzebnaveed/moe_switch_transformer_summarization shrg7/musicgen-melody-lora-punk shrg7/musicgen-melody-lora-punk-base skroed/musicgen-medium slavocado/musicgen-large smarters/musicgen-large-csi smarters/musicgen-small-csi sweet-dreambooths/black-eyed-peas-v1-autotuned sweet-dreambooths/black-eyed-peas-v1-crafted-prompt sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-1-epoch sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-text-only sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-text-only-no-instance sweet-dreambooths/black-eyed-peas-v1-crafted-prompt-3-epochs-text-only-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-16-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-16-epochs-text-only-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-3-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-8-epochs-piano-prompts sweet-dreambooths/black-eyed-peas-v1-crafted-variable-prompt-8-epochs-text-only-piano-prompts sweet-dreambooths/black-eyed-peas-v1-lower-lr sweet-dreambooths/black-eyed-peas-v1-unprompted sweet-dreambooths/black-eyed-peas-v1-unprompted-lower-lr sweet-dreambooths/combined-artists-text-only-1-epochs sweet-dreambooths/combined-artists-text-only-3-epochs talargv/musicgen-finetune-aav talargv/musicgen-finetune-phonk taufiqsyed/musicgen-melody-lora-punk taufiqsyed/salami-data-clean-model taufiqsyed/salami_neural_demo_model taufiqsyed/salami_truncsplit_dora_model taufiqsyed/salami_truncsplit_finetune_model taufiqsyed/salami_truncsplit_legit1_model taufiqsyed/salami_truncsplit_model taufiqsyed/salami_truncsplit_model_mid taufiqsyed/salami_truncsplit_model_smol taufiqsyed/salami_truncsplit_model_trial2 tboucher/piano-mono-melody tryolabs/long-t5-tglobal-base-blogpost-cqa-onnx tsuyuan/long-t5-encodec-tglobal-base tvergho/t5-cards vinnie329/musicgen-lora-emotive-ambient vinnie329/musicgen-lora-emotive-ambient-2 vinnie329/musicgen-melody-lora-alt-hip-hop whaleloops/longt5-tglobal-large-16384-pubmed-10k_steps xkristian/long5-LegalDocumentSummarization ybelkada/switch-base-8-xsum ylacombe/musicgen-melody ylacombe/musicgen-melody-large ylacombe/musicgen-melody-large-punk-lora ylacombe/musicgen-melody-lora-punk ylacombe/musicgen-melody-punk-lora ylacombe/musicgen-stereo-melody ylacombe/musicgen-stereo-melody-large yuthrb/musicgen-custom zcode/seq2seq-parseg zera09/Word-selector zera09/long_t5 zera09/long_t5_4 zubes01/switch-base-8-imdb-text-classification

Converter

Models

T5Converter

Pattern #3 (263 models)

-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+decoder:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)

Affected models

Converter	Models
TikTokenConverter	1024m/OLMoE-1B-7B-0924-Base 1024m/OLMoE-1B-7B-0924-Instruct-Base 4bit/mpt-7b-storywriter-4bit-128g Alchan/mpt-7b-chat AntonV/mamba2-1.3b-av AntonV/mamba2-1.3b-hf AntonV/mamba2-130m-av AntonV/mamba2-130m-hf AntonV/mamba2-2.7b-av AntonV/mamba2-2.7b-hf AntonV/mamba2-370m-av AntonV/mamba2-370m-hf AntonV/mamba2-780m-av AntonV/mamba2-780m-hf ArthurZ/mamba-1.4b ArthurZ/mamba-130m Codemaster67/OLMo-7B-USPTO-1k-ZINC DanielAWrightGabrielAI/mpt-7b-storywriter-4bit-128g-65kTokens-CPU EleutherAI/Hermes-RWKV-v4-3B Fizzarolli/OLMoE-1B-7B-0924-extended-pos-emb Intel/neural-chat-7b-v1-1 KnutJaegersberg/RWKV-4-PilePlus-169M-20230520-done-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-1B5-20230520-2942-486Gtokens-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-3B-20230520-3147-520Gtokens-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-430M-20230520-6162-1018Gtokens-ctx4098 KnutJaegersberg/RWKV-pileplus-1B5-evol_instruct_v2 Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192 Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch5-8192 Lansechen/OLMoE-1B-7B-0125-Distill-or-math220k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch3-8192 Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32 Litzy619/OLMoE-1B-7B-0924-step490000-tokens2055B-qlora Nethermind/Mpt-Instruct-DotNet-S NickNickGo/pocket_olmoe OccamRazor/mpt-7b-storywriter-4bit-128g OuteAI/OuteTTS-0.3-1B P1ayer-1/mpt-7b-instruct-base Pratye/mpt-7b-chat Q-bert/Mamba-130M Q-bert/Mamba-1B Q-bert/Mamba-3B RWKV/RWKV7-Goose-Pile-168M-HF RWKV/rwkv-4-14b-pile RWKV/rwkv-4-169m-pile RWKV/rwkv-4-1b5-pile RWKV/rwkv-4-3b-pile RWKV/rwkv-4-430m-pile RWKV/rwkv-4-7b-pile RWKV/rwkv-raven-14b RWKV/rwkv-raven-1b5 RWKV/rwkv-raven-3b RWKV/rwkv-raven-7b RichardErkhov/DeepMount00_-_mamba_790_hf_qa-4bits RichardErkhov/allenai_-_OLMoE-1B-7B-0924-4bits RichardErkhov/allenai_-_OLMoE-1B-7B-0924-8bits RichardErkhov/state-spaces_-_mamba-130m-hf-8bits RichardErkhov/tsavage68_-_mpt_1000_STEPS_1e5_SFT_SFT-8bits RtaForge/mamba2-2.7b-gurukul-instruct RtaForge/mamba2-2.7b-gurukul-instruct StarRing2022/RWKV-4-Raven-3B-v11-zh StarRing2022/RWKV-430M-Pile-Alpaca TRI-ML/mamba-7b-rw TehVenom/MPT-7b-Chat-Instruct-LongCTX-Merge TehVenom/MPT-7b-InstructAndStorywriting-50_50-Merge TehVenom/MPT-7b-WizardLM_Uncensored-Storywriter-Merge TehVenom/MPT-7b-storywriter-Apache-2.0 TehVenom/mpt-7b-InstructAndStorywriting-75_25-Merge Tomasal/OLMoE-1B-7B-0125-1-epoch-enron Xenova/tiny-mamba-onnx ZySec-AI/Mamba-2.8B-CyberSec agr505/fine_tuned_mamba_causal_1_19_run2 agr505/fine_tuned_mamba_causal_1_19_run3 allenai/MolmoE-1B-0924 allenai/MolmoE-1B-0924 allenai/OLMo-1B-0724-hf allenai/OLMo-1B-hf allenai/OLMo-7B-0424-Instruct-hf allenai/OLMo-7B-0424-hf allenai/OLMo-7B-0724-Instruct-hf allenai/OLMo-7B-0724-SFT-hf allenai/OLMo-7B-0724-hf allenai/OLMo-7B-Instruct-hf allenai/OLMo-7B-Twin-2T-hf allenai/OLMo-7B-hf allenai/OLMoE-1B-7B-0125 allenai/OLMoE-1B-7B-0125 allenai/OLMoE-1B-7B-0125-DPO allenai/OLMoE-1B-7B-0125-Instruct allenai/OLMoE-1B-7B-0125-SFT allenai/OLMoE-1B-7B-0924 allenai/OLMoE-1B-7B-0924 allenai/OLMoE-1B-7B-0924-Instruct allenai/OLMoE-1B-7B-0924-Instruct allenai/OLMoE-1B-7B-0924-SFT allenai/OLMoE-1B-7B-0924-SFT allura-org/MoE-Girl-1BA-7BT amd/AMD-OLMo-1B amd/AMD-OLMo-1B-SFT amd/AMD-OLMo-1B-SFT-DPO anas-awadalla/mpt-7b autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_epoch4_freeze autoprogrammer/gsm_OLMoE-1B-7B-0125_lr2e-05_epoch4_epoch_3 autoprogrammer/olmoe_densebackward0125 autoprogrammer/olmoe_densebackward0125_v1 avidoavid/RWKV-1b5-finetuned-overfit ayoubkirouane/Mamba-Chat-2.8B binh230/mamba2-370m breadlicker45/MuseBan breadlicker45/MuseRWKV breadlicker45/MuseRift breadlicker45/MuseRizz breadlicker45/muse-test-36 breadlicker45/muse-test-37 breadlicker45/muse-test-38 breadlicker45/muse-test35 breadlicker45/museRWKV-test breadlicker45/music-rwkv-v4 breadlicker45/music-rwkv2-v4 breadlicker45/rwkv-4-169m-pile-5120 breadlicker45/rwkv-4-169m-pile-6144 breadlicker45/rwkv-4-430m-2048 breadlicker45/rwkv-4-430m-3072 breadlicker45/rwkv-4-430m-4096 breadlicker45/rwkv-4-430m-5120 breadlicker45/rwkv-4-430m-6144 breadlicker45/rwkv-music3 breadlicker45/token-music cahya/rwkv-1B5-instruction cg666/OLMoE-1B-7B-0125-Instruct-grpo-E6-D100 cg666/OLMoE-1B-7B-0125-Instruct-grpo-E6-D8000-L4096 echarlaix/tiny-mpt-random-remote-code efederici/ipt-125m estrogen/olmoe-upscale estrogen/olmoe-upscale-attempt1 estrogen/olmoe-upscale-inkmixv3-ep1 estrogen/olmoe-upscale-inkmixv3-ep2 ethzanalytics/mpt-7b-storywriter-sharded feliperodriguezborquez/OLMoE-0125-attn-rnd-V1 feliperodriguezborquez/OLMoE-0125-base-V1 feliperodriguezborquez/OLMoE-0125-base-rnd-V1 feliperodriguezborquez/OLMoE-0924-attn-rnd-V1 feliperodriguezborquez/OLMoE-0924-base-V1 feliperodriguezborquez/OLMoE-0924-base-rnd-V1 feliperodriguezborquez/OLMoE-0924-my-V1 feliperodriguezborquez/OLMoE-0924-my-rnd-V1 finnstrom3693/rwkv-raven-1.5b fla-hub/mamba-7B fla-hub/rwkv7-168M-pile foilfoilfoil/RWKV-pileplus-HF-169M fxmarty/tiny-mpt-random-remote-code gl198976/mpt-7b gl198976/mpt-7b-instruct gretelai/mpt-7b han1997/mamba-2.8b-slimpj-hf harindhar10/OLMo-7B-USPTO-1k-ZINC harindhar10/OLMo-7B-ZINC20-10k harindhar10/OLMo-7B-ZINC20-50k harindhar10/OLMo-7B-ZINC20-50k-USPTO-50k hf-internal-testing/tiny-random-MambaForCausalLM hf-internal-testing/tiny-random-MambaModel hf-internal-testing/tiny-random-MptForCausalLM hf-internal-testing/tiny-random-MptForQuestionAnswering hf-internal-testing/tiny-random-MptForSequenceClassification hf-internal-testing/tiny-random-MptForTokenClassification hf-internal-testing/tiny-random-MptModel hf-internal-testing/tiny-random-OlmoForCausalLM hf-internal-testing/tiny-random-OlmoeForCausalLM hf-internal-testing/tiny-random-RwkvForCausalLM hf-internal-testing/tiny-random-RwkvModel huxiang088/OLMoE-1B-7B-0924-Instruct-NVFP4 hyungtae/mpt-30b interview-eval/olmoe-depthqa-test-4 interview-eval/olmoe-depthqa-test-instruct-6 interview-eval/olmoe-depthqa-test-train-5 interview-eval/olmoe-depthqa-train-1 interview-eval/olmoe-gsm8k-3 interview-eval/olmoe-math-test-4 interview-eval/olmoe-math-test-gsm8k-5 interview-eval/olmoe-math-test-instruct-6 interview-eval/olmoe-math-test-train-5 interview-eval/olmoe-math-train-1 interview-eval/olmoe-math-train-gsm8k-2 iwalton3/rwkv-14b-wizardlm jmichaelov/parc-rwkv-seed2 jploski/mpt-mini-shakespeare jprafael/mpt-7b-instruct-sharded katuni4ka/tiny-random-olmo-hf lennyhans/OLMoE-1B-7B-0125-Instruct-bnb-4bit lennyhans/OLMoE-1B-7B-0125-bnb-4bit lennyhans/OLMoE-1B-7B-0125-bnb-4bit lentan/mpt-125m lightblue/japanese-mpt-7b manojpreveen/mpt-30b-v5 mesh-ops/OLMoE-1B-7B-0924-step1140000-tokens4781B mesh-ops/OLMoE-1B-7B-0924-step1200000-tokens5033B mesh-ops/OLMoE-1B-7B-0924-step1215000-tokens5096B mesh-ops/OLMoE-1B-7B-0924-step1220000-tokens5117B mesh-ops/OLMoE-1B-7B-0924-step900000-tokens3774B mlx-community/OLMoE-1B-7B-0125 mlx-community/OLMoE-1B-7B-0125-4bit mlx-community/OLMoE-1B-7B-0125-6bit mlx-community/OLMoE-1B-7B-0125-6bit mlx-community/OLMoE-1B-7B-0125-8bit mlx-community/OLMoE-1B-7B-0125-Instruct mlx-community/OLMoE-1B-7B-0125-Instruct-4bit mlx-community/OLMoE-1B-7B-0125-Instruct-6bit mlx-community/OLMoE-1B-7B-0125-Instruct-8bit mlx-community/mamba-130m-hf-f32 modularai/replit-code-1.5 motionlabs/OLMoE-1B-5B nm-testing/OLMoE-1B-7B-0924-Instruct-FP8 nomic-ai/gpt4all-mpt nomic-ai/gpt4all-mpt-2 onnx-community/tiny-random-olmo-hf openaccess-ai-collective/mpt-7b-wizardlm optimum-intel-internal-testing/tiny-mamba optimum-intel-internal-testing/tiny-random-MptForCausalLM optimum-intel-internal-testing/tiny-random-olmo-hf petkopetkov/mamba2-1.3b-hf petkopetkov/mamba2-130m-hf petkopetkov/mamba2-2.7b-hf petkopetkov/mamba2-370m-hf petkopetkov/mamba2-780m-hf porcu-pine/mamba-detoxer ragunath-ravi/mamba-akkadian-translator rdabin/OLMoE-1B-7B-0924-Instruct-all_components rdabin/OLMoE-1B-7B-0924-Instruct-attention_and_experts rdabin/OLMoE-1B-7B-0924-Instruct-attention_only rdabin/OLMoE-1B-7B-0924-Instruct-experts_only rdabin/OLMoE-1B-7B-0924-Instruct-router_and_attention rdabin/OLMoE-1B-7B-0924-Instruct-router_and_experts rdabin/OLMoE-1B-7B-0924-Instruct-router_only replit/replit-code-v1_5-3b rwl4/mpt-7b-chat-extended scottsus/mamba-1.4b-instruct-hf scottsus/mamba-2.8b-papers-trained scottsus/mamba-2.8b-wdc-trained-v2 sgugger/rwkv-430M-pile sgugger/rwkv-7b-pile state-spaces/mamba-1.4b-hf state-spaces/mamba-130m-hf state-spaces/mamba-2.8b-hf state-spaces/mamba-370m-hf state-spaces/mamba-790m-hf taylodl1/possum1_8k_hf tbmod/OLMo-7B-Instruct-hf team-lucid/mptk-1b telecomadm1145/mamba2_exp5 ucmp137538/rwkv-4-169m-pile-finetuned-sst2 umuthopeyildirim/fin-rwkv-169M umuthopeyildirim/fin-rwkv-1b5 umuthopeyildirim/fin-rwkv-430m whaleloops/clinicalmamba-130m-hf whaleloops/clinicalmamba-2.8b-hf wtang06/mpt-125m-c4 yujiepan/mamba-tiny-random yujiepan/mpt-tiny-random yusx-swapp/ofm-mamba-1.4b-lambda-hf zary0/mamba-2.7b-ja-sft zhangtaolab/plant-dnamamba-BPE zhangtaolab/plant-dnamamba-BPE-promoter

Converter

Models

TikTokenConverter

1024m/OLMoE-1B-7B-0924-Base 1024m/OLMoE-1B-7B-0924-Instruct-Base 4bit/mpt-7b-storywriter-4bit-128g Alchan/mpt-7b-chat AntonV/mamba2-1.3b-av AntonV/mamba2-1.3b-hf AntonV/mamba2-130m-av AntonV/mamba2-130m-hf AntonV/mamba2-2.7b-av AntonV/mamba2-2.7b-hf AntonV/mamba2-370m-av AntonV/mamba2-370m-hf AntonV/mamba2-780m-av AntonV/mamba2-780m-hf ArthurZ/mamba-1.4b ArthurZ/mamba-130m Codemaster67/OLMo-7B-USPTO-1k-ZINC DanielAWrightGabrielAI/mpt-7b-storywriter-4bit-128g-65kTokens-CPU EleutherAI/Hermes-RWKV-v4-3B Fizzarolli/OLMoE-1B-7B-0924-extended-pos-emb Intel/neural-chat-7b-v1-1 KnutJaegersberg/RWKV-4-PilePlus-169M-20230520-done-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-1B5-20230520-2942-486Gtokens-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-3B-20230520-3147-520Gtokens-ctx4096 KnutJaegersberg/RWKV-4-PilePlus-430M-20230520-6162-1018Gtokens-ctx4098 KnutJaegersberg/RWKV-pileplus-1B5-evol_instruct_v2 Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192 Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch5-8192 Lansechen/OLMoE-1B-7B-0125-Distill-or-math220k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch1-8192 Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch3-8192 Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32 Litzy619/OLMoE-1B-7B-0924-step490000-tokens2055B-qlora Nethermind/Mpt-Instruct-DotNet-S NickNickGo/pocket_olmoe OccamRazor/mpt-7b-storywriter-4bit-128g OuteAI/OuteTTS-0.3-1B P1ayer-1/mpt-7b-instruct-base Pratye/mpt-7b-chat Q-bert/Mamba-130M Q-bert/Mamba-1B Q-bert/Mamba-3B RWKV/RWKV7-Goose-Pile-168M-HF RWKV/rwkv-4-14b-pile RWKV/rwkv-4-169m-pile RWKV/rwkv-4-1b5-pile RWKV/rwkv-4-3b-pile RWKV/rwkv-4-430m-pile RWKV/rwkv-4-7b-pile RWKV/rwkv-raven-14b RWKV/rwkv-raven-1b5 RWKV/rwkv-raven-3b RWKV/rwkv-raven-7b RichardErkhov/DeepMount00_-_mamba_790_hf_qa-4bits RichardErkhov/allenai_-_OLMoE-1B-7B-0924-4bits RichardErkhov/allenai_-_OLMoE-1B-7B-0924-8bits RichardErkhov/state-spaces_-_mamba-130m-hf-8bits RichardErkhov/tsavage68_-_mpt_1000_STEPS_1e5_SFT_SFT-8bits RtaForge/mamba2-2.7b-gurukul-instruct RtaForge/mamba2-2.7b-gurukul-instruct StarRing2022/RWKV-4-Raven-3B-v11-zh StarRing2022/RWKV-430M-Pile-Alpaca TRI-ML/mamba-7b-rw TehVenom/MPT-7b-Chat-Instruct-LongCTX-Merge TehVenom/MPT-7b-InstructAndStorywriting-50_50-Merge TehVenom/MPT-7b-WizardLM_Uncensored-Storywriter-Merge TehVenom/MPT-7b-storywriter-Apache-2.0 TehVenom/mpt-7b-InstructAndStorywriting-75_25-Merge Tomasal/OLMoE-1B-7B-0125-1-epoch-enron Xenova/tiny-mamba-onnx ZySec-AI/Mamba-2.8B-CyberSec agr505/fine_tuned_mamba_causal_1_19_run2 agr505/fine_tuned_mamba_causal_1_19_run3 allenai/MolmoE-1B-0924 allenai/MolmoE-1B-0924 allenai/OLMo-1B-0724-hf allenai/OLMo-1B-hf allenai/OLMo-7B-0424-Instruct-hf allenai/OLMo-7B-0424-hf allenai/OLMo-7B-0724-Instruct-hf allenai/OLMo-7B-0724-SFT-hf allenai/OLMo-7B-0724-hf allenai/OLMo-7B-Instruct-hf allenai/OLMo-7B-Twin-2T-hf allenai/OLMo-7B-hf allenai/OLMoE-1B-7B-0125 allenai/OLMoE-1B-7B-0125 allenai/OLMoE-1B-7B-0125-DPO allenai/OLMoE-1B-7B-0125-Instruct allenai/OLMoE-1B-7B-0125-SFT allenai/OLMoE-1B-7B-0924 allenai/OLMoE-1B-7B-0924 allenai/OLMoE-1B-7B-0924-Instruct allenai/OLMoE-1B-7B-0924-Instruct allenai/OLMoE-1B-7B-0924-SFT allenai/OLMoE-1B-7B-0924-SFT allura-org/MoE-Girl-1BA-7BT amd/AMD-OLMo-1B amd/AMD-OLMo-1B-SFT amd/AMD-OLMo-1B-SFT-DPO anas-awadalla/mpt-7b autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_epoch4_freeze autoprogrammer/gsm_OLMoE-1B-7B-0125_lr2e-05_epoch4_epoch_3 autoprogrammer/olmoe_densebackward0125 autoprogrammer/olmoe_densebackward0125_v1 avidoavid/RWKV-1b5-finetuned-overfit ayoubkirouane/Mamba-Chat-2.8B binh230/mamba2-370m breadlicker45/MuseBan breadlicker45/MuseRWKV breadlicker45/MuseRift breadlicker45/MuseRizz breadlicker45/muse-test-36 breadlicker45/muse-test-37 breadlicker45/muse-test-38 breadlicker45/muse-test35 breadlicker45/museRWKV-test breadlicker45/music-rwkv-v4 breadlicker45/music-rwkv2-v4 breadlicker45/rwkv-4-169m-pile-5120 breadlicker45/rwkv-4-169m-pile-6144 breadlicker45/rwkv-4-430m-2048 breadlicker45/rwkv-4-430m-3072 breadlicker45/rwkv-4-430m-4096 breadlicker45/rwkv-4-430m-5120 breadlicker45/rwkv-4-430m-6144 breadlicker45/rwkv-music3 breadlicker45/token-music cahya/rwkv-1B5-instruction cg666/OLMoE-1B-7B-0125-Instruct-grpo-E6-D100 cg666/OLMoE-1B-7B-0125-Instruct-grpo-E6-D8000-L4096 echarlaix/tiny-mpt-random-remote-code efederici/ipt-125m estrogen/olmoe-upscale estrogen/olmoe-upscale-attempt1 estrogen/olmoe-upscale-inkmixv3-ep1 estrogen/olmoe-upscale-inkmixv3-ep2 ethzanalytics/mpt-7b-storywriter-sharded feliperodriguezborquez/OLMoE-0125-attn-rnd-V1 feliperodriguezborquez/OLMoE-0125-base-V1 feliperodriguezborquez/OLMoE-0125-base-rnd-V1 feliperodriguezborquez/OLMoE-0924-attn-rnd-V1 feliperodriguezborquez/OLMoE-0924-base-V1 feliperodriguezborquez/OLMoE-0924-base-rnd-V1 feliperodriguezborquez/OLMoE-0924-my-V1 feliperodriguezborquez/OLMoE-0924-my-rnd-V1 finnstrom3693/rwkv-raven-1.5b fla-hub/mamba-7B fla-hub/rwkv7-168M-pile foilfoilfoil/RWKV-pileplus-HF-169M fxmarty/tiny-mpt-random-remote-code gl198976/mpt-7b gl198976/mpt-7b-instruct gretelai/mpt-7b han1997/mamba-2.8b-slimpj-hf harindhar10/OLMo-7B-USPTO-1k-ZINC harindhar10/OLMo-7B-ZINC20-10k harindhar10/OLMo-7B-ZINC20-50k harindhar10/OLMo-7B-ZINC20-50k-USPTO-50k hf-internal-testing/tiny-random-MambaForCausalLM hf-internal-testing/tiny-random-MambaModel hf-internal-testing/tiny-random-MptForCausalLM hf-internal-testing/tiny-random-MptForQuestionAnswering hf-internal-testing/tiny-random-MptForSequenceClassification hf-internal-testing/tiny-random-MptForTokenClassification hf-internal-testing/tiny-random-MptModel hf-internal-testing/tiny-random-OlmoForCausalLM hf-internal-testing/tiny-random-OlmoeForCausalLM hf-internal-testing/tiny-random-RwkvForCausalLM hf-internal-testing/tiny-random-RwkvModel huxiang088/OLMoE-1B-7B-0924-Instruct-NVFP4 hyungtae/mpt-30b interview-eval/olmoe-depthqa-test-4 interview-eval/olmoe-depthqa-test-instruct-6 interview-eval/olmoe-depthqa-test-train-5 interview-eval/olmoe-depthqa-train-1 interview-eval/olmoe-gsm8k-3 interview-eval/olmoe-math-test-4 interview-eval/olmoe-math-test-gsm8k-5 interview-eval/olmoe-math-test-instruct-6 interview-eval/olmoe-math-test-train-5 interview-eval/olmoe-math-train-1 interview-eval/olmoe-math-train-gsm8k-2 iwalton3/rwkv-14b-wizardlm jmichaelov/parc-rwkv-seed2 jploski/mpt-mini-shakespeare jprafael/mpt-7b-instruct-sharded katuni4ka/tiny-random-olmo-hf lennyhans/OLMoE-1B-7B-0125-Instruct-bnb-4bit lennyhans/OLMoE-1B-7B-0125-bnb-4bit lennyhans/OLMoE-1B-7B-0125-bnb-4bit lentan/mpt-125m lightblue/japanese-mpt-7b manojpreveen/mpt-30b-v5 mesh-ops/OLMoE-1B-7B-0924-step1140000-tokens4781B mesh-ops/OLMoE-1B-7B-0924-step1200000-tokens5033B mesh-ops/OLMoE-1B-7B-0924-step1215000-tokens5096B mesh-ops/OLMoE-1B-7B-0924-step1220000-tokens5117B mesh-ops/OLMoE-1B-7B-0924-step900000-tokens3774B mlx-community/OLMoE-1B-7B-0125 mlx-community/OLMoE-1B-7B-0125-4bit mlx-community/OLMoE-1B-7B-0125-6bit mlx-community/OLMoE-1B-7B-0125-6bit mlx-community/OLMoE-1B-7B-0125-8bit mlx-community/OLMoE-1B-7B-0125-Instruct mlx-community/OLMoE-1B-7B-0125-Instruct-4bit mlx-community/OLMoE-1B-7B-0125-Instruct-6bit mlx-community/OLMoE-1B-7B-0125-Instruct-8bit mlx-community/mamba-130m-hf-f32 modularai/replit-code-1.5 motionlabs/OLMoE-1B-5B nm-testing/OLMoE-1B-7B-0924-Instruct-FP8 nomic-ai/gpt4all-mpt nomic-ai/gpt4all-mpt-2 onnx-community/tiny-random-olmo-hf openaccess-ai-collective/mpt-7b-wizardlm optimum-intel-internal-testing/tiny-mamba optimum-intel-internal-testing/tiny-random-MptForCausalLM optimum-intel-internal-testing/tiny-random-olmo-hf petkopetkov/mamba2-1.3b-hf petkopetkov/mamba2-130m-hf petkopetkov/mamba2-2.7b-hf petkopetkov/mamba2-370m-hf petkopetkov/mamba2-780m-hf porcu-pine/mamba-detoxer ragunath-ravi/mamba-akkadian-translator rdabin/OLMoE-1B-7B-0924-Instruct-all_components rdabin/OLMoE-1B-7B-0924-Instruct-attention_and_experts rdabin/OLMoE-1B-7B-0924-Instruct-attention_only rdabin/OLMoE-1B-7B-0924-Instruct-experts_only rdabin/OLMoE-1B-7B-0924-Instruct-router_and_attention rdabin/OLMoE-1B-7B-0924-Instruct-router_and_experts rdabin/OLMoE-1B-7B-0924-Instruct-router_only replit/replit-code-v1_5-3b rwl4/mpt-7b-chat-extended scottsus/mamba-1.4b-instruct-hf scottsus/mamba-2.8b-papers-trained scottsus/mamba-2.8b-wdc-trained-v2 sgugger/rwkv-430M-pile sgugger/rwkv-7b-pile state-spaces/mamba-1.4b-hf state-spaces/mamba-130m-hf state-spaces/mamba-2.8b-hf state-spaces/mamba-370m-hf state-spaces/mamba-790m-hf taylodl1/possum1_8k_hf tbmod/OLMo-7B-Instruct-hf team-lucid/mptk-1b telecomadm1145/mamba2_exp5 ucmp137538/rwkv-4-169m-pile-finetuned-sst2 umuthopeyildirim/fin-rwkv-169M umuthopeyildirim/fin-rwkv-1b5 umuthopeyildirim/fin-rwkv-430m whaleloops/clinicalmamba-130m-hf whaleloops/clinicalmamba-2.8b-hf wtang06/mpt-125m-c4 yujiepan/mamba-tiny-random yujiepan/mpt-tiny-random yusx-swapp/ofm-mamba-1.4b-lambda-hf zary0/mamba-2.7b-ja-sft zhangtaolab/plant-dnamamba-BPE zhangtaolab/plant-dnamamba-BPE-promoter

Pattern #4 (178 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}{1,3}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]..."), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])

Affected models

Converter	Models
GPT2Converter	0x88844451/5abc9f1a-cf97-4fe2-93c2-22ad01b8e0ea Aivesa/03c4f17f-c58f-4d99-9fa6-723e83ce2289 Apolo81/granite-4-350m-map-commands-gguf Boojum/blue-moe Boojum/blue-moe-6b-it ClarenceDan/0dc7f291-d101-46d9-a564-231611885d6e ClarenceDan/e93909e1-3ea7-49ac-89b0-3a6358376513 Etherll/Tashkeel-350M-v2 ExaltedSlayer/ibm-granite-4.0-h-small-mlx-mxfp4 Goekdeniz-Guelmez/Josiefied-granite-4.0-micro-abliterated-v1 ModelCloud/dbrx-base-converted-v2 ModelCloud/dbrx-instruct-converted-v2 Open4bits/granite-4.0-h-tiny-mlx-fp16 Open4bits/granite-4.0-micro-mlx-3Bit OpenMOSE/RWKV-Reka-3.1-Flash R0mAI/331883f4-b627-4999-a74e-d33fc5fafdd2 R0mAI/3d737383-c3fa-4c8f-ad38-b5af93247aca RedHatAI/granite-4.0-h-small-FP8-block RedHatAI/granite-4.0-h-small-FP8-dynamic RedHatAI/granite-4.0-h-tiny-FP8-dynamic RichardErkhov/katuni4ka_-_tiny-random-dbrx-4bits RichardErkhov/katuni4ka_-_tiny-random-dbrx-8bits Rocketknight1/dbrx-tiny-random SystemAdmin123/tiny-random-dbrx adammandic87/05f441bc-4409-4188-a62a-44e7d3d95c8a allenai/Flex-code-2x7B-1T allenai/Flex-code-2x7B-1T allenai/Flex-creative-2x7B-1T allenai/Flex-creative-2x7B-1T allenai/Flex-math-2x7B-1T allenai/Flex-math-2x7B-1T allenai/Flex-news-2x7B-1T allenai/Flex-news-2x7B-1T allenai/Flex-pes2o-2x7B-1T allenai/Flex-pes2o-2x7B-1T allenai/Flex-reddit-2x7B-1T allenai/Flex-reddit-2x7B-1T allenai/Molmo-7B-O-0924 allenai/Molmo2-O-7B allenai/MolmoAct-7B-O-0812 amd/dbrx-instruct-FP8-KV badmadrad/alm-granite-4.0-tiny-h-finetuned bbytxt/7c8b28fa-995f-45c1-8c76-ee14295c6be7 bdambrosio/dbrx-instruct-7.0bpw-h8-exl2 bowilleatyou/6932964f-b4ba-4d87-9bde-aee96d2216a5 cyankiwi/granite-4.0-h-micro-AWQ-4bit cyankiwi/granite-4.0-h-micro-AWQ-8bit cyankiwi/granite-4.0-h-small-AWQ-4bit cyankiwi/granite-4.0-h-small-AWQ-8bit cyankiwi/granite-4.0-h-tiny-AWQ-4bit daniel40/086ec58e-d093-49c7-9efb-0b6d27efa875 diaenra/be30be8d-506f-4ccb-923b-3fbfcef79427 dimasik1987/ff134de0-010b-48b0-8471-a1047e70f02f dimasik2987/f427b45e-0117-4127-9cce-1a55987b38c4 drdreaddd/8398b0d6-4a31-499c-94e3-4279c5e9fa74 drewbenson/granite-4.0-h-micro-Q4-mxfp4-MLX fedovtt/e63674ec-1155-4808-aeb9-c00d2f68f6ed filipesantoscv11/160bad19-cf3c-40d3-83ee-c8349ef2e991 filipesantoscv11/710c5c72-b571-4714-830c-59baed18da52 filipesantoscv11/f8a073c0-d3ed-4c99-8a44-d111d6fc700b hf-internal-testing/tiny-random-FlexOlmoForCausalLM huihui-ai/Huihui-granite-4.0-micro-abliterated ibm-granite/granite-4.0-1b ibm-granite/granite-4.0-1b-base ibm-granite/granite-4.0-350m ibm-granite/granite-4.0-350m-base ibm-granite/granite-4.0-h-1b ibm-granite/granite-4.0-h-1b-base ibm-granite/granite-4.0-h-350m ibm-granite/granite-4.0-h-350m-base ibm-granite/granite-4.0-h-micro ibm-granite/granite-4.0-h-micro-base ibm-granite/granite-4.0-h-small ibm-granite/granite-4.0-h-small-FP8 ibm-granite/granite-4.0-h-small-base ibm-granite/granite-4.0-h-tiny ibm-granite/granite-4.0-h-tiny-base ibm-granite/granite-4.0-micro ibm-granite/granite-4.0-micro-base inference-optimization/granite-4.0-h-tiny-FP8-block introvoyz041/ibm-granite-4.0-h-small-mlx-mxfp4-mlx-4Bit irishprancer/caa2d38c-dc86-4198-afd3-1a599519e6bd katuni4ka/tiny-random-dbrx lmstudio-community/granite-4.0-h-small-MLX-4bit lmstudio-community/granite-4.0-h-small-MLX-5bit lmstudio-community/granite-4.0-h-small-MLX-6bit lmstudio-community/granite-4.0-h-small-MLX-8bit lmstudio-community/granite-4.0-h-tiny-MLX-4bit lmstudio-community/granite-4.0-h-tiny-MLX-5bit lmstudio-community/granite-4.0-h-tiny-MLX-6bit lmstudio-community/granite-4.0-h-tiny-MLX-8bit magiccodingman/Granite-4.0-H-1B-Unsloth-MXFP4-Hybrid-GGUF magiccodingman/Granite-4.0-H-350M-Unsloth-MXFP4-Hybrid-GGUF magiccodingman/Granite-4.0-H-350M-Unsloth-MagicQuant-Hybrid-GGUF marialvsantiago/e83503c0-748b-4fb7-9d6c-5faa6535a694 mlx-community/Granite-4.0-H-Tiny-4bit-DWQ mlx-community/granite-4.0-1b-4bit mlx-community/granite-4.0-h-1b-3bit mlx-community/granite-4.0-h-1b-4bit mlx-community/granite-4.0-h-1b-6bit mlx-community/granite-4.0-h-1b-8bit mlx-community/granite-4.0-h-350m-4bit mlx-community/granite-4.0-h-350m-8bit mlx-community/granite-4.0-h-micro-4bit mlx-community/granite-4.0-h-micro-8bit mlx-community/granite-4.0-h-tiny-3bit-MLX mlx-community/granite-4.0-h-tiny-3bit-MLX mlx-community/granite-4.0-h-tiny-4bit mlx-community/granite-4.0-h-tiny-5bit-MLX mlx-community/granite-4.0-h-tiny-6bit-MLX mlx-community/granite-4.0-h-tiny-6bit-MLX mlx-community/granite-4.0-micro-8bit mrferr3t/098d7e0d-f18a-4f53-a5d8-5ce2370b3e54 mrferr3t/0a4b6146-bc64-48be-afac-5923d9f127d6 mrferr3t/2dfb4f3e-a033-4718-8d94-7e6e02e17ab9 mrferr3t/5c21cdd1-d5ac-48c8-b859-55f27d47371b mrferr3t/94697a59-2f6e-4920-9a51-48640bb5e678 mrferr3t/9a05be10-930f-49e2-8668-dc3d1c7facbc mrferr3t/aa07cd73-f9f1-4665-9565-e5e2752236b0 mrferr3t/ddfccb92-64d3-4fb2-a8db-243e0595faca nttx/26d1fec9-df17-4915-a9e6-6bb9488f30fa nttx/30a115b0-dd24-4bdc-b54a-3f823984babb nttx/32d5a43b-9ec3-4ff1-9561-49166dc39e00 nttx/540e706b-b84b-44b1-a4ff-cd09597559fb numerouno01/85b9c394-bbd3-40d6-99c6-a40e8a403146 numerouno01/d6e3e57d-cb21-45df-ae20-208268435bca onnx-community/granite-4.0-1b-ONNX-web onnx-community/granite-4.0-350m-ONNX-web onnx-community/granite-4.0-micro-ONNX-web optimum-intel-internal-testing/tiny-random-dbrx optimum-intel-internal-testing/tiny-random-granitemoehybrid prxy5605/40933b89-85aa-42fa-a544-f4b3f4da613b prxy5605/c7a0a965-e520-490c-b575-2cb0e842d0fb prxy5606/263d1fdb-e97d-4666-9aad-257eba4d228c qgallouedec/tiny-DbrxForCausalLM ramendik/miki-pebble-20260131-safetensors seblaku/2329ff78-bb12-41e9-b193-cf014c9dfcba sergioalves/0c12ba50-1288-403c-b3f8-d9a452c7f0ce shanearora/Flex-reddit-2x7B-1T taopanda/test-tiny-random-dbrx tiny-random/granite-moe-hybrid trl-internal-testing/tiny-DbrxForCausalLM trl-internal-testing/tmp-tiny-DbrxForCausalLM unsloth/granite-4.0-1b unsloth/granite-4.0-1b-unsloth-bnb-4bit unsloth/granite-4.0-350m unsloth/granite-4.0-350m-base unsloth/granite-4.0-350m-unsloth-bnb-4bit unsloth/granite-4.0-h-1b unsloth/granite-4.0-h-1b-unsloth-bnb-4bit unsloth/granite-4.0-h-350m unsloth/granite-4.0-h-350m-unsloth-bnb-4bit unsloth/granite-4.0-h-micro unsloth/granite-4.0-h-micro-base-unsloth-bnb-4bit unsloth/granite-4.0-h-micro-unsloth-bnb-4bit unsloth/granite-4.0-h-small unsloth/granite-4.0-h-small-FP8-Dynamic unsloth/granite-4.0-h-small-bnb-4bit unsloth/granite-4.0-h-small-unsloth-bnb-4bit unsloth/granite-4.0-h-tiny unsloth/granite-4.0-h-tiny-FP8-Dynamic unsloth/granite-4.0-h-tiny-base unsloth/granite-4.0-h-tiny-base-unsloth-bnb-4bit unsloth/granite-4.0-micro unsloth/granite-4.0-micro-base unsloth/granite-4.0-micro-base-unsloth-bnb-4bit unsloth/granite-4.0-micro-unsloth-bnb-4bit vermoney/cdc1a8da-3025-4663-a070-219f81140542 vertings6/a434d1b9-2850-4b62-a2b9-3efc8fac06ec vertings6/bd5521cc-e201-47c5-bbf9-64fe3343cdf8 vmpsergio/dc6170db-fe75-4268-b721-53f18159aa2c whiteapple8222/26792c56-d5ae-4cbd-b70f-e16bae2c539c whiteapple8222/8a526cc6-dac2-4461-b95b-6aa9599ef5a8_private whiteapple8222/caf66720-6973-4d4f-8328-2befc929adba yujiepan/dbrx-tiny-random yujiepan/dbrx-tiny256-random yujiepan/granite-4.0-h-tiny-random yujiepan/granite-moe-hybrid-tiny-random

Converter

Models

GPT2Converter

Pattern #5 (157 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)

Affected models

Converter	Models
GemmaConverter	0xroyce/silent-voice-multimodal DeadlyHug/gemma3nE4b-it_expv7_8k_r EpistemeAI/Audiogemma-3N-finetune EsotericsEnjoyer/BROKEN-t5gemma-2b-2b-Steiner-Esoterics-Merged FrancescoCaracciolo/chisrtina-e4b GaborMadarasz/gemma-3n-E2B-it_hun_ASR-finetuned Hayloo9838/siglip2-vision-only MLGResearch/cleaver_t5g_ss Mfusenig/large-t5gemma-finetuned-checkpoint-2400 Mfusenig/large-t5gemma-finetuned-checkpoint-2800 Mfusenig/large-t5gemma-finetuned-checkpoint-3200 Mfusenig/large-t5gemma-finetuned-checkpoint-3600 Mfusenig/large-t5gemma-finetuned-final-best-model Mfusenig/small-t5gemma-finetuned-checkpoint-14000 Mfusenig/small-t5gemma-finetuned-checkpoint-21000 Mfusenig/small-t5gemma-finetuned-checkpoint-7000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-13000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-14000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-19500 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-21000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-26000 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-6500 Mfusenig/t5gemma-finetuned_full_dataset_small-checkpoint-7000 Minthy/t5gemma-2b-2b-ul2-encoder-only MuXodious/gemma-3n-E4B-it-absolute-heresy-MPOA Nadhari/gemma-3n-swahili-E4B-it Nayana-cognitivelab/NayanaSectionOCR Nozim6690/hugging-face_shieldgemma-2-4b-it Qwe1325/Huihui-gemma-3n-E4B-it-abliterated-bnb-4bit RedHatAI/gemma-3n-E2B-it-FP8-dynamic RedHatAI/gemma-3n-E4B-it-FP8-dynamic RedHatAI/gemma-3n-E4B-it-quantized.w4a16 RuteNL/ViT-SO400M-16-SigLIP2-384-ONNX RuteNL/ViT-gopt-16-SigLIP2-384-ONNX attilasir/AttilaAI blind-assist/gemma-3n-2b-finetune-e1-8500 blind-assist/gemma-3n-4b-finetune-8500 brunopio/recurrentgemma-2b-it-nbits4-GS64-Axis1-HQQ-T brunopio/recurrentgemma-2b-it-nbits4-GSNone-Axis0-HQQ-T chimbiwide/Gemma3NPC-it-float16 cp500/mece dogma-black/transformers_t5gemma_2b-prefixlm_v1 google/gemma-3n-E2B google/gemma-3n-E2B-it google/gemma-3n-E4B google/gemma-3n-E4B-it google/recurrentgemma-2b google/recurrentgemma-2b-it google/recurrentgemma-9b google/recurrentgemma-9b-it google/shieldgemma-2-4b-it google/t5gemma-2b-2b-prefixlm google/t5gemma-2b-2b-prefixlm-it google/t5gemma-2b-2b-ul2 google/t5gemma-2b-2b-ul2-it google/t5gemma-9b-2b-prefixlm google/t5gemma-9b-2b-prefixlm-it google/t5gemma-9b-2b-ul2 google/t5gemma-9b-2b-ul2-it google/t5gemma-9b-9b-prefixlm google/t5gemma-9b-9b-prefixlm-it google/t5gemma-9b-9b-ul2 google/t5gemma-9b-9b-ul2-it google/t5gemma-b-b-prefixlm google/t5gemma-b-b-prefixlm-it google/t5gemma-b-b-ul2 google/t5gemma-b-b-ul2-it google/t5gemma-l-l-prefixlm google/t5gemma-l-l-prefixlm-it google/t5gemma-l-l-ul2 google/t5gemma-l-l-ul2-it google/t5gemma-ml-ml-prefixlm google/t5gemma-ml-ml-prefixlm-it google/t5gemma-ml-ml-ul2 google/t5gemma-ml-ml-ul2-it google/t5gemma-s-s-prefixlm google/t5gemma-s-s-prefixlm-it google/t5gemma-s-s-ul2 google/t5gemma-s-s-ul2-it google/t5gemma-xl-xl-prefixlm google/t5gemma-xl-xl-prefixlm-it google/t5gemma-xl-xl-ul2 google/t5gemma-xl-xl-ul2-it govnejri/Estimin3n harisarang/t5gemma-2b-2b-prefixlm-lora-pretrained-full harisarang/t5gemma-2b-2b-prefixlm-lora-sft-full harshaljanjani/tiny-t5gemma-test hf-internal-testing/namespace-google-repo_name-gemma-3n-E4B-it huihui-ai/Huihui-gemma-3n-E4B-it-abliterated igorktech/gemma-3n-E2B-it-language igorktech/gemma-3n-e2b-it-language-pruned jordimas/t5gemma-s-s-ul2 lmstudio-community/gemma-3n-E2B-it-MLX-4bit lmstudio-community/gemma-3n-E2B-it-MLX-6bit lmstudio-community/gemma-3n-E2B-it-MLX-8bit lmstudio-community/gemma-3n-E2B-it-MLX-bf16 lmstudio-community/gemma-3n-E4B-it-MLX-4bit lmstudio-community/gemma-3n-E4B-it-MLX-6bit lmstudio-community/gemma-3n-E4B-it-MLX-8bit lmstudio-community/gemma-3n-E4B-it-MLX-bf16 lyimo/gemma-3n-swahili mlx-community/Huihui-gemma-3n-E4B-it-abliterated-lm-4bit mlx-community/Huihui-gemma-3n-E4B-it-abliterated-lm-6bit mlx-community/Huihui-gemma-3n-E4B-it-abliterated-lm-8bit mlx-community/MedraN-E4B-Uncensored-Q4 mlx-community/gemma-3-12b-it-qat-4bit mlx-community/gemma-3-27b-it-qat-4bit mlx-community/gemma-3-4b-it-qat-4bit mlx-community/gemma-3n-E2B-4bit mlx-community/gemma-3n-E2B-it-4bit mlx-community/gemma-3n-E2B-it-lm-4bit mlx-community/gemma-3n-E2B-it-lm-bf16 mlx-community/gemma-3n-E2B-it-text-4bit-dwq mlx-community/gemma-3n-E4B-bf16 mlx-community/gemma-3n-E4B-it-4bit mlx-community/gemma-3n-E4B-it-8bit mlx-community/gemma-3n-E4B-it-bf16 mlx-community/gemma-3n-E4B-it-lm-4bit mlx-community/gemma-3n-E4B-it-lm-bf16 mshojaei77/gemma-3n-E4B-persian nehmeailabs-org/nehme-flashcheck-270m nightknocker/recurrent-t5gemma-l-l-ul2-encoder oddadmix/MasriSwitch-Gemma3n-Transcriber-v1 onnx-community/gemma-3n-E2B-it-ONNX rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-HealthCare sil-ai/t5gemma-swh-nih silma-ai/SILMA-Kashif-2B-Instruct-v1.0 sugarquark/sd15-text-encoder-t5g-2b-ul2-it thivy/embeddinggemma-300m-norwegian-health timm/ViT-B-16-SigLIP2 timm/ViT-B-16-SigLIP2-256 timm/ViT-B-16-SigLIP2-384 timm/ViT-B-16-SigLIP2-512 timm/ViT-B-32-SigLIP2-256 timm/ViT-L-16-SigLIP2-256 timm/ViT-L-16-SigLIP2-384 timm/ViT-L-16-SigLIP2-512 timm/ViT-SO400M-14-SigLIP2 timm/ViT-SO400M-14-SigLIP2-378 timm/ViT-SO400M-16-SigLIP2-256 timm/ViT-SO400M-16-SigLIP2-384 timm/ViT-SO400M-16-SigLIP2-512 timm/ViT-gopt-16-SigLIP2-256 timm/ViT-gopt-16-SigLIP2-384 tiny-random/gemma-3n unsloth/gemma-3n-E2B unsloth/gemma-3n-E2B-it unsloth/gemma-3n-E2B-it-unsloth-bnb-4bit unsloth/gemma-3n-E2B-unsloth-bnb-4bit unsloth/gemma-3n-E4B unsloth/gemma-3n-E4B-it unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit unsloth/gemma-3n-E4B-unsloth-bnb-4bit varshu23/gemma3-e1b-sliced-4bit yasserrmd/GemmaECG-Vision yujiepan/gemma-3n-tiny-random yujiepan/gemma-3n-tiny-random-dim4

Converter

Models

GemmaConverter

Pattern #6 (148 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\n"), content=" "), Replace(pattern=Regex(" {2,}"), content=" ")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])

Affected models

Converter	Models
PegasusConverter	Aexeos/gbp-large-pubmed-ft BEE-spoke-data/pegasus-x-base-synthsumm_open-16k BritnyB/summarizer Deena123/pegasus-x-base Feluda/pegasus-samsum Haribaskar2594/bigbird-pegasus-med-v2 Haribaskar2594/google-pegasus-large Haribaskar2594/google-pegasus-med-full Joemgu/pegasus-x-sumstew Kevincp560/bigbird-pegasus-large-arxiv-finetuned-pubmed Kevincp560/bigbird-pegasus-large-bigpatent-finetuned-pubMed ManqingLiu/pegasus-samsum MarketingHHM/autotrain-sumituptestv4-60050134312 MikaSie/PegasusX_no_extraction_V1 MikaSie/RoBERTa_PegasusX_dependent_V1 NazzX1/pegasus-Finetuned-sum-full-note-v1 Nourrr/pegasus-x-base Nourrr/pegasus-x-base-OldVersion PergaZuZ/cdc_influenza_pagasus-x-large PoseyATX/Bronze_Buffalo_89 QuickRead/fine-tune-Pegasus RichardErkhov/pszemraj_-_bigbird-pegasus-large-K-booksum-4bits RichardErkhov/pszemraj_-_bigbird-pegasus-large-K-booksum-8bits Shaelois/MeetingScript UNIST-Eunchan/pegasus-x-booksum-chapter Venkatesh4342/pegasus-samsum acmc/summarizer_google_bigbird-pegasus-large-pubmed_base_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_keybert_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_mesh_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_mesh_unfaceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_most_frequent_faceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_most_frequent_unfaceted acmc/summarizer_google_bigbird-pegasus-large-pubmed_tf_idf_unfaceted alex2awesome/pegasus-x-large alk/pegasus-scitldr alphahg/pegasus-x-base-finetuned-paper alvinwatner/pegasus-large-qg-squad alvinwatner/pegasus-large-qg-squad-alpha-interro alwaysaditi/pegasus_X_pacsum alwaysaditi/pegasus_hiporank_final aplnestrella/pegasus-x-arxiv-cord19 aplnestrella/pegasus-x-cord19 aplnestrella/pegasus-x-cord19-ENC_16-DEC_24-b_4-e_8-g_1 aplnestrella/pegasus-x-cord19-ENC_16-DEC_8-b_8-e_8-g_1 aplnestrella/pegasus-x-cord19-extended aruca/pegasus_x-meeting-summarizer aruca/pegasus_x-meeting-summarizer-gpt3.5 aruca/pegasusx-AMI-text-summarizer bruehle/BigBirdPegasus_Chemtagger bruehle/BigBirdPegasus_Llama budhwant/big-bird-hindi-sumarization ccdv/lsg-pegasus-large-4096 chinhon/pegasus-multi_news-commentaries_hdwriter chinhon/pegasus-multi_news-headline csquin/pegasus-x-cord19-ENC_16-DEC_24-b_4-e_8-g_1_v2 deltayrn/pegasusBase farleyknight/patent-summarization-google-bigbird-pegasus-large-arxiv-2022-09-20 farleyknight/patent-summarization-pegasus-2022-09-16 gcesare/pegasus-x-base-finetuned-pubmed gigant/pegasusx_tib google/bigbird-pegasus-large-arxiv google/bigbird-pegasus-large-bigpatent google/bigbird-pegasus-large-pubmed google/pegasus-x-base google/pegasus-x-large grenmon/pegasus-x-large-finetuned-summarization haesun/pegasus-samsum hafez1412/pegasus-x-merged hf-internal-testing/tiny-random-BigBirdPegasusForCausalLM hf-internal-testing/tiny-random-BigBirdPegasusForConditionalGeneration hf-internal-testing/tiny-random-BigBirdPegasusForQuestionAnswering hf-internal-testing/tiny-random-BigBirdPegasusForSequenceClassification hf-internal-testing/tiny-random-BigBirdPegasusModel hf-internal-testing/tiny-random-PegasusForCausalLM hf-internal-testing/tiny-random-PegasusForConditionalGeneration hf-internal-testing/tiny-random-PegasusModel hf-internal-testing/tiny-random-PegasusXForConditionalGeneration hf-internal-testing/tiny-random-PegasusXModel hf-internal-testing/tiny-random-bigbird_pegasus hf-internal-testing/tiny-random-pegasus hf-tiny-model-private/tiny-random-BigBirdPegasusForCausalLM hf-tiny-model-private/tiny-random-BigBirdPegasusForConditionalGeneration hf-tiny-model-private/tiny-random-BigBirdPegasusForQuestionAnswering hf-tiny-model-private/tiny-random-BigBirdPegasusForSequenceClassification hf-tiny-model-private/tiny-random-BigBirdPegasusModel hf-tiny-model-private/tiny-random-PegasusForCausalLM hf-tiny-model-private/tiny-random-PegasusModel hf-tiny-model-private/tiny-random-PegasusXForConditionalGeneration hf-tiny-model-private/tiny-random-PegasusXModel himanimaheshwari3/my_h_billsum_model ireneli1024/bigbird-pegasus-large-pubmed-elife-finetuned ireneli1024/bigbird-pegasus-large-pubmed-plos-finetuned junyinc/LING-575-WI-SUM katarinajoanne/bigbird_fine_tuned kmfoda/output_dir li-lab/ascle-bigbird-pegasus-large-pubmed-elife-finetuned li-lab/ascle-bigbird-pegasus-large-pubmed-plos-finetuned luojingbao/pegasus_output mayu0007/pegasus_large_covid minjingzhu/bigbird-pegasus-large-pubmed-finetuned-legal minjingzhu/bigbird-pegasus-large-pubmed-finetuned-legal-2 natanmb/pegasus-x-base-finetuned-multi-news onnx-internal-testing/tiny-random-BigBirdPegasusForCausalLM-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusForConditionalGeneration-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusForQuestionAnswering-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusForSequenceClassification-ONNX onnx-internal-testing/tiny-random-BigBirdPegasusModel-ONNX optimum-intel-internal-testing/tiny-random-bigbird_pegasus optimum-intel-internal-testing/tiny-random-pegasus priyankrathore/Pegasus-Lay-Final pszemraj/bigbird-pegasus-large-K-booksum pszemraj/bigbird-pegasus-large-K-booksum pszemraj/pegasus-large-book-summary pszemraj/pegasus-large-summary-explain pszemraj/pegasus-x-large-book-summary pszemraj/pegasus-x-large-book_synthsumm-bf16 quanganh22/pegasus-x-cui quanganh22/pegasus-x-epoch_1 quanganh22/pegasus-x-finetune-subtitleonly quanganh22/pegasus-x-finetuned-final quanganh22/pegasus-x-finetuned-final-v2 seonglae/resrer-pegasus-x sharmadhruv/my_awesome_qa_model sohamchougule/pegasus-large-finetuned-samsum sohamchougule/pegasus-x-base-finetuned-samsum starcatmeow/autotrain-cybersecurity-summarization-pegasus-x-book-43369110299 tanvi-junankar/summify-pegasus-x theojolliffe/bigbird-pegasus-large-arxiv-finetuned-roundup-280922 theojolliffe/distill-pegasus-cnn-16-4-finetuned-arxiv twigs/bigbird-pegasus-large twigs/bigbird-pegasus-large-4096-arxiv twigs/bigbird-pegasus-large-4096-govreport twigs/bigbird-pegasus-large-4096-pubmed twigs/pegasus-x-large-4096-arxiv twigs/pegasus-x-large-4096-govreport twigs/pegasus-x-large-4096-pubmed twigs/pegasus-x-large-8192-arxiv twigs/pegasus-x-large-8192-govreport twigs/pegasus-x-large-8192-pubmed ubaada/pegasus-x-large-booksum-16k vatsalinfodesk/pegasus-samsum vickt/LLM_Teached_PEGASUS_CNNDM wanyu/IteraTeR-PEGASUS-Revision-Generator zakerous/pegasus-x-large-finetuned-samsum1000 zakerous/pegasus-x-large-finetuned-samsum1000-1 zakerous/pegasus-x-large-finetuned-samsum5000 zedfum/arman-longformer-8k-finetuned-ensani zphang/pegasus-x-large

Converter

Models

PegasusConverter

Pattern #7 (119 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Sequence(pretokenizers=[Digits(individual_digits=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])

Affected models

Converter	Models
GPT2Converter	DJLougen/granite4-tax Danielbrdz/CodeBarcenas-1b HuggingFaceH4/starchat-alpha HuggingFaceH4/starchat-beta InfosysEnterprise/NT-Java-1.1B JoydeepC/trueGL LoupGarou/WizardCoder-Guanaco-15B-V1.0 LoupGarou/WizardCoder-Guanaco-15B-V1.1 Multi-Domain-Expert-Learning/osiris_12b RaymondLi/sc2-3b-test RichardErkhov/allura-org_-_MoE-Girl-800MA-3BT-8bits RichardErkhov/ibm-granite_-_granite-3.0-1b-a400m-base-4bits RichardErkhov/ibm-granite_-_granite-3.0-1b-a400m-base-8bits RichardErkhov/ibm-granite_-_granite-3.0-3b-a800m-base-4bits RichardErkhov/ibm-granite_-_granite-3.0-3b-a800m-base-8bits RichardErkhov/ibm-research_-_PowerMoE-3b-8bits RichardErkhov/ibm_-_PowerMoE-3b-4bits RichardErkhov/ibm_-_PowerMoE-3b-8bits RichardErkhov/nuprl_-_MultiPL-T-StarCoderBase_1b-4bits TevunahAi/Granite-34B-Code-Instruct-8k-2048-Calibration-FP8 TevunahAi/granite-34b-code-instruct-8k-FP8 TheBloke/Octocoder-GPTQ TheBloke/WizardCoder-15B-1.0-GPTQ TheBloke/sqlcoder-GPTQ TheBloke/sqlcoder2-GPTQ TheBloke/starchat-beta-GPTQ TheBloke/starcoder-GPTQ V-YangXu/StarCoder-Alpaca WizardLMTeam/WizardCoder-15B-V1.0 Xenova/WizardCoder-1B-V1.0 Xenova/starcoderbase-1b Xenova/tiny_starcoder_py allura-org/MoE-Girl-800MA-3BT allura-org/MoE-Girl_400MA_1BT arjunguha/notstarcoder-1b aurora-m/aurora-m-biden-harris-redteamed bigcode/gpt_bigcode-santacoder bigcode/santacoderpack bigcode/starcoder bigcode/starcoder-co-format bigcode/starcoder-cxo bigcode/starcoder-cxso bigcode/starcoder-o bigcode/tiny_starcoder_py bugdaryan/WizardCoderSQL-15B-V1.0 cobrakenji/granite-20b-code-base-GGUF codeparrot/starcoder-self-instruct defog/sqlcoder defog/sqlcoder2 fals3/bigcode-starcoderbase-unit-test-fine-tuning hf-internal-testing/tiny-random-GraniteForCausalLM hyper-accel/tiny-random-gpt_bigcode ibm-granite/granite-20b-code-base-8k ibm-granite/granite-20b-code-base-8k ibm-granite/granite-20b-code-base-r1.1 ibm-granite/granite-20b-code-instruct-8k ibm-granite/granite-20b-code-instruct-8k ibm-granite/granite-20b-functioncalling ibm-granite/granite-3.0-1b-a400m-base ibm-granite/granite-3.0-2b-base ibm-granite/granite-3.0-3b-a800m-base ibm-granite/granite-3.0-8b-base ibm-granite/granite-3.1-1b-a400m-base ibm-granite/granite-3.1-2b-base ibm-granite/granite-3.1-3b-a800m-base ibm-granite/granite-3.1-8b-base ibm-granite/granite-3.3-2b-base ibm-granite/granite-3.3-8b-base ibm-granite/granite-34b-code-base-8k ibm-granite/granite-34b-code-instruct-8k ibm-granite/granite-3b-code-base-128k ibm-granite/granite-3b-code-base-2k ibm-granite/granite-3b-code-instruct-128k ibm-granite/granite-3b-code-instruct-2k ibm-granite/granite-4.0-tiny-preview ibm-granite/granite-8b-code-base-4k ibm-granite/granite-8b-code-instruct-128k ibm-granite/granite-8b-code-instruct-4k ibm-research/PowerLM-3b ibm-research/PowerMoE-3b ibm-research/moe-7b-1b-active-shared-experts iterateai/Interplay-AppCoder jinaai/starcoder-1b-textbook kollecter/granite-3.1-1b-a400m-base kollecter/granite-3.1-3b-a800m-base mdouglas/granite-3.1-3b-a800m-base-bnb-4bit michaelfeil/ct2fast-starcoder mkdir700/v2-starcoderbase1b-personal-copilot-A100-40GB-colab mlx-community/granite-20b-code-instruct-4bit mlx-community/granite-20b-code-instruct-8bit mlx-community/granite-34b-code-base-4bit mlx-community/granite-34b-code-base-8bit mlx-community/granite-34b-code-instruct-8bit mrm8488/santacoder-finetuned-the-stack-bash-shell mrm8488/santacoder-finetuned-the-stack-clojure muhtasham/santacoder-finetuned-the-stack-assembly muhtasham/santacoder-finetuned-the-stack-cobol nlpguy/granite-3.0-1b-a400m-base nlpguy/granite-3.0-3b-a800m-base nuprl/MultiPL-T-StarCoderBase_15b nuprl/MultiPL-T-StarCoderBase_1b openaccess-ai-collective/minotaur-15b patrickbdevaney/WizardLM-1b-GGUF rahuldshetty/tiny-starcoder-instruct refactai/starcoderbase-1b richardr1126/spider-natsql-wizard-coder-merged richardr1126/spider-skeleton-wizard-coder-merged seeklhy/codes-15b seeklhy/codes-15b-bird seeklhy/codes-1b seeklhy/codes-3b seeklhy/codes-3b-bird-with-evidence seeklhy/codes-7b seeklhy/codes-7b-merged sky-2002/tiny-starcoder-ft tdoehmen/starcoder-schemapile-fk umm-maybe/StarCoder-1B-R2 umm-maybe/StarCoder-1B-StackStar yujiepan/starcoder-tiny-random

Converter

Models

GPT2Converter

Pattern #8 (104 models)

-normalizer:		None
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+normalizer:		Sequence(normalizers=[Prepend(prepend="▁"), Replace(pattern=String(" "), content="▁")])
+pre_tokenizer:		None

Affected models

Converter	Models
LlamaConverter	BEE-spoke-data/beecoder-220M-python Bharanidharan07/idefics_2_finetuned_copy GeorgeBredis/ruIdefics2-ruLLaVA-merged Guilherme34/Samantha-multimodal-v2-model HuggingFaceM4/Sightseer HuggingFaceM4/idefics2 HuggingFaceM4/idefics2-8b HuggingFaceM4/idefics2-8b-AWQ HuggingFaceM4/idefics2-8b-base HuggingFaceM4/idefics2-8b-base-AWQ HuggingFaceM4/idefics2-8b-chatty HuggingFaceM4/idefics2-8b-chatty-AWQ HuggingFaceM4/idefics2-tfrm-compatible HuggingFaceM4/idefics2_raven_finetuned HuggingFaceM4/tr_272_bis_opt_step_15000_merge Mantis-VL/idefics2-8b-video-eval-refined-40k_4096_generation Mantis-VL/idefics2-8b-video-eval-refined-40k_4096_regression Mantis-VL/mantis-8b-idefics2-classification-example_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-20k-mantis-2epoch_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-20k_2048 Mantis-VL/mantis-8b-idefics2-video-eval-40k-2epoch_4096_generation Mantis-VL/mantis-8b-idefics2-video-eval-40k-mantis-2epoch_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-50k-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-50k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-50k-mantis_4096 Mantis-VL/mantis-8b-idefics2-video-eval-50k_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-batch32_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k-mantis_4096 Mantis-VL/mantis-8b-idefics2-video-eval-95k_4096 Mantis-VL/mantis-8b-idefics2-video-eval-anno-real_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-debug_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-high-res-20k-mantis-3epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-high-res-35k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-high-res-40k-mantis-2epoch_4096 Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k-ablation-anno_4096_generation Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k-ablation-anno_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k-sora_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k_4096_generation Mantis-VL/mantis-8b-idefics2-video-eval-refined-40k_4096_regression Mantis-VL/mantis-8b-idefics2-video-eval_5184_regression Mantis-VL/mantis-8b-idefics2-video-eval_6144_regression Mantis-VL/mantis-8b-idefics2_8192 Nanbeige/ToolMind-Web-3B OpenGVLab/Mini-InternVL2-4B-DA-DriveLM OpenWebVoyager/OpenWebVoyager-opt-1 Pavithra2910/09thmay Pavithra2910/finetuningidefics Reverb/Idefics2-8b-docVQA-finetuned SalmanFaroz/idefics2-8b-DocVQA-SP Shashank91097/Idefic Shashank91097/Idefic_medical_VQA_merged11 StevenHH2000/Finedefics Syed-Hasan-8503/Idefics2-8B-SFT TD788432/IDEFICS-n.2-FT-DocVQA TIGER-Lab/Mantis-8B-Idefics2 TIGER-Lab/VISTA-Mantis TIGER-Lab/VideoScore TIGER-Lab/VideoScore-v1.1 Trelis/idefics2-8b-chatty-bf16 andrew-together/idefics2-8b-finetune-combined-50k_8192 edbeeching/vsft-idefics2 enghamdiali/idefics-9b-merge enghamdiali/idefics-9b-qt_f enghamdiali/idfc-m1 francepfl/DriveLM-mantis-8b-idefics2_8192-cot francepfl/mantis-8b-idefics2_exp10_italian_8192 francepfl/mantis-8b-idefics2_exp_italian_8192 giobin/idefics2_random_connector_v2 huz-relay/idefics2-8b-ocr instructlab/granite-7b-lab jancuhel/idefics2-8b-img-text-relevancy jihadzakki/idefics2-8b-medvqa jihadzakki/idefics2-8b-roco-slake jihadzakki/idefics2-8b-vqarad-delta lamm-mit/Cephalo-Idefics-2-vision-10b-alpha lamm-mit/Cephalo-Idefics-2-vision-10b-beta lamm-mit/Cephalo-Idefics-2-vision-12b-alpha lamm-mit/Cephalo-Idefics-2-vision-8b-alpha lamm-mit/Cephalo-Idefics-2-vision-8b-beta matbee/idefics2-weblinx-20500 mlx-community/idefics2-8b-4bit mlx-community/idefics2-8b-8bit mlx-community/idefics2-8b-chatty-4bit mlx-community/idefics2-8b-chatty-8bit mqliu/mantis-8b-idefics2_1024 pallavibiswas/idefics2-finetuned-re-id perceptorLLM/idefics2-8b-4bit-bf16 perceptorLLM/idefics2-8b-4bit-fp16 qgallouedec/tiny-Idefics2ForConditionalGeneration smishr-18/Idefics-PokeCards smishr-18/Idefics2-PokemonCards tctrautman/20240709-kibbe-training-gen-1x-merged tiiuae/viscon-contextual-captioner trl-internal-testing/tiny-Idefics2ForConditionalGeneration vctmk/mantis-8b-idefics2-classification-example_2048_regression vctmk/mantis-8b-idefics2-classification-tedED_4096_regression vctmk/mantis-8b-idefics2-classification-tedEDself_8g_4096_regression vctmk/mantis-8b-idefics2-classification-tedEDself_v2_3_8g_4096_regression worldboss/idefics-9b-doodles-v1 wyu1/Leopard-Idefics2 zesquirrelnator/idefics2-8b-docvqa-finetuned-tutorial zixianma/mma_idefics2_293k-toolp-seq_length_8192-lr_1e-5

Converter

Models

LlamaConverter

Pattern #9 (98 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("<\|startoftext\|>|<\|endoftext\|>|'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+"), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+"), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])

Affected models

Converter	Models
CLIPConverter	BAAI/BGE-VL-large Marqo/marqo-fashionCLIP RaviKush/clipseg_finetuned_dice_bce RaviKush/clipseg_focal_loss_v0 RaviKush/clipseg_focal_loss_v1 Xenova/clip-vit-base-patch16 Xenova/clipseg-rd16 Xenova/clipseg-rd64 Xenova/clipseg-rd64-refined Xenova/owlv2-base-patch16 Xenova/owlv2-base-patch16-ensemble Xenova/owlv2-base-patch16-finetuned Xenova/owlvit-base-patch16 Xenova/owlvit-base-patch32 Xenova/owlvit-large-patch14 apple/DFN2B-CLIP-ViT-L-14 apple/DFN5B-CLIP-ViT-H-14 apple/DFN5B-CLIP-ViT-H-14-378 apple/MobileCLIP-S1-OpenCLIP apple/MobileCLIP-S2-OpenCLIP apple/aimv2-large-patch14-224-lit beyazitkelceoglu/owlv2-large-patch14-ONNX codyliu20032003/oneformer-parkseg12k-test dokutoshi/owlvit-base-patch32_FT_cppe5 gj5520/KoalaSeg hf-internal-testing/tiny-random-CLIPSegModel hf-internal-testing/tiny-random-OneFormerForUniversalSegmentation hf-internal-testing/tiny-random-OneFormerModel hf-internal-testing/tiny-random-OwlViTForObjectDetection hf-internal-testing/tiny-random-OwlViTModel hf-internal-testing/tiny-random-Owlv2ForObjectDetection hf-internal-testing/tiny-random-Owlv2Model hf-internal-testing/tiny-random-owlvit hf-internal-testing/tiny-random-owlvit-object-detection hf-tiny-model-private/tiny-random-CLIPSegModel hf-tiny-model-private/tiny-random-OneFormerForUniversalSegmentation hf-tiny-model-private/tiny-random-OneFormerModel hf-tiny-model-private/tiny-random-OwlViTForObjectDetection hf-tiny-model-private/tiny-random-OwlViTModel hiendang7613/oneformer_190725_swinT imageomics/bioclip imageomics/bioclip-2 laion/CLIP-ViT-L-14-CommonPool.XL-s13B-b90K laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K laion/CLIP-ViT-g-14-laion2B-s34B-b88K laion/CLIP-convnext_base_w-laion2B-s13B-b82K laion/CLIP-convnext_base_w-laion2B-s13B-b82K-augreg laion/CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K-augreg laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft-soup laion/CLIP-convnext_xxlarge-laion2B-s34B-b82K-augreg-soup mayank0621/owlvit-base-patch32_FT_cppe5 mieszkok/oneformer_ade20k_swin_large_geopose3k_original_900_E5 mieszkok/shi-labs_oneformer_ade20k_swin_large_geopose3k_original_images900_epochs5 onnx-community/owlv2-base-patch16-ONNX onnx-community/owlv2-base-patch16-ensemble-ONNX onnx-community/owlv2-base-patch16-finetuned-ONNX onnx-community/owlv2-large-patch14-ensemble-ONNX onnx-community/owlv2-large-patch14-finetuned-ONNX onnx-community/owlvit-base-patch32-ONNX onnx-internal-testing/tiny-random-OwlViTForObjectDetection-ONNX onnx-internal-testing/tiny-random-OwlViTModel-ONNX onnx-internal-testing/tiny-random-Owlv2ForObjectDetection-ONNX onnx-internal-testing/tiny-random-Owlv2Model-ONNX openai/clip-vit-base-patch16 openai/clip-vit-large-patch14 pooya-mohammadi/oneformer_ade20k_swin_tiny_clothes rathi2023/owlvit-base-patch32 rathi2023/owlvit-base-patch32_FT_cppe5 redlessone/DermLIP_ViT-B-16 suinleelab/monet timm/MobileCLIP2-S0-OpenCLIP timm/MobileCLIP2-S3-OpenCLIP timm/PE-Core-B-16 timm/PE-Core-L-14-336 timm/PE-Core-bigG-14-448 timm/eva02_base_patch16_clip_224.merged2b_s8b_b131k timm/eva02_enormous_patch14_plus_clip_224.laion2b_s9b_b144k timm/eva02_large_patch14_clip_224.merged2b_s4b_b131k timm/eva02_large_patch14_clip_336.merged2b_s6b_b61k timm/eva_giant_patch14_plus_clip_224.merged2b_s11b_b114k timm/resnet101_clip.openai timm/resnet50_clip.openai timm/vit_base_patch16_clip_224.laion400m_e32 timm/vit_base_patch16_plus_clip_240.laion400m_e31 timm/vit_base_patch32_clip_224.laion2b_e16 timm/vit_base_patch32_clip_224.laion400m_e31 timm/vit_base_patch32_clip_224.laion400m_e32 timm/vit_huge_patch14_clip_224.metaclip_2pt5b timm/vit_large_patch14_clip_224.laion400m_e32 timm/vit_large_patch14_clip_224.metaclip_2pt5b timm/vit_large_patch14_clip_336.openai wisdomik/QuiltNet-B-16 wisdomik/QuiltNet-B-32 woweenie/open-clip-vit-h-nsfw-finetune zer0int/CLIP-GmP-ViT-L-14 zer0int/LongCLIP-GmP-ViT-L-14 zer0int/LongCLIP-L-Diffusers zerosandones/owlv2-large-patch14-ensemble-ONNX

Converter

Models

CLIPConverter

Pattern #10 (82 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
T5Converter	Dmjdxb/deplot Joemgu/mlong-t5-base-sumstew Joemgu/mlong-t5-large-sumstew KennethTM/pix2struct-base-table2html Shubham-Awasthi/pix2struct_infovqa TeeA/DEPLOT-ViChart TeeA/ViMATCHA TomasFAV/Pix2StructCzechInvoice TomasFAV/Pix2StructCzechInvoiceLarge Xenova/deplot Xenova/pix2struct-ai2d-base Xenova/pix2struct-chartqa-base Xenova/pix2struct-docvqa-base Xenova/pix2struct-infographics-vqa-base Xenova/pix2struct-screen2words-base Xenova/pix2struct-tiny-random Xenova/pix2struct-widget-captioning-base am-infoweb/pix2struct-7.3K-model_12_08-new aravind-selvam/deplot_v0 aravind-selvam/pix2struct_chart bollscoasts/pix2act-onnx brainventures/deplot_kr darksensei/pix2struct-cord fxmarty/pix2struct-tiny-random gitlost-murali/pix2struct-refexp-base gitlost-murali/pix2struct-refexp-large giulioderasmo/Pix2struct-sroie-10k google/deplot google/matcha-base google/matcha-chart2text-pew google/matcha-chart2text-statista google/matcha-chartqa google/matcha-plotqa-v1 google/pix2struct-ai2d-base google/pix2struct-ai2d-large google/pix2struct-base google/pix2struct-chartqa-base google/pix2struct-docvqa-base google/pix2struct-docvqa-large google/pix2struct-infographics-vqa-base google/pix2struct-infographics-vqa-large google/pix2struct-large google/pix2struct-ocrvqa-base google/pix2struct-ocrvqa-large google/pix2struct-screen2words-base google/pix2struct-screen2words-large google/pix2struct-textcaps-base google/pix2struct-textcaps-large google/pix2struct-widget-captioning-base google/pix2struct-widget-captioning-large habibi26/ocr_struk hoangphu7122002ai/pix2struct_v0 juanivazquez/id_card-pix2struct-model-v3 optimum-intel-internal-testing/pix2struct-tiny-random oroikon/ft_pix2struct_chart_captioning paturi1710/pix2Struct-base-table-parsing-json-v2.0 paturi1710/pix2Struct-base-table-parsing-v1.0 pierretokns/pix2act-weblinx-base-onnx pierretokns/pix2act-weblinx-large-onnx prajwalJumde/pix2struct-test-model_08_08-new santiagoperezs/comunicacion-aviso-pix2struct-cord ssh1419/deplot-batch-1-new-loss-only-token ssh1419/deplot-batch-3-token-freeze-curri ssh1419/indi-deplot ssh1419/indi-deplot-1-freeze ssh1419/indi-deplot-200 ssh1419/indi-deplot-3-final ssh1419/indi-deplot-batch-16 ssh1419/indi-deplot-freeze-norm ssh1419/indi-deplot-lr-half-half ssh1419/test-deplot-1 sujr/pix2struct-base teamapocalypseml/regben2ipa-umt5base to-be/Pix2StructGhega turgutguvercin/pix2struct-turkish-receipts warshakhan/pix2struct-base-docvqa-change warshakhan/pix2struct-base-docvqa-public xcodemind/UICoder xcodemind/uicopilot_structure xcodemind/webcoder ybelkada/pix2struct-base-football zirui3/pix2struct-cord-v2

Converter

Models

T5Converter

Pattern #11 (57 models)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=False)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)

Affected models

Converter	Models
BertConverter	AlIshaq/DPR-question_encoder-faq-pesantren DataHammer/scidpr-question-encoder Mjollnir1996/dpr-question_encoder-bert-base-multilingual_mod NAACL2022/spider NAACL2022/spider-nq-question-encoder NAACL2022/spider-trivia-ctx-encoder NAACL2022/spider-trivia-question-encoder PrimeQA/XOR-TyDi_monolingual_DPR_qry_encoder aubmindlab/araelectra-base-discriminator castorini/ance-dpr-context-multi castorini/ance-dpr-question-multi castorini/bpr-nq-ctx-encoder castorini/bpr-nq-question-encoder datasetsANDmodels/image2text deepset/bert-small-mm_retrieval-passage_encoder deepset/bert-small-mm_retrieval-question_encoder deepset/bert-small-mm_retrieval-table_encoder dsksd/dpr-ctx_encoder-single-qrecc-model-base facebook/dpr-ctx_encoder-multiset-base facebook/dpr-ctx_encoder-single-nq-base facebook/dpr-question_encoder-multiset-base facebook/dpr-question_encoder-single-nq-base facebook/dpr-reader-multiset-base facebook/dpr-reader-single-nq-base firqaaa/indo-dpr-question_encoder-single-squad-base google/mobilebert-uncased hf-internal-testing/tiny-random-DPRQuestionEncoder hf-internal-testing/tiny-random-dpr hf-tiny-model-private/tiny-random-DPRQuestionEncoder hfl/chinese-electra-180g-base-discriminator hfl/chinese-electra-180g-large-discriminator hfl/chinese-electra-180g-small-discriminator hfl/chinese-electra-180g-small-ex-discriminator lmz/candle-blip norwoodsystems/image-caption seduerr/paiintent soheeyang/dpr-ctx_encoder-single-trivia-base soheeyang/dpr-question_encoder-single-trivia-base soheeyang/rdr-ctx_encoder-single-nq-base soheeyang/rdr-ctx_encoder-single-trivia-base soheeyang/rdr-question_encoder-single-nq-base soheeyang/rdr-question_encoder-single-trivia-base squeezebert/squeezebert-mnli squeezebert/squeezebert-mnli-headless squeezebert/squeezebert-uncased tau/spider tau/spider-nq-ctx-encoder tau/spider-nq-question-encoder tau/spider-trivia-ctx-encoder tau/spider-trivia-question-encoder typeform/squeezebert-mnli vblagoje/dpr-ctx_encoder-single-lfqa-base vblagoje/dpr-ctx_encoder-single-lfqa-wiki vblagoje/dpr-question_encoder-single-lfqa-base vblagoje/dpr-question_encoder-single-lfqa-wiki zhiweitong/dpr-answer_encoder-single-nq-base zhiweitong/dpr-ctx_encoder-single-nq-base

Converter

Models

BertConverter

AlIshaq/DPR-question_encoder-faq-pesantren DataHammer/scidpr-question-encoder Mjollnir1996/dpr-question_encoder-bert-base-multilingual_mod NAACL2022/spider NAACL2022/spider-nq-question-encoder NAACL2022/spider-trivia-ctx-encoder NAACL2022/spider-trivia-question-encoder PrimeQA/XOR-TyDi_monolingual_DPR_qry_encoder aubmindlab/araelectra-base-discriminator castorini/ance-dpr-context-multi castorini/ance-dpr-question-multi castorini/bpr-nq-ctx-encoder castorini/bpr-nq-question-encoder datasetsANDmodels/image2text deepset/bert-small-mm_retrieval-passage_encoder deepset/bert-small-mm_retrieval-question_encoder deepset/bert-small-mm_retrieval-table_encoder dsksd/dpr-ctx_encoder-single-qrecc-model-base facebook/dpr-ctx_encoder-multiset-base facebook/dpr-ctx_encoder-single-nq-base facebook/dpr-question_encoder-multiset-base facebook/dpr-question_encoder-single-nq-base facebook/dpr-reader-multiset-base facebook/dpr-reader-single-nq-base firqaaa/indo-dpr-question_encoder-single-squad-base google/mobilebert-uncased hf-internal-testing/tiny-random-DPRQuestionEncoder hf-internal-testing/tiny-random-dpr hf-tiny-model-private/tiny-random-DPRQuestionEncoder hfl/chinese-electra-180g-base-discriminator hfl/chinese-electra-180g-large-discriminator hfl/chinese-electra-180g-small-discriminator hfl/chinese-electra-180g-small-ex-discriminator lmz/candle-blip norwoodsystems/image-caption seduerr/paiintent soheeyang/dpr-ctx_encoder-single-trivia-base soheeyang/dpr-question_encoder-single-trivia-base soheeyang/rdr-ctx_encoder-single-nq-base soheeyang/rdr-ctx_encoder-single-trivia-base soheeyang/rdr-question_encoder-single-nq-base soheeyang/rdr-question_encoder-single-trivia-base squeezebert/squeezebert-mnli squeezebert/squeezebert-mnli-headless squeezebert/squeezebert-uncased tau/spider tau/spider-nq-ctx-encoder tau/spider-nq-question-encoder tau/spider-trivia-ctx-encoder tau/spider-trivia-question-encoder typeform/squeezebert-mnli vblagoje/dpr-ctx_encoder-single-lfqa-base vblagoje/dpr-ctx_encoder-single-lfqa-wiki vblagoje/dpr-question_encoder-single-lfqa-base vblagoje/dpr-question_encoder-single-lfqa-wiki zhiweitong/dpr-answer_encoder-single-nq-base zhiweitong/dpr-ctx_encoder-single-nq-base

Pattern #12 (53 models)

-normalizer:		None
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=False)
+normalizer:		Sequence(normalizers=[Prepend(prepend="▁"), Replace(pattern=String(" "), content="▁")])
+pre_tokenizer:		None

Affected models

Converter	Models
LlamaConverter	1093212290a/idefics-9b-doodles A2Amir/SF_A68_IDEFICS_9B_IDL_SFT Abhaykoul/idefics-9b-doodles Alvi12/idefics-9b-doodles Aman8252/idefics-9b-doodles ArthurFischel/custom-tiny-random-idefics ArthurFischel/tiny-random-idefics-smw_10k-300steps HuggingFaceM4/idefics-80b HuggingFaceM4/idefics-80b-instruct HuggingFaceM4/idefics-9b HuggingFaceM4/idefics-9b-instruct HuggingFaceM4/tiny-random-idefics KadirErturk/image_info OpenGVLab/InternVL2-40B-AWQ Salmamoori/idefics-9b-doodles a8nova/tiny-random-idefics areegtarek/idefics-9b-all areegtarek/idefics-9b-doodles areegtarek/idefics-9b-instruct-3batchesoneepoch areegtarek/idefics-9b-instruct-3batchesoneepoch-1-2 areegtarek/idefics-9b-instruct-3batchesoneepoch-1-2-3 areegtarek/idefics-9b-instruct-3batchesoneepoch-1-2-3-abnormal2epochsfreeze areegtarek/idefics-9b-instruct-abnormal3epochs areegtarek/idefics-9b-instruct-all areegtarek/idefics-9b-instruct-all-v2 areegtarek/idefics-9b-instruct-all-v3 areegtarek/idefics-9b-instruct-stage-1 areegtarek/idefics-9b-instruct-stage-1-stage-2 areegtarek/idefics-9b-instruct-stage-1-stage-2-stage-3 areegtarek/idefics-9b-instruct-threesplitsthreeepochs-1 areegtarek/idefics-9b-instruct-threesplitsthreeepochs-1-2 areegtarek/idefics-9b-instruct-threesplitsthreeepochs-1-2-3 areegtarek/idefics-9b-randomsampleNIH areegtarek/idefics-9b-split1-v1 areegtarek/idefics-9b-split1-v1-split1.2-v1 areegtarek/idefics-9b-stage1-v1 areegtarek/idefics-9b-stage1-v1-stage2-v1 areegtarek/idefics-9b-stage1-v1-stage2-v1-stage3-v1 areegtarek/idefics-9b-threebatchestenepochs dawoz/IDEFICS-frozenlake enghamdiali/idefics-9b-fn gauthamk28/idefics-9b-doodles jacky892/idefics-9b-doodles justinkarlin/idefics-9b-faces machinev/idefics-9b-LPU_model mattzhang/idefics-9b-doodles mervinpraison/idefics-9b-doodles mervinpraison/idefics-9b-pokemon-blip mychen76/idefics-9b-doodles ntust0/idefics-9b-bayc samim2024/Image-Text-To-Text turing-motors/Heron-Idefics2-8B-v0.1 worldboss/idefics-9b-doodles

Converter

Models

LlamaConverter

Pattern #13 (40 models)

-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 1), trim_offsets=True, add_prefix_space=True)
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"])})

Affected models

Converter	Models
BlenderbotConverter	Adapting/dialogue_agent_nlplab2022 BOUDABOUS/ai-univ-chatbot BOUDABOUS/fine-tuned-chatbot Bbrown44/aas_nlp_v1 Danieldor/Baldor-Assist DriveMyScream/Blenderbot_ChatBot Grendar/blenderbot-400M-distill-Shiro Megareyka/blenderbot-400M-FineTuned Ruthu1/skincare Saima0/mental-health-chatbot Xenova/blenderbot-400M-distill abhijitgayen/cogo-blenderbot-slow azkamannan2004/MindEase-CD-CPU azkamannan2004/MindEase-CD-fp16 breadlicker45/autotrain-blender-50601120822 facebook/blenderbot-1B-distill facebook/blenderbot-3B facebook/blenderbot-400M-distill hf-internal-testing/tiny-random-BlenderbotForCausalLM hf-internal-testing/tiny-random-BlenderbotForConditionalGeneration hf-internal-testing/tiny-random-BlenderbotModel hf-tiny-model-private/tiny-random-BlenderbotForCausalLM hf-tiny-model-private/tiny-random-BlenderbotForConditionalGeneration hf-tiny-model-private/tiny-random-BlenderbotModel jonggul2/finetuning lyubomirr/GAIA onnx-internal-testing/tiny-random-BlenderbotForConditionalGeneration-ONNX onnx-internal-testing/tiny-random-BlenderbotModel-ONNX optimum-intel-internal-testing/tiny-random-BlenderbotModel scriptkidd196883/ytp-engage-model-advanced scriptkidd196883/ytp-engage-model-beginner scriptkidd196883/ytp-engage-model-intermediate sir-evil/my-chat-model stanleychu2/system_400M stanleychu2/user_400M tgoktug/my_awesome_blendersum_model tgoktug/my_awesome_meeting_blendersum_model theastro/starkbot venkatavivekanandareddy/my-blenderbot-transformer-model vpadaraju/newest

Converter

Models

BlenderbotConverter

Pattern #14 (31 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=False), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
BigBirdConverter	GBaker/clinical-bigbird-medqa-usmle-nocontext LucasS/bigbirdABSA Mahmoud8/bigbird-roberta-base Shaier/bigbird-roberta-base ShengdingHu/sst2 alex2awesome/quote-attribution__qa-model-v2 alex2awesome/quote-attribution__qa-model-v3 hf-internal-testing/tiny-random-BigBirdForCausalLM hf-internal-testing/tiny-random-BigBirdForMaskedLM hf-internal-testing/tiny-random-BigBirdForQuestionAnswering hf-internal-testing/tiny-random-BigBirdForSequenceClassification hf-internal-testing/tiny-random-BigBirdForTokenClassification hf-internal-testing/tiny-random-BigBirdModel hf-internal-testing/tiny-random-big_bird hf-tiny-model-private/tiny-random-BigBirdForCausalLM hf-tiny-model-private/tiny-random-BigBirdForMultipleChoice hf-tiny-model-private/tiny-random-BigBirdForPreTraining hf-tiny-model-private/tiny-random-BigBirdForQuestionAnswering hf-tiny-model-private/tiny-random-BigBirdForSequenceClassification hf-tiny-model-private/tiny-random-BigBirdForTokenClassification hf-tiny-model-private/tiny-random-BigBirdModel ilos-vigil/bigbird-small-indonesian ilos-vigil/bigbird-small-indonesian-nli nsi319/bigbird-roberta-base-finetuned-app pepa/bigbird-roberta-base-fever pepa/bigbird-roberta-base-snli pepa/bigbird-roberta-large-fever pepa/bigbird-roberta-large-snli rubentito/bigbird-base-itc-mpdocvqa tgood/bigbird-roberta-base yikuan8/Clinical-BigBird

Converter

Models

BigBirdConverter

Pattern #15 (29 models)

-normalizer:		Sequence(normalizers=[NFD(), Lowercase(), StripAccents()])
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)

Affected models

Converter	Models
OpenAIGPTConverter	4stack/gpt-finetuned CaoTrungHieu/GPT_Entailment Dave12121/chatFsentiment LRSR/gpt-finetuned MichaelHu03/CS6220-GPT akiraaqira/career-full anezatra/gpt1-openassistant-117M-instruct dqwdqweqw/gpt-finetuned folklore1000/lyric001s512 goktug14/gpt1_sst2_left goktug14/gpt1_sst2_right hf-internal-testing/tiny-random-OpenAIGPTForSequenceClassification hf-internal-testing/tiny-random-OpenAIGPTLMHeadModel hf-internal-testing/tiny-random-OpenAIGPTModel hf-tiny-model-private/tiny-random-OpenAIGPTForSequenceClassification hf-tiny-model-private/tiny-random-OpenAIGPTLMHeadModel hf-tiny-model-private/tiny-random-OpenAIGPTModel hojjatkarami/ehr_gpt2 jeonghyeon97/gpt-finetuned karanzrk/essayl0 lgaalves/gpt1 model-attribution-challenge/openai-gpt obov/gpt-finetuned openai-community/openai-gpt soonbob/gpt-finetuned tmnam20/test_pretrain_pipeline vietnhatthai/test_pretrain_gpt_pipeline vietnhatthai/test_pretrain_pipeline vietnhatthai/viet_news_pretrain_pipeline

Converter

Models

OpenAIGPTConverter

4stack/gpt-finetuned CaoTrungHieu/GPT_Entailment Dave12121/chatFsentiment LRSR/gpt-finetuned MichaelHu03/CS6220-GPT akiraaqira/career-full anezatra/gpt1-openassistant-117M-instruct dqwdqweqw/gpt-finetuned folklore1000/lyric001s512 goktug14/gpt1_sst2_left goktug14/gpt1_sst2_right hf-internal-testing/tiny-random-OpenAIGPTForSequenceClassification hf-internal-testing/tiny-random-OpenAIGPTLMHeadModel hf-internal-testing/tiny-random-OpenAIGPTModel hf-tiny-model-private/tiny-random-OpenAIGPTForSequenceClassification hf-tiny-model-private/tiny-random-OpenAIGPTLMHeadModel hf-tiny-model-private/tiny-random-OpenAIGPTModel hojjatkarami/ehr_gpt2 jeonghyeon97/gpt-finetuned karanzrk/essayl0 lgaalves/gpt1 model-attribution-challenge/openai-gpt obov/gpt-finetuned openai-community/openai-gpt soonbob/gpt-finetuned tmnam20/test_pretrain_pipeline vietnhatthai/test_pretrain_gpt_pipeline vietnhatthai/test_pretrain_pipeline vietnhatthai/viet_news_pretrain_pipeline

Pattern #16 (23 models)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})

Affected models

Converter	Models
DebertaConverter	AnkitAI/deberta-xlarge-base-emotions-classifier Denyol/FakeNews-deberta-large KoalaAI/OffensiveSpeechDetector KoalaAI/Text-Moderation PORTULAN/albertina-100m-portuguese-ptbr-encoder PORTULAN/albertina-100m-portuguese-ptpt-encoder PleIAs/KaribuAI djagatiya/ner-deberta-base-ontonotesv5-englishv4 garrettbaber/twitter-roberta-base-joy-intensity h2oai/deberta_finetuned_pii hf-internal-testing/tiny-random-DebertaForMaskedLM hf-internal-testing/tiny-random-DebertaForQuestionAnswering hf-internal-testing/tiny-random-DebertaForSequenceClassification hf-internal-testing/tiny-random-DebertaForTokenClassification hf-internal-testing/tiny-random-DebertaModel jammmmmm/pii lakshyakh93/deberta_finetuned_pii matejmicek/autotrain-crender2.0-39012102367 protectai/lakshyakh93-deberta_finetuned_pii-onnx raj-tomar001/LLM-DetectAIve_deberta-base s-nlp/deberta-large-formality-ranker sagawa/PubChem-10m-deberta sagawa/ZINC-deberta

Converter

Models

DebertaConverter

AnkitAI/deberta-xlarge-base-emotions-classifier Denyol/FakeNews-deberta-large KoalaAI/OffensiveSpeechDetector KoalaAI/Text-Moderation PORTULAN/albertina-100m-portuguese-ptbr-encoder PORTULAN/albertina-100m-portuguese-ptpt-encoder PleIAs/KaribuAI djagatiya/ner-deberta-base-ontonotesv5-englishv4 garrettbaber/twitter-roberta-base-joy-intensity h2oai/deberta_finetuned_pii hf-internal-testing/tiny-random-DebertaForMaskedLM hf-internal-testing/tiny-random-DebertaForQuestionAnswering hf-internal-testing/tiny-random-DebertaForSequenceClassification hf-internal-testing/tiny-random-DebertaForTokenClassification hf-internal-testing/tiny-random-DebertaModel jammmmmm/pii lakshyakh93/deberta_finetuned_pii matejmicek/autotrain-crender2.0-39012102367 protectai/lakshyakh93-deberta_finetuned_pii-onnx raj-tomar001/LLM-DetectAIve_deberta-base s-nlp/deberta-large-formality-ranker sagawa/PubChem-10m-deberta sagawa/ZINC-deberta

Pattern #17 (17 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=String(" {2,}"), content="▁")])
-pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})

Affected models

Converter	Models
XLMRobertaConverter	VerboVision/MetaCLIP2-Distil-60-PCV facebook/metaclip-2-worldwide-b16 facebook/metaclip-2-worldwide-b16-384 facebook/metaclip-2-worldwide-b32 facebook/metaclip-2-worldwide-b32-384 facebook/metaclip-2-worldwide-giant facebook/metaclip-2-worldwide-giant-378 facebook/metaclip-2-worldwide-huge-378 facebook/metaclip-2-worldwide-huge-quickgelu facebook/metaclip-2-worldwide-huge-quickgelu facebook/metaclip-2-worldwide-l14 facebook/metaclip-2-worldwide-l14 facebook/metaclip-2-worldwide-m16 facebook/metaclip-2-worldwide-m16-384 facebook/metaclip-2-worldwide-s16 facebook/metaclip-2-worldwide-s16-384 onnx-community/metaclip-2-worldwide-huge-378-ONNX

Converter

Models

XLMRobertaConverter

Pattern #18 (15 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\s{2,}|[\n\r\t]"), content=" "), NFC(), Strip(strip_left=False, strip_right=True)])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
ReformerConverter	Nick1899/reformer-biological-papers-finetuned Nick1899/reformer-biological-papers-finetuned1 google/reformer-crime-and-punishment hf-internal-testing/tiny-random-ReformerForMaskedLM hf-internal-testing/tiny-random-ReformerForQuestionAnswering hf-internal-testing/tiny-random-ReformerForSequenceClassification hf-internal-testing/tiny-random-ReformerModel hf-internal-testing/tiny-random-reformer hf-tiny-model-private/tiny-random-ReformerForMaskedLM hf-tiny-model-private/tiny-random-ReformerForQuestionAnswering hf-tiny-model-private/tiny-random-ReformerForSequenceClassification hf-tiny-model-private/tiny-random-ReformerModel mwesner/reformer-clm nadellaroshni/reformer_model robingeibel/reformer-finetuned-big_patent-16384

Converter

Models

ReformerConverter

Nick1899/reformer-biological-papers-finetuned Nick1899/reformer-biological-papers-finetuned1 google/reformer-crime-and-punishment hf-internal-testing/tiny-random-ReformerForMaskedLM hf-internal-testing/tiny-random-ReformerForQuestionAnswering hf-internal-testing/tiny-random-ReformerForSequenceClassification hf-internal-testing/tiny-random-ReformerModel hf-internal-testing/tiny-random-reformer hf-tiny-model-private/tiny-random-ReformerForMaskedLM hf-tiny-model-private/tiny-random-ReformerForQuestionAnswering hf-tiny-model-private/tiny-random-ReformerForSequenceClassification hf-tiny-model-private/tiny-random-ReformerModel mwesner/reformer-clm nadellaroshni/reformer_model robingeibel/reformer-finetuned-big_patent-16384

Pattern #19 (14 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "eng_Latn":SpecialToken(id="eng_Latn", ids=[256047], tokens=["eng_Latn"])})
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="<unk>", type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="<unk>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<unk>":SpecialToken(id="<unk>", ids=[3], tokens=["<unk>"])})

Affected models

Converter	Models
NllbConverter	AfriNLP/AfriNLLB-12enc-12dec-full-ft-kd AfriNLP/AfriNLLB-8enc-8dec-iterative-498m-ft JustFrederik/nllb-200-3.3B-ct2-float16 JustFrederik/nllb-200-distilled-1.3B-ct2-float16 JustFrederik/nllb-200-distilled-1.3B-ct2-int8 JustFrederik/nllb-200-distilled-600M-ct2 JustFrederik/nllb-200-distilled-600M-ct2-float16 JustFrederik/nllb-200-distilled-600M-ct2-int8 KomorebiAI/nllb-200-3.3B-float16-ct2 KomorebiAI/nllb-200-3.3B-int8-ct2 entai2965/nllb-200-3.3B-ctranslate2 entai2965/nllb-200-3.3B-ctranslate2-float16 entai2965/nllb-200-distilled-1.3B-ctranslate2 entai2965/nllb-200-distilled-600M-ctranslate2

Converter

Models

NllbConverter

AfriNLP/AfriNLLB-12enc-12dec-full-ft-kd AfriNLP/AfriNLLB-8enc-8dec-iterative-498m-ft JustFrederik/nllb-200-3.3B-ct2-float16 JustFrederik/nllb-200-distilled-1.3B-ct2-float16 JustFrederik/nllb-200-distilled-1.3B-ct2-int8 JustFrederik/nllb-200-distilled-600M-ct2 JustFrederik/nllb-200-distilled-600M-ct2-float16 JustFrederik/nllb-200-distilled-600M-ct2-int8 KomorebiAI/nllb-200-3.3B-float16-ct2 KomorebiAI/nllb-200-3.3B-int8-ct2 entai2965/nllb-200-3.3B-ctranslate2 entai2965/nllb-200-3.3B-ctranslate2-float16 entai2965/nllb-200-distilled-1.3B-ctranslate2 entai2965/nllb-200-distilled-600M-ctranslate2

Pattern #20 (12 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=String(" {2,}"), content="▁")])
-pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})

Affected models

Converter	Models
XLMRobertaConverter	Coder-Dragon/kosmos-finetuned-DocLayNet DhananjayNahata24/Kosmos-2-DJ1 Mit1208/Kosmos-2-PokemonCards-trl-merged MoonstoneF/kosm-checkpoint MoonstoneF/kosmos-finetuned-DocLayNet ShivamExto/Kosmos-2-Furnas-trl-2 ShivamExto/Kosmos-2-Furnas-trl-2-1 hf-internal-testing/tiny-random-Kosmos2ForConditionalGeneration hf-internal-testing/tiny-random-Kosmos2Model ishaangupta293/kosmos-2-patch14-24-dup-ms microsoft/kosmos-2-patch14-224 sutantowilliam/kosmos-finetuned-DocLayNet

Converter

Models

XLMRobertaConverter

Coder-Dragon/kosmos-finetuned-DocLayNet DhananjayNahata24/Kosmos-2-DJ1 Mit1208/Kosmos-2-PokemonCards-trl-merged MoonstoneF/kosm-checkpoint MoonstoneF/kosmos-finetuned-DocLayNet ShivamExto/Kosmos-2-Furnas-trl-2 ShivamExto/Kosmos-2-Furnas-trl-2-1 hf-internal-testing/tiny-random-Kosmos2ForConditionalGeneration hf-internal-testing/tiny-random-Kosmos2Model ishaangupta293/kosmos-2-patch14-24-dup-ms microsoft/kosmos-2-patch14-224 sutantowilliam/kosmos-finetuned-DocLayNet

Pattern #21 (12 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\n"), content=" "), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
PegasusConverter	CLARA-MeD/pegasus-xsum ChaniM/tst-summarization Einmalumdiewelt/PegasusXSUM_GNAD RajSang/pegasus-sports-titles allenai/pegasus-multi_lexsum-long-short allenai/pegasus-multi_lexsum-long-tiny allenai/pegasus-multi_lexsum-short-tiny eilamc14/pegasus-xsum-text-simplification google/pegasus-xsum summarizationnnnn/Pegasus the-hir0/pegasus-detoxify wgcv/tidy-tab-model-pegasus-xsum

Pattern #22 (10 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=String(" {2,}"), content="▁")])
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})

Affected models

Converter	Models
XLMRobertaConverter	Chow05/fine-tune-embedding-v1 Chow05/fine-tune-embedding-v2 Chow05/fine-tune-embedding-v3 Chow05/fine-tune-embedding-v4 Chow05/fine-tune-embedding-v5 Chow05/fine-tune-embedding-v6 dangvantuan/vietnamese-document-embedding jinaai/jina-clip-v2 longsteel/embedding visheratin/mexma-siglip2

Pattern #23 (8 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\n"), content=" "), Replace(pattern=Regex(" {2,}"), content=" ")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])

Affected models

Converter	Models
PegasusConverter	Nicovis/ConvSum hyperchancellor07/pegasus-samsum-dialogue-summarizer mariam16elgohary/pegasus_arxiv_mit_lectures6 mohitskaushal/legal-pegasus-layman-legal-summarizer seanduffy/arxiv_summarizer seanduffy/govreport_summarizer seanduffy/pubmed_summarizer takao3548/pegasus-samsum

Pattern #24 (8 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
T5Converter	KETI-AIR-Downstream/long-ke-t5-base-translation-aihub-bidirection KETI-AIR-Downstream/long-ke-t5-base-translation-aihub-en2ko KETI-AIR-Downstream/long-ke-t5-base-translation-aihub-ko2en KETI-AIR/long-ke-t5-base KETI-AIR/long-ke-t5-small ding-diri-ding-dong/long-ke-t5-base-translation-aihub-ko2en pellucid/my_awesome_opus100_model pigeon01/sungju-finetuned-ko-to-en_ver3

Pattern #25 (8 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=String("[\(\)\[\]\{\}]|([!\"#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=Regex("[\(\)\[\]\{\}]|([!"\#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("

Affected models

Converter	Models
TikTokenConverter	HongxuanLi/nougat-base-deploy Xenova/nougat-base facebook/nougat-base jjreif/nougat-base-fork kevin-pek/nougat-api mzbac/nougat-base-8bit-mlx pszemraj/nougat-base-onnx pszemraj/nougat-base-onnx-quant_avx2

Pattern #26 (8 models)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=String("[\(\)\[\]\{\}]|([!\"#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=String("SPL1T-TH1S-Pl3A5E"), behavior=Removed, invert=False), Digits(individual_digits=True), Split(pattern=Regex("[\(\)\[\]\{\}]|([!"\#\$%\&'\*\+,\-\./:;<=>\?\\\^_`\|\~])\1*"), behavior=Isolated, invert=False), Split(pattern=String("
-truncation:		{'max_length': 4096, 'stride': 0, 'strategy': 'longest_first', 'direction': 'right'}
+truncation:		{'max_length': 3584, 'stride': 0, 'strategy': 'longest_first', 'direction': 'right'}

Affected models

Converter	Models
TikTokenConverter	CuiSiwei/nougat-for-formula Xenova/nougat-small facebook/nougat-small mzbac/nougat-small-8bit-mlx onnx-community/nougat-small-ONNX pszemraj/nougat-small-onnx pszemraj/nougat-small-onnx-quant_avx2 pszemraj/nougat-small-onnx-quant_avx512_vnni

Pattern #27 (7 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+pre_tokenizer:		None
-decoder:		Sequence(decoders=[Replace(pattern=String("▁"), content=" "), ByteFallback(), Fuse(), Strip(content=" ", start=1, stop=0)])
+decoder:		None

Affected models

Converter	Models
LlamaConverter	BEE-spoke-data/smol_llama-101M-GQA-python HuggingFaceM4/tiny-random-idefics-m4 OpenGVLab/InternVL2-4B Wanfq/FuseLLM-7B nahidalam/llava-aimv2 philschmid/tiny-random-idefics-m4 screenmate/idefics_50_25_25_merged

Pattern #28 (7 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
T5Converter	timm/ViT-B-16-SigLIP timm/ViT-B-16-SigLIP-256 timm/ViT-B-16-SigLIP-512 timm/ViT-B-16-SigLIP-i18n-256 timm/ViT-L-16-SigLIP-256 timm/ViT-SO400M-14-SigLIP timm/ViT-SO400M-14-SigLIP-384

Pattern #29 (7 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None

Affected models

Converter	Models
GemmaConverter	RichardErkhov/google_-_recurrentgemma-2b-4bits RichardErkhov/google_-_recurrentgemma-2b-8bits RichardErkhov/google_-_recurrentgemma-2b-it-4bits RichardErkhov/google_-_recurrentgemma-2b-it-8bits monology/recurrentgemma-9b-it-8bit qihoo360/fg-clip2-so400m theo77186/recurrentgemma-9b-it-bnb-4bit

Pattern #30 (6 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+pre_tokenizer:		Sequence(pretokenizers=[ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True), Metaspace(replacement="▁", prepend_scheme=first, split=False)])

Affected models

Converter	Models
LlamaConverter	CanisAI/teach-generalist-ministral-3b-r2 CanisAI/teach-humanities-ministral-3b-r2 CanisAI/teach-language-ministral-3b-r2 CanisAI/teach-math-ministral-3b-r2 CanisAI/teach-science-ministral-3b-r2 thestarfarer/Ministral-3-14B-1x1

Pattern #31 (6 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
+AddedToken("▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None

Affected models

Converter	Models
GemmaConverter	alpindale/recurrentgemma-9b alpindale/recurrentgemma-9b-it gg-hf/recurrentgemma-9b gg-hf/recurrentgemma-9b-it gg-hf/recurrentgemma-9b-it-pytorch gg-hf/recurrentgemma-9b-pytorch

Pattern #32 (6 models)

-model:			AddedToken("<pad>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("<unk>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+model:			AddedToken("<unk>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<pad>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=True)
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256026], tokens=["__fra__"])})
-decoder:		Metaspace(replacement="▁", prepend_scheme=first, split=True)
+normalizer:		None
+pre_tokenizer:		None
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={})
+decoder:		None

Affected models

Converter	Models
SeamlessM4TConverter	audo/seamless-m4t-v2-large facebook/seamless-m4t-v2-large jaman21/seamless-m4t-v2-t2st jaman21/seamless-m4t-v2-t2tt jaman21/seamless-m4t-v2-t2tt-t2st osanseviero/seamless-copy

Pattern #33 (5 models)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="[SEP]", type_id=0)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})

Affected models

Converter	Models
DebertaConverter	KISTI-AI/scideberta-cs cross-encoder/nli-deberta-base geckos/deberta-base-fine-tuned-ner hf-internal-testing/tiny-random-deberta optimum-intel-internal-testing/tiny-random-deberta

Pattern #34 (5 models)

-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None

Affected models

Converter	Models
GemmaConverter	FlatFootInternational/gemma-3n-E4B-it-bf16 MuXodious/gemma-3n-E4B-it-absolute-heresy-MPOA-mlx-8Bit blind-assist/google-gemma-3n-2b-e3 tomaarsen/t5gemma-s-gooaq-cmnrl wdaniel00763n/gemma-3N-news-finetune

Pattern #35 (5 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])

Affected models

Converter	Models
SeamlessM4TConverter	ThivyanRR/english_seamlessm4t_medium ThivyanRR/gujarathi_seamlessm4t_medium elego/ss-hmong-v3 lukmanaj/hf-seamless-m4t-medium-en-tw-10-ep lukmanaj/hf-seamless-m4t-medium-en-tw-3-ep

Pattern #36 (5 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])

Affected models

Converter	Models
SeamlessM4TConverter	AnasAber/seamless-darija-eng RafatK/SMT-AZR ThivyanRR/indic_seamlessm4t_v2_large UBC-NLP/Simba-S xun/seamless-m4t-v2-large-8bit-bnb

Pattern #37 (4 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=True)
+pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)

Affected models

Converter	Models
RobertaConverter	DmitrySpartak/layoutlm-invoices faisalraza/layoutlm-invoices impira/layoutlm-document-qa impira/layoutlm-invoices

Pattern #38 (3 models)

+AddedToken("<|fim_prefix|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<|fim_middle|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<|fim_suffix|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<|endofprompt|>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}{1,3}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]..."), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])

Affected models

Converter	Models
GPT2Converter	ChuckMcSneed/dolphin-2.9.1-dbrx-llamacppfixed imi2/dbrx-base-2.5bpw-h6-exl2 nicoboss/dbrx-base

Pattern #39 (3 models)

-post_processor:		RobertaProcessing(sep=("<sep>", 50265), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)

Affected models

Converter	Models
RobertaConverter	dtorber/BioNLP-2024-dtorber-baseline-eLife dtorber/BioNLP-intro-disc-eLife dtorber/BioNLP-tech-intro-disc-eLife

Pattern #40 (3 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
NllbConverter	KnutJaegersberg/nllb-moe-54b-4bit Maxime-Bakunzi/twigane-en-kin-translation madatnlp/nllb-moe-54b-8bit

Pattern #41 (3 models)

-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+normalizer:		NFKC()
+pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)

Affected models

Converter	Models
GPT2Converter	UBC-NLP/Jasmine-350M VietAI/gpt-j-6B-vietnamese-news VietAI/gpt-neo-1.3B-vietnamese-news

Pattern #42 (3 models)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256026], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__eng__":SpecialToken(id="__eng__", ids=[256022], tokens=["__eng__"])})

Affected models

Converter	Models
SeamlessM4TConverter	Geneline-X/seamless-m4t-v2-sunbird-multilingual-v1 KDiallo/seamless_sunbird_finetune KDiallo/seamless_sunbird_finetune_v2

Pattern #43 (3 models)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256026], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__eng__":SpecialToken(id="__eng__", ids=[256022], tokens=["__eng__"])})

Affected models

Converter	Models
SeamlessM4TConverter	EricTydd/SpeechtoText-Burmese Marialab/finetuned-seamless-m4T-large-1000-step blakenp/english_to_spanish_model

Pattern #44 (2 models)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=False)
-pre_tokenizer:		BertPreTokenizer()
-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[2], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[3], tokens=["[SEP]"])})
-decoder:		WordPiece(prefix="##", cleanup=True)
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
+pre_tokenizer:		Sequence(pretokenizers=[BertPreTokenizer(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
+post_processor:		BertProcessing(sep=("[SEP]", 3), cls=("[CLS]", 2))
+decoder:		Metaspace(replacement="▁", prepend_scheme=always, split=True)

Affected models

Converter	Models
BertConverter	Icelandic-lt/convbert-small-igc-is jonfd/convbert-small-igc-is

Pattern #45 (2 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=first, split=False)
+pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=never, split=False)

Affected models

Converter	Models
LlamaConverter	OpenGVLab/InternVL2_5-2B-MPO-hf OpenGVLab/InternVL2_5-8B-MPO-hf

Pattern #46 (2 models)

-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=False)
+pre_tokenizer:		None
-decoder:		Sequence(decoders=[Replace(pattern=String("▁"), content=" "), ByteFallback(), Fuse(), Strip(content=" ", start=1, stop=0)])
+decoder:		None

Affected models

Converter	Models
LlamaConverter	OpenGVLab/InternVL-Chat-V1-2 OpenGVLab/InternVL2-40B

Pattern #47 (2 models)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		None
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])

Affected models

Converter	Models
T5Converter	freddy913/FRDYV2_38 freddy913/FRDYV2_39

Pattern #48 (2 models)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
-pre_tokenizer:		BertPreTokenizer()
-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[0], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
-decoder:		WordPiece(prefix="##", cleanup=True)
+normalizer:		Sequence(normalizers=[NFKC(), Lowercase()])
+pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+post_processor:		BertProcessing(sep=("[SEP]", 2), cls=("[CLS]", 0))
+decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)

Affected models

Converter	Models
BertConverter	Bingsu/mobilebert_ko_mlm_1 Bingsu/my_mobilebert_untrained

Pattern #49 (2 models)

-pre_tokenizer:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("</s>", 36745), cls=("<s>", 36744), trim_offsets=True, add_prefix_space=True)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+pre_tokenizer:		Whitespace()
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={})
+decoder:		None

Affected models

Converter	Models
RobertaConverter	rpii2023/Je_baat rpii2023/naya_token

Pattern #50 (2 models)

-normalizer:		None
+normalizer:		Lowercase()

Affected models

Converter	Models
RobertaConverter	cambridge-climb/baseline-roberta_pre_layer_norm-model climb-mao/climb-roberta_pre_layer_norm-model

Pattern #51 (2 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+pre_tokenizer:		None

Affected models

Converter	Models
GemmaConverter	RichardErkhov/voidful_-_recurrentgemma-2b-base-4bits RichardErkhov/voidful_-_recurrentgemma-2b-base-8bits

Pattern #52 (2 models)

-AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
+AddedToken("<mask>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=False)
-normalizer:		Replace(pattern=String(" "), content="▁")
-pre_tokenizer:		Split(pattern=String(" "), behavior=MergedWithPrevious, invert=False)
+normalizer:		None
+pre_tokenizer:		None
-decoder:		Sequence(decoders=[Replace(pattern=String("▁"), content=" "), ByteFallback(), Fuse()])
+decoder:		None

Affected models

Converter	Models
GemmaConverter	aimagelab/LLaVA_MORE-gemma_2_2b-finetuning aimagelab/LLaVA_MORE-gemma_2_9b-finetuning

Pattern #53 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])

Affected models

Converter	Models
T5Converter	teamapocalypseml/regben2ipa-mt5-base

Pattern #54 (1 model)

-AddedToken("<SEP>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("<CLS>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		NFC()
-pre_tokenizer:		Sequence(pretokenizers=[Digits(individual_digits=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
+normalizer:		None
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("\d{1,3}(?=(?:\d{3})*\b)"), behavior=Isolated, invert=False), Split(pattern=Regex("[^\r\n\p{L}\p{N}]?[\p{Lu}\p{Lt}\p{Lm}\p{Lo}\p{M}]*[\p{Ll}\p{Lm}\p{Lo}\p{M}]+(?i:'s|'t|'re|'ve|'m|'ll..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])

Affected models

Converter	Models
TikTokenConverter	optimum-intel-internal-testing/tiny-random-aya-base

Pattern #55 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=False), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])

Affected models

Converter	Models
BigBirdConverter	pszemraj/bigbird-roberta-base-edu-classifier

Pattern #56 (1 model)

-normalizer:		None
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=never, split=False)
+normalizer:		Sequence(normalizers=[Replace(pattern=String(" "), content="▁")])
+pre_tokenizer:		None

Affected models

Converter	Models
LlamaConverter	logsyc/failure-aware-ernie-4.5

Pattern #57 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=False), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
BigBirdConverter	kimsan0622/bigbird-base

Pattern #58 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("\s{2,}|[\n\r\t]"), content=" "), NFC(), Strip(strip_left=False, strip_right=True)])
+normalizer:		Sequence(normalizers=[Strip(strip_left=True, strip_right=True), Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])

Affected models

Converter	Models
DebertaV2Converter	DataFog/pii-small-en

Pattern #59 (1 model)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)

Affected models

Converter	Models
MarkupLMConverter	SaulLu/markuplm-base

Pattern #60 (1 model)

-normalizer:		Sequence(normalizers=[Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-pre_tokenizer:		Metaspace(replacement="▁", prepend_scheme=always, split=True)
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])

Affected models

Converter	Models
XLMRobertaConverter	microsoft/layoutxlm-base

Pattern #61 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])

Affected models

Converter	Models
MBart50Converter	dlucidone/kumaoni-mbart-lora

Pattern #62 (1 model)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[1], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+post_processor:		BertProcessing(sep=("[SEP]", 2), cls=("[CLS]", 1))

Affected models

Converter	Models
BertConverter	KBLab/bert-base-swedish-cased-reallysimple-ner

Pattern #63 (1 model)

-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A...")
+pre_tokenizer:		Sequence(pretokenizers=[WhitespaceSplit(), Metaspace(replacement="▁", prepend_scheme=always, split=True)])
+post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=0), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+decoder:		Metaspace(replacement="▁", prepend_scheme=always, split=True)

Affected models

Converter	Models
RobertaConverter	vgaraujov/led-base-16384-spanish

Pattern #64 (1 model)

-post_processor:		RobertaProcessing(sep=("</s>", 25905), cls=("<s>", 25904), trim_offsets=True, add_prefix_space=False)
+post_processor:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)

Affected models

Converter	Models
RobertaConverter	Mwnthai/bodo-legal-led-summ

Pattern #65 (1 model)

-normalizer:		NFC()
+normalizer:		None

Affected models

Converter	Models
TikTokenConverter	NYTK/PULI-HuBA-mamba-130M

Pattern #66 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Replace(pattern=Regex(" {2,}"), content=" ")])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Replace(pattern=Regex(" {2,}"), content=" ")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="eng_Latn", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "eng_Latn":SpecialToken(id="eng_Latn", ids=[256047], tokens=["eng_Latn"])})
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="eng_Latn", type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="eng_Latn", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "eng_Latn":SpecialToken(id="eng_Latn", ids=[256047], tokens=["eng_Latn"])})

Affected models

Converter	Models
NllbConverter	facebook/nllb-moe-54b

Pattern #67 (1 model)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
+normalizer:		Sequence(normalizers=[NFKD(), StripAccents(), Lowercase()])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[2], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[3], tokens=["[SEP]"])})
+post_processor:		TemplateProcessing(single=[Sequence(id=A, type_id=0)], pair=[Sequence(id=A, type_id=0), Sequence(id=B, type_id=1)], special_tokens={})

Affected models

Converter	Models
BertConverter	novelcore/gem-electra

Pattern #68 (1 model)

-normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=False)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=True)

Affected models

Converter	Models
BertConverter	Seznam/small-e-czech

Pattern #69 (1 model)

-AddedToken("<s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("</s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("[SEP]", 102), cls=("[CLS]", 101), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=True)
+pre_tokenizer:		BertPreTokenizer()
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[101], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[102], tokens=["[SEP]"])})
+decoder:		WordPiece(prefix="##", cleanup=True)

Affected models

Converter	Models
RobertaConverter	thunlp/Lawformer

Pattern #70 (1 model)

-post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)
+post_processor:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)

Affected models

Converter	Models
RobertaConverter	mrm8488/longformer-base-4096-spanish

Pattern #71 (1 model)

-AddedToken("<s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-AddedToken("</s>", rstrip=False, lstrip=False, single_word=False, normalized=False, special=True)
-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("[SEP]", 3), cls=("[CLS]", 2), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=False, lowercase=True)
+pre_tokenizer:		BertPreTokenizer()
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=1), SpecialToken(id="[SEP]", type_id=1)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[2], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[3], tokens=["[SEP]"])})
+decoder:		WordPiece(prefix="##", cleanup=True)

Affected models

Converter	Models
RobertaConverter	UWB-AIR/MQDD-pretrained

Pattern #72 (1 model)

-normalizer:		None
-pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
-post_processor:		RobertaProcessing(sep=("[SEP]", 2), cls=("[CLS]", 0), trim_offsets=True, add_prefix_space=False)
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+normalizer:		BertNormalizer(clean_text=True, handle_chinese_chars=True, strip_accents=None, lowercase=False)
+pre_tokenizer:		BertPreTokenizer()
+post_processor:		TemplateProcessing(single=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0)], pair=[SpecialToken(id="[CLS]", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="[SEP]", type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="[SEP]", type_id=0)], special_tokens={"[CLS]":SpecialToken(id="[CLS]", ids=[0], tokens=["[CLS]"]), "[SEP]":SpecialToken(id="[SEP]", ids=[2], tokens=["[SEP]"])})
+decoder:		WordPiece(prefix="##", cleanup=True)

Affected models

Converter	Models
RobertaConverter	theSOL1/kolongformer-base-4096

Pattern #73 (1 model)

-normalizer:		Sequence(normalizers=[NFC(), Replace(pattern=Regex("\s+"), content=" "), Lowercase()])
-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("<\|startoftext\|>|<\|endoftext\|>|'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+"), behavior=Removed, invert=True), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)])
-post_processor:		RobertaProcessing(sep=("<|endoftext|>", 1), cls=("<|startoftext|>", 0), trim_offsets=False, add_prefix_space=False)
+normalizer:		None
+pre_tokenizer:		ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=True)
+post_processor:		ByteLevel(add_prefix_space=True, trim_offsets=False, use_regex=True)

Affected models

Converter	Models
CLIPConverter	hf-internal-testing/tiny-random-clip

Pattern #74 (1 model)

-pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?[\p{L}\p{M}]+|\p{N}| ?[^\s\p{L}\p{M}\p{N}]+[\r\n]*|\s..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=True, use_regex=False)])
+pre_tokenizer:		Sequence(pretokenizers=[Split(pattern=Regex("(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?[\p{L}\p{M}]+|\p{N}| ?[^\s\p{L}\p{M}\p{N}]+[\r\n]*|\s..."), behavior=Isolated, invert=False), ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)])
-decoder:		ByteLevel(add_prefix_space=True, trim_offsets=True, use_regex=True)
+decoder:		ByteLevel(add_prefix_space=False, trim_offsets=False, use_regex=False)

Affected models

Converter	Models
TikTokenConverter	inferencerlabs/Qwen3.5-397B-A17B-MLX-4.1bit

Pattern #75 (1 model)

-post_processor:		TemplateProcessing(single=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="<s>", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0), SpecialToken(id="</s>", type_id=0), Sequence(id=B, type_id=1), ...], special_tokens={"</s>":SpecialToken(id="</s>", ids=[2], tokens=["</s>"]), "<s>":SpecialToken(id="<s>", ids=[0], tokens=["<s>"])})
+post_processor:		RobertaProcessing(sep=("</s>", 2), cls=("<s>", 0), trim_offsets=True, add_prefix_space=False)

Affected models

Converter	Models
MPNetConverter	mukaj/fin-mpnet-base

Pattern #76 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256057], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__dan__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__dan__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__dan__":SpecialToken(id="__dan__", ids=[256041], tokens=["__dan__"])})

Affected models

Converter	Models
SeamlessM4TConverter	mosesdaudu/Dyula_French

Pattern #77 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAhyIAgMzkAgC4PQAAeyIAgMzsAgC4BQAAiyIAgMw8AADNvAAAmwkAgJ4JAIChCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-post_processor:		TemplateProcessing(single=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="</s>", type_id=0), SpecialToken(id="__fra__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__fra__":SpecialToken(id="__fra__", ids=[256057], tokens=["__fra__"])})
+post_processor:		TemplateProcessing(single=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), SpecialToken(id="</s>", type_id=0)], pair=[SpecialToken(id="__eng__", type_id=0), Sequence(id=A, type_id=0), Sequence(id=B, type_id=0), SpecialToken(id="</s>", type_id=0)], special_tokens={"</s>":SpecialToken(id="</s>", ids=[3], tokens=["</s>"]), "__eng__":SpecialToken(id="__eng__", ids=[256047], tokens=["__eng__"])})

Affected models

Converter	Models
SeamlessM4TConverter	Marialab/finetuned-seamless-m4T-medium-1000-step

Pattern #78 (1 model)

-normalizer:		Sequence(normalizers=[Replace(pattern=Regex("[\n\r\t]"), content=" "), NFKC(), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" +▁"), content="▁"), Replace(pattern=Regex("^▁+$"), content=""), ...])
+normalizer:		Sequence(normalizers=[Precompiled(precompiled_charsmap="ALQCAACEAAAAAACAAQAAgMz8AgC4BQAAjSIAgMzkAgC4PQAAgSIAgMzsAgC4BQAAkSIAgMw8AADNvAAAngkAgKEJAICkCQCAgx0A..."), Strip(strip_left=False, strip_right=True), Replace(pattern=Regex(" {2,}"), content="▁")])
-decoder:		Metaspace(replacement="▁", prepend_scheme=first, split=True)
+decoder:		Metaspace(replacement="▁", prepend_scheme=always, split=True)

Affected models

Converter	Models
SeamlessM4TConverter	panoyo9829/seamless-m4t-v2-large-fp16

Tokenizer backend equivalence report

Mismatches by converter

Diff patterns

Pattern #1 (456 models)

Pattern #2 (285 models)

Pattern #3 (263 models)

Pattern #4 (178 models)

Pattern #5 (157 models)

Pattern #6 (148 models)

Pattern #7 (119 models)

Pattern #8 (104 models)

Pattern #9 (98 models)

Pattern #10 (82 models)

Pattern #11 (57 models)

Pattern #12 (53 models)

Pattern #13 (40 models)

Pattern #14 (31 models)

Pattern #15 (29 models)

Pattern #16 (23 models)

Pattern #17 (17 models)

Pattern #18 (15 models)

Pattern #19 (14 models)

Pattern #20 (12 models)

Pattern #21 (12 models)

Pattern #22 (10 models)

Pattern #23 (8 models)

Pattern #24 (8 models)

Pattern #25 (8 models)

Pattern #26 (8 models)

Pattern #27 (7 models)

Pattern #28 (7 models)

Pattern #29 (7 models)

Pattern #30 (6 models)

Pattern #31 (6 models)

Pattern #32 (6 models)

Pattern #33 (5 models)

Pattern #34 (5 models)

Pattern #35 (5 models)

Pattern #36 (5 models)

Pattern #37 (4 models)

Pattern #38 (3 models)

Pattern #39 (3 models)

Pattern #40 (3 models)

Pattern #41 (3 models)

Pattern #42 (3 models)

Pattern #43 (3 models)

Pattern #44 (2 models)

Pattern #45 (2 models)

Pattern #46 (2 models)

Pattern #47 (2 models)

Pattern #48 (2 models)

Pattern #49 (2 models)

Pattern #50 (2 models)

Pattern #51 (2 models)

Pattern #52 (2 models)

Pattern #53 (1 model)

Pattern #54 (1 model)

Pattern #55 (1 model)

Pattern #56 (1 model)

Pattern #57 (1 model)

Pattern #58 (1 model)

Pattern #59 (1 model)

Pattern #60 (1 model)

Pattern #61 (1 model)

Pattern #62 (1 model)

Pattern #63 (1 model)

Pattern #64 (1 model)

Pattern #65 (1 model)

Pattern #66 (1 model)

Pattern #67 (1 model)

Pattern #68 (1 model)

Pattern #69 (1 model)

Pattern #70 (1 model)

Pattern #71 (1 model)

Pattern #72 (1 model)

Pattern #73 (1 model)

Pattern #74 (1 model)

Pattern #75 (1 model)

Pattern #76 (1 model)

Pattern #77 (1 model)