Tonight, I am releasing eight Gemma fine tunes and a beta of their combined mixture of experts model named GemMoE.
GemMoE has ALL Gemma bug fixes built-in. You do not have to do anything extra to get great fine tunes/inference with it. It's a beast of a model.
This would not
Today was my last day at xAI.
I was in charge of keeping people from making unauthorized changes to the system prompt.
It sounds simple when I put it like that, but in practice, it was a game of cat and mouse. Some days, it felt like I was the only one standing between order
Today we're sharing the next phase of Reflection.
We're building frontier open intelligence accessible to all.
We've assembled an extraordinary AI team, built a frontier LLM training stack, and raised $2 billion.
Why Open Intelligence Matters
Technological and scientific
Today, we’re officially releasing the weights for AFM-4.5B and AFM-4.5B-Base on HuggingFace. This is a major milestone for @arcee_ai. AFM is designed to be flexible and high-performing across a wide range of deployment environments.
Our customers needed a better base model <10B parameters.
We spent the last 5 months building one.
I'm delighted to share a preview of our first Arcee Foundation Model: AFM-4.5B-Preview.
I'm excited to release a project I've been working on the last couple of weeks.
Qwen1.5-8x7b:
huggingface.co/Crystalcareai/…
And the accompanying dataset created with the intention of encouraging MoE models to organically develop their own experts:
huggingface.co/datasets/Cryst…
The purpose
Releasing INTELLECT-2: We’re open-sourcing the first 32B parameter model trained via globally distributed reinforcement learning:
• Detailed Technical Report
• INTELLECT-2 model checkpoint
primeintellect.ai/blog/intellect…
You can fake it pretty far in this industry just by saying, “Hrmm, that’s cool but I’m worried it won’t generalize,” whenever you’re presented with literally any information.
We’re going permissive: Apache 2.0 across the board. AFM-4.5B is now relicensed from Arcee to Apache 2.0; the agent variant will launch under Apache 2.0; and all upcoming releases ship with open weights. Three models are in training.