Australia shouldn't have to rent its AI future.
So we built Matilda.
An Australian-built AI assistant, running on our own fully sovereign cluster in Melbourne.
Public access is rolling out now.
Muon has been a recent obsession of ours, so we dug deeper. It changes the loss curve, but does it change what a model learns?
We trained matched AdamW and Muon GPT-2-class models, held validation loss fixed, and compared their SAE features by firing patterns over the same 1M