Checklist
Motivation
If the model is the exact same architecture, we can use a smaller model to speed up CI? Currently some tests are taking a long time.
Also we can increase the BS for many accuracy related tests.
E.g, Gemma 27B -> 1B
We may also have to adjust accuracy related tests if this happens.
Related resources
No response
Checklist
Motivation
If the model is the exact same architecture, we can use a smaller model to speed up CI? Currently some tests are taking a long time.
Also we can increase the BS for many accuracy related tests.
E.g, Gemma 27B -> 1B
We may also have to adjust accuracy related tests if this happens.
Related resources
No response