Pinned
lately i explored how cpu-only inference works, various optimization techniques (not implemented), tinyllama architecture and revised c++
thankful to my goblin. helped with learning concepts and doing the plumbing (tests, gguf loading and such)
github.com/Graffioh/magi-…
also














