Rysana (@Rysana) / X

Rysana

42 posts

Rysana

@Rysana

We create the world's fastest and most powerful AI.

Joined June 2023

Rysana reposted
John
@jrysana
Jun 2
We've created an exciting (to me at least) new benchmark @Rysana that solves a lot of issues with most LLM benchmarks and is very strong at identifying human-level capability. Much better signal than the analogous popular ones we've tried. More to share on this soon, maybe.
1.9K
Rysana reposted
John
@jrysana
Feb 27, 2025
I feel a great disturbance in the force
312K
Rysana
@Rysana
May 6
and then all at once.
1.5K
Rysana
@Rysana
Mar 17, 2025
6.4K
Rysana
@Rysana
Aug 21, 2024
ix xxiv
00:00
12K
Rysana reposted
John
@jrysana
Mar 18, 2024
Introducing Inversion, our family of structured LLMs. Our first generation models excel in structured tasks, offering unmatched speed, latency, reliability, and efficiency, with the most comprehensive typed JSON output support available anywhere. rysana.com/inversion
00:00
213K
Rysana
@Rysana
Aug 7, 2024
Since this post, we've breached 100,000 t/s (per user) on optimized workloads and are targeting a peak of >1,000,000 t/s in production by EOY.
Rysana
@Rysana
Mar 20, 2024
Over 9000! tokens per second. Ultra-fast, always-valid typed JSON with LLMs. rysana.com/log/over-9000-…
12K
Rysana
@Rysana
Jul 26, 2024
Fastest LLM API in the world, end-to-end function calling in the blink of an eye
25K
Rysana
@Rysana
May 18, 2024
Not too long ago, electricity was mostly confined to lightning here and there. Last century, it became available and malleable for humans, first sparsely and now with great abundance all over. Intelligence is this century's electricity. Let's put it everywhere, in everything.
4.2K
Rysana reposted
John
@jrysana
May 16, 2024
inversion-sm is now, on average, 2.39x faster, 14.4% smarter, and 2.11x cheaper than when we announced the model, and completes the average request in well under 200ms. that's a +475% boost in intelligence flux since late March, and >1000% since the first checkpoint.
5.4K
Rysana reposted
John
@jrysana
Apr 20, 2024
in the past month: ~4.5× faster inversion compiler ~100× faster sampling ~2× faster runtime overhead ~12% faster overall speed, same models ~6× faster query parser ~95% less internal networking
22K
Rysana
@Rysana
Apr 4, 2024
Applied general intelligence flux (IQ*Hz/$) will increase by over 1,000,000,000x by the end of this decade.
2.7K
Rysana reposted
John
@jrysana
Mar 23, 2024
Today, we've optimized the Inversion compiler's runtime overhead again, pushing down below ~9 microseconds on even the most complex structures. This work builds out the support for up to 100,000 tokens/second inference with perfect structured output.
7.5K