BLACKBOX AI is now a @Microsoft Official Partner.
Enterprises can purchase @blackboxai licenses directly through the Microsoft Marketplace.
Built for teams that ship fast and stay secure.
Weβre live at London Tech Week π¬π§
See agentic inference live at the BLACKBOX AI booth.
Donβt miss Richard on Tuesday 9th, 12:30 at the Core Stage:
βThe Secure Orchestration Layer for the Agentic Enterprise.β
Meet us there.
Introducing NVIDIA Nemotron 3 Ultra.
A frontier smart open model built for long-running agents that need to plan, reason, use tools and keep working across complex coding, research and enterprise workflows.
Up to 5x faster inference and up to 30% lower cost for agentic tasks.
We partnered with @nvidia as their single flagship provider and optimized Nemotron 3 Ultra to the highest inference speed served today at 420 output tokens/sec
Nemotron 3 Ultra was launched today, including a focus on low latency agentic performance. We tested it against peers under restricted turn-usage limits on Terminal-Bench v2.1 - @nvidia Nemotron 3 Ultra completes tasks at a much faster pace than peers due to its high inference
Nemotron 3 Ultra was launched today, including a focus on low latency agentic performance. We tested it against peers under restricted turn-usage limits on Terminal-Bench v2.1 - @nvidia Nemotron 3 Ultra completes tasks at a much faster pace than peers due to its high inference
420.2 tok/s on a 550B model. β‘οΈ
Nemotron-3-Ultra-550B-A55B reaches 420.2 tok/s powered by BLACKBOX AI Inference Engine.
Blackbox now delivers the fastest inference in the industry, outperforming every other provider, including on smaller-parameter models.
Check our blog in
Open source needed a comeback.
Agents needed real speed.
The world needed an American answer.
Day 0. @nvidia Nemotron Ultra is live on Blackbox at 420+ tok/s.
$0.37 in. $1.08 out. Per million tokens.
550B parameters. Faster than models a fraction of its size.
The fastest
Today we're shipping Nemotron 3 Ultra.
A 550B MoE frontier-intelligence open model built for long-running agents.
It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models.
420.2 tok/s on a 550B model. β‘οΈ
Nemotron-3-Ultra-550B-A55B reaches 420.2 tok/s powered by BLACKBOX AI Inference Engine.
Blackbox now delivers the fastest inference in the industry, outperforming every other provider, including on smaller-parameter models.
Check our blog in