Pinned
One of the first, if not the first large scale LLM to run on multiple commercial grade NVIDIA RTX GPUs. Nice work @Theta_Network and @alibaba_cloud team.
Qwen3 32B by Alibaba is now live on Theta EdgeCloud as a decentralized on-demand inference API, a large-scale LLM served across community GPU nodes using pipeline parallelism over the internet. 🧵















