You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
run GPT-OSS 20B and Qwen3 30B A3B with intel arrow lake igpu on vulkan and sycl backend ,the best decode speed can be 20 token per second, but the prefill also only 20+ token/s, I have no idea about it
run GPT-OSS 20B and Qwen3 30B A3B with intel arrow lake igpu on vulkan and sycl backend ,the best decode speed can be 20 token per second, but the prefill also only 20+ token/s, I have no idea about it