Sign in to view Brian’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Brian’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Boston, Massachusetts, United States
Sign in to view Brian’s full profile
Brian can introduce you to 10+ people at Red Hat
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
13K followers
500+ connections
Sign in to view Brian’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Brian
Brian can introduce you to 10+ people at Red Hat
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Brian
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Brian’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
About
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
Activity
13K followers
-
Brian Stevens shared thisOpen source is the foundation for compute from cloud to edge, and at Red Hat we have been leading the charge to make that true for production inference at scale as well. To understand the economic benefits, we commissioned Forrester Consulting to analyze the numbers, and our new TEI study shows a 233% ROI with Red Hat AI. Read the full study to see how standardizing your platform layer can drive real value: https://red.ht/4up4kdK
-
Brian Stevens reposted thisIt was a great pleasure to host CNBC at our Red Hat AI booth during the India AI Impact Summit this week in Delhi, and to share what we have been building together with customers, partners and the open source community. The momentum has been building as we continue to deliver a high performance AI platform that promises to deliver any model, using any accelerator, and running on any cloud. Daniel Aw Fytos Charalambides Navtez Bal Paul Whittard Garry Gray Philip Yeap Abhishek Shukla Mukesh Mehta Ausim Khan Mangesh Surve Misha Joshi Arpita Sengupta Ravi Goyal Joel Jackson Joe Fernandes Brian Stevens Vincent Caldeira James Lovegrove Tushar Katarki Jeff DeMoss Erwan Granger Karl Eklund Jeff Winn Vijay Chebolu Prasad Mukhedkar Tarun Ghai Kiran Challapalli abhishek vijra Elisa NavarroBrian Stevens reposted thisTHREE LEADERS. ONE BOOTH. THE REAL TALK ON AI IN INDIA. From the Red Hat booth at Bharat Mandapam during the India AI Impact Summit 2026, CNBC-TV18's Global AI Lens brings you insights from the team powering enterprise AI across India and Asia-Pacific. Steve Shirkey, Director, APAC AI Platform Misha Joshi, Senior Director, Head of Services (India/South Asia) and Ausim Khan, Director, Partner Ecosystem (India & South Asia) share Red Hat's vision for AI in India. Platform. Services. Ecosystem. The full stack of AI transformation—decoded. Global AI Lens | India AI Impact Summit 2026 HCLTech Red Hat d-Matrix Sid Sheth Dr. Ravi Gupta Pradip Thaker #GlobalAILens #IndiaAI #AIImpactSummit #CNBCTV18 #RedHat #BharatMandapam #EnterpriseAI
-
Brian Stevens reposted thisBrian Stevens reposted thisI’m so happy to share with you that this weekend (November 1st) there will be a great vLLM meetup in Beijing! We (Red Hat) together with our partners and friends in the community, such as the Ant Group, AMD, ByteDance and MetaX, co-host this event and we invite the most seasoned people in the community such as Michael Goin - the lead vLLM maintainer to give us a general update and llm-d’s latest features, and many others to share insightful topics - quite a few around distributed inference solutions and different accelerator support ways. So if you happen to be in Beijing this weekend, welcome to join in person! You can either drop me a message or register through “vLLM community” WeChat account! The meetup also has live-streaming, we will provide the live-streaming link shortly so please stay tuned;) Brian Stevens Brent Holden Vincent Caldeira Michael Goin Saša Zelenović Grant Shipley Tushar Katarki Christopher Nuland Jeff Winn Steve Shirkey Li Ming Tsai Andreas Spanner Chris Butler Vinod Pathangay Ken Komazawa Lisa Li
-
Brian Stevens shared thisThe pace of advancements in AI is stunning, and innovations in open source are driving the industry’s emerging capabilities in inference platforms, agentic architectures, and data integration. While that alone is a reason to celebrate, it's hard to keep up and a day doesn’t go by where I’m not investing time to learn something new. At Red Hat we decided to turn that inside out, and share a day-of-learning with like-minded colleagues. We’ve curated a virtual event of 12 session across 4 tracks: Inference and Optimization – Optimal deployment with vLLM, LLM Compressor, and Speculators. Model Customization – Connecting models to enterprise data Agentic AI – Building open and flexible agents with MCP, LlamaStack, and more. Scaling Across Hybrid Cloud – Learn distributed inference and scaling strategies with OpenShift AI. Register here: https://lnkd.in/e_UZ5qAG, and happy learning! Red Hat AI Day of Learning: Your Path to Enterprise-Ready AI October 16, 2025 | 10:00 a.m. - 11:30 a.m ESTRed Hat AI Day of Learning: Your Path to Enterprise-Ready AIRed Hat AI Day of Learning: Your Path to Enterprise-Ready AI
-
Brian Stevens reposted thisBrian Stevens reposted thisRed Hat AI Day of Learning: Your Path to Enterprise-Ready AI October 16, 2025 | Starts at 10:00 a.m. EST | Virtual Choose your own AI learning journey with 12 breakout sessions across 4 tracks: 1️⃣ Increase Efficiency with Fast, Flexible, and Efficient Inferencing 2️⃣ Simplified and Consistent Experience for Connecting Models to Data 3️⃣ Accelerate Agentic AI Delivery and Stay at the Forefront of Innovation 4️⃣ Flexibility and Consistency When Scaling AI Across the Hybrid Cloud Developers, engineers, and technical practitioners can mix and match sessions to focus on the topics that matter most and leave with practical skills to apply right away. Register here: https://lnkd.in/eCSDvYVyRed Hat AI Day of Learning: Your Path to Enterprise-Ready AIRed Hat AI Day of Learning: Your Path to Enterprise-Ready AI
-
Brian Stevens shared thisEldar Kurtić is amazing. Proud to be on his team.Brian Stevens shared thisWhen a brilliant mind from Bosnia and Herzegovina leads innovation at a global powerhouse like Red Hat, it’s impossible not to take notice. We’re proud to introduce another strong addition to our Engineering Stage this September — Eldar Kurtić, whose work pushes the boundaries of AI efficiency and deployment. ↳ Eldar Kurtić is a Senior Researcher at Red Hat and Institute of Science and Technology Austria, specializing in efficient inference techniques for large language models (LLMs), with a particular focus on sparsity and quantization. His work centers on developing methods to accelerate inference within the vLLM engine, bridging cutting-edge research with practical deployment solutions. At Kiss the Future AI Summit, Eldar will lead a hands-on workshop titled: “Beginner-Friendly Introduction to LLM Quantization: From Zero to Hero” He will cover: ↳ What quantization is — and why it matters ↳ How quantization fits into the architecture of LLMs ↳ Today’s leading quantization techniques for deployment ↳ How to quantize your own models ↳ Accuracy trade-offs and tuning for optimal performance ↳ Real-world inference cost and performance implications Whether you’re an ML engineer, AI researcher, or just getting started, this is your gateway to mastering LLM quantization. 🎟️ Did you know you can get a ticket just for the Engineering Stage? Head to Entrio now and secure your spot! The End of Hype. The Start of Impact. | 25th and 26th September #KissTheFutureAISummit #KisstheFuture #AISummitSarajevo2025 #EldarKurtic #RedHat
-
Brian Stevens reposted thisMore than happy to be part of this story! Thank you Blum Institut for the invitation.Brian Stevens reposted thisWhen a brilliant mind from Bosnia and Herzegovina leads innovation at a global powerhouse like Red Hat, it’s impossible not to take notice. We’re proud to introduce another strong addition to our Engineering Stage this September — Eldar Kurtić, whose work pushes the boundaries of AI efficiency and deployment. ↳ Eldar Kurtić is a Senior Researcher at Red Hat and Institute of Science and Technology Austria, specializing in efficient inference techniques for large language models (LLMs), with a particular focus on sparsity and quantization. His work centers on developing methods to accelerate inference within the vLLM engine, bridging cutting-edge research with practical deployment solutions. At Kiss the Future AI Summit, Eldar will lead a hands-on workshop titled: “Beginner-Friendly Introduction to LLM Quantization: From Zero to Hero” He will cover: ↳ What quantization is — and why it matters ↳ How quantization fits into the architecture of LLMs ↳ Today’s leading quantization techniques for deployment ↳ How to quantize your own models ↳ Accuracy trade-offs and tuning for optimal performance ↳ Real-world inference cost and performance implications Whether you’re an ML engineer, AI researcher, or just getting started, this is your gateway to mastering LLM quantization. 🎟️ Did you know you can get a ticket just for the Engineering Stage? Head to Entrio now and secure your spot! The End of Hype. The Start of Impact. | 25th and 26th September #KissTheFutureAISummit #KisstheFuture #AISummitSarajevo2025 #EldarKurtic #RedHat
-
-
Brian Stevens shared thisPersonally or professionally, every conversation I have w/ Chris Wright is always fun. This one was no different where we dove into vLLM and all things AI+Inference. Cheers.Brian Stevens shared thisHow do we take AI from research labs to robust, scalable enterprise production? Red Hat CTO Chris Wright and Red Hat AI CTO Brian Stevens dive deep into production-quality inference, the role of open source projects like vLLM, and the journey to practical enterprise AI. They discuss parallels with Linux's early days and the community effort needed to build the future AI stack. A must-listen for tech leaders navigating AI: https://red.ht/43ZcQWo.
-
Brian Stevens liked thisLooking forward to catching up with everyone and returning to the Engineering Stage in Sarajevo this October! Thanks Blum Institut for the invitation!Brian Stevens liked thisBosnia and Herzegovina on stage, and we wouldn't have it any other way. Those of you who caught Eldar's session last year know exactly why we made sure to bring him back. Meet Eldar Kurtić, Principal Research Scientist at Red Hat and the Institute of Science and Technology Austria, working at the intersection of AI research and real-world deployment. Eldar's work centers on making large language models faster and leaner, through sparsity and quantization techniques that push the boundaries of what efficient inference actually looks like. He's actively developing methods to accelerate inference within the vLLM engine, which means his research doesn't just live in papers, it ships. This year, he's bringing a topic that's quickly become one of the hottest conversations in the field: Speculative Decoding. Whether you're just getting started or already deep in the weeds, Eldar's talk promises to take you from zero to hero. Join Eldar on Kiss the Future Engineering stage on October 15-16th in Sarajevo. Grab your tickets via Entrio: https://lnkd.in/ddTMUAf4 Eldar, welcome back to Kiss the Future. We're so glad to have you with us again. #KissTheFuture #KTF2026 #AISummit #SpeculativeDecoding #RedHat
-
Brian Stevens liked thisBrian Stevens liked thisI had the pleasure of participating in a few great sessions with my colleagues at the ScotiaTech town hall last week. It was such a pleasure to see everyone come together to celebrate our progress and highlight our plans for the future of this extremely capable team. As you can imagine, a big topic of conversation was around AI – which is especially timely given our launch this week of several new Scotia Intelligence capabilities. Along with my colleagues Joe Martinez and Neha Mudalgikar, we discussed how engineering excellence and a strong security mindset need to coexist for us to continue shaping the safe adoption of this transformative technology across the bank. I then joined Cathy K., Jonathan Echeverria and Sebastian Blandizzi for a very energetic discussion about what’s really driving this success: our people. The team has generated great momentum over the last six months, which can all be attributed to the passion and talent of the amazing team that are all a part of #ScotiaTech. A fantastic kick off to the summer! #ScotiaTech #Scotiapride
-
Brian Stevens liked thisBrian Stevens liked this🎉 llm-d v0.7 is officially live! If our earlier releases proved what llm-d could do, v0.7 is about making sure you can easily deploy and run it in production. Backed by a 3.5x surge in community PR volume, this release focuses entirely on production hardening, eliminating operational friction, and expanding hardware reach. Here is the quick technical breakdown of what’s new: ⚙️ Streamlined Day-1 Ops: Clone to serving in minutes using the new Standalone Mode (Envoy default), plus a complete migration to Kustomize-first deployment pipelines. 🔌 Blackwell & Multi-Hardware: Upgraded to CUDA 13 for native NVIDIA Blackwell support, alongside validated production images for AMD ROCm, Intel XPUs, Google TPUs, and Rebellions ATOM. 🧠 Workload-Aware Routing: Introduced experimental Flow Control to eliminate noisy-neighbor issues, and an OpenAI-compatible Batch Gateway for heavy offline workloads. 💾 Tiered KV Caching: Real-time prefix cache tracking is now enabled by default, paired with seamless cache offloading from GPU HBM to CPU and persistent storage (AWS EFS/NVMe). We’ve also added 10,000+ lines of brand-new documentation and an overhauled, multi-platform CI matrix to ensure what we guide is exactly what you deploy. A massive thank you to our 23 new contributors and hardware ecosystem partners for making this milestone happen. Read the full architectural breakdown on our blog: 👇 https://lnkd.in/eHvUECVQ #AIInfrastructure #LLMInference #OpenSource #Kubernetes #PlatformEngineeringllm-d v0.7: From Feature Introduction to Production Hardening | llm-dllm-d v0.7: From Feature Introduction to Production Hardening | llm-d
-
Brian Stevens liked thisAmazing timing to have the Red Hat global team in Ottawa for the launch of the Government of Canada AI for All strategy! Especially when the global SVP and Head of AI Brian Stevens was in town with Public Sector CTO, John Dvorak . A great opportunity to talk about automation, AI, sovereign datacentres, IP, open source and acceleration hubs! Corey Somers Jason Barton Juan Berlie Roch Cousineau Paul Pinkney Kenneth Canam Christian Roy Louise Girouard Melissa Cable-Cibula Marci Surkes Compass Rose
-
Brian Stevens liked thisBrian Stevens liked thisI recently co-authored a blog with Erwan Gallen and Chris Procter on using #rebellions NPU on Red Hat AI Enterprise based on our recent joint solution announcement. Together, Red Hat AI and Rebellions ATOM NPUs help advance an open AI ecosystem where customers can run the models they want on the accelerators that best fit their needs, while delivering strong performance and energy efficiency. Many thanks to the various teams for making this happen: 1. Red Hat AI leadership (Brian Stevens Joe Fernandes Tushar Katarki) 2. Red Hat AI Inferencing Engineering team (Taneem Ibrahim Selbi Nuryyeva Nicolò Lucchesi Daniele Trifirò Michael Goin) 3. Red Hat Ecosystem Engineering (Nenad Perić Pablo Iranzo Gómez Chris Procter) 4. Our Inference PM (Erwan Gallen) 5. Rebellions' team (Minwook Ahn jinmoo Seok) Link: https://lnkd.in/gXftR5Xy Steve Shirkey Steven Huels Daniel Aw Ameeta Roy Vincent Caldeira Itamar Heim Hong-Seok Kim #redhat #rebellions #vllm #inference #kubernetes #redhatai
-
Brian Stevens reacted on thisBrian Stevens reacted on thisKubernetes didn't win because it was the first container orchestrator. It won because it became the open standard everyone could build on. AI inference needs the same moment. LLM workloads are stateful, latency-sensitive, and wildly variable in cost. Standard service routing wasn't built for this. That gap is exactly what llm-d addresses. By contributing llm-d to the CNCF, Red Hat, alongside CoreWeave, IBM, Google, and NVIDIA, is making a long-term bet: that the future of enterprise AI runs on open standards, not proprietary lock-in. Read more: https://red.ht/4tK5n7d #OpenSource #CNCF #AIInference #CloudNative #RedHat #llmd #EnterpriseAI
-
Brian Stevens liked thisBrian Stevens liked thisWhere open-source innovation meets enterprise scale! Had a great time at Red Hat Summit 2026 in Atlanta. !AMD and Red Hat are advancing the next wave of AI and hybrid cloud—delivering performance, flexibility, and choice for customers. Energizing conversations across the ecosystem—and always a pleasure catching up with my good friend Brian Stevens!
-
Brian Stevens liked thisBrian Stevens liked thisBuilding a durable AI ecosystem through open innovation - SiliconANGLEBuilding a durable AI ecosystem through open innovation - SiliconANGLE
Experience & Education
-
Red Hat
*** *** ** ***
-
*******
***** ********* ****** ** ***** *********
-
*** ******
***** ********
-
********** *********** *********
** ******** ******* undefined
-
********** ** *** *********
** ******** *******
View Brian’s full experience
See their title, tenure and more.
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
or
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View Brian’s full profile
-
See who you know in common
-
Get introduced
-
Contact Brian directly
Other similar profiles
-
Jocelyn Goldfein
Jocelyn Goldfein
Zetta Venture Partners is named after the zettabyte (a trillion gigabytes!) Founded in 2013, we were the first AI-focused fund and we've been backing AI and infrastructure entrepreneurs since long before it was cool.<br><br>We are keenly interested in cloud and data infrastructure (like MotherDuck or Domo), tools and platforms for developers and data scientists building with AI (like Kaggle, Domino, Weaviate or Fixie) and applications powered by ML (like Tractable, Lilt, Skan, and too many more to name). <br><br>We lead or co-lead $1-5M rounds for pre-product-market-fit startups with B2B business models. We believe in verticals like financial services, insure tech, life sciences, health care, sustainability, cloud infra, devops, cybersecurity, manufacturing, supply chain and logistics. <br><br>More about us: https://zettavp.com<br><br>Before venture, I spent my career as an engineer and engineering leader. I led engineering teams in the high growth early years of VMware and Facebook (as well as a few startups including one of my own). I've worked across systems from OS'es to LAMP and mobile apps, to shrink-wrapped native software to developer tools and of course ML. The one "constant" has been high-growth and industry transformation. I'm passionate about scaling products, teams, and companies, and I care deeply about STEM education.
7K followersLos Altos, CA -
Chris Wright
Chris Wright
Experienced technology and strategy leader with a passion for open source software. Collaboration and continuous improvement are best tools for change.
16K followersGreater Boston -
Steven Sinofsky
Steven Sinofsky
Hardcore Software: Inside the Rise and Fall of the PC Revolution
647K followersSioux Falls, SD
Explore more posts
-
Preeti Somal
4K followers
Maxim Fateev's blog is a must read ... It's direct talk from his experience. "After spending over two decades building orchestration solutions. I’ve seen the patterns, I’ve tried the different approaches, and I’ve watched the industry rediscover the same lessons over and over" So many parallels to why Infrastructure as Code won over the gazillions of visual infra layout tools ... https://lnkd.in/gB_G3ijA Temporal Technologies
49
2 Comments -
Prashant Kelker
I am a global consulting… • 10K followers
Google took a deep dive into the environmental impact of AI inference across its entire system and discovered that a typical Gemini Apps prompt in May 2025 only used 0.24 Wh of energy, 0.03 grams of CO₂e, and 0.26 milliliters of water. Over the past year, efficiency improvements, clean energy sources, and advancements in hardware have significantly reduced energy consumption by 33 times and emissions by 44 times. This is ground breaking. Maybe we were all too early to cry foul on the environmental impact of AI. Kudos, Google! https://lnkd.in/evnvvhxJ
22
1 Comment -
Shlomi Ben Haim
JFrog • 15K followers
According to Gil Luria, head of technology research at investment bank DA Davidson, these companies benefit from the growing demand for #AI #software tools, offering essential services for expanding AI capabilities. https://lnkd.in/gUgiQVHU Snowflake Datadog JFrog
94
-
David Ramel
1105 Media Inc. • 401 followers
AWS just refunded $250K in data egress fees after a high-profile dispute--highlighting growing pressure around cloud repatriation and data mobility costs. This article looks at what happened and how it fits into the broader cloud pricing debate. Read the full story:
-
Shanker Iyer
Plainsight • 195 followers
MCP servers are quickly becoming a practical way to connect LLMs to real production systems. In this demo, software developers Shalin Mehta and Samuel Wilt, walk through how we’re using MCP at Plainsight to automate deployment for real-time video AI pipelines. Agents handle the full lifecycle: detecting new pipelines, creating Kubernetes runs, monitoring readiness, and confirming that inference is actually running in production. The walkthrough by Shalin Mehta and Sam Wilt covers: • MCP as a control plane for video AI • Automated deployment and verification • Kubernetes orchestration with feedback loops • LLM agents interacting with live infrastructure If you’re interested in what MCP looks like beyond toy examples, watch the full walkthrough. #MCP #VideoAI #Kubernetes #LLMs #AIInfrastructure How MCP Powers Agentic Workflows for Production Vision Applications https://lnkd.in/gRrR4_md
14
-
Beth Pariseau
Informa TechTarget • 6K followers
IBM's $11 billion deal for Confluent is mainly focused on #datamanagement for #AI, but also raises possibilities for its Red Hat and HashiCorp portfolios, and for some, uneasy echoes of past worries about #opensource. My writeup on #observability and #ITops implications for the #IBM - #Confluent acquisition, featuring reaction and analysis from Torsten Volk, Rob Strechay, Steven Dickens and more below. #enterprisetech #enterpriseIT #kafka #OSS
19
3 Comments
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top contentOthers named Brian Stevens
-
Brian S.
Greater Chicago Area -
Brian Stevens
Knoxville, TN -
Brian Stevens
Toronto, ON -
Brian Stevens
Austin, TX
841 others named Brian Stevens are on LinkedIn
See others named Brian Stevens