Google Cloud Managed Lustre

High performance parallel file system

Accelerate HPC and AI training and serving with Google's highest performance, POSIX-compliant, parallel file system.

Features

Accelerate AI

To meet the demands of next-generation AI, we are increasing the scale by 10x. By boosting Managed Lustre performance to 10 TB/s, we’ve cleared the path for massive AI/ML workloads, accelerating both training and inference pipelines significantly.

Dynamic tier

To maximize flexibility, our new dynamic option delivers peak performance for your most critical data while reducing total costs across your entire dataset. By consolidating into a single SKU, we provide predictable billing with the exact cost known upfront. Because all data resides within Lustre, you achieve ultra-low latency for hot data, and block-like latency for cold data. This will help eliminate bottlenecks in model loading, training, and high-frequency checkpointing when you bring your entire dataset to Managed Lustre.

Maximize GPU saturation and compute ROI

The true cost of AI isn't just storage—it's idle compute. Managed Lustre delivers the high-throughput, low-latency data delivery required to keep your most expensive assets fully saturated. By optimizing data distribution and accelerating rapid checkpointing, you dramatically improve accelerator use, leading to reduced overhead and superior performance per dollar.

Performance and scale for AI/ML workloads

Training large deep learning models requires massive datasets. Managed Lustre, based on DDN EXAScaler, distributes data access, reducing training times, enabling faster insights, better accuracy, and handling of complex AI projects. Its scalability is designed to performance keeps pace with growing data, preventing storage bottlenecks. Watch Omdia overview on Managed Lustre here.

Discover how Managed Lustre can help your organization store more data to support your AI projects.

Accelerate AI inference with KV cache

Agentic AI continues to drive large context windows and this may pose a storage challenge to give you with a responsive experience as they interact with Large Language Models. Large context windows increase latency sensitivity as the local memory on the accelerators is often exhausted, requiring the model to access external storage.

Read the blog: Reducing TCO for AI inferencing with external KV cache on Managed Lustre.

Powering innovation across industries

Industries
AI and ML ^{Eliminate data starvation and train foundation models at massive scale. By decoupling compute and storage, Managed Lustre boosts LLM inference throughput, with sub-millisecond external KV caching, keeping your most expensive accelerators fully saturated.} ^Explore^AI^{at Google Cloud.}
Healthcare and life sciences ^{Propel groundbreaking innovation toward new cures. Deliver the extreme IOPS required to accelerate drug discovery, analyze complex genomics sequencing, and power AI-driven medical imaging suites, drastically reducing time-to-insight for researchers and clinicians.} ^Explore^{healthcare and life sciences}^{at Google Cloud.}
Machine vision, robotics, and autonomous vehicles _{Accelerate the software-defined vehicle pipeline. Empower innovators to effortlessly ingest petabyte-scale sensor telemetry, while supercharging research and development engineering with zero-latency storage for aerodynamics, safety, and, thermal optimization simulations.} ^Explore^automotive^{at Google Cloud.}
Capital markets _{Execute workloads that demand sub-millisecond precision. Whether running intricate quantitative risk analyses and real-time market simulations for financial services, accelerating high-resolution VFX rendering and, post-production workflows for global media studios.} ^Explore^{capital markets}^and^{financial services}^{at Google Cloud.}
Media and entertainment _{Meet production deadlines with zero-compromise storage performance. Give your studios with the ultra-high throughput necessary for seamless high-resolution video editing, real-time VFX rendering, and, accelerated post-production workflows.} ^Explore^{media and entertainment}^{at Google Cloud.}

AI and ML

^{Eliminate data starvation and train foundation models at massive scale. By decoupling compute and storage, Managed Lustre boosts LLM inference throughput, with sub-millisecond external KV caching, keeping your most expensive accelerators fully saturated.}

^Explore^AI^{at Google Cloud.}

Healthcare and life sciences

^{Propel groundbreaking innovation toward new cures. Deliver the extreme IOPS required to accelerate drug discovery, analyze complex genomics sequencing, and power AI-driven medical imaging suites, drastically reducing time-to-insight for researchers and clinicians.}

^Explore^{healthcare and life sciences}^{at Google Cloud.}

Machine vision, robotics, and autonomous vehicles

_{Accelerate the software-defined vehicle pipeline. Empower innovators to effortlessly ingest petabyte-scale sensor telemetry, while supercharging research and development engineering with zero-latency storage for aerodynamics, safety, and, thermal optimization simulations.}

^Explore^automotive^{at Google Cloud.}

Capital markets

_{Execute workloads that demand sub-millisecond precision. Whether running intricate quantitative risk analyses and real-time market simulations for financial services, accelerating high-resolution VFX rendering and, post-production workflows for global media studios.}

^Explore^{capital markets}^and^{financial services}^{at Google Cloud.}

Media and entertainment

_{Meet production deadlines with zero-compromise storage performance. Give your studios with the ultra-high throughput necessary for seamless high-resolution video editing, real-time VFX rendering, and, accelerated post-production workflows.}

^Explore^{media and entertainment}^{at Google Cloud.}

How It Works

High-performance storage for AI. Instantly provision and scale cloud HPC on demand with Google Cloud Managed Lustre, powered by DDN EXAScaler.

Common Uses

Get started

Create Managed Lustre instance: Deploy your Managed Lustre instance with just a few clicks.
Ingest and Connect: Easily hydrate your Managed Lustre file system with data directly from Google Cloud Storage.
Process at Scale: Feed massive datasets into Vertex Training Clusters (VTC) or GKE with ultra-low latency.
Accelerate: Give high-throughput data directly to next-generation hardware, ensuring GPUs run at maximum use.

Tutorials, quickstarts, & labs

Create Managed Lustre instance: Deploy your Managed Lustre instance with just a few clicks.
Ingest and Connect: Easily hydrate your Managed Lustre file system with data directly from Google Cloud Storage.
Process at Scale: Feed massive datasets into Vertex Training Clusters (VTC) or GKE with ultra-low latency.
Accelerate: Give high-throughput data directly to next-generation hardware, ensuring GPUs run at maximum use.

Pricing

Managed Lustre pricing	Pricing for Managed Lustre is primarily based on location and service level.
Service level	Pricing
1,000 MB/s/TiB _{Best for high-performance workloads want AI/ML training where throughput is critical.}	Starting at $0.60 per GiB per month
500 MB/s/TiB _{Best for high-performance balance: Excellent for demanding AI/ML workloads, complex HPC applications, and data-intensive analytics that require substantial throughput but may benefit from a more balanced price-to-performance ratio.}	Starting at $0.34 per GiB per month
250 MB/s/TiB _{Best for general purpose HPC and throughput-intensive AI: Suitable for a broad range of HPC workloads, AI/ML inference, data preprocessing, and applications that require significantly better performance than traditional NFS, at a cost-effective price point.}	Starting at $0.21 per GiB per month
125 MB/s/TiB _{Best for capacity-focused workloads with parallel access requires: Designed for scenarios where large capacities and parallel file system access are key. Good for less I/O-bound parallel tasks.}	Starting at $0.145 per GiB per month
25 MB/s/TiB - Dynamic _{Dynamic performance with block-level caching for hot data. Best for cacheable workloads, where reads and writes are concentrated on a subset of a larger data corpus. Provides a unified namespace for hot and cold data.}	Starting at $0.06 per GiB per month

Explore Google Cloud pricing. View all pricing details.

Managed Lustre pricing

Pricing for Managed Lustre is primarily based on location and service level.

1,000 MB/s/TiB

_{Best for high-performance workloads want AI/ML training where throughput is critical.}

Pricing

Starting at $0.60 per GiB per month

500 MB/s/TiB

_{Best for high-performance balance: Excellent for demanding AI/ML workloads, complex HPC applications, and data-intensive analytics that require substantial throughput but may benefit from a more balanced price-to-performance ratio.}

Pricing

Starting at $0.34 per GiB per month

250 MB/s/TiB

_{Best for general purpose HPC and throughput-intensive AI: Suitable for a broad range of HPC workloads, AI/ML inference, data preprocessing, and applications that require significantly better performance than traditional NFS, at a cost-effective price point.}

Pricing

Starting at $0.21 per GiB per month

125 MB/s/TiB

_{Best for capacity-focused workloads with parallel access requires: Designed for scenarios where large capacities and parallel file system access are key. Good for less I/O-bound parallel tasks.}

Pricing

Starting at $0.145 per GiB per month

25 MB/s/TiB - Dynamic

_{Dynamic performance with block-level caching for hot data. Best for cacheable workloads, where reads and writes are concentrated on a subset of a larger data corpus. Provides a unified namespace for hot and cold data.}

Pricing

Starting at $0.06 per GiB per month

Explore Google Cloud pricing. View all pricing details.

Pricing calculator

Estimate your monthly costs for Google Cloud products.

Custom quote

Connect with our sales team to get a custom quote for your organization.

Start your proof of concept

Get started with Managed Lustre

Dig into the technical details

Explore Managed Lustre on Google Cloud

Start building your AI application with Vertex AI

Explore AI Hypercomputer, Google's integrated supercomputing architecture

Business Case

Hear from our Managed Lustre customers

“Our ability to help companies identify and block deepfake audio, video, and images is only as good as our models. Managed Lustre is critical to our successful model training with our dynamic datasets. We can fully saturate our GPUs and it's 6x faster than the other storage solutions we evaluated.”

Watch their success story here.

^{—Zohaib Ahmed, CEO Resemble AI}

“Managed Lustre enables us to scale AI model training for AFEELA Intelligent Drive by 3x compared to other Google Cloud solutions.”

^{—Motoi Kataoka, Senior Manager, AI and Data Analytics Platform, Sony Honda Mobility Inc.}

"By integrating Managed Lustre with VTC (vertex training clusters), Salesforce AI Research eliminated the typical onboarding bottlenecks, allowing us to hit the ground running with the inferencing workload. This high-throughput low-latency storage keeps our B200 GPUs fully saturated, driving a substantial performance gain in Large Language Models inference over the H200. For our customers, this translates directly into faster, more responsive AI agents that may handle complex reasoning at a fraction of the previous latency."

^{—Lavanya Karanam, Principal Software Engineer, Salesforce}

“Moving to Google Cloud has fundamentally changed the pace of my research. My work involves large-scale neural network training against massive datasets — including the full Common Corpus, which the cluster team uploaded and made directly cluster-accessible for me. The full dataset simply would not have been achievable with any infrastructure I had practical access to otherwise. With managed Lustre, ingestion previously bottlenecking my pipeline now completes in seconds, my GPUs stay consistently utilized, and far less time is spent waiting on queue. The result is measurably faster time-to-insight across every experiment I run."

^{—Christopher J. Lynch, Ph.D., Research Assistant Professor, Virginia Modeling, Analysis, & Simulation Center (VMASC), Old Dominion University}

"Managed Lustre has eliminated at least 50% of the interruptions we experience when running training experiments for mathematical reasoning models, allowing us to run twice as many experiments. We’ve integrated the service as a regional cache for "hot" checkpoints and achieved faster, more reliable, and more convenient startup and checkpoint persistence. In our workflow, training jobs write checkpoints that subsequent inference or new training jobs consume in an offline system, resulting in up to a 15x increase in data retrieval speed and a 50+% decrease in startup time. The ability to use Lustre as a reliable mounted filesystem with solid performance out of the box has enabled our research team to be more self-sufficient in experimentation with new training technologies, allowing for easily twice as many iteration cycles while maintaining superior performance over fetching the same data from other storage options."

^{—Riley Patterson, Harmonic, Infrastructure Lead}

Scaling GKE workloads with Managed Lustre

A guide on using the Managed Lustre CSI driver with Google Kubernetes Engine (GKE) to seamlessly provision high-performance storage for containerized AI, ML, and HPC workloads. Read blog.

Accelerating AI and HPC with Managed Lustre

Overview of how Managed Lustre simplifies the deployment of parallel file systems for high-performance computing workloads. Read blog.

External KV cache with Managed Lustre

A deep dive into using Lustre to offload KV caches for large language model (LLM) inference, reducing memory overhead on TPUs/GPUs. Read blog.

Google Cloud Managed Lustre

High performance parallel file system

Product highlights

Accelerate AI

Dynamic tier

Maximize GPU saturation and compute ROI

Performance and scale for AI/ML workloads

Accelerate AI inference with KV cache

High-performance storage for AI. Instantly provision and scale cloud HPC on demand with Google Cloud Managed Lustre, powered by DDN EXAScaler.

Get started

Tutorials, quickstarts, & labs

Pricing calculator

Custom quote

Start your proof of concept

Get started with Managed Lustre

Dig into the technical details

Explore Managed Lustre on Google Cloud

Start building your AI application with Vertex AI

Explore AI Hypercomputer, Google's integrated supercomputing architecture

View our latest launches: