What is DC/OS?
DC/OS (the Distributed Cloud Operating System) is an open-source, distributed operating system based on the Apache Mesos distributed systems kernel. It manages multiple machines in the cloud or on-premises from a single interface, deploying containers, distributed services, and legacy applications into those machines. The system provides networking, service discovery, and resource management to keep services running and communicating with each other.
DC/OS offers infrastructure portability, allowing development locally and deployment to production in the cloud or on-premises without vendor lock-in. It includes a service catalog for easy installation of distributed data services, databases, CI/CD tools, and monitoring with just a few clicks or commands. Backed by D2iQ, Inc., it supports containerized workloads, big data pipelines, and traditional applications while handling scheduling, resource allocation, and automatic recovery.
Features
- Distributed Operating System: Provides scheduling, resource allocation, service discovery, workload co-location, automatic recovery, load balancing, and software-defined networking
- Infrastructure Portability: Develop locally and deploy to production on DC/OS in the cloud or on-premises with support for multiple cloud providers
- Service Catalog Included: Install distributed data services, databases, CI/CD tools, and monitoring with just a few clicks or commands
- Containers & Fast Data: Deploy and run stateful or stateless distributed workloads including Docker containers, big data pipeline tools, and traditional apps
- Built with Apache Mesos: Uses Mesos' two-level scheduling to increase resource utilization and lowers barrier to cluster administration
Use Cases
- Deploying and managing containerized applications in production environments
- Running big data pipelines and distributed data services efficiently
- Managing legacy applications alongside modern containerized workloads
- Implementing CI/CD tools and monitoring systems across distributed infrastructure
- Ensuring high availability and automatic recovery for critical services