Skip to content

feat(eks): scale MCP workloads and node group defaults#2214

Merged
topher-lo merged 3 commits intomainfrom
codex/update-deployment-for-mcp-services
Feb 27, 2026
Merged

feat(eks): scale MCP workloads and node group defaults#2214
topher-lo merged 3 commits intomainfrom
codex/update-deployment-for-mcp-services

Conversation

@topher-lo
Copy link
Contributor

@topher-lo topher-lo commented Feb 27, 2026

Motivation

  • Provide a larger baseline capacity for the Tracecat MCP service by increasing replica count and per-pod CPU/memory so MCP can handle heavier loads.
  • Ensure cluster autoscaling/default capacity is sufficient for the increased MCP footprint by raising on-demand and spot node group size defaults.

Description

  • Increased MCP replica default from 1 to 2 in Terraform variables at deployments/eks/variables.tf and deployments/eks/modules/eks/variables.tf (variable tracecat_mcp_replicas).
  • Raised on-demand node group defaults to desired/min/max = 10/10/16 via deployments/eks/variables.tf and deployments/eks/modules/eks/variables.tf (node_desired_size, node_min_size, node_max_size).
  • Raised spot node group defaults to desired/min/max = 3/3/6 via deployments/eks/variables.tf and deployments/eks/modules/eks/variables.tf (spot_node_desired_size, spot_node_min_size, spot_node_max_size).
  • Added Helm overrides in deployments/eks/modules/eks/helm.tf to set MCP pod resource requests/limits to 4000m CPU and 4096Mi memory, and aligned the Helm chart default deployments/helm/tracecat/values.yaml so MCP also shows replicas: 2 and the updated resource values.

Testing

  • Ran python -c "import yaml, pathlib; yaml.safe_load(pathlib.Path('deployments/helm/tracecat/values.yaml').read_text())" to validate the modified Helm values.yaml, which succeeded.
  • Ran git diff --check to ensure no trailing whitespace/conflicts, which returned clean.
  • Attempted terraform fmt on modified Terraform files but the terraform CLI was not available in this environment, so formatting could not be validated here.

Codex Task


Summary by cubic

Scaled the Tracecat MCP service by setting replicas to 2 and setting pod resources to 1000m CPU and 1024Mi memory. Increased EKS capacity defaults to 10/10/16 (on-demand) and 3/3/6 (spot), and updated the README default profile to match.

Written for commit f1eee89. Summary will update on new commits.

Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1 issue found across 4 files

Prompt for AI agents (unresolved issues)

Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="deployments/eks/modules/eks/variables.tf">

<violation number="1" location="deployments/eks/modules/eks/variables.tf:78">
P3: Update the EKS README default deployment profile values to match the new node group defaults so operators don’t follow outdated sizing guidance.</violation>
</file>

Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.

@topher-lo topher-lo merged commit 32bcbe1 into main Feb 27, 2026
9 checks passed
@topher-lo topher-lo deleted the codex/update-deployment-for-mcp-services branch February 27, 2026 20:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant