Skip to main content

Find Your Scale

Use this guide to right-size your DevOpsGenie deployment — from a lean development cluster to a multi-tenant production environment handling billions of events per day.

Cluster Tiers

Starter (Development / Evaluation)

Ideal for: small teams evaluating DevOpsGenie, development environments, internal tooling.

ComponentSpecMonthly AWS Cost
EKS Control PlaneManaged~$73
System Node Group2× m6i.large (2 vCPU / 8 GiB each)~$140
Workload Node Group2× m6i.xlarge (4 vCPU / 16 GiB each)~$280
NAT Gateway~$35
Total~$530/month
# Starter sizing in Terraform
module "eks" {
...
eks_managed_node_groups = {
system = {
instance_types = ["m6i.large"]
min_size = 2
max_size = 3
desired_size = 2
}
workloads = {
instance_types = ["m6i.xlarge"]
min_size = 2
max_size = 10
desired_size = 2
}
}
}

Growth (Small Production)

Ideal for: production workloads with up to 20 microservices, 3–5 engineering teams.

ComponentSpecMonthly AWS Cost
EKS Control PlaneManaged~$73
System Node Group3× m6i.xlarge (HA across 3 AZs)~$420
Workload Nodes (Karpenter)~8× m6i.2xlarge on-demand~$2,400
Spot Nodes (Karpenter)~4× m6i.2xlarge Spot~$300
ALB~$40
NAT Gateway3× (HA)~$105
Total~$3,338/month

Scale (Mid-Size Production)

Ideal for: 50+ microservices, 10–20 teams, regulated industry workloads.

ComponentSpecMonthly AWS Cost
EKS Control PlaneManaged~$73
System Node Group3× m6i.2xlarge~$840
Workload Nodes (Karpenter)~20× m6i.4xlarge mix~$8,000
Spot Nodes (Karpenter)~10× m6i.4xlarge Spot~$1,200
ALB + WAF~$200
NAT Gateway~$105
Total~$10,418/month

Enterprise (Large-Scale Production)

Ideal for: 100+ microservices, 50+ teams, multi-region, high compliance requirements.

Contact sales@devopsgenie.io for enterprise sizing assistance.

Resource Requests Reference

Platform Components (System Node Pool)

ComponentCPU RequestMemory RequestReplicas
ArgoCD Server100m256Mi2
ArgoCD Repo Server250m512Mi2
Prometheus500m2Gi1 (HA: 2)
Alertmanager100m128Mi2
Grafana200m256Mi1
Loki500m1Gi1 (HA: 3)
OPA Gatekeeper200m512Mi3
cert-manager100m64Mi1
External Secrets200m128Mi1
Karpenter200m256Mi2
CoreDNS100m70Mi2–5
Total (minimum)~2.5 vCPU~5 GiB

Workload Sizing Guidelines

Workload TypeCPU RequestMemory RequestNotes
REST API (Go/Rust)100–250m64–256MiEfficient runtimes
REST API (Node.js)250–500m256–512MiEvent-loop model
REST API (Java/Spring)500m–1512Mi–1GiJVM overhead
Background worker250–500m256–512MiDepends on batch size
ML inference (CPU)2–44–8GiConsider GPU nodes
Data pipeline1–42–8GiHighly variable

Autoscaling Recommendations

HPA (Horizontal Pod Autoscaler)

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
name: payments-api
namespace: team-payments
spec:
scaleTargetRef:
apiVersion: apps/v1
kind: Deployment
name: payments-api
minReplicas: 3
maxReplicas: 50
metrics:
- type: Resource
resource:
name: cpu
target:
type: Utilization
averageUtilization: 60
- type: Resource
resource:
name: memory
target:
type: Utilization
averageUtilization: 70
behavior:
scaleDown:
stabilizationWindowSeconds: 300
policies:
- type: Percent
value: 25
periodSeconds: 60
scaleUp:
stabilizationWindowSeconds: 0
policies:
- type: Pods
value: 4
periodSeconds: 60

VPA (Vertical Pod Autoscaler)

Use VPA in Recommendation mode initially to find the right requests:

apiVersion: autoscaling.k8s.io/v1
kind: VerticalPodAutoscaler
metadata:
name: payments-api
namespace: team-payments
spec:
targetRef:
apiVersion: apps/v1
kind: Deployment
name: payments-api
updatePolicy:
updateMode: "Off" # Recommendation only — don't restart pods
# View VPA recommendations after 24–48h of production traffic
kubectl get vpa payments-api -n team-payments -o yaml | \
yq '.status.recommendation'

Cost Optimization Tips

  1. Use Spot/Preemptible instances for stateless workloads — 70–80% savings over on-demand
  2. Enable Karpenter consolidation to right-size nodes automatically
  3. Set CPU limits to prevent noisy-neighbour issues, but don't set memory limits (use requests only to avoid OOMKill under throttling)
  4. Use Graviton/ARM nodes (AWS m7g) for compatible workloads — 20% better price-performance
  5. Schedule batch workloads off-peak using CronJob startingDeadlineSeconds
# Get a cost estimate for your current configuration
devopsgenie sizing estimate \
--provider aws \
--region us-east-1 \
--tier growth