Your SLOs are burning.
Reflex fixes them.

SLO-based automation for Kubernetes. Audit your alerts. Automate your responses. Start safe with observe mode, graduate to auto.

Perfect for Platform Teams • Kubernetes-native • Start safe, graduate to auto • Self-hosted, privacy-first

reflex-demo
$ kubectl apply -f slo.yaml
SLO api-availability created
$ reflex runtime status
🔥 api-availability
Burn rate: 3.2x (target: 99.9%, current: 97.1%)
📊 Recommendation: Scale deployment/api +2 replicas
🤖 Mode: observe → Would create ReflexRecommendation
$ reflex runtime set api-scale --automation-level auto
Promoted api-scale reflex to AUTO mode
🎯 Next SLO violation → Automatic scaling

The SLO Response Crisis

🔥

SLOs Burn While You Sleep

Your availability drops to 97% at 2am. By morning, you've burned through a week's error budget.

Manual Response Too Slow

Page the on-call, investigate, decide, execute. 20 minutes to scale up while customers suffer.

😴

You Can't Automate Everything

HPA scales on CPU. But what about error rates? Latency spikes? Database connection issues?

78% of SLO violations happen outside business hours when response time is slowest

Two Products, Complete Solution

Fix problems in your alerts, then automate responses to SLO violations. Use separately or together.

One-time
🔍

Reflex Audit

$99

Find problems in your Prometheus alerts with expert SRE knowledge and AI-powered explanations.

3-layer linting: structural + curated + AI
Expert SRE knowledge for PostgreSQL, K8s, HTTP, Redis
AI-powered explanations and context
Auto-fix suggestions with confidence scores
Buy Audit →

How It Works

Install. Define SLOs. Start in observe mode. Graduate to auto as trust builds.

1

Install

Helm chart or wheel download after purchase. Works with any Prometheus.

2

Define SLOs

Set targets: 99.9% availability, P95 latency < 500ms.

3

Start in observe mode

See what Reflex would do. Build confidence.

4

Graduate to auto as trust builds

Fix issues while you sleep. SLOs stay healthy.

1. Define Your SLOs

apiVersion: slo.reflex.io/v1
kind: SLO
metadata:
  name: api-availability
spec:
  service: "checkout-api"
  target: 99.9  # 99.9% availability
  indicators:
  - name: success_rate
    query: "rate(http_requests_total{code!~'5..'}[5m])"

• Availability target: 99.9%

• Latency target: P95 < 500ms

• K8s health checks

2. Create Reflexes

apiVersion: reflex.io/v1
kind: Reflex
metadata:
  name: api-scale
spec:
  automationLevel: observe  # Start safe
  trigger:
    burnRate: "> 2.0"  # 2x burn rate
  action:
    type: scale
    target: "deployment/checkout-api"
    parameters:
      replicas: "+2"

✓ Scale when SLO burns too fast

✓ Restart on error rate spikes

✓ Notify team on violations

✓ Rollback bad deployments

You Control The Automation Level

Build confidence gradually. Start safe, graduate to full automation when ready.

👁️

Observe Mode

See recommendations without any action. Perfect for understanding what Reflex would do and building confidence.

No actions Recommendations only Learn & validate
🤖

Auto Mode

Full automation with safety guardrails. Fix issues automatically while you sleep. Your SLOs stay healthy.

Automatic actions Safety preconditions Instant response

Perfect For

Platform Teams

Automate remediation across 50+ microservices. SLOs stay healthy while you focus on feature work.

Result: 80% reduction in manual incident response

SRE Teams

Sleep through the night. Reflex handles the routine scaling, restarts, and alerts automatically.

Result: 90% fewer 3am pages for routine issues

Kubernetes Operators

Go beyond CPU-based HPA. Scale on error rates, latency, and custom SLO metrics automatically.

Result: 10x faster response to SLO violations

Simple, Transparent Pricing

Choose your products. Start free with alert linting and runtime observe mode.

Free Audit Runtime Starter Runtime Pro
Price $0 $99
one-time
$149/mo $299/mo
Alert linting (limited)
Full Audit + AI
Runtime observe
Runtime actions
Multiple clusters
Start Free Buy Now Start Trial Contact

Free

Perfect for getting started and learning

$0 forever
Basic alert linting (static checks)
Runtime observe mode
1 Kubernetes cluster
Community support
Start Free

Audit

Deep alert analysis with AI insights

$99 one-time
3-layer linting: structural + curated + AI
Expert SRE knowledge built-in
AI-powered explanations
Auto-fix suggestions
Buy Audit

Runtime Pro

Multiple clusters and priority support

$299 per month
Everything in Starter
Up to 5 Kubernetes clusters
Advanced integrations
Priority support
Contact Sales

Frequently Asked Questions

What's included in the free tier?

Observe mode lets you see all recommendations Reflex would make. Perfect for learning and building confidence.

Do I need internet access?

No. Reflex runtime works completely offline after installation. Self-hosted and privacy-first.

What happens if I cancel?

Your reflexes switch back to observe mode. You keep all the SLO monitoring and recommendations.

Is there a money-back guarantee?

Yes! 30 days, no questions asked. If Reflex doesn't improve your SLO reliability, get a full refund.

Get Started in 5 Minutes

1

Install Reflex Runtime

python3 scripts/install.py --prometheus-url http://your-prometheus:9090
2

Define Your SLOs

kubectl apply -f - <
3

Create Reflexes

kubectl apply -f examples/reflexes/traffic-spike-scale.yaml
4

Graduate Through Trust Ladder

reflex runtime status  # See recommendations
reflex runtime set api-scale --automation-level auto

What You'll See

👁️ Observe mode: See what Reflex would do, build confidence
🧪 Dry-run mode: Simulate actions, validate safety
🤖 Auto mode: Fix issues automatically while you sleep
📊 SLO health: Real-time burn rate tracking and alerts

Trusted by Platform Teams

"Reflex prevented 23 pages last month. Our SLOs stay healthy while we sleep. Best investment we've made for reliability."
Sarah Chen Principal SRE, TechScale
"Started with observe mode, graduated to auto after 2 weeks. Now our APIs self-heal during traffic spikes. Amazing."
Marcus Rodriguez DevOps Lead, CloudNative Co
"The trust ladder is genius. We went from skeptical to fully automated in a month. SLO violations drop from 20 minutes to 30 seconds."
Emily Zhang Platform Engineer, DataFlow
1,000+
SLO Violations Fixed
90%
Fewer 3AM Pages
30s
Average Response Time
24/7
Self-Healing Uptime

Stop Losing Sleep Over SLO Violations

Join the platform teams using Reflex to automatically fix issues while they sleep. Start safe with observe mode, graduate to full automation when ready.

👁️ Free observe mode forever
🔒 Self-hosted, privacy-first
💸 30-day money-back guarantee