Module 7 takes the agent you built in Module 6 and turns it into a production cloud service. You'll containerize the stack, orchestrate it on Kubernetes, automate delivery, and operate it with observability, security, and cost controls. The goal: a reliable Digital FTE that runs 24/7 for real users.
Prerequisites: Modules 4-6. You need a working agent service to deploy.
You have learned Gateway API fundamentals, configured Envoy Gateway, written HTTPRoutes, applied rate limiting, set up TLS, implemented traffic splitting, configured autoscaling with KEDA, and explored Envoy AI Gateway for LLM traffic. Each lesson added a pattern to your traffic-engineer skill. Now you will compose all these patterns into a production-ready traffic stack for your Task API.
This capstone follows the specification-first approach. You will write a specification defining what "production-ready" means for your deployment, then use your traffic-engineer skill to generate configurations that meet those criteria. The goal is not just working configuration—it is configuration you can validate, defend, and evolve.
By the end, you will have a complete traffic engineering stack protecting your Task API: external access through Gateway, rate limiting per user, TLS termination, and autoscaling based on demand. You will also finalize your traffic-engineer skill as a reusable component for any future AI agent deployment.
Before generating any YAML, define what you are building. Create a file named traffic-spec.md in your project:
Each user (identified by x-user-id header) is limited to 100 requests/minute. Anonymous users are limited to 10 requests/minute.
Validation:
HTTPS access works with a valid (self-signed for local) certificate.
Validation:
KEDA ScaledObject scales pods from 1-10 based on request rate. Scale-to-zero disabled (minimum 1 replica for availability).
Validation:
Traffic routes to correct backends based on path prefix:
Validation:
Backend protected from overload with max 50 concurrent connections.
Validation:
Traffic-engineer skill generates all required resources from this spec.
Validation:
traffic-spec.md created (2847 bytes)
Using my traffic-engineer skill, generate Gateway and HTTPRoute for Task API:
From traffic-spec.md:
Output:
Request BackendTrafficPolicy matching SC-002:
Review AI's output. Check these specifics:
Expected structure:
Output:
Request protection against backend overload:
Review AI's output. Ensure circuit breaker is added to existing policy, not a separate resource:
Output:
Request self-signed certificate setup:
Review AI's output. For local development, you need:
Certificate generation:
Output:
Request autoscaling configuration matching SC-004:
Review AI's output. Check these specifics:
Expected structure:
Output:
With all configurations generated and reviewed, apply them to your cluster in the correct order.
Output:
Output:
Output:
Output:
Output:
Run each validation from your specification.
Output:
Result: PASS
Output:
Result: PASS (100 succeeded, 10 rate limited)
Output:
Result: PASS (10 succeeded, 5 rate limited)
Output:
Result: PASS
Output:
Result: PASS (READY=True, MIN=1, MAX=10)
Output:
Result: PASS
Output:
Result: PASS
Output:
Result: PASS
All success criteria met. Your Task API has production-ready traffic engineering.
Your traffic-engineer skill has accumulated patterns from all 12 lessons. Review and finalize it.
Your skill should include this decision logic:
Your skill should include templates for:
1. Gateway with HTTP and HTTPS:
2. HTTPRoute with versioned paths:
3. BackendTrafficPolicy with rate limiting and circuit breaker:
4. KEDA ScaledObject:
Add this to your skill:
Run this test to verify your skill handles the complete workflow:
Your traffic-engineer skill is now complete. It should handle:
Consider what edge cases need documentation:
This skill becomes part of your Digital FTE portfolio. When deploying future AI agents, invoke this skill to generate production-ready traffic configurations in minutes rather than hours.
Your traffic-engineer skill is ready for your skills portfolio:
Commit your finalized skill:
You now own a production-tested traffic engineering skill that transforms any AI agent deployment from "pod running in cluster" to "production service ready for users."