Updated 17 April 2026
API Gateway vs Load Balancer: Cost, Features, and When to Use Which (2026)
One of the most common architecture decisions for AWS engineers: should you put API Gateway or an Application Load Balancer in front of your services? The cost crossover point is approximately 500k requests per day. Here is the full cost breakdown, feature comparison, and a decision framework that actually helps you choose.
Quick Verdict
Under 500k requests/day, API Gateway is cheaper because it scales to zero. Above 500k/day with steady, predictable traffic, ALB wins on per-request cost. For public APIs needing auth, rate limiting, and caching, API Gateway is almost always the better architecture regardless of cost. For internal high-volume service-to-service traffic where auth is handled downstream, ALB wins decisively.
The Fundamental Pricing Difference
API Gateway (HTTP API)
$1.00/million requests
Pure consumption pricing. No baseline charge. If you have zero traffic, you pay zero. Scales instantly from 0 to millions of requests. No instance capacity to manage. Includes: JWT auth, rate limiting, routing. Data transfer charged separately ($0.09/GB outbound).
Minimum monthly cost: $0.00
At 500k req/day (15M/month): ~$15/month
Application Load Balancer (ALB)
$0.0225/hour + LCU
Hybrid pricing: hourly baseline plus Load Capacity Unit (LCU) consumption charges. LCUs measure the max of: new connections per second, active connections, processed bytes per hour, or rule evaluations per second. Even at zero traffic, you pay the hourly charge. At volume, per-request cost drops significantly below API Gateway.
Minimum monthly cost: $16.43/month (720 hours)
At 500k req/day (15M/month): ~$25/month
Cost Comparison by Traffic Volume
The table below models AWS HTTP API vs ALB for REST traffic. ALB LCU costs are estimated using the processed-bytes dimension (dominant at API traffic patterns with 4 KB avg payload). Real-world costs vary based on your LCU profile.
| Daily/Monthly traffic | API Gateway (HTTP) | ALB | Winner | Notes |
|---|---|---|---|---|
| 100k req/day (3M/month) | ~$3/month | ~$20/month | API Gateway | ALB minimum charge dominates at low volume |
| 300k req/day (9M/month) | ~$9/month | ~$22/month | API Gateway | ALB LCU costs still dominated by hourly baseline |
| 500k req/day (15M/month) | ~$15/month | ~$25/month | Roughly tied | Crossover zone - feature needs should decide |
| 1M req/day (30M/month) | ~$30/month | ~$28/month | ALB | ALB per-request cost now below API Gateway |
| 5M req/day (150M/month) | ~$150/month | ~$40/month | ALB | ALB is 73% cheaper at this volume |
| 20M req/day (600M/month) | ~$564/month | ~$70/month | ALB | ALB advantage widens significantly at high volume |
ALB pricing: $0.0225/hour ($16.43/month base) + $0.008/LCU-hour. API Gateway HTTP pricing: $1.00/million for first 300M. Data transfer excluded from both for comparability. Source: aws.amazon.com pricing pages, April 2026.
Feature Comparison: Where API Gateway Wins on Capability
Cost is only part of the equation. API Gateway and ALB have genuinely different feature sets. An ALB that requires $500/month in custom engineering to replace what API Gateway provides natively is not actually cheaper.
| Feature | API Gateway (HTTP) | ALB |
|---|---|---|
| JWT / OAuth authentication | Native, built-in | Not supported natively (add Cognito) |
| API key management | Yes (REST API only, not HTTP) | No |
| Per-client rate limiting | Yes (usage plans, REST API) | No |
| Request throttling | Yes (10k req/sec default) | No (capacity scales, no throttle) |
| Response caching | Yes (REST API, $0.028/GB/hr) | No |
| Request transformation | Yes (REST API VTL templates) | No |
| URL path routing | Yes | Yes (path-based routing rules) |
| Host-based routing | Yes (custom domains) | Yes (SNI + host headers) |
| Health checks | Via Lambda/backend | Yes (native, TCP + HTTP) |
| WebSocket support | Yes (REST API, $0.25/M conn-min) | Yes (sticky sessions) |
| gRPC | No native support | Yes (HTTP/2 native) |
| AWS WAF integration | Yes (add-on) | Yes (add-on, same pricing) |
| Lambda integration | Native (no VPC required) | Yes (Lambda targets, requires VPC) |
| ECS / EC2 targets | Via VPC link (REST) or HTTP integration | Native target groups |
| Minimum monthly cost | $0 (pay per request) | $16.43 (hourly baseline) |
Decision Framework: Which to Choose
Use API Gateway when
- Traffic is bursty or low-volume (scales to zero, no baseline cost)
- You need JWT/OAuth auth at the edge without writing auth code
- You are exposing APIs to external developers with API keys and rate limits
- Backend is Lambda - API Gateway is the native, lowest-latency pair
- You want managed rate limiting without custom code or WAF rules
- You need response caching to reduce backend Lambda invocations
Use ALB when
- High-volume steady traffic (1M+ requests/day) with predictable patterns
- Internal service-to-service routing where auth happens downstream
- gRPC workloads (ALB has native HTTP/2 + gRPC support)
- Backend is ECS, EC2, or container fleet needing native health checks
- Low-latency priority (ALB adds 1-5ms vs API Gateway 10-100ms)
- Cost is primary constraint and you can handle auth/rate-limiting in code
Using Both Together: The Hybrid Architecture
Many production systems use API Gateway and ALB together. The pattern is: API Gateway at the public edge, ALB inside the VPC for routing to a compute fleet. This gives you the best of both worlds: API Gateway handles auth, rate limiting, and external-facing API management; ALB handles efficient internal routing, health checks, and gRPC.
Layer 1: Cloudflare or CDN (optional)
DDoS protection, edge caching, WAF. Handles 60-80% of traffic at the edge for read-heavy APIs.
Layer 2: API Gateway (HTTP API)
JWT validation, rate limiting per client, routing rules. Forwards only authenticated, non-rate-limited requests inward.
Layer 3: ALB (inside VPC)
Routes to ECS services or EC2 instances. Target group health checks. gRPC support for internal service calls.
Layer 4: Your services (ECS, Lambda, EC2)
Business logic. Auth already handled at layer 2; services focus on computation.
This pattern is particularly common in regulated industries (finance, healthcare) where auth must occur before compute is invoked, and where internal service-to-service traffic volume justifies ALB's efficiency. The additional API Gateway cost is offset by the security and compliance value.
Startup Cost Journey: A Worked Timeline
Month 1-6: Early stage (50k req/day, 1.5M/month)
~$2/monthDecision: API Gateway HTTP API
Scales to zero. No baseline cost. Lambda integration is native. $2/month for JWT auth and rate limiting is a good deal.
Month 7-12: Growth (300k req/day, 9M/month)
~$9/monthDecision: API Gateway HTTP API
Still cheaper than ALB ($22/month). No reason to switch. Focus on product, not infrastructure.
Month 13-18: Scale (1M req/day, 30M/month)
API GW: ~$30/month + ALB: ~$28/month = ~$58/monthDecision: Evaluate ALB behind API Gateway
ALB now cheaper per-request for the compute-routing layer. Add ALB behind API Gateway for ECS fleet. API Gateway stays at the auth/rate-limit edge.
Month 19-24: High traffic (10M req/day, 300M/month)
~$200-400/month totalDecision: Split architecture: Cloudflare edge + ALB internal
At this scale, replace API Gateway with Cloudflare Workers at the edge (zero egress, WAF included) and ALB for internal routing. Major cost reduction vs API Gateway alone ($300+/month).
Non-AWS Equivalents
Azure
Azure API Management (APIM) is the API Gateway equivalent. Azure Application Gateway is the ALB equivalent (L7). Azure Load Balancer is the NLB equivalent (L4). APIM Consumption tier is pure pay-per-call (similar to AWS HTTP API). Standard APIM is hourly (similar to ALB). See Azure APIM pricing.
Google Cloud
Apigee X is the API Gateway equivalent. Google Cloud Load Balancing (HTTP(S) LB) is the ALB equivalent. Cloud Load Balancing has no per-request fee for global HTTP LB - you pay per-rule per-month. At high volume, GCP Load Balancing can be cheaper than both AWS API GW and AWS ALB.
Cloudflare (replaces both)
Cloudflare Workers effectively replaces both roles. Workers handles auth, routing, rate limiting, and transformation at the edge (API Gateway role) with zero-egress data transfer and no per-instance cost (load balancer role). For many teams, moving to Cloudflare Workers eliminates the need to choose between API GW and ALB. See Cloudflare pricing.
Kong + ALB pattern
Kong Gateway OSS or Konnect deployed on EC2/ECS, with an ALB in front. Kong handles all API management features; ALB provides health checks, SSL termination, and Kong instance routing. Common in multi-cloud environments where Kong's portability is valued. See Kong pricing and self-hosted cost analysis.