Serverless

What it is

Serverless is a cloud execution model where the provider allocates resources dynamically and charges only for compute consumed. It doesn't mean "no servers" — it means the developer doesn't manage them. The provider handles provisioning, scaling, patching, and availability.

Fundamental characteristics

No server management: no instances to configure or maintain
Automatic scaling: from zero to thousands of instances based on demand
Pay per use: charged per invocation/duration, not idle time
Event-driven: functions execute in response to events

Serverless services in AWS

Service	Function
Lambda	Functions as a Service (FaaS)
API Gateway	HTTP/REST/WebSocket APIs
DynamoDB	NoSQL database
S3	Object storage
Step Functions	Workflow orchestration
EventBridge	Event bus
SQS / SNS	Messaging

Common patterns

API backend: API Gateway → Lambda → DynamoDB
Event processing: S3 upload → Lambda → processing
Workflows: Step Functions orchestrating multiple Lambdas
Cron jobs: EventBridge schedule → Lambda

Advantages

Fast time to market
Zero cost when there's no traffic
Scaling without intervention
Smaller attack surface (no OS to patch)

Limitations

Cold starts: latency on first invocation (see analysis below)
Maximum duration: Lambda has a 15-minute limit per invocation
Memory: maximum 10 GB per function
Payload: 6 MB synchronous, 256 KB asynchronous
Concurrency: 1,000 concurrent executions by default (can be increased)
Vendor lock-in: provider-specific APIs
State: functions are stateless by design — state goes in DynamoDB, S3, or ElastiCache

Cold starts

A cold start occurs when Lambda creates a new execution environment. Latency varies significantly by runtime and package size:

Runtime	Typical cold start	With SnapStart/provisioned
Node.js	100–300 ms	N/A
Python	150–400 ms	N/A
Java	1–3 s	200–400 ms with SnapStart
.NET	400–800 ms	100–200 ms with Native AOT
Rust/Go	10–30 ms	Not needed

Mitigation strategies:

Provisioned concurrency: keeps instances "warm" — eliminates cold starts but has a fixed cost
SnapStart (Java): snapshot of the initialized environment, reduces cold start from seconds to milliseconds
Minimize dependencies: smaller packages initialize faster
Initialization outside the handler: code outside the handler runs once and is reused

Serverless vs. containers

Criteria	Serverless (Lambda)	Containers (Fargate)
Maximum duration	15 minutes	No limit
Scaling	Automatic, per invocation	Automatic, by metrics (slower)
Cold start	100 ms – 3 s	30–60 s (task provisioning)
Idle cost	$0	Cost per vCPU/memory while running
High traffic cost	Can be high (per invocation)	More predictable (per hour)
State	Stateless	Can maintain in-memory state
Networking	Optional VPC, slow ENI	Native VPC, full networking

Use serverless when: variable or unpredictable traffic, short executions (under 15 min), small teams wanting zero ops, event-driven architectures.

Use containers when: long-running processes, need for in-memory state, constant and predictable traffic, complex networking requirements.

Cost modeling

Lambda charges $0.20 per million invocations plus $0.0000166667 per GB-second. For an API with 1 million requests/month, 256 MB memory, and 200 ms average:

Invocations: 1M × $0.20 = $0.20
Compute: 1M × 0.2s × 0.25 GB × $0.0000166667 = $0.83
Total: ~$1.03/month

The same load on Fargate (0.25 vCPU, 0.5 GB, running 24/7): ~$9.10/month. Serverless wins for variable loads. But at 100M requests/month, Lambda costs ~$103 while Fargate stays at ~$9.10 — the crossover point depends on the traffic pattern.

Anti-patterns

Monolithic Lambda: one function that does everything — loses granular scaling advantages and increases cold starts
Lambda-to-Lambda chains: invoking one Lambda from another directly — use Step Functions or SQS instead
Over-orchestration: Step Functions for logic that fits in a single function — adds unnecessary latency and cost
Ignoring concurrency limits: without reserved concurrency, one function can consume the entire account quota
Functions without timeout: the default timeout is 3 seconds, but unadjusted functions can run for 15 minutes by mistake

Why it matters

Serverless eliminates server management and payment for idle capacity. For workloads with variable traffic — APIs, event processing, scheduled tasks — the pay-per-execution model can dramatically reduce costs while scaling automatically.

References

Serverless Architectures — AWS — AWS, 2024. Official serverless services documentation.
Serverless Land — AWS, 2024. Patterns, examples, and resources for serverless architectures.
Operating Lambda: Performance optimization — AWS Compute Blog, 2022. Detailed cold start analysis and optimization strategies.
Lambda concurrency — AWS, 2024. Reserved and provisioned concurrency documentation.
Serverless Framework — Serverless Inc, 2024. Multi-cloud framework for serverless applications.

What it is

Fundamental characteristics

No server management: no instances to configure or maintain
Automatic scaling: from zero to thousands of instances based on demand
Pay per use: charged per invocation/duration, not idle time
Event-driven: functions execute in response to events

Serverless services in AWS

Service	Function
Lambda	Functions as a Service (FaaS)
API Gateway	HTTP/REST/WebSocket APIs
DynamoDB	NoSQL database
S3	Object storage
Step Functions	Workflow orchestration
EventBridge	Event bus
SQS / SNS	Messaging

Common patterns

API backend: API Gateway → Lambda → DynamoDB
Event processing: S3 upload → Lambda → processing
Workflows: Step Functions orchestrating multiple Lambdas
Cron jobs: EventBridge schedule → Lambda

Advantages

Fast time to market
Zero cost when there's no traffic
Scaling without intervention
Smaller attack surface (no OS to patch)

Limitations

Cold starts: latency on first invocation (see analysis below)
Maximum duration: Lambda has a 15-minute limit per invocation
Memory: maximum 10 GB per function
Payload: 6 MB synchronous, 256 KB asynchronous
Concurrency: 1,000 concurrent executions by default (can be increased)
Vendor lock-in: provider-specific APIs
State: functions are stateless by design — state goes in DynamoDB, S3, or ElastiCache

Cold starts

A cold start occurs when Lambda creates a new execution environment. Latency varies significantly by runtime and package size:

Runtime	Typical cold start	With SnapStart/provisioned
Node.js	100–300 ms	N/A
Python	150–400 ms	N/A
Java	1–3 s	200–400 ms with SnapStart
.NET	400–800 ms	100–200 ms with Native AOT
Rust/Go	10–30 ms	Not needed

Mitigation strategies:

Provisioned concurrency: keeps instances "warm" — eliminates cold starts but has a fixed cost
SnapStart (Java): snapshot of the initialized environment, reduces cold start from seconds to milliseconds
Minimize dependencies: smaller packages initialize faster
Initialization outside the handler: code outside the handler runs once and is reused

Serverless vs. containers

Criteria	Serverless (Lambda)	Containers (Fargate)
Maximum duration	15 minutes	No limit
Scaling	Automatic, per invocation	Automatic, by metrics (slower)
Cold start	100 ms – 3 s	30–60 s (task provisioning)
Idle cost	$0	Cost per vCPU/memory while running
High traffic cost	Can be high (per invocation)	More predictable (per hour)
State	Stateless	Can maintain in-memory state
Networking	Optional VPC, slow ENI	Native VPC, full networking

Use serverless when: variable or unpredictable traffic, short executions (under 15 min), small teams wanting zero ops, event-driven architectures.

Use containers when: long-running processes, need for in-memory state, constant and predictable traffic, complex networking requirements.

Cost modeling

Lambda charges $0.20 per million invocations plus $0.0000166667 per GB-second. For an API with 1 million requests/month, 256 MB memory, and 200 ms average:

Invocations: 1M × $0.20 = $0.20
Compute: 1M × 0.2s × 0.25 GB × $0.0000166667 = $0.83
Total: ~$1.03/month

Anti-patterns

Monolithic Lambda: one function that does everything — loses granular scaling advantages and increases cold starts
Lambda-to-Lambda chains: invoking one Lambda from another directly — use Step Functions or SQS instead
Over-orchestration: Step Functions for logic that fits in a single function — adds unnecessary latency and cost
Ignoring concurrency limits: without reserved concurrency, one function can consume the entire account quota
Functions without timeout: the default timeout is 3 seconds, but unadjusted functions can run for 15 minutes by mistake

Why it matters

References

Serverless Architectures — AWS — AWS, 2024. Official serverless services documentation.
Serverless Land — AWS, 2024. Patterns, examples, and resources for serverless architectures.
Operating Lambda: Performance optimization — AWS Compute Blog, 2022. Detailed cold start analysis and optimization strategies.
Lambda concurrency — AWS, 2024. Reserved and provisioned concurrency documentation.
Serverless Framework — Serverless Inc, 2024. Multi-cloud framework for serverless applications.

Serverless

What it is

Fundamental characteristics

Serverless services in AWS

Common patterns

Advantages

Limitations

Cold starts

Serverless vs. containers

Cost modeling

Anti-patterns

Why it matters

References

Related content

Serverless

What it is

Fundamental characteristics

Serverless services in AWS

Common patterns

Advantages

Limitations

Cold starts

Serverless vs. containers

Cost modeling

Anti-patterns

Why it matters

References

Related content