Tweet

Yan Cui

Jul 18 • 17 tweets • 9 min read

Are you spending too much on Lambda?

Here are some ways to save money on your Lambda bill.

🧵

#serverless #awslambda

Lambda is usually very cost-effective, you only pay when they run (with ms billing) and they scale to zero.

But a combination of misconfiguration and a high throughput use case can give you a nasty surprise when you get your AWS bill, esp if you don't have billing alerts set up.

A few of my clients have experienced this type of problem before. For example, when a function that is invoked millions of times a day is allocated with way too much memory. Or when provisioned concurrency is enabled on a function with a lot of memory.

Cloud cost control is a complex topic and these types of mistakes are easy to make. I have seen many folks get caught out by the cost of their ECS clusters (bad scaling configuration) or VPC (VPC endpoints, NAT gateway, etc.), or maybe API Gateway and CloudWatch!

@alex_casalboni

As far as Lambda is concerned, the most effective way to cut costs is to make sure you're not over-allocating memories.

@alex_casalboni's Lambda power tuning project is the go-to solution for right-sizing Lambda functions.

github.com/alexcasalboni/…

@alex_casalboni

@alex_casalboni But, most of the time, there's no point in optimizing because there are no meaningful cost savings to be had - saving 30% on $0.01/month is a wasted afternoon...

The trick is in finding functions that are worthwhile optimising for cost.

@alex_casalboni

@alex_casalboni You should always right-size functions that use provisioned concurrency because the uptime cost is tied to memory allocation.

for 1 provisioned concurrency:
128MB function = $1.40 per month
10GB function = $111.61 per month

scary when you can be 2 orders of magnitude wrong!

@alex_casalboni

@alex_casalboni You should also optimize functions that are executed often (e.g. millions of times a month) and:
1. have a long avg execution time, and/or
2. has high memory allocation

If you use cost tagging, you can find functions with high individual costs in AWS billing.

@alex_casalboni

@alex_casalboni For @Lumigo customers, you can do this easily by sorting your functions by cost (desc order) and then look at the top of the list.

Look for functions that:
1. have a high cost; and
2. low avg. memory

Prime targets for optimization!

@alex_casalboni

@alex_casalboni @Lumigo Another way to save on Lambda costs is to switch to ARM arch.

Per ms, it's about 25% cheaper compared to x86 functions.

But, you need to test your workload on ARM vs x86 and make sure there's no significant perf difference.

@alex_casalboni

@alex_casalboni @Lumigo In the launch post, AWS mentions a 19% better performance in their tests.

Since the launch, there's been a lot of report from others that found their workload to be significantly worse performing on ARM. In the worst case, I saw 60% longer exec time running on ARM.

@alex_casalboni

@alex_casalboni @Lumigo IO-heavy functions are usually good candidates for ARM. Function spends most of its time idle, waiting for API response, so CPU cycles are wasted anyway, and CPU perf likely does not vary greatly in these cases either.

Take that 25% saving and say thank you :-)

@alex_casalboni

@alex_casalboni @Lumigo Lastly, a counter-intuitive one... you can save money by using provisioned concurrency if you have a busy function and you know what you're doing!

PC has uptime cost, but duration cost is ~70% cheaper than on-demand.

@alex_casalboni

@alex_casalboni @Lumigo If provisioned concurrencies are kept busy ~60% of the time, then you reach break-even, and beyond that, cost savings.

Risky approach - steep penalty if you get it wrong because of uptime cost. I wouldn't recommend this, use PC for eliminating cold starts instead (as intended).

@alex_casalboni

@alex_casalboni @Lumigo If you prefer reading thin a long-form blog post instead, you can find it here:

theburningmonk.com/2022/07/the-be…

@alex_casalboni

@alex_casalboni @Lumigo Also available as a YouTube video on my channel:

@alex_casalboni

@alex_casalboni @Lumigo That's a wrap!

If you enjoyed this thread:

1. Follow me @theburningmonk for more of these
2. RT the tweet below to share this thread with your audience

https://twitter.com/theburningmonk/status/1549016170279407617

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @theburningmonk

Yan Cui

@theburningmonk

Jun 17

Great question from my current cohort of students, paraphrased:

"Should you always use Step Functions to chain together a few Lambda functions? Are there patterns to simplify this? How about using SQS between the functions?"

Here are my thoughts 🧵

Firstly, on the broader topic of orchestration vs choreography, I've written my thoughts before. TL;DR is that I prefer orchestration for intra-service workflows, and use events for inter-service communication.

theburningmonk.com/2020/08/choreo…

If you have a dead-simple workflow then Step Functions can be overkill, especially if you're new to it.

Simplest approach:
1. implement workflow inside a single function
2. use Lambda destinations to chain several functions together

Some considerations to think about...

Read 13 tweets

Yan Cui

@theburningmonk

May 5

As the Lambda service becomes more mature and fully featured, there's also more confusion around when to use these new features. Function URL came up in a conversation today, so let's talk about that!

🧵

Let me start by saying that I'm quite excited by its release and I think it's great that it's now an option.

But I also think it shouldn't be the default for most of the people who are using Lambda today.

lumigo.io/blog/aws-lambd…

Unless you have something really simple or need to optimize on cost (e.g. high throughput API) then you should stick with API GW.

Most of the APIs I've worked on don't need to scale to 1000+ TPS, but almost all of them ended up needing some features from API GW.

Read 8 tweets

Yan Cui

@theburningmonk

May 3

The no. 1 question I get about #serverless is around testing - how should I test these cloud-hosted functions? Should I use local simulators? How do I run these in my CI/CD pipeline?

Here are my thoughts on this 🧵

There's value in testing YOUR code locally, but don't bother with simulating AWS locally, too much effort to set up and too brittle to maintain. Seen many teams spend weeks trying to get localstack running and then waste even more time whenever it breaks in mysterious ways 😠

Much better to use temporary environments (e.g. for each feature, or even each commit). Remember, with serverless components you only pay for what you use, so these environments are essentially free 🤘

Read 19 tweets

Yan Cui

@theburningmonk

Dec 17, 2020

@MarcJBrooker

If you want to learn about the internal details of Lambda, then check out @MarcJBrooker's session "Deep dive into AWS Lambda security: Function isolation"

virtual.awsevents.com/media/1_gu6tp8…

Here's my play-by-play

🧵

haha, a typical EC2 bare metal instance can run A LOT of Lambda workers! Which means AWS needs to figure out how to do it securely and performantly.

"Serverless still runs on servers"

Sure, Wifi still involves wires, it's just not something you care about when you use them.

But yeah, Lambda runs on EC2 bare metal instances.

Read 17 tweets

Yan Cui

@theburningmonk

Dec 6, 2020

@dyanacek

Stitching together my previous tweets on @dyanacek's session (BLD205) on "Monitoring production services at Amazon" so it's easier to share

#aws #reinvent2020

virtual.awsevents.com/media/1_4nwtxc…

"For amazon.com we found the "above the fold" latency is what customers are the most sensitive to"

This is an interesting insight, that not all service latencies are equal and that improving the overall page latency might actually end up hurting the user experience if it negatively impacts the "above the fold" latency as a result. 💡

Read 8 tweets

Yan Cui

@theburningmonk

Dec 5, 2020

@clare_liguori

If you missed @clare_liguori's Continuous Delivery session this week (like I did) then good news, it's available on-demand now 🎊

virtual.awsevents.com/media/1_ua3d99…

And here's my play-by-play for the session

🧵...

This is a typical CD pipeline in AWS.

This is far more complex than the most complex CD pipeline I have ever had! Just cos it's complex, doesn't mean it's over-engineered though. Given the blast radius, I'm glad they do releases carefully and safely.

If you look closely, beyond all the alpha, beta, gamma environments, it's one-box in a region first then the rest of the region, I assume starting with the least risky regions first.

Read 30 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Yan Cui

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @theburningmonk

Yan Cui

Yan Cui

Yan Cui

Yan Cui

Yan Cui

Yan Cui

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?