Gwen (Chen) Shapira Profile picture
Co-founder of @niledatabase. Making SaaS global, elastic and chill. Find me at: https://t.co/uyuHg400cp
Michael Hood Profile picture 1 subscribed
Mar 19 13 tweets 4 min read
If you are using Postgres for embeddings, JSON or large text columns, or all of the above... then you are using TOAST, whether you know it or no.
TOAST can have massive impact on performance. So lets talk about it a bit: 🧵 Postgres has 8K pages, and you can't spread a row across pages. In fact, Postgres prefers to leave a bit of room for additional versions of rows or additional rows.

So if a row exceeds 2K (modifiable TOAST_TUPLE_THRESHOLD), Postgres is going to try and do something about it. Image
Jun 19, 2023 4 tweets 2 min read
I've ran into quite a few social posts that conflate relational databases, SQL, transactions and ACID. This is wrong in the sense that these are different concepts and you can mix and match them in different ways.

But turns out that historically, RDBMS predates SQL and ACID by… twitter.com/i/web/status/1… The intersection between transactions and ACID is especially interesting.

You can have ACID without transactions (BerkleyDB had ACID guarantees on single statements).

But once you had transactions, they interact with all letters.

Atomic: fully committed or fully rolled back… twitter.com/i/web/status/1…
Sep 27, 2022 8 tweets 3 min read
I didn't expect to attend in-person conferences this year. It is hard to justify taking time away from a very new company and product.

But @QConSF is convincing enough that I'll make an exception.

The talks I'm looking forward to the most: 🧵 Microservices: The whole track looks great, but Orchestration vs Choreography by @anatomic and Enabling Data Mesh by @zhamakd are topics that I've been interested in for years and would love to discover new perspectives.
Jan 23, 2022 11 tweets 2 min read
Future of Data prediction from @sriramsubram: "One DB will never happen. One platform will".
My thoughts: When I started my career, the DB world was pretty clear - You had OLTP database for running transactional workloads, you had a DWH for analytical workloads and your had ETL/ELT in between.
Nov 7, 2021 9 tweets 2 min read
One of the ways developers get in trouble is when they try to outsmart the framework they are using.

The framework has some limitations, and they work hard to get around these limitations without understanding what they are trading off when they do this. Example! You have a Kafka producer, writing events about a large set of entities to a topic. You want events about each entity to be processed in order. And you also want some degree of parallelism between the entities.
Oct 10, 2021 6 tweets 1 min read
Some advice on the enterprise SKU where you need to contact sales. But only very little, because I know very little about enterprise sales. First, if your target persona is sales-averse (all developers), you'll win by minimizing the situations they have to talk to a salesperson. Do you really need them to contact sales, or maybe just provide an option? "For discounts, contact sales" may be better.
Oct 9, 2021 9 tweets 2 min read
I am not sure SaaS companies understand how challenging and important SKUs are. A lot of what I see on pricing pages seems completely random. A thread. 1/ Why are they important? First, they are critical for landing customers. Each target user needs to look at your pricing page and go, "Yes. This SKU has the features I need, at a price that I am willing to pay".
Oct 9, 2021 4 tweets 1 min read
I noticed that my communication style tends to drift toward the style of the leader I work most closely with. Is it a weird trait, or someone everyone does? When I worked with someone who went "mmm.... not exactly, this is a bit subtle", I also softened my disagreements. Now I work with someone comfortable saying "No" as a complete sentence, and I'm doing this more. A year ago, I'd have told you that this isn't my style.
May 4, 2020 4 tweets 1 min read
Trying to square the whole "developers are kingmakers" and developer-marketing/evangelism trends with the observed reality in which developers are terrible at getting budget for things that make their own lives better. It is surprisingly common to hear "I'm stuck managing my own Kafka, even though it is a waste of my time, and even though I'm pretty bad at it because there is no budget for a managed service".
Mar 17, 2020 6 tweets 2 min read
I'm thinking a lot recently about building kickass Kafka applications. I don't think we've spent enough time as a community discussing this and sharing best practices. Running @confluentcloud means that we are seeing lots of clients. So, what makes Kafka apps awesome? 1. Resilience: Kafka can be slow to respond, it can return errors, requests can time out, requests can be retried infinite times, responses can be garbage, etc. You need to think through the desired behavior under error conditions and write apps that will survive.
Feb 21, 2020 23 tweets 5 min read
Live tweeting @AnnaPovzner talk on Quotas in @apachekafka Proud to have her in my team ❤️
Nov 12, 2019 13 tweets 3 min read
Not getting results you want from data lake? You may think of trying data lake in the cloud - but it's the same paradigm with same issues. @zhamakd at #QConSF The problem is that the layers in data lake design map to technical components. This doesn't work - change is orthogonal.
Nov 12, 2019 18 tweets 5 min read
#QConSF - @criccomini shares that WePay data infrastructure is based on Airflow, Kafka and BigQuery. The roadmap to data maturity
Dec 11, 2018 14 tweets 2 min read
At @apachekafka meetup in Singapore and Agoda, an online travel agency is talking about their Kafka architecture. Should be good! 5bn messages a day, 190TB a day. 9000+ services. 5 data centers. 230 Kafka brokers.
Sep 27, 2018 12 tweets 3 min read
LinkedIn sharing their streaming architecture at #strangeloop Explaining the importance of my favorite pattern: stream-table join.