Neelesh Salian 💻 Profile picture
Dec 3, 2022 7 tweets 2 min read Read on X
How is #DuckDb going to be used inside a company? My head can't think beyond it being used for local development. Can it really replace a DWH without being distributed?

If it does go beyond local, I do have a few things to ponder upon, 🧵
1. Search and Discovery: Where are users going to find artifacts inside the DuckDB cluster (eventually)? We'll need a catalog interface to surface this portion. The metadata of the artifacts in the data model (database, table) needs to be consistent across the cluster.
2. How is privacy handled? What if someone downloads sensitive data onto their laptop and it is stolen thus opening up a data breach? My paranoia comes from working on Data infra for GDPR, and CCPA. Access control should be implemented and enforced.
3. Chargeback: How are metrics tracked and thus the tracking of costs done in this environment? Assuming a cloud vendor for the provisioning, strong metadata is key here to know that costs are in check. What instance type works best for deployment is going to matter here.
4. Failover: How do we recover from instances falling over? Replication with a single leader assuming a single DC or multi-leader with multiple DCs. These need to be done carefully to avoid consistency issues. What do we sacrifice here in CAP?
5. Authz and Authn: How are these integrated into the cluster? Checking authorization and identification is important in a distributed environment.
I'm sure there are more things to think about than what I have here. Do add anything here in this thread. #datawarehouse

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Neelesh Salian 💻

Neelesh Salian 💻 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @neelesh_salian

Apr 28, 2023
What if you could blow up your data infrastructure and start greenfield?

For the past few weeks, I've been writing this series that gives a high-level view of what it takes to build out a full Data Platform for your team.

Here's your guide for building that Data Platform. 🧵 Image
Read 12 tweets
Apr 27, 2023
The last and often neglected piece of the Data Platform is the Business Intelligence layer. It's what makes the business make sense.

Continuing in the Data Platform building series, let's talk about Business intelligence.

💫 Part 10: Business Intelligence: Image
To be transparent, my experience is very minimal in this area so I'm going to mostly refer to what I've seen and inferred from others. If you have insights and things to add, please do comment below.
✴ The purpose of this:
Why have a BI layer in the first place?
- Give the end user, an analyst, an executive, or a non-team stakeholder, a view into data.
- Visualization helps to see the data in front of you and powers business decisions.
Read 10 tweets
Dec 15, 2022
#NormConf is happening.
- First session, 7:30 AM EST - 12:00 AM EST / 12:30PM - 5:00 PM UTC:
- Second session, 12:00 PM EST - 5:00 PM EST / 5:00 PM - 10:00 PM UTC:
- Third session, 5:00 PM EST - 10:30 PM EST / 10:00 PM - 3:30 AM UTC:
Read 4 tweets
Oct 28, 2022
I did Netflix () earlier this week. Let's do @Twitter's tech and Data
This company might be popular for its platform at large but there were a lot of data industry pieces that are worth calling out. Here is a 🧵
Streaming Processing: Storm! This was streaming before streaming. It paved the way for a lot of streaming processing systems. Event processing at the earliest. blog.twitter.com/engineering/en…
That gave rise to Heron (blog.twitter.com/engineering/en…)
Read 12 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(