Tweet

Peter Liu ⚡️

@NewOrleansVC

Mar 28 • 10 tweets • 2 min read Twitter logo

@RevelryVC

Open sourcing @RevelryVC's AI Data Strategy DD Checklist.

1/ Data is the fuel for building AI so we're sharing our "AI Data Strategy" DD Checklist to gather feedback and insights to improve our evaluation process and provide value to other investors and founders. #AllInAI

2/ We've identified 7 key areas to focus on when evaluating a startup's data strategy. These areas are critical to ensure that the AI system is effective and that the company's approach aligns with its overall goals.

Here are the 7 areas of our DD process:

3/ Data Acquisition: The process of collecting, sourcing, and obtaining relevant data. A strong data acquisition strategy ensures a diverse and reliable dataset that accurately represents the target use case and can be a critical moat over time.

4/ Data Preparation & Preprocessing: Involves cleaning, transforming, and organizing raw data into a format that can be easily used by AI models. A robust preprocessing pipeline enhances data quality, leading to better model performance.

5/ Data Labeling: The process of annotating data with relevant labels or tags. High-quality labeling ensures that AI models can learn effectively from the input data, resulting in more accurate predictions.

6/ Data Storage & Management: The infrastructure and practices related to securely storing, managing, and maintaining data. Effective data management helps ensure compliance with regulations, data protection, and traceability.

7/ Data Augmentation: Techniques that increase the size and diversity of datasets by creating new, modified instances of existing data. Proper augmentation can lead to improved model performance and generalization.

8/ Data Privacy: Measures taken to protect sensitive information and ensure compliance with relevant privacy regulations. Strong privacy practices help build user trust and mitigate potential legal risks.

9/ Tradeoffs & Overall Strategy: The balance between various data strategy components and their alignment with the startup's business goals and market needs. A flexible and adaptable strategy can accommodate changes in the market or technology landscape.

10/ Here's a link to our checklist. Looking forward to getting feedback and refining this over time.

docs.google.com/spreadsheets/d…

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @NewOrleansVC

Peter Liu ⚡️

@NewOrleansVC

Mar 30

twitter.com/i/web/status/1…

1/ Day 7 of #AllInAI: Generative Adversarial Networks (#GANs)

Yesterday, we explored synthetic data, which led us to GANs - the tech behind creating realistic synthetic data. After learning more, there are several other real world applications that have come from GANs research… twitter.com/i/web/status/1…

twitter.com/i/web/status/1…

2/ 👊GANs, or Generative Adversarial Networks, are a type of AI model architecture/technique. They consist of two separate neural networks, a Generator and a Discriminator, that are trained together in a process called Adversarial Training. The goal is to generate new, realistic… twitter.com/i/web/status/1…

twitter.com/i/web/status/1…

3/ 🤖✍️ Imagine a GAN model generating realistic human signatures. The Generator creates a random signature, while the Discriminator classifies it as "real" or "fake." They improve over time, and eventually, the Generator creates signatures that look authentic & are hard to… twitter.com/i/web/status/1…

Read 8 tweets

Peter Liu ⚡️

@NewOrleansVC

Mar 29

1/ Day 6 of #AllinAI 🧪 Introducing #SyntheticData - an incredibly powerful tool that's quickly becoming a must-have for AI teams. Synethic data can help companies overcome some of the key issues of AI & data collection, such as privacy, biases, and data scarcity.

2/ 💡 What is Synthetic Data? It's artificially generated data that mimics the characteristics of real-world data. It's created using algorithms, simulations, and generative models like GANs (which pit AI vs. AI to create and authenticate "fake" data).

3/ 📈 Why companies should consider Synthetic Data in their AI stack:

1. Reduces data privacy concerns
2. Helps overcome data scarcity
3. Enhances dataset diversity
4. Reduces biases in data
5. Facilitates edge cases testing
6. Speeds up model development

Read 9 tweets

Peter Liu ⚡️

@NewOrleansVC

Mar 28

@DeepMind

Day 5: Google's @DeepMind.

🧠 While @OpenAI grabs headlines, we wanted to dig into @DeepMind, one of the OG AI research institutions acquired for ~$500M by @Google (a steal!). Its breakthroughs have made a huge impact on the AI field. Let's take a look! 🚀 #AllInAI (1/9)

@DeepMind

🎮 @DeepMind's AlphaGo made history in 2016 by defeating Lee Sedol in Go. Combining deep learning & reinforcement learning with Monte Carlo Tree Search, AlphaGo soon learned to play complex games entirely through self-play, without human guidance. (2/9)

@DeepMind

🎼 WaveNet, another @DeepMind invention, uses a convolutional neural network (CNN) to generate realistic human-like speech. It powers text-to-speech applications like @Google Assistant and has pushed the boundaries of speech synthesis, music generation, and voice cloning. (3/9)

Read 9 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Peter Liu ⚡️

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @NewOrleansVC

Peter Liu ⚡️

Peter Liu ⚡️

Peter Liu ⚡️

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!