Python 🐍 + Data Science 🚀 trainer @__metasnake__
🦜 Speaker
✍ Author
👨🏫 Instructor (@Stanford)
📣 DM for Sponsorship
4 subscribers
Apr 19 • 6 tweets • 2 min read
XGBoost is a popular machine learning algorithm that provides excellent performance out of the box for various types of machine learning problems, including regression and classification.
However, one common issue with XGBoost is overfitting, leading to poor generalization and inaccurate predictions on new data.
Overfitting occurs when the model learns the training data too well, including the noise and randomness, and cannot generalize to new data.
Mar 15 • 10 tweets • 3 min read
Big O notation (pronounced Big-Oh) is a way to discuss how long an algorithm takes to run. It is also called "runtime complexity".
Understanding this is key for technologists.
Let's look at it.
1/
If we want to calculate the mean of a sequence, a naive implementation will calculate the sum and divide it by the length:
2/
Mar 6 • 7 tweets • 2 min read
A thread about pandas... (The library not the animal)
🐼🧵
I like to describe pandas:
⋅ Like Excel
⋅ Like SQL
Things that you would do with Excel or SQL, you can program in pandas
Feb 8 • 13 tweets • 4 min read
I often teach about Decorators in Python.
Many know how to use them, but few can write them.
These are tricky because nested functions make our brains hurt.
Here are some hints for grokking them.
1/
In short, decorators allow you to inject orthogonal behavior before or after a function is executed.
But my favorite decorator definition is related to the construction and will help you easily create them: A callable that takes a callable and returns a callable.
2/
Oct 13, 2023 • 13 tweets • 4 min read
I often teach about Decorators in Python.
Many know how to use them, but few can write them.
These are tricky because nested functions make our brains hurt.
Here are some hints for grokking them.
1/
In short, decorators allow you to inject orthogonal behavior before or after a function is executed.
But my favorite decorator definition is related to the construction and will help you easily create them: A callable that takes a callable and returns a callable.
2/
Sep 21, 2023 • 13 tweets • 4 min read
I often teach about Decorators in Python.
Many know how to use them, but few can write them.
These are tricky because nested functions make our brains hurt.
Here are some hints for grokking them.
1/
In short, decorators allow you to inject orthogonal behavior before or after a function is executed.
But my favorite decorator definition is related to the construction and will help you easily create them: A callable that takes a callable and returns a callable.
2/
Sep 7, 2023 • 15 tweets • 4 min read
Here are is a common Pandas operation with a twist. 🐼
I'll present frequent suggestions, and then better suggestions.
🧵
Here's our data, the 2020 Python developers survey from
@JetBrains
.
I'll just use a subset so it will fit in tweets better.
Aug 16, 2023 • 6 tweets • 2 min read
A reader asked how to split a single (tuple) column into multiple columns in a chain...
Let's go. 🐼💪
Let's recreate their data:
Aug 2, 2023 • 9 tweets • 2 min read
Recently, I've been working on a new course for data analytics. This Python code stands out for its ability to turn raw sales data into insightful visualizations. I want to share why this code is crucial and how it could help transform your data analysis process.
The code helps visualize sales data, focusing on different countries, by using Pandas and Matplotlib. It transforms sales data into weekly aggregates, filtering out UK data, and highlights specific countries of interest (in this case, Finland).
Jul 18, 2023 • 7 tweets • 2 min read
A thread about pandas... (The library not the animal)
🐼🧵
I like to describe pandas:
⋅ Like Excel
⋅ Like SQL
Things that you would do with Excel or SQL, you can program in pandas
Apr 27, 2023 • 13 tweets • 5 min read
I often teach about Decorators in Python.
Many know how to use them, but few can write them.
These are tricky because nested functions make our brains hurt.
Here are some hints for grokking them.
1/
In short, decorators allow you to inject orthogonal behavior before or after a function is executed.
But my favorite decorator definition is related to the construction and will help you easily create them: A callable that takes a callable and returns a callable.
2/
Apr 27, 2023 • 4 tweets • 2 min read
Here's what I'm running on my @elgato Stream Deck.
Media controls, pomodoro timer, local time, Zoom mute, and (two buttons) for Zoom Exit (need to hit Return after hitting the Run button 🤷♀️)
World times (useful for client work and meetings).
Application launchers. (Some of these should be launched at startup on Windows... 🤔Looking at you TPFanControl and PowerToys.)
Apr 20, 2023 • 6 tweets • 2 min read
Inspired by @svpino and @dvassallo, I want to be more deliberate with my tweeting. To start that off, I wanted to understand (quantitatively) how I tweet.
And that is the essence of "Applied Pandas: Twitter Analytics". I show how to access your Twitter data, set up an environment, and perform exploratory data analysis on it. 📉
Apr 20, 2023 • 8 tweets • 3 min read
Suggestions for writing better Python code:
📝Use an Editor that gives feedback
✅Format consistently
🧪Test your code
🦆Use type hints
📚Learn Python idioms
Editors like @pycharm, @code can give you quick insight into coding issues.
One of the most important choices you have in life is your work. 👷
How do you decide where to work?
What to do?
How do you get your first job?
When to start looking for a new job?
🧵
Most people want a job that has both satisfying work and good perks...
Apr 4, 2023 • 7 tweets • 3 min read
A thread about pandas... (The library not the animal)
🐼🧵
I like to describe pandas:
⋅ Like Excel
⋅ Like SQL
Things that you would do with Excel or SQL, you can program in pandas
Apr 3, 2023 • 6 tweets • 2 min read
Inspired by @svpino and @dvassallo, I want to be more deliberate with my tweeting. To start that off, I wanted to understand (quantitatively) how I tweet.
And that is the essence of "Applied Pandas: Twitter Analytics". I show how to access your Twitter data, set up an environment, and perform exploratory data analysis on it. 📉
Dec 16, 2022 • 10 tweets • 3 min read
Big O notation (pronounced Big-Oh) is a way to discuss how long an algorithm takes to run. It is also called "runtime complexity".
Understanding this is key for technologists.
Let's look at it.
1/
If we want to calculate the mean of a sequence, a naive implementation will calculate the sum and divide it by the length:
2/
Dec 14, 2022 • 6 tweets • 2 min read
A reader asked how to split a single (tuple) column into multiple columns in a chain...
Let's go. 🐼💪
Let's recreate their data:
Oct 18, 2022 • 5 tweets • 2 min read
Real world Pandas is a little more complicated than just one-off operations to change one thing with your data.
If you write a recipe for your data, it will ease understanding the process you go through.
Here's a recipe to clean up the Ames housing dataset.
Oct 17, 2022 • 8 tweets • 3 min read
Upcoming training:
Oct:
🏭Python in Production Course
Nov:
🐼Intermediate Pandas Course
🐻Intro to Polars Workshop
📈Seaborn Objects Workshop
📊Fundamentals of Data Analysis with Pandas Course
🧪Beginning Pytest Workshop
👩🔬Advanced Pytest Workshop
Best practices for production Python code: