Tweet

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

David Andrés 🤖📈🐍

@daansan_ml

Sep 18 • 12 tweets • 3 min read Twitter logo

Read on Twitter

𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝘁𝗵𝗲 𝗜𝗺𝗽𝗼𝗿𝘁𝗮𝗻𝗰𝗲 𝗼𝗳 𝗟𝗼𝗴 𝗥𝗲𝘁𝘂𝗿𝗻𝘀 𝗶𝗻 𝗙𝗶𝗻𝗮𝗻𝗰𝗲 💰

Why Log Returns and not a simple price difference?

Let's develop this a bit more.

🧵 👇

In finance, one of the most crucial types of data is price information.

However, when it comes to Time Series analysis, using raw prices can be problematic.

Let's see why log returns are preferred over simple price differences and their significance in financial modeling.

𝗧𝗵𝗲 𝗣𝗿𝗼𝗯𝗹𝗲𝗺 𝘄𝗶𝘁𝗵 𝗡𝗼𝗻-𝗦𝘁𝗮𝘁𝗶𝗼𝗻𝗮𝗿𝘆 𝗗𝗮𝘁𝗮

In Time Series analysis, non-stationary data can introduce a lot of noise and make it difficult to identify underlying patterns.

👉 This is why raw prices are generally not used in financial analyses.

𝗔𝗯𝘀𝗼𝗹𝘂𝘁𝗲 𝘃𝘀. 𝗥𝗲𝗹𝗮𝘁𝗶𝘃𝗲 𝗖𝗵𝗮𝗻𝗴𝗲

You might think that using the difference between prices could solve this issue.

This approach only provides the absolute change and lacks information on the relative or percentage change, which is often more insightful.

𝗟𝗼𝗴 𝗥𝗲𝘁𝘂𝗿𝗻𝘀

This is where log returns come into the picture.

Log returns not only capture the relative change but also offer two additional advantages:

1️⃣ Time Additivity
2️⃣ Statistical Properties

1️⃣ 𝗧𝗶𝗺𝗲 𝗔𝗱𝗱𝗶𝘁𝗶𝘃𝗶𝘁𝘆

Log returns are time-additive. You could simply sum the log returns for two consecutive periods to get the log return for the combined period.

This property is incredibly useful for simplifying analyses and computations in time series modeling.

2️⃣ 𝗦𝘁𝗮𝘁𝗶𝘀𝘁𝗶𝗰𝗮𝗹 𝗣𝗿𝗼𝗽𝗲𝗿𝘁𝗶𝗲𝘀

Log returns tend to be more normally distributed than simple returns, especially when returns are high.

The assumption of normality is foundational to many financial theories and models.

The formula for calculating log returns, using the natural logarithm ( 𝑙𝑛 ), is:

𝑅ₜ = 𝑙𝑛( 𝑃ₜ / 𝑃ₜ₋₁ )

Where:
- 𝑅ₜ is the log return at time ( 𝑡 )
- 𝑃ₜ is the price at time ( 𝑡 )
- 𝑃ₜ₋₁ is the price at the previous time period

✦ The log return is the natural logarithm of the ratio of the price at time ( 𝑡 ) to the price at the previous time period ( 𝑡-1 ).

Using log returns:

• simplifies mathematical modeling

• makes time series analysis more straightforward

• aligns well with the statistical assumptions made in financial theories

You can read more about this in my article:

mlpills.dev/time-series/xg…

You should also join our newsletter, DSBoost🚀

Every week we share:
🔹Interviews
🔹Podcast notes
🔹Learning resources
🔹Interesting collections of content

Subscribe for free👇👇
dsboost.dev

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @daansan_ml

David Andrés 🤖📈🐍

@daansan_ml

Sep 17

Feature encoding is key, discover 𝗢𝗻𝗲 𝗛𝗼𝘁 𝗘𝗻𝗰𝗼𝗱𝗶𝗻𝗴.

A very useful technique when you don't have many distinct values in a column.

Find out more about it 🧵 👇

It converts each unique category into a new binary column of 1 or 0.

🔧 When should you use it?
For nominal categories where no ordinal relationship exists.

🟢 Pros:

• Easy to use and interpret.

• No ordinal relationships are introduced.

Read 5 tweets

David Andrés 🤖📈🐍

@daansan_ml

Sep 15

Feature encoding is key for many models.

The most basic technique is 𝗟𝗮𝗯𝗲𝗹 𝗘𝗻𝗰𝗼𝗱𝗶𝗻𝗴.

Find out more about it 🧵 👇

In this technique, each unique category is mapped to an integer starting from 0.

It does not assume any relationship of order or magnitude between the categories → categories are numbered arbitrarily.

🔧 When should you use it?

It is best suited for ordinal data where the order matters but can be used for nominal data when the algorithm can handle it correctly (e.g., decision trees).

Read 6 tweets

David Andrés 🤖📈🐍

@daansan_ml

Sep 10

You want to forecast the price of the EUR/USD pair.

You could definitely use the price during the previous days, but what if you could improve that? 🤔

Discover what else you can use here 🧵 👇

0️⃣ As mentioned, we could and should use the previous prices of this currency pair. Of course, we should convert it to Log Returns first (see my article in the last tweet) to make them stationary.

But that's not sufficient, many other variables influence EUR/USD price.

The EUR/USD price is your endogenous variable.

These additional variables, also called exogenous, can help you make more accurate forecasts 👇

Read 11 tweets

David Andrés 🤖📈🐍

@daansan_ml

Sep 5

Find out more about another feature scaling technique:

✨Standard Scaling or Z-score Normalization✨

🧵 👇

In this case, features are scaled so that they have the properties of a standard normal distribution with mean μ=0 and standard deviation σ=1.

🔧Use it when the algorithm assumes that the distribution of your features is Gaussian.

This method is also useful as a general technique when you don't know the distribution of your feature and you're not particularly concerned about robustness to outliers.

Read 7 tweets

David Andrés 🤖📈🐍

@daansan_ml

Sep 3

Discover one of the most used feature scaling techniques:

✨Min-Max Scaling✨

🧵 👇

This is the simplest form of normalization.

👉 The idea is to scale the range of each feature (like age, salary, etc.) so that they all fit within a specific range, usually between 0 and 1. This can make it easier for machine learning algorithms to learn from the data.

🔧 Use it when the distribution of the feature is not Gaussian and you need values in a bounded interval. However, this method is sensitive to outliers.

Read 7 tweets

David Andrés 🤖📈🐍

@daansan_ml

Aug 9

ARIMA is one of the most popular traditional statistical methods used for time series forecasting.

THREAD 🧵 👇

ARIMA stands for Auto-Regressive Integrated Moving Average.

It is composed of 3 components:

🔹 Auto-Regressive (AR)
🔹 Integrated (I)
🔹 Moving Average (MA)

1️⃣ Auto-Regressive (AR) models use a linear combination of past values of the variable of interest.

They are described by the parameter "p", which refers to the number of previous values to consider for the forecast.

Read 7 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

David Andrés 🤖📈🐍

Try unrolling a thread yourself!

More from @daansan_ml

David Andrés 🤖📈🐍

David Andrés 🤖📈🐍

David Andrés 🤖📈🐍

David Andrés 🤖📈🐍

David Andrés 🤖📈🐍

David Andrés 🤖📈🐍

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!