Have you had troubles or having troubles arranging your machine learning projects? This thread should give you some idea on how to arrange machine learning / deep learning projects. See the folder structure: 1/6 🔽
input/: This folder consists of all the input files and data for your machine learning project. If you are working on NLP projects, you can keep your embeddings here. If you are working on image projects, all images go to a subfolder inside this folder. 2/6
src/: We will keep all the python scripts associated with the project here. If I talk about a python script, i.e. any *.py file, it is stored in the src folder. 3/6
models/: This folder keeps all the trained models.
notebooks/: All jupyter notebooks (i.e. any *.ipynb file) are stored in the notebooks folder. 4/6
README..md: This is a markdown file where you can describe your project and write instructions on how to train the model or to serve this in a production environment. 5/6
LICENSE: This is a simple text file that consists of a license for the project, such as MIT, Apache, etc. 6/6
• • •
Missing some Tweet in this thread? You can try to
force a refresh
- Divide the training data into folds.
- Train a bunch of models: M1, M2.....Mn.
- Create full training predictions (using out of fold training) and test predictions using all these models. 2/4
- Till here it is Level – 1 (L1).
- Use the fold predictions from these models as features to another model.
This is now a Level – 2 (L2) model.
- Use the same folds as before to train this L2 model. 3/4
In this thread, I will show you how to train a deep learning based sentiment classification model using BERT. First, you need the IMDb dataset, you can grab it from kaggle.com/lakshmi25npath…. Save this as imdb.csv. 👇
1/5