2️⃣ After I finished it I started playing around with Keras and scikit-learn to build a model of my guitar amplifier.
It took 72 hours of recording, 250+ hours of training, and 6 months of experimenting to conclude - ML algorithms like NN won't produce anything useful.
3️⃣ I've learned a lot about ML from these sources:
A. L. Samuel - Some studies in machine learning using game of checkers
K. Gurney - An introduction to neural networks
J. N. Nilsson - Introduction To Machine Learning
J. Patterson in A. Gibson - Deep Learning
4️⃣ And from these ones:
T. Kam Ho - Random decision forests
T. Hastie, R. Tibshirani in J. Friedman - The Elements of Statistical Learning.
Springer
Keras documentation
scikit-learn documentation
5️⃣ We started our startup in 2017 with a project for some industrial optimization.
I've learned GA (genetic algorithms) on the fly to solve the problem.
After that, we started @typlessAPI - I used Keras to build initial data extraction models for invoices.
6️⃣ Results weren't very promising. I experimented with algorithms.
Using scikit-learn results were much better.
But we wanted more.
7️⃣ During that time I used these resources a lot:
* https://machinelearningmastery. com
* https://www.pyimagesearch. com
* https://towardsdatascience. com
8️⃣ Soon after that @galgiacomelli and me - build the first custom ML library for metadata extraction from invoices (invoice number, total amount, ...).
It was changed and redesigned numerous times.
Still not good enough.
9️⃣ About 1 year ago we built the initial release of a library for data extraction from any document - including line items/tables.
It becomes stable after 6 months of hard work.
Now I mostly deploy and monitor models.
1️⃣0️⃣ During that time we used mostly:
* scientific papers
* whiteboard & pen
1️⃣1️⃣ Takeaways:
* just start (Udemy, FreeCodeCamp, ...)
* find a problem to solve (classify feelings of your girly/boy, a recommendation system for your friend's online store, classify your comments on Reddit, ...)
* be creative
* NNs are not a good fit for all problems
• • •
Missing some Tweet in this thread? You can try to
force a refresh
It reads and writes almost like the English language. You need a small amount of code to get a job done. You'll probably rewrite it at least 3 times.
2️⃣ 1st class citizen on cloud platforms
You don't want to deal with physical servers in a startup. You just want to run your app. Python has cloud SDK libraries, it runs on serverless platforms, it's used in ML services, a lot of examples in cloud docs are using Python.