AI (and Robotics) From Scratch in 2026
Table of Contents
How quickly this moves
Start by reading AI 2027 written by a former OpenAI researcher on the kinds of agents we should expect to see every few months. This has been accurate up to q2 2026 since it was written.
Curriculum
Machine Learning is the overall field and inside it exists Neural Networks, inside that LLMs, Computer Vision, etc.
Applied AI:
- 10-202 Intro to Modern AI so you can modify any open source LLM as this will take off soon with new hardware. You can easily get paid insane amounts of money doing this right now.
- 10-714 DL Algorithms & Implementation teaches how to build your own PyTorch stack from scratch and trust me you want to use types and toss out Python into the dustbin of history because right now types are making agentic coding very simple to use and for AI-assisted "lightweight formal methods" to verify the code.
To understand these courses:
- Ziko Kolter's LA crash course
- Combine these with 3Blue1Brown Linear Algebra lectures on YouTube or NJ Wildberger's excellent LinAlg course which walks through everything at the geometric level.
- This single video Matrix Calculus from MIT's 2020 18.06 course on linear algebra which we will take here too as our first math lecture. Derivatives are covered and explained simply. There is a full Matrix calculus course but it's not needed at this applied level but we take it below.
- The notes from MIT's 6.390 Intro to ML
This is all you really need for those 2 courses as they're both mostly self-contained anyway. These are the best applied ML resources I've found anywhere in 2026 then just ask ChatGPTx/Grok/DeepSeek/Fable or whatever new agent to help you understand the rest and find more YouTube tutorials. If you want to go out into the world and sign up companies to "AI" (and get paid) or you want to hack around some LLM open source model project this is all you need.
The rest of this workshop is to understand the modern theory of AI because much of it is not available to the public anymore so I asked everyone I work with what's going on and everything below is what they came up with.
Theory of modern AI
Does there exist a unified theory of modern AI yet or something close to it?
- These 30 Lectures were recently put together by MIT's leading Neuroscience researcher by that I mean he's the world's most cited Neuroscience researcher. He used the assistance of a research paper Agentic Investigator. Give it a research topic and it gives you back an entire paper on that topic.
These lectures show current AI provides a generic mean or 'smoothing' effect on everything they generate. The human user has to provide the sparse logical jumps to escape the annoying politeness and homogenizing of scientific thought that AI defaults to. They also show compositionality which means a complex thing is constructed of many small reusable basic things and these are (as described in the prologue) mathematically implied to be sparse for the simple reason that we can compute them efficiently. Genericity is another trait of modern AI and means a learning function being defined by any set of coordinates is invariant to transformations and defining it in different spaces where you 'shift' the function doesn't really matter. There will be many lectures about this in detail and we will take them. AI also can't be sentient (yet) no matter what AGI marketing these companies claim. This is another theme of these lectures.
Things we have to take to understand these lectures:
- The mathematical model of Machine Learning which hasn't changed.
- Some optional short lectures from the perspective of theoretical neuroscience.
- An optional book Understanding Deep Learning
- All the Calculus/Algebra content below
- Maybe the Zhang book on statistical learning theory but depends what I can find, what I want is Poggio's 9.520 'draft book' that isn't available anywhere but we can always watch old lectures and take our own notes then go on a hunt for new papers. I may just skip all this and go on a tangent with the deep learning lectures because that's what matters right now in research.
Math for ML/Robotics
As per Tomaso Poggio in his 30 lectures on Deep Learning the vast majority of Real Numbers are uncomputable and undecidable so numerical approximations in the discrete world have to be done. The inputs are simply too gigantic, a tiny 3D model is 100 x 100 x 100 with 106 unknowns, so almost all linear algebra learned in undergrad is not used as now you live in the world of factoring (SVD), matrices have to be randomized (jack polynomials), eigenvalues are approximated, and preconditioning is needed as gradient descent is too slow because it 'zig-zags'.
The following is very similar to CMU's Math Foundations for Robotics course as well:
- Matrix calculus
- MIT Numerical Methods and large scale linear algebra.
- Modern Regression and Data Analysis because in applied ML you often find yourself trying to interpret data. We came this far why not be "advanced" students in the art of data analysis too (think quant finance).
Sensorimoto Learning (Robotics)
- 6.8200 Computational Sensorimoto Learning this guy is the leading researcher for "control learning" so we will learn the math model. There is many other math models for robotics but an AI will eventually discard them for it's own interpretation hence I chose this course just to get a taste of robotic learning. We are interested in that link to the full lecture notes.
Research you may want to do (Optional)
You can use the Agentic Investigator to help you research these fields by yourself.
Reverse engineering LLMS
Make sure agents are doing what they claim they are doing:
- Mechanistic Interpretability (Neel Nanda@Google DeepMind)
Many companies will be very interested if you can do this. This competition is still running as Apr 9 2026 but there will be more. Not everyone wants to use the expensive Anthropic or OpenAI agents sometimes a simple open source agent learning your codebase is good enough. Of course next year's agents will be so advanced we will have to use this year's agents to reverse engineer them but interpretability is still going as of June 2026.
Causality
Any future medical AI is going to need Causal AI models. If I do X what will happen to Y? To paraphrase Glenn Shafer the basic idea is to bring back the probability tree to represent a step-by-step evolution of an observer's knowledge. If that observer is nature, then the steps in the tree are causes, and the probabilities in the tree express nature's limited ability to predict the causes.
- Elements of Causal Inference Foundations and learning algorithms.
This is one of my research fields so there will be much more here as causality is the future of AI and will be very hard to train that's why you go on all those AI training sites and they are desperate for you to find causesand will pay you (for now) a lot of money to do so.
Conformal Prediction
Conformal Prediction or confidence intervals are also wide open to research for example you want to know how much money some junky API that Anthropic peddles like Claude Code is going to charge you to generate some slop. You can learn this using conformal prediction and write a tool.
- Algorithmic Learning in a Random World using classic techniques like support vector machines.
Game Theory AI
It's possible to completely throw out stochastic math and do statistics and probability purely in the field of game theory. Most human activities involve someone else and none of it is really random so if you want to make a pokerbot this is how you do it. This is my primary research area so I'll be doing lots of this here and causality.
- Game-Theoretic Foundations for Probability and Finance
Calculus
Look up the 3Blue1Brown calculus series on YouTube it's enough to get started on the applied LLM courses.
The symbolic calculus (scalar calculus of one variable) taught in first year colleges and universities is designed to bring up the students to a base level they think you need in order to pass the rest of your courses. They do this because the faculty can't trust your high school education and everyone arrives with poor algebra skills. Even if you take so-called 'Advanced Placement' Calculus in Grade 12 they have broken up the calculus sequence into multiple courses now so you will only place out of basic differentiation but are forced to take their integration and approximation courses or whatever they are called now.
Concretely what this means is the book Calculus: Early Transcendentals 9th ed by James Stewart though all the editions look roughly the same. You can search for this on github and get numerous full copies and instructor versions too with all the solutions to problems.
The chapters out of Stewart's book:
- Chapters 1-6
This is every school's first year introductory Calc I of basic differentiation and basic integration that runs for a single semester or 4 months. As mentioned earlier you can take AP Calc in high school and place out of this.
- Chapters 7 - 12.3
This is Calc II often called techniques of integration or integration and approximation and is the second semester course. There's some things that shouldn't exist here like partial fractions should be in another course and are stupid to learn here. Also 'improper integrals' don't actually exist it's a failing of the Riemann integral which was replaced over 100+ years ago.
- Chapters 13 - end
Calc III often called vector calculus or multivariable calculus. This is the same calculus you already know just generalized to higher dimensions. In scalar calculus the derivative is a zooming function on a curve and you look at the points around the point you zoomed which appear to be a straight line and it's the tangent. In vector calculus the tangent is now an entire plane or flat 2D surface.
We won't be doing this book.
Banach spaces
Modern machine learning is theoretically done with Banach spaces where you get a vector space with measurement. This is to promote sparsity (weights that are exact zeros) while classical machine learning relies on Hilbert spaces (inner product norms). Then you have to convert all this to the real world of computer hardware using numerical methods.
We should just skip all of vector calculus and proceed in a direct path to Banach spaces. There's a free book for this "TBB-Dripped" or Elementary Real Analysis by Thomson-Bruckner the dripped version or 'Dump the Riemann Integral Project' version. Many concepts here will come up in the Tomaso Poggio Deep Learning lectures and for any probability material. It looks like a large book but half way through any undergrad math text you pick up speed as the problems become too easy to solve.
Linear Algebra
There's a concept of groups and inside those are an abelian group with the ability to perform 'actions' over a ring structure that has a special case of being a field. This is the space of vectors and linear maps. A book exists to teach all this from scratch by Paolo Aluffi called Algebra: Chapter 0. There are many other great undergrad linear algebra texts but what is the point in taking them if we are only using numerical analysis (SVD) and matrix calculus thus throwing out most of their content. We may as well learn how it all works at the big picture level plus group theory structures and symmetry in ML is another emerging field right now.
I will create my own Linear Algebra course
- NJ Wilberger geometrical demonstrations of transformations
- The SVD 'compact version' to avoid block matrices and Edelman's 18.06 MIT course
- The entire field described in category theory so you understand what's going on at the overview level now everything makes sense