AI (and Robotics) From Scratch in 2026
Table of Contents
How quickly this moves
Start by reading AI 2027 written by a former OpenAI researcher on the kinds of agents we should expect to see every few months. This has been accurate up to q2 2026 since it was written.
Curriculum
Applied AI:
- 10-202 Intro to Modern AI so you can modify any open source LLM as this will take off soon with new hardware. You can easily get paid insane amounts of money doing this right now.
- 10-714 DL Algorithms & Implementation teaches how to build your own PyTorch stack from scratch and trust me you want to use types and toss out Python into the dustbin of history because right now types are making agentic coding very simple to use and for AI-assisted "lightweight formal methods" to verify the code.
Almost everything in those courses is self-contained or you can ask Gemini/Grok/ChatGPTx to help you. The only math you really need is understanding the beginning of this single video Matrix Calculus from MIT's 2020 18.06 course on linear algebra which we will take here too as our first math lecture.
Towards a Unified theory of AI:
Does there exist a unified theory of modern AI yet or something close to it?
- These 30 Lectures were recently put together by MIT's leading Neuroscience researcher by that I mean he's the world's most cited Neuroscience researcher. He used the assistance of a research paper Agentic Investigator. Give it a research topic and it gives you back an entire paper on that topic.
These lectures show current AI provides a generic mean or 'smoothing' effect on everything it generates. The human user has to provide the sparse logical jumps to escape the annoying politeness and homogenizing of scientific thought that AI defaults to. They also show compositionality which means a complex thing is constructed of many small reusable basic things and these are (as described in the prologue) mathematically implied to be sparse for the simple reason that we can compute them efficiently. Genericity is another trait of modern AI and means a learning function being defined by any set of coordinates is invariant to transformations and defining it in different spaces where you 'shift' the function doesn't really matter. There will be many lectures about this in detail and we will take them.
AI also can't be sentient (yet) no matter what AGI marketing these companies claim. This is another theme of these lectures.
Things we have to take to understand these lectures:
- The mathematical model of Machine Learning which hasn't changed.
We will also have to incrementally take various lectures from 15-251 CMU's Great Ideas in Computer Science such as boolean functions and some complexity theory because deep learning is a deep topic touching many theoretical fields.
Math for ML/Robotics
Throw out almost all math learned in undergrad. As per Tomaso Poggio in his 30 lectures on Deep Learning the Real Numbers don't exist. Everything must be done by numerical analysis/approximations which means no limits, no Riemann integral, no pivots or echelon forms, no inverting matrices, limited use of eigenwhatever, and exclusively using SVD factorization for linear algebra. Even gradient descent isn't really used anymore as it 'zig-zags'.
The following is very similar to CMU's Math Foundations for Robotics course as well:
- Matrix calculus
- MIT Numerical Methods and large scale linear algebra.
- Infinitesimal calculus book
- We will only ever use dy/dx
- Infinitesimal calculus book
Research you may want to do (Optional)
You can use the Agentic Investigator to help you research these fields by yourself.
Reverse engineering LLMS
Make sure agents are doing what they claim they are doing:
- Mechanistic Interpretability (Neel Nanda@Google DeepMind)
Many companies will be very interested if you can do this. This competition is still running as Apr 9 2026 but there will be more. Not everyone wants to use the expensive Anthropic or OpenAI agents sometimes a simple open source agent learning your codebase is good enough. Of course next year's agents will be so advanced we will have to use this year's agents to reverse engineer them but interpretability is still going as of June 2026. Anthropic is the least trustworthy AI company in my opinion for a dozen reasons. I would trust the Chinese pirates who stole DeepSeek over Anthropic anyday. Say what you want about Sam Altman but at least we know something about Sam Altman from Hacker News I do not trust Anthropic whatsoever they are like the dystopia in the game Half-Life but worse. I would trust Bill Epstein Gates over Anthropic.
Causality
Any future medical AI is going to need Causal AI models. If I do X what will happen to Y? To paraphrase Glenn Shafer the basic idea is to bring back the probability tree to represent a step-by-step evolution of an observer's knowledge. If that observer is nature, then the steps in the tree are causes, and the probabilities in the tree express nature's limited ability to predict the causes.
- Elements of Causal Inference Foundations and learning algorithms.
Conformal Prediction
Conformal Prediction or confidence intervals are also wide open to research for example you want to know how much money some junky API that Anthropic peddles like Claude Code is going to charge you to generate some slop. You can learn this using conformal prediction and write a tool.
- Algorithmic Learning in a Random World using classic techniques like support vector machines.
Game Theory AI
It's possible to completely throw out stochastic math and do statistics and probability purely in the field of game theory. No measure theory needed. Most human activities involve someone else and none of it is really random so if you want to make a pokerbot this is how you do it. This is my primary research area so I'll be doing lots of this here and causality.
- Game-Theoretic Foundations for Probability and Finance
Calculus
Why do they still make us learn calculus in university despite AI being able to do this for us easily? Or even symbolic calculators or applications like Sage/Wolfram for the last 10+ years?
The symbolic calculus (scalar calculus of one variable) taught in first year colleges and universities is designed to bring up the students to a base level they think you need in order to pass the rest of your courses. They do this because the faculty can't trust your high school education and everyone arrives with poor algebra skills. Even if you take so-called 'Advanced Placement' Calculus in Grade 12 they have broken up the calculus sequence into multiple courses now so you will only place out of basic differentiation but are forced to take their integration and approximation courses or whatever they are called now.
Concretely what this means is the book Calculus: Early Transcendentals 9th ed by James Stewart though honestly all the editions are roughly the same. You can search for this on github and get numerous full copies and instructor versions too with all the solutions to problems.
The chapters out of Stewart's book:
- Chapters 1-6
This is every school's first year introductory Calc I of basic differentiation and basic integration that runs for a single semester or 4 months. As mentioned earlier you can take AP Calc in high school and place out of this.
- Chapters 7 - 12.3
This is Calc II often called techniques of integration or integration and approximation and is the second semester course. It always ends with a brief intro to linear algebra. There's some things that shouldn't exist here like partial fractions should be in another course and are stupid to learn here. Also 'improper integrals' don't actually exist it's a failing of the Riemann integral which is a toy integral discarded when you get into grad school. Riemann literally wrote his integral on a napkin to solve some urgent problem and never expected it would become the standard curriculum for 100+ years.
- Chapters 13 - end
Calc III often called vector calculus or multivariable calculus. This simply means the same calculus you already know just generalized to higher dimensions. In scalar calculus the derivative is a zooming function on a curve and you look at the points around the point you zoomed which appears to be a straight line and it's the tangent. In vector calculus the tangent is now an entire plane or flat 2D surface. You also get an intro to differential equations probably the most amazing thing about calculus where you can time step back and forth some entire complex system like the state of the universe. SciML/AI is going crazy right now approximating solutions to these ODEs/PDEs which were just a nonsense bag of tricks when you took these courses only a decade ago.
The above is considered 12 months of work doing this 3x a week if at a university pace. However from personal experience you will find that mid way or less through the book you will have taught yourself excellent problem solving skills and now the rest of the book is 'too easy' and you start flying through chapters. You will finish it in 1/4 of the time. Even analysis textbooks like Apostol which seem difficult you traverse the peak early if you stick with it and the rest is downhill skiing on cruise control. That and Apostol often drills trig identities in the exercises when he doesn't have anything new to show the reader.
What should we do
Real numbers are mostly undecidable and even worse the way they are taught is over 100+ years old and obsolete. There exists a 'Natural Integral' on the real (imaginary continuum) line invented in 1960s or so that is sometimes called the gauge integral, or the Newton integral, or the Henstock-Kurzweil integral and it includes the Lebesgue integral as a special case. It is vastly more simple than Riemann limiting sum constructions. There are a few undergrad level books by analysis experts written for beginners about this such as Brian S Thomson and Andrew Bruckner written books I suggest you search for them if you want 21st century real number theory. They are called 'DRIPPED' or 'Drop the Riemann Integral'. Bruckner is still doing derivatives research in his 90s and uploading papers such as what actually characterizes the derivative? Turns out the Darboux property is still the only characteristic and there is functions that are discontinous everywhere yet still have derivatives. They also blow out Rolles' theorem and other junk you have to take in a modern calculus course in undergrad. The integral ends up being basically the mean value theorem. AI is killing this entire obsolete abstraction right now.
Real numbers have this (fake) idealized construction where compactness exists however we will learn in the deep learning lectures that basically almost every real number is undecidable so there's no actual compactness. You can approximate e, pi, phi, a very few other real numbers using a stream algorithm but the rest of them (the vast majority in fact) are not computable at all. Ergo the entire theory is just trash in 2026 and we shouldn't even bother entertaining it here.
Instead we will learn the infintesimal calculus with modern definitions though I am now denying you the satisfaction of working through an elementary text and realizing it's too easy for you which provides motivation and encouragement. I plan on using linear algebra for that. For example the David Poole book used by CMU where you will do enough applications that math undergrad problems become trivial. You need this win and I will provide it just not silly calculus that even Newton himself would object to.
TODO
__ Home