Cursor Team: Future of Programming with AI | Lex Fridman Podcast #447
Cursor founders Michael Truell, Sualeh Asif, Arvid Lunnemark, and Aman Sanger explain why Claude Sonnet beats GPT-4o for real-world coding and where AI programming is headed.
- Claude Sonnet is their consensus best model for coding; o1 excels at hard algorithmic problems but misses rough human intent.
- Cursor Tab uses a sparse mixture-of-experts model with KV-cache reuse and speculative edits to hit sub-100ms autocomplete latency.
- SWE-bench is heavily contaminated: frontier models hallucinate correct file paths and function names from training data, making scores unreliable.
- Frontier models fail ~40% of the time at the seemingly trivial step of applying a code sketch as a diff; Cursor trains a dedicated apply model to fix this.
- Arvid bet in 2022 that AI would win an IMO gold medal by 2024; DeepMind came within one point, making it the most prescient prediction in the group.
- The team believes programming’s future keeps humans in the driver seat with variable abstraction levels, not a Slack-style delegation model.
- Synthetic data taxonomy: distillation, easier-inverse-direction data (e.g., insert bugs to train bug detectors), and verifier-grounded rollouts (math proofs, test suites).
Guests: Michael Truell (Anysphere/Cursor), Sualeh Asif (Anysphere/Cursor), Arvid Lunnemark (Anysphere/Cursor), Aman Sanger (Anysphere/Cursor) · 2024-10-06 · Watch on YouTube