Machine learning at scale
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Stateful agents with Letta.ai
Introduction
Jun 29
•
Ludovico Bessi
3
Share this post
Machine learning at scale
Stateful agents with Letta.ai
Copy link
Facebook
Email
Notes
More
How Block Diffusion Bridges AR and Diffusion Models
interpolating between autoregressive and diffusion language models
Jun 22
•
Ludovico Bessi
3
Share this post
Machine learning at scale
How Block Diffusion Bridges AR and Diffusion Models
Copy link
Facebook
Email
Notes
More
Tackling the LLM Cold Start Problem with Smarter Storage
aka a lesson on how to do serverless inference for LLMs
Jun 15
•
Ludovico Bessi
6
Share this post
Machine learning at scale
Tackling the LLM Cold Start Problem with Smarter Storage
Copy link
Facebook
Email
Notes
More
OpenPipe: RL for multi turn agents
aka train your own agents!
Jun 8
•
Ludovico Bessi
4
Share this post
Machine learning at scale
OpenPipe: RL for multi turn agents
Copy link
Facebook
Email
Notes
More
Text-to-SQL just got a lot better with RL
aka multi turn is all you need?
Jun 1
•
Ludovico Bessi
8
Share this post
Machine learning at scale
Text-to-SQL just got a lot better with RL
Copy link
Facebook
Email
Notes
More
2
May 2025
AI Site reliability engineer?
aka deep dive on AI startup that wants to automate SREs :)
May 25
•
Ludovico Bessi
2
Share this post
Machine learning at scale
AI Site reliability engineer?
Copy link
Facebook
Email
Notes
More
KV-Runahead: Scalable causal LLM inference with parallel KV cache generation
aka cache everything!
May 18
•
Ludovico Bessi
4
Share this post
Machine learning at scale
KV-Runahead: Scalable causal LLM inference with parallel KV cache generation
Copy link
Facebook
Email
Notes
More
Beyond Basic RAG towards Agentic RAG
Scaling basic RAG to a robust, production-grade system is quite hard: making it multimodal, latency problems, query rewriting, edge cases, etc.
May 11
•
Ludovico Bessi
10
Share this post
Machine learning at scale
Beyond Basic RAG towards Agentic RAG
Copy link
Facebook
Email
Notes
More
LLM Serving (Bonus!): takeaways from industry
Introduction
May 4
•
Ludovico Bessi
5
Share this post
Machine learning at scale
LLM Serving (Bonus!): takeaways from industry
Copy link
Facebook
Email
Notes
More
April 2025
LLM Serving (4): Disaggregated serving
Introduction
Apr 27
•
Ludovico Bessi
9
Share this post
Machine learning at scale
LLM Serving (4): Disaggregated serving
Copy link
Facebook
Email
Notes
More
LLM serving (3): Speculative decoding
Introduction
Apr 20
•
Ludovico Bessi
6
Share this post
Machine learning at scale
LLM serving (3): Speculative decoding
Copy link
Facebook
Email
Notes
More
4
LLM Serving (2): Paged attention
Introduction
Apr 13
•
Ludovico Bessi
5
Share this post
Machine learning at scale
LLM Serving (2): Paged attention
Copy link
Facebook
Email
Notes
More
2
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts