Archive - Machine learning at scale

R4ec: Teaching Your Recommender LLMs to Think Twice

TLDR; The Problem: Using Large Language Models (LLMs) for recommendations with simple, one-shot prompts is like using System-1 thinking—fast but…

Nov 2 •

October 2025

Spiking Brain-Inspired LMs

Hybrid linear MoE LLM

Oct 26 •

Beyond RLHF with Rubrics as Rewards

TL;DR: The paper "Rubrics as Rewards (RaR)" introduces a framework for LLM alignment that replaces opaque preference-based reward models with…

Oct 22 •

Agent Learning from Human Feedback (ALHF)

case study directly from databricks!

Oct 19 •

Deep Dive into Claude Code post mortem

aka reading post mortem is my guilty pleasure!

Oct 15 •

Learning Facts At Scale With Active Reading

How to reliably embed knowledge into a model's parameters?

Oct 12 •

IntentRec: Predict user intent with multi task learning

hierarchical models for big business gains!

Oct 5 •

September 2025

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

2,146 tokens/s Code Model with Discrete Diffusion

Sep 28 •

Become a Research Scientist for a day?

aka you too can become a researcher for a day (or more!)

Sep 24 •

More Than Just a Few Tokens Deep

safety alignment that's safe also on the next tokens

Sep 21 •

AmazonQAC: Large scale query autocomplete dataset

Datasets are important!

Sep 14 •

[Bonus] The most overloaded role: "Machine learning engineer"

let's demistify what it actually is (spoiler: no single definition)

Sep 10 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts