Devan Shah

I am a senior in Computer Science at Princeton University where I am grateful to be advised by Prof. Elad Hazan. Through the Hazan Lab, I work on language modeling and dynamical systems. Separately, I have also worked on projects with Jane Street and TikTok.

Broadly, I am interested in better understanding language models, such as improving how models reason, designing post-transformer architectures, and improving optimizers. At Princeton, I help host NLP & CV reading groups, chair our ACM chapter, and am a member of Phi Beta Kappa, Sigma Xi, and Tau Beta Pi.

Some recent work I am especially proud of is [1] and [3].

Email / GitHub / Google Scholar / LinkedIn

Work Experience

Princeton Hazan Lab

September 2024 — Present

I work on language modeling and dynamical systems in the Hazan Lab at Princeton, led by Prof. Elad Hazan. I've worked on projects aiming to accelerate the Spectral Transformer and better model linear dynamical systems, such as such as SpectraLDS, FutureFill, and Google Deluca.

Quantitative Research at Jane Street

May 2025 — August 2025

I interned as a Quantitative Research intern at Jane Street.

Recommendation Systems at TikTok

May 2024 — August 2024

As a machine learning engineering intern, I improved the recommendation system for ecommerce videos through projects focusing on improving multi-interest modeling.

Research

SpectraLDS: Provable Distillation for Linear Dynamical Systems

Devan Shah, Shlomo Fortgang, Sofiia Druchyna, Elad Hazan
NeurIPS 2025
website / paper / code / poster / slides

UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs

Devan Shah*, Owen Yang*, Daniel Yang, Chongyi Zheng, Benjamin Eysenbach
NeurIPS 2025 Workshop on Scaling Environment for Agents
website / paper / code / poster

FutureFill: Fast Generation from Convolutional Sequence Models

Naman Agarwal, Xinyi Chen, Evan Dogariu, Devan Shah, Hubert Strauss, Vladimir Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan
ICLR 2026
website / paper / code / slides

ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides

Devan Shah , Ruoqi Huang, Nisha Vinayaga-Sureshkanth, Tingting Chen, Murtuza Jadliwala
IEEE TMC 2025
paper

Projects

Scaling Mutual Information Skill Learning to Larger Skill Spaces

Devan Shah*, David Shustin*
(* indicates equal contribution)

paper

Camelax: A Lightweight Machine Learning Library in OCaml

Devan Shah*, Daniel Yang*

paper / code

Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning

Devan Shah*, Kevin Wang*, David Yan*

paper / code / poster

Parallel Scaling with Entropic Reasoners

Devan Shah*, Owen Yang*, Daniel Yang*

paper / code / poster

Wave Filtering for General Linear Dynamical Systems

Devan Shah* , Brandon Cho*

paper / code

Truly Adaptive Bloom Filters

Devan Shah* , David Yan*

paper / poster

Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)

Devan Shah , Ruoqi Huang, Tingting Chen, Murtuza Jadliwala
AAAI 2025
paper / poster / slides

DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction

Devan Shah

paper / code

Surveys

Discussion on "Why Does Deep and Cheap Learning Work So Well?"

Devan Shah

paper

A Survey of State Space Models: From Linear Systems to Language

Devan Shah* , Brandon Cho*

paper

Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates

Devan Shah* , Owen Yang*, Sunil Vittal*

paper

Coursework

Computer Science

COS217	Introduction to Programming Systems
COS226	Algorithms and Data Structures
COS326	Functional Programming
COS418	Distributed Systems
ECE435	Machine Learning and Pattern Recognition
COS435	Introduction to Reinforcement Learning
COS484	Natural Language Processing
COS597	Long Term Memory in AI - Vector Search and Databases
COS597	Systems and Machine Learning
COS597	Inference in Action: Probabilistic Topics in Reinforcement Learning

Math

MAT216	Multivariable Analysis and Linear Algebra I
MAT218	Multivariable Analysis and Linear Algebra II
ECO310	Microeconomic Theory: A Mathematical Approach
MAT377	Combinatorial Mathematics
MAT385	Probability Theory
MAT478	Topics in Combinatorics: The Probabilistic Method

Theory

ECE434	Theoretical Machine Learning
COS445	Economics and Computing
COS487	Theory of Computation
COS521	Advanced Algorithm Design
COS522	Computational Complexity
COS585	Information Theory and Applications
ORF543	Deep Learning Theory
COS598	Theory of Natural Algorithms

Creative Writing

FRS116	Evolution of Human Language
CWR202	Creative Writing (Poetry)
POL316	Civil Liberties
ATL494	Creating Comedy for Television
ATL497	How to Write a Monologue

Contests

2025	Agent Builders Hackathon: #3/46 (and 2 track wins), Solo Entry
2025	Anthropic Alignment Hack #2/13
2025	Stanford Treehacks Codegen: Best Code Generation Application #1/24
2024	Columbia DevFest Overall Winners #1/54
2023	MIT Energy and Climate Hackathon #3/80
2023	ICPC Greater NY Regional Contest #4/92
2023	HackHarvard CareYaya Track #1/24
2023	Princeton HackaTron Web3 Hack #2
2022	HackPrinceton Best AI Hack

Devan Shah

Work Experience

Princeton Hazan Lab

September 2024 — Present

Quantitative Research at Jane Street

May 2025 — August 2025

Recommendation Systems at TikTok

May 2024 — August 2024

EEG Vision Embeddings at CareYaya

Scooter Authentication at Cal Poly Pomona

Price Prediction at Ticket Wallet (YC X25)

Research

SpectraLDS: Provable Distillation for Linear Dynamical Systems

UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs

FutureFill: Fast Generation from Convolutional Sequence Models

ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides

Projects

Scaling Mutual Information Skill Learning to Larger Skill Spaces

Camelax: A Lightweight Machine Learning Library in OCaml

Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning

Parallel Scaling with Entropic Reasoners

Wave Filtering for General Linear Dynamical Systems

Truly Adaptive Bloom Filters

Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)

DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction

Surveys

Discussion on "Why Does Deep and Cheap Learning Work So Well?"

A Survey of State Space Models: From Linear Systems to Language

Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates

Coursework

Computer Science

Math

Theory

Creative Writing

Contests

Devan Shah

Work Experience

Princeton Hazan Lab

September 2024 — Present

Quantitative Research at Jane Street

May 2025 — August 2025

Recommendation Systems at TikTok

May 2024 — August 2024

EEG Vision Embeddings at CareYaya

November 2023 — May 2024

Scooter Authentication at Cal Poly Pomona

June 2023 - April 2024

Price Prediction at Ticket Wallet (YC X25)

June 2023 — August 2023

Research

SpectraLDS: Provable Distillation for Linear Dynamical Systems

UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs

FutureFill: Fast Generation from Convolutional Sequence Models

ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides

Projects

Scaling Mutual Information Skill Learning to Larger Skill Spaces

Camelax: A Lightweight Machine Learning Library in OCaml

Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning

Parallel Scaling with Entropic Reasoners

Wave Filtering for General Linear Dynamical Systems

Truly Adaptive Bloom Filters

Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)

DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction

Surveys

Discussion on "Why Does Deep and Cheap Learning Work So Well?"

A Survey of State Space Models: From Linear Systems to Language

Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates

Coursework

Computer Science

Math

Theory

Creative Writing

Contests