Devan Shah
I am a senior in Computer Science at Princeton University where I am
grateful to be advised by Prof. Elad Hazan. Through the Hazan Lab, I work on language modeling and
dynamical systems. Separately, I have also worked on projects with Jane Street and TikTok.
Broadly, I am interested in better understanding language models, such as improving how models reason,
designing post-transformer architectures, and improving optimizers. At Princeton, I help
host NLP & CV reading groups, chair our ACM chapter, and am a member of Phi Beta Kappa, Sigma Xi, and
Tau
Beta Pi.
Some recent work I am especially proud of is [1] and [3].
Email /
GitHub /
Google Scholar /
LinkedIn
|
|
|
|
Princeton Hazan Lab
September 2024 — Present
I work on language modeling and dynamical systems in the Hazan Lab at Princeton, led by Prof. Elad Hazan. I've worked on projects aiming to accelerate the Spectral Transformer and better model linear dynamical systems, such as such as SpectraLDS, FutureFill, and Google Deluca.
|
|
|
Quantitative Research at Jane Street
May 2025 — August 2025
I interned as a Quantitative Research intern at Jane Street.
|
|
|
Recommendation Systems at TikTok
May 2024 — August 2024
As a machine learning engineering intern, I improved the recommendation system for ecommerce videos through projects focusing on improving multi-interest modeling.
|
Show 3 more experiences
|
|
EEG Vision Embeddings at CareYaya
November 2023 — May 2024
I studied the reconstruction of viewed images from brain activity, improving performance by conditioning on a user's camera roll. We aimed to better align EEG embeddings with CLIP space.
|
|
|
Scooter Authentication at Cal Poly Pomona
June 2023 - April 2024
I worked on continuously authentication of mobility scooter riders based on their posture patterns. We worked on developing embeddings based on motion that could be suitable for identification or auth.
|
|
|
Price Prediction at Ticket Wallet (YC X25)
June 2023 — August 2023
I worked at an early-stage startup to advise and design data pipelines, pricing algorithms, and refine product pitches. I also worked on algorithms to determine if a ticket was likely to be sold.
|
|
SpectraLDS: Provable Distillation for Linear Dynamical Systems
Devan Shah, Shlomo Fortgang, Sofiia Druchyna, Elad Hazan
NeurIPS 2025
website /
paper
/ code
/ poster
/ slides
|
|
UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs
Devan Shah*, Owen Yang*, Daniel Yang, Chongyi Zheng, Benjamin Eysenbach
NeurIPS 2025 Workshop on Scaling Environment for Agents
website /
paper
/ code
/ poster
|
|
FutureFill: Fast Generation from Convolutional Sequence Models
Naman Agarwal, Xinyi Chen, Evan Dogariu, Devan Shah, Hubert Strauss, Vladimir Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan
ICLR 2026
website /
paper
/ code
/ slides
|
|
ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides
Devan Shah , Ruoqi Huang, Nisha Vinayaga-Sureshkanth, Tingting Chen, Murtuza Jadliwala
IEEE TMC 2025
paper
|
|
Scaling Mutual Information Skill Learning to Larger Skill Spaces
Devan Shah*, David Shustin* (* indicates equal contribution)
paper
|
|
Camelax: A Lightweight Machine Learning Library in OCaml
Devan Shah*, Daniel Yang*
paper
/ code
|
|
Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning
Devan Shah*, Kevin Wang*, David Yan*
paper
/ code
/ poster
|
|
Parallel Scaling with Entropic Reasoners
Devan Shah*, Owen Yang*, Daniel Yang*
paper
/ code
/ poster
|
|
Wave Filtering for General Linear Dynamical Systems
Devan Shah* , Brandon Cho*
paper
/ code
|
|
Truly Adaptive Bloom Filters
Devan Shah* , David Yan*
paper
/ poster
|
|
Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)
Devan Shah , Ruoqi Huang, Tingting Chen, Murtuza Jadliwala
AAAI 2025
paper
/ poster
/ slides
|
|
DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction
Devan Shah
paper
/ code
|
|
Discussion on "Why Does Deep and Cheap Learning Work So Well?"
Devan Shah
paper
|
|
A Survey of State Space Models: From Linear Systems to Language
Devan Shah* , Brandon Cho*
paper
|
|
Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates
Devan Shah* , Owen Yang*, Sunil Vittal*
paper
|
Computer Science
|
| COS217 |
Introduction to
Programming Systems |
| COS226 |
Algorithms and Data
Structures |
| COS326 |
Functional Programming
|
| COS418 |
Distributed Systems |
| ECE435 |
Machine Learning and
Pattern Recognition |
| COS435 |
Introduction to
Reinforcement Learning |
| COS484 |
Natural Language
Processing |
| COS597 |
Long Term Memory in AI -
Vector Search and Databases |
| COS597 |
Systems and Machine
Learning |
| COS597 |
Inference in Action:
Probabilistic Topics in Reinforcement Learning |
|
Math
|
| MAT216 |
Multivariable Analysis and
Linear Algebra I |
| MAT218 |
Multivariable Analysis and
Linear Algebra II |
| ECO310 |
Microeconomic Theory: A
Mathematical Approach |
| MAT377 |
Combinatorial Mathematics
|
| MAT385 |
Probability Theory |
| MAT478 |
Topics in Combinatorics:
The Probabilistic Method |
|
Theory
|
| ECE434 |
Theoretical Machine
Learning |
| COS445 |
Economics and Computing
|
| COS487 |
Theory of Computation
|
| COS521 |
Advanced Algorithm Design
|
| COS522 |
Computational Complexity
|
| COS585 |
Information Theory and
Applications |
| ORF543 |
Deep Learning Theory |
| COS598 |
Theory of Natural
Algorithms |
|
Creative Writing
|
| FRS116 |
Evolution of Human
Language |
| CWR202 |
Creative Writing (Poetry)
|
| POL316 |
Civil Liberties |
| ATL494 |
Creating Comedy for
Television |
| ATL497 |
How to Write a Monologue
|
|
| 2025 |
Agent Builders Hackathon:
#3/46 (and 2 track wins), Solo Entry |
| 2025 |
Anthropic Alignment Hack #2/13
|
| 2025 |
Stanford Treehacks Codegen:
Best Code Generation Application #1/24 |
| 2024 |
Columbia DevFest Overall
Winners #1/54 |
| 2023 |
MIT Energy and Climate
Hackathon #3/80 |
| 2023 |
ICPC Greater NY Regional
Contest #4/92 |
| 2023 |
HackHarvard CareYaya Track
#1/24 |
| 2023 |
Princeton HackaTron Web3 Hack
#2 |
| 2022 |
HackPrinceton Best AI Hack
|
|
|