I am a researcher & engineer
focused on
reasoning, interpretability,
and evaluation

My research focuses on natural language processing, machine learning interpretability, and evaluation methodologies, with work published at ACL, EMNLP, and NeurIPS. I see engineering as the art of solving constraints, research as the art of discovery, and life as the ultimate optimization problem.
Experience
Amazon
SDE Intern
ML pipelines for intent classification, sentiment analysis, and routing prediction on customer transcripts. Integrated RAG and LLM-based task classification.
Stanford NLP Group
Researcher
Multi-trigger classification in mechanistic interpretability under Dr. Jan-Philipp Fränken. Achieved ~98% AUROC on single-classification accuracy.
Princeton PLI
Lead Author & Researcher
First-author EMNLP 2025 on LLM benchmarking, first-author ACL 2023 on symbolic math reasoning. Built evaluation protocols for symbolic reasoning datasets.
Wharton Analytics Fellows
Senior Analyst
Led LLM development for IKEA dashboard navigation. Modeled marketing spend for Zillow. Developed clustering algorithms for Fox viewer personas.
Education
University of Pennsylvania
Jerome Fisher M&T Program
B.S.E. Computer Science (Engineering) & B.S. Economics, Finance (Wharton)
University of Pennsylvania
M.S.E. Computer Science
Concurrent master's degree
Research
Circuit Distillation for Math Reasoning
Vedant Gaur, Eshan Singhal, Audhav Durai, Praneel Varshney
ESE 5460 Final CapstoneThe Progress Illusion: Revisiting meta-evaluation standards of LLM evaluators
Tianruo Rose Xu, Vedant Gaur, Liu Leqi, Tanya Goyal
EMNLP 2025Weak-to-Strong In-Context Optimization
Alok Shah, Khush Gupta, Keshav Ramji, Vedant Gaur
NeurIPS ATTRIB 2024Learned Meta Token Reasoning
Alok Shah, Khush Gupta, Keshav Ramji, Vedant Gaur
NeurIPS ATTRIB 2024Probes and Cons: Multi-Trigger Classification Reveals Mixed Functional Mappings
Vedant Gaur, Sriram Tolety
arXiv 2024Reasoning in Large Language Models Through Symbolic Math Word Problems
Vedant Gaur, Nikunj Saunshi
ACL 2023Symbolic Math Reasoning with Language Models
Vedant Gaur, Nikunj Saunshi
IEEE URTC 2022Projects
BOLD AI
Transformer-based tool for detecting stutters in speech
Veritas
Real-time truth verification system built at xAI hackathon
Rabbit Hole
Endless generative Wikipedia - HackMIT winner
Socrates
Generative knowledge exploration tool - AGI House hackathon
Mesh
Generative Engine Optimization dashboard with real-time visibility insights
Sora to 3D
Sora video to Gaussian splat + point-cloud estimation pipeline
NumPy Transformer
Transformer architecture implemented from scratch in NumPy
Collodge
Airbnb for college dorms - peer-to-peer housing marketplace