I'm a researcher at Skild AI, where I focus on building scalable multi-modal AI for robotics. Previously, I completed my master's degree in Electrical and Computer Engineering from Carnegie Mellon University, where I was advised by Prof. Deepak Pathak, and worked closely with Prof. Fernando De la Torre.
My research interests lie in multi-modal models, generative modeling, and reasoning.
Under Review, 2026
* Project co-leads · † Core contributors
Sim2Reason trains LLMs inside virtual worlds governed by real physics laws for stronger, transferable physical reasoning, with zero-shot gains on the International Physics Olympiad.
arXiv, 2026
A test-time iterative refinement strategy that uses a VLM feedback critic and image editor in loop to improve compositional text-to-image generation, significantly outperforming compute-matched parallel sampling across various models.
ICCV, 2025
AGenDA bridges the domain gap in aerial vehicle detection by synthesizing target-domain training data with Stable Diffusion and auto-labeling via cross-attention maps.
In Submission, 2024
VADER is a framework for efficiently fine-tuning video diffusion models using reward gradients, enabling alignment with task-specific objectives.