MIT Class of 2028 - AI + Mathematics - Generative Models

Qiao Sun

I am an undergraduate at MIT, double majoring in Artificial Intelligence and Decision Making (Course 6-4) and Mathematics (Course 18). I work on generative modeling in He Vision Group, led by Kaiming He, as a UROP student, with a current focus on fast, simple, and principled image generation: diffusion and flow matching, normalizing flows, one-step generation, and multimodal understanding/generation.

Diffusion Models Flow Matching Normalizing Flows Text-to-Image JAX/TPU PyTorch/GPU

Research

Fast generation

One-step and low-NFE image generation in pixel space and latent-free settings.

Principled models

Normalizing flows, diffusion, and flow matching with a focus on what structure is actually necessary.

Unified systems

Vision-language understanding and text-to-image generation under a shared modeling view.

Publications

ICML 2025 poster - first author

Is Noise Conditioning Necessary for Denoising Generative Models?

Q. Sun, Z. Jiang, H. Zhao, and K. He

We revisit a common assumption in diffusion and flow-matching models: whether denoisers need explicit noise conditioning. Across eight reimplemented denoising generative models, uEDM shows that noise-unconditional diffusion can remain competitive, with theory matching the empirical behavior.

CVPR Spotlight - project lead - first author

Bidirectional Normalizing Flow: From Data to Noise and Back

Y. Lu, Q. Sun, X. Wang, Z. Jiang, H. Zhao, and K. He

BiFlow revisits normalizing flows with a learned reverse map guided by hidden-state alignment. It removes the need for explicit inverse-flow computation, avoids slow autoregressive inference, and enables single-evaluation NF-based generation with strong fidelity.

ICML accepted - first author - one-step pixel-space generation

One-step Latent-free Image Generation with Pixel Mean Flows

Y. Lu, S. Lu, Q. Sun, H. Zhao, Z. Jiang, X. Wang, T. Li, Z. Geng, and K. He

pMF builds a strong baseline for one-step, latent-free generation by using MeanFlow with x-prediction directly in pixel space. The project reports 2.22 FID on ImageNet 256 and 2.48 FID on ImageNet 512.

Experience

Jun 20 - Aug 31, 2025 - quantitative research

Quant Strategy Analyst Intern, Ubiquant Investment (Jiukun Quant)

Worked on quantitative strategy analysis, backtesting workflows, factor diagnostics, and reproducible Python research utilities for strategy evaluation.

Projects

MIT 6.4210 course project - robotics manipulation

Fast Humanoid Loco-Manipulation via Flow Matching

Reimplemented a diffusion-based humanoid loco-manipulation pipeline inspired by BeyondMimic, then replaced DDPM sampling with flow matching. The project used simulation data, motion tracking, post-hoc control guidance, and lower-latency sampling with 5 FM steps.

Education & Honors

2024 - present

Massachusetts Institute of Technology, undergraduate in AI and Mathematics, GPA 5.00/5.00.

2025

Top 17 in the 2025 Putnam Mathematical Competition.

2024

2nd place in the 2024 Putnam Mathematical Competition.

2023

Gold Medal and 11th place at the International Mathematical Olympiad.

2023 - 2024

Pre-college student at Tsinghua University's Institute for Interdisciplinary Information Sciences, GPA 4.00/4.00.

2022

Gold Medal and 1st place with perfect score in the Chinese Mathematical Olympiad.

2022 - 2024

Excellent Award in Alibaba Global Mathematics Competition, top 70 among 50,000+ participants.

Recent posts