Xindi(Cindy) Wu

I am Xindi Wu, a 4th year Ph.D. student in Computer Science at Princeton University, advised by Prof. Olga Russakovsky. Previously I was in Master of Science in Computer Vision at Carnegie Mellon University. I was very fortunate to be advised by Prof. Deva Ramanan and work with Prof. Min Xu and Prof. Haohan Wang and Prof. Kris Kitani. I was admitted to the Honors Youth Program at Xi'an Jiaotong University.

My research focuses on data for efficient multimodal machine learning, developing data-centric approaches that trace model performance back to training data composition and introduce data-driven capabilities to build more scalable and capable vision-language systems.

My work sits at the intersection of three core pillars:

I am happy to collaborate and answer questions about my research, feel free to send me an email. I especially encourage students from underrepresented groups to reach out.

Email /  CV /  Google Scholar /  LinkedIn /  GitHub  /  X

profile photo

News
2025
10/2025: Gave a poster presentation on Where is Motion From? Scalable Motion Attribution for Video Generation Models at ICCV Workshop on Reliable and Interactable World Models 2025 (Poster).
06/2025: Gave a talk at Datology AI Summer of Data Seminar on From Data to Capability: Data for efficient multimodal machine learning (Slides).
06/2025: Co-organizing the ICCV 2025 Workshop on Curated Data for Efficient Learning. We welcome submissions on data-centric approaches including data pruning, synthetic data, dataset distillation and more!
05/2025: Started my internship at Nvidia Spatial Intelligence Lab this summer in Santa Clara! ☀️
04/2025: Led a discussion on Compositionality in Vision-Language Discussion Group (Slides).
02/2025: Attended NYC Vision Day and gave a poster presentation (Poster) with William Yang.
01/2025: Led a discussion on Data Influence in VisualAI Lab Reading Group (Slides).
2024
12/2024: Attended NeurIPS 2024 and gave a poster presentation on ConceptMix (Poster).
10/2024: Our paper Corgi: Cached Memory Guided Video Generation is accepted to WACV 2025.
09/2024: Attended ECCV 2024 and gave a spotlight talk on ConceptMix (Slides) at Knowledge in Generative Models Workshop; and our Vision-Language Dataset Distillation received best paper award at Dataset Distillation Workshop.
07/2024: Our paper Vision-Language Dataset Distillation is accepted to TMLR 2024.
07/2024: Gave a talk on Compositional Generation Evaluation (Slides) at Google Research.
06/2024: Attended CVPR 2024 and gave a talk on Scaling Down before Scaling Up: Recent Progress on Dataset Distillation (Slides | Video) at CVPR 2024 Dataset Distillation Workshop.
02/2024: Led a discussion on Scaling Law in the PLI Vision-Language Reading Group (Slides).
02/2024: I am TAing for COS 429 Computer Vision in Spring 2024.
01/2024: Passed my general exam (quals)! (Video | Slides | Reading List). Huge thanks to my committee members Olga Russakovsky, Szymon Rusinkiewicz and Adji Bousso Dieng for their support and feedback!
2023
09/2023: I am TAing for COS 597O Advanced Topics in Computer Science: Deep Generative Models, with focus on Methods, Applications & Societal Considerations in Fall 2023.
06/2023: Attended CVPR 2023 and presented Pix2Map (Poster).
05/2023: Started my internship at Meta Reality Lab in Redmond, WA on multimodal generative models.
2022
08/2022: Started my PhD journey at Princeton University! 💪
Selected Publications & Preprints
Where is Motion From? Scalable Motion Attribution for Video Generation Models.
ICCV Workshop on Reliable and Interactable World Models 2025
Preprint 2025
Preprint 2025
Preprint 2025
Preprint 2024
NeurIPS Datasets & Benchmarks 2024, ECCV Knowledge in Generative Models Workshop (Spotlight)
Paper | Supp | IEEE Paper Site | Poster | Bibtex
WACV 2025
TMLR 2024, ECCV Dataset Distillation Workshop (Best Paper)
CVPR 2023
KDD 2022
Kristen Grauman, et al .
CVPR 2022
CVPR 2020 (Oral)
CVPR Workshop, 2020
BIBM 2019 (Oral)
Computational Biology, Codon Publications, Brisbane, Australia, 2019
ICIP 2019
DSP 2018
Education
Princeton University, USA
Ph.D. in Computer Science • Aug. 2022 to Now
Advisor: Prof. Olga Russakovsky
Carnegie Mellon University, USA
M.S. in Computer Vision • Sept. 2020 to Dec. 2021
Advisor: Prof. Deva Ramanan
Xi'an Jiaotong University, China
B.Sc. in Computer Science • Sep. 2016 to July 2020
Advisor: Prof. Jinjun Wang, Prof. Pengju Ren
National University of Singapore, Singapore
Summer session • Human Computer Interaction •July 2017 to Aug. 2017

Experiences
Spatial Intelligence Lab, NVIDIA
Research Scientist Intern • May 2025 - Now
Mentors & Advisors: Dr. Jonathan Lorraine, Dr. Despoina Paschalidou, Dr. Jun Gao, Prof. Laura Leal-Taixé, Prof. Antonio Torralba, Prof. Sanja Fidler
Meta Reality Labs, Meta.
Research Scientist Intern • May. 2023 to Aug. 2023
Mentors: Dr. Shane Moon, Dr. Uriel Singer Dr. Luna Dong, Dr. Zhaojiang Lin, Dr. Paul Crook, Dr. Andrea Madotto Dr. Ethan Xu
Perception ML Team, Snap Inc.
Machine Learning Engineer • Feb. 2022 to July 2022
Research Intern • May. 2021 to Aug. 2021
Mentors: Dr. Alireza Zareian and Dr. Chen Wang
CMU Argo AI Center for Autonomous Vehicle Research, ArgoAI
Capstone Project • Feb. 2021 to Dec 2021
Advisors: Prof. Deva Ramanan. Mentors: Dr. Aljoša Ošep and Dr. Francesco Ferroni
Megvii (Face ++) Research, Megvii
Computer Vision Research Intern • June 2020 to Sept. 2020
Mentor: Banghuai Li
Computational Biology Department, Carnegie Mellon University, USA
Research Intern • March 2019 to Feb. 2020
Advisor: Prof. Min Xu
Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, China
Research Intern • Dec. 2017 to Feb. 2019
Advisors: Prof. Pengju Ren and Prof. Jinjun Wang

Teaching
TA: COS 429 Computer Vision, Spring 2024, Princeton, with instructor: Vikram V. Ramaswamy and Felix Heide
TA: COS 597O Advanced Topics in Computer Science: Deep Generative Models: Methods, Applications & Societal Considerations. Fall 2023, Princeton, with instructor: Adji Bousso Dieng.
Professional Activities & Outreach

Reviewer: NeurIPS 25'/24'/23', ICLR 25'/24', ICML 25'/24', CVPR 25'/24'/23'/22', ICCV 25'/23', ECCV 24'/22', TMLR, ICRA 24', ACCV 24', ICLR 23' Workshop ME-FoMo, NeurIPS Interpolate Workshop 22', BMVC 20', IJCAI 20'.
Workshop Co-organizer: ICCV 2025 Workshop on Curated Data for Efficient Learning.
Committee Member: Diversity, Equity and Inclusion Committee at Robotics Institute, CMU.


Website source from Jon Barron