PhD Candidate Youhui Ye becomes a PhD Graduate, Spring 2026
Congratulations to Our Newest PhD Graduate, Dr. Youhui Ye!
The Department of Statistics at Virginia Tech is proud to announce and celebrate the successful completion of Dr. Youhui Ye's doctoral degree requirements.
Dr. Youhui Ye successfully defended his dissertation titled "Statistical Inference for Image Classification, Language Reasoning, and Counterfactual Estimation" on May 4, 2026, a significant milestone representing years of rigorous work, dedication, and impactful research.
Title: Statistical Inference for Image Classification, Language Reasoning, and Counterfactual Estimation.
Abstract: His work addresses three interconnected challenges in statistical inference and machine learning: uncertainty quantification under distribution shift, robust causal estimation in panel data, and reliable confidence estimation for large language models. The first contribution introduces DPI-RG, a distribution-free framework for constructing predictive sets for high-dimensional data under distribution shifts. Using a conditional Wasserstein round-trip generative model, DPI-RG maps data to a lower-dimensional latent space where test statistics asymptotically follow a Chi-square distribution, yielding theoretically guaranteed p-values. The framework exhibits strong robustness to outliers and target shifts without requiring outlier labels during training. The second contribution presents CONCORD, a confidence estimation framework for large language models that represents responses as semantic distributions over constituent clauses rather than single embedding vectors. By combining Sentence Mover's Distance with Symbolic Answer Match with Inception-style CNN, CONCORD captures both structural divergence and discrete answer agreement, producing calibrated confidence scores that substantially outperform logit-based and embedding-based baselines on mathematical reasoning benchmarks. The third contribution proposes an L-infinity-regularized Synthetic Control method that bridges the sparse weighting of traditional synthetic control and the denser, more robust weighting of Difference-in-Differences. By distributing weights more evenly across control units while remaining data-driven, the method improves robustness and interpretability while increasing the likelihood of satisfying the parallel trends assumption.
Dr. Youhui Ye will be joining the Virginia Tech Transportation Institute (VTTI) as a Postdoctoral Associate.