WEIWEI PAN — Research

I am generally interested in unfolding consequences of design choices in machine learning pipelines (e.g. model assumptions, inductive biases, inference, optimization and model explanations) for social-technical decision making systems. I am also broadly interested in issues of AI safety, ethics and regulation. My interests include uncertainty quantification, deep generative models, deep Bayesian models, approximate inference, user modeling in RL, explanable AI and HCI.

Since 2024, I’ve been co-leading an interdisciplinary research initiative to understand the impact of AI on frontline humanitarian negotiation. This work focuses both on machine learning and HCI.

Recent Papers

More publications can be found at my Google Scholar.

Ian M. Moore, Eura Nofshin, Siddharth Swaroop, Susan Murphy, Finale Doshi-Velez, Weiwei Pan, When and Why Hyperbolic Discounting Matters for Reinforcement Learning Interventions, RLC, 2025
Hiwot Belay Tadesse, Alihan Hüyük, Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez, Directly Optimizing Explanations for Desired Properties, UAI, 2025.
Kirsten N. Morehouse, Siddharth Swaroop, Weiwei Pan, Rethinking LLM Bias Probing Using Lessons from the Social Sciences, ICML (Position Track), 2025.
Philipp Arens, D. Adam Quirk, Weiwei Pan, Yaniv Yacoby, Finale Doshi-Velez, Connor J. Walsh, Preference-based assistance optimization for lifting and lowering with a soft back exosuit, Science Advances, 2025.
Salma Abdel Magid, Weiwei Pan, Simon Warchol, Grace Guo, Junsik Kim, Mahia Rahman, Hanspeter Pfister, Is What You Ask For What You Get? Investigating Concept Associations in Text-to-Image Models, TMLR, 2025.
Zilin Ma, Yiyang Mei, Claude Bruderlein, Krzysztof Z. Gajos, Weiwei Pan, “ChatGPT, Don’t Tell Me What to Do”: Designing AI for Context Analysis in Humanitarian Frontline Negotiations, CHI Work, 2025.
Jonas B Raedler, Siddharth Swaroop, Weiwei Pan, AI Companions Are Not The Solution To Loneliness: Design Choices And Their Drawbacks, ICLR Workshop on Building Trust in Language Models, 2025.
Sudhan Chitgopkar, Noah Dohrmann, Stephanie Monson, Jimmy Mendez, Finale Doshi-Velez, Weiwei Pan, Accuracy Isn’t Everything: Understanding the Desiderata of AI Tools in Legal-Financial Settings, Neurips Workshop on Behavioral Machine Learning, 2024.
Soline Boussard, Susannah Cheng Su, Helen Zhao, Siddharth Swaroop, Weiwei Pan, Understanding Model Bias Requires Systematic Probing Across Tasks, Neurips Workshop on Socially Responsible Language Modelling Research, 2024.
Julia Smakman, Lisa Soder, Connor Dunlop, Siddharth Swaroop, Weiwei Pan, AI Agents & Liability – Mapping Insights from ML and HCI Research to Policy, Neurips Workshop on Socially Responsible Language Modelling Research, 2024.
Julia Smakman, Lisa Soder, Connor Dunlop, Siddharth Swaroop, Weiwei Pan, An Autonomy-Based Classification: Liability in the Age of AI Agents, Neurips Workshop on Regulatable ML, 2024.
S.B. Wang, R.D.I. Van Genugten, Y Yacoby, W Pan, K.H. Bentley, S.A. Bird, R.J. Buonopane, A. Christie, M. Daniel, A. Haim, L. Follet, R.G. Fortgang, F. Kelly-Brunyak, E.M. Kleiman, A.J. Millner, O. Obi-Obasi, J.P. Onnela, N. Ramlal, J.R. Ricard, J.W. Smoller, T. Tambedou, K.L. Zuromski, M.K. Nock, Idiographic prediction of suicidal thoughts: Building personalized machine learning models with real-time monitoring data, Nature Mental Health, 2024.
Esther Brown, Shivam Raval, Alex Rojas, Jiayu Yao, Sonali Parbhoo, Leo A Celi, Siddharth Swaroop, Weiwei Pan, Finale Doshi-Velez, Where do doctors disagree? Characterizing Decision Points for Safe Reinforcement Learning in Choosing Vasopressor Treatment, American Medical Informatics Association (AMIA), 2024.
Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez, Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders, Advances in Approximate Bayesian Inference (non-Archival), 2024.
Zilin Ma, Susannah Cheng Su, Nathan Zhao, Linn Bieske, Blake Bullwinkel, Jinglun Gao, Gekai Liao, Siyao Li, Ziqing Luo, Boxiang Wang, Zihan Wen, Yanrui Yang, Yanyi Zhang, Claude Bruderlein, Weiwei Pan, Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations, ICML Workshop on NextGenAISafety, 2024.
Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez, A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning, ICML Workshop on NextGenAISafety, 2024.
Hiwot Belay Tadesse, Weiwei Pan, Finale Doshi-Velez, Optimizing Machine Learning Explanations for Properties, ICML Workshop on Humans, Algorithmic Decision-Making and Society: Modeling Interactions and Impact, 2024.
Kirsten Morehouse, Weiwei Pan, Juan Manuel Contreras, Mahzarin R. Banaji, Bias Transmission in Large Language Models: Evidence from Gender-Occupation Bias in GPT-4, ICML Workshop on NextGenAISafety, 2024.
David Berthiaume, Yuan Tang, Chau Nguyen, Siyu Gai, Emilia Mazzolenis, Weiwei Pan, Synthetic Data-driven Prediction of Height for Childhood Malnutrition, ICML Workshop on AI4Science, 2024.
Paul Nitschke, Lars Lien Ankile, Eura Shin, Siddharth Swaroop, Finale Doshi-Velez, Weiwei Pan, AMBER: An Entropy Maximizing Environment Design Algorithm for Inverse Reinforcement Learning, ICML Workshop on Models of Human Feedback for AI Alignment, 2024.
Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez, Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks, International Conference on Autonomous Agents and Multiagent Systems, 2024.
Jiayu Yao, Weiwei Pan, Finale Doshi-Velez, Barbara E Engelhardt, Inverse Reinforcement Learning with Multiple Planning Horizons, Reinforcement Learning Conference, 2024.