I am generally interested in unfolding consequences of design choices in machine learning pipelines (e.g. model assumptions, inductive biases, inference, optimization and model explanations) for social-technical decision making systems. I am also broadly interested in issues of AI safety, ethics and regulation. My interests include uncertainty quantification, deep generative models, deep Bayesian models, approximate inference, user modeling in RL, explanable AI and HCI.

Recent Papers

More publications can be found at my Google Scholar.

  1. Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez, Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders, Advances in Approximate Bayesian Inference (non-Archival), 2024.
  2. Zilin Ma, Susannah Cheng Su, Nathan Zhao, Linn Bieske, Blake Bullwinkel, Jinglun Gao, Gekai Liao, Siyao Li, Ziqing Luo, Boxiang Wang, Zihan Wen, Yanrui Yang, Yanyi Zhang, Claude Bruderlein, Weiwei Pan, Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations
  3. Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez, A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning, ICML Workshop on NextGenAISafety, 2024.
  4. Hiwot Belay Tadesse, Weiwei Pan, Finale Doshi-Velez, Optimizing Machine Learning Explanations for Properties, ICML Workshop on Humans, Algorithmic Decision-Making and Society: Modeling Interactions and Impact, 2024.
  5. Kirsten Morehouse, Weiwei Pan, Juan Manuel Contreras, Mahzarin R. Banaji, Bias Transmission in Large Language Models: Evidence from Gender-Occupation Bias in GPT-4, ICML Workshop on NextGenAISafety, 2024.
  6. David Berthiaume, Yuan Tang, Chau Nguyen, Siyu Gai, Emilia Mazzolenis, Weiwei Pan, Synthetic Data-driven Prediction of Height for Childhood Malnutrition, ICML Workshop on AI4Science, 2024.
  7. Paul Nitschke, Lars Lien Ankile, Eura Shin, Siddharth Swaroop, Finale Doshi-Velez, Weiwei Pan, AMBER: An Entropy Maximizing Environment Design Algorithm for Inverse Reinforcement Learning, ICML Workshop on Models of Human Feedback for AI Alignment, 2024.
  8. Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez, Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks, International Conference on Autonomous Agents and Multiagent Systems, 2024.
  9. Jiayu Yao, Weiwei Pan, Finale Doshi-Velez, Barbara E Engelhardt, Inverse Reinforcement Learning with Multiple Planning Horizons, Reinforcement Learning Conference, 2024.