publications

2024

  1. Under Review
    CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
    Yu Ying Chiu, Liwei Jiang, Bill Yuchen Lin, Chan Young Park, Shuyue Stella Li, Sahithya Ravi, Mehar Bhatia, Maria Antoniak, Yulia Tsvetkov, Vered Shwartz, and Yejin Choi
    2024
  2. ICLR 2025 (Spotlight)
    DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
    Yu Ying Chiu, Liwei Jiang, and Yejin Choi
    In International Conference on Learning Representations 2025 (Spotlight), 2024
  3. Under Review
    A Computational Framework for Behavioral Assessment of LLM Therapists
    Yu Ying Chiu*, Ashish Sharma*, Inna Wanyin Lin, and Tim Althoff
    2024
  4. TACL 2024
    Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
    Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, and Shane Steinert-Threlkeld
    In Transactions of the Association for Computational Linguistics 2024, 2024
  5. Under Review
    WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
    Wenting Zhao, Tanya Goyal, Yu Ying Chiu, Liwei Jiang, Benjamin Newman, Abhilasha Ravichander, Khyathi Chandu, Ronan Le Bras, Claire Cardie, Yuntian Deng, and Yejin Choi
    2024
  6. Preprint
    CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs’ (Lack of) Multicultural Knowledge
    Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, and Yejin Choi
    2024

2023

  1. EMNLP 2023 System Demo
    humanoidagent_gif.gif
    Humanoid Agents: Platform for Simulating Human-like Generative Agents
    Zhilin Wang*Yu Ying Chiu*, and Yu Cheung Chiu
    In Empirical Methods in Natural Language Processing 2023 (System Demonstrations), Dec 2023