Best Papers

Best Paper Awards

  • An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
    Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig

  • Towards Robust Speech Representation Learning for Thousands of Languages
    William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe

  • Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
    Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf

  • Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method
    Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng

  • CoGen: Learning from Feedback with Coupled Comprehension and Generation
    Mustafa Omer Gul, Yoav Artzi

Outstanding Papers

  • Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models
    Sander Land, Max Bartolo

  • Learning to Retrieve Iteratively for In-Context Learning
    Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme

  • Measuring Psychological Depth in Language Models
    Fabrice Y Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Senahan Yildiz, Miryung Kim, Amit Sahai, Nanyun Peng

  • Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs
    Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze

  • Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation
    Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture

  • Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
    Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M Khapra

  • GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory
    Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song

  • Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving
    Xin Quan, Marco Valentino, Louise A. Dennis, Andre Freitas

  • The Zeno’s Paradox of ‘Low-Resource’ Languages
    Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury

  • When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages
    Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen

  • Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs
    Kanishka Misra, Kyle Mahowald

  • Fool Me Once? Contrasting Textual and Visual Explanations in a Clinical Decision-Support Setting
    Maxime Guillaume Kayser, Bayar Menzat, Cornelius Emde, Bogdan Alexandru Bercean, Alex Novak, Abdalá Trinidad Espinosa Morgado, Bartlomiej Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu

  • Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval
    Yifan Qiao, Parker Carlson, Shanxiu He ,Yingrui Yang, Tao Yang

  • Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing
    Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty

  • Are Large Language Models Capable of Generating Human-Level Narratives?
    Yufei Tian, Tenghao Huang,Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng

  • Formality is Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge
    Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen

  • OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants
    Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta

  • SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
    Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot

  • Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs
    Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li

  • Which questions should I answer? Salience Prediction of Inquisitive Questions
    Yating Wu, Ritika Rajesh Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li

Best Demo Paper Award

  • OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents
    Qiang Sun, Yuanyi Luo, Sirui Li, Wenxiao Zhang, Wei Liu

Outstanding Demo Paper

  • sign.mt: Real-Time Multilingual Sign Language Translation Application
    Amit Moryossef

Resource Paper Awards

  • KidLM: Advancing Language Models for Children – Early Insights and Future Directions
    Mir Tafseer Nayeem, Davood Rafiei

  • A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models
    Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie

Social Impact Paper Awards

  • What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study
    Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof-Arenas, Luisa Bentivogli

  • STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
    Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami

  • Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps
    Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy

Special Theme Paper Award

  • DEM: Distribution Edited Model for Training with Mixed Data Distributions
    Dhananjay Ram, Aditya Rawal, Momchil Hardalov,Nikolaos Pappas, Sheng Zha