Findings
Transferability of Syntax-Aware Graph Neural Networks in Zero-Shot Cross-Lingual Semantic Role Labeling
Rachel Sidney Devianti, Yusuke Miyao
Reformatted Alignment
Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu
Adversarial Math Word Problem Generation
Roy Xie, Chengxuan Huang, Junlin Wang, Bhuwan Dhingra
Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing
Wei Zhao, Zhe Li, Yige Li, YE ZHANG, Jun Sun
Promoting Constructive Deliberation: Reframing for Receptiveness
Gauri Kambhatla, Matthew Lease, Ashwin Rajadesingan
A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction
Yinghao Li, Rampi Ramprasad, Chao Zhang
Rater Cohesion and Quality from a Vicarious Perspective
Deepak Pandita, Tharindu Cyril Weerasooriya, Sujan Dutta, Sarah K. K. Luger, Tharindu Ranasinghe, Ashiqur R. KhudaBukhsh, Marcos Zampieri, Christopher M Homan
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM Agents
Zengqing Wu, Run Peng, Shuyuan Zheng, Qianying Liu, Xu Han, Brian I. Kwon, Makoto Onizuka, Shaojie Tang, Chuan Xiao
Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction
Amrit Diggavi Seshadri
From Test-Taking to Test-Making: Examining LLM Authoring of Commonsense Assessment Items
Melissa Roemmele, Andrew Gordon
”I Never Said That”: A dataset, taxonomy and baselines on response clarity classification
Konstantinos Thomas, Giorgos Filandrianos, Maria Lymperaiou, Chrysoula Zerva, Giorgos Stamou
Immunization against harmful fine-tuning attacks
Domenic Rosati, Jan Wehner, Kai Williams, Lukasz Bartoszcze, Hassan Sajjad, Frank Rudzicz
UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause
Guimin Hu, Zhihong Zhu, Daniel Hershcovich, Lijie Hu, Hasti Seifi, Jiayuan Xie
CodeFort: Robust Training for Code Generation Models
Yuhao Zhang, Shiqi Wang, Haifeng Qian, Zijian Wang, Mingyue Shang, Linbo Liu, Sanjay Krishna Gouda, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras
MP-RNA: Unleashing Multi-species RNA Foundation Model via Calibrated Secondary Structure Prediction
Heng Yang, Ke Li
“Any Other Thoughts, Hedgehog?” Linking Deliberation Chains in Collaborative Dialogues
Abhijnan Nath, Videep Venkatesha, Mariah Bradford, Avyakta Chelle, Austin Collin Youngren, Carlos Mabrey, Nathaniel Blanchard, Nikhil Krishnaswamy
Evaluation of Question Answer Generation for Portuguese: Insights and Datasets
Felipe Paula, CASSIANA ROBERTA LIZZONI MICHELIN, Viviane Moreira
Evolutionary Contrastive Distillation for Language Model Alignment
Julian Katz-Samuels, Zheng Li, Hyokun Yun, Priyanka Nigam, Yi Xu, Vaclav Petricek, Bing Yin, Trishul Chilimbi
A Fairness-Driven Method for Learning Human-Compatible Negotiation Strategies
Ryan Shea, Zhou Yu
Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media
Nikhil Mehta, Dan Goldwasser
Are LLMs Effective Negotiators? Systematic Evaluation of the Multifaceted Capabilities of LLMs in Negotiation Dialogues
Deuksin Kwon, Emily Weiss, Tara Kulshrestha, Kushal Chawla, Gale Lucas, Jonathan Gratch
When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications
Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A Miller, Danielle Bitterman, Matthew Churpek, Majid Afshar
Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
Aditya Sharma, Michael Saxon, William Yang Wang
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring
Jiazheng Li, Hainiu Xu, ZHAOYUE SUN, Yuxiang Zhou, David West, Cesare Aloisi, Yulan He
LOCR: Location-Guided Transformer for Optical Character Recognition
Yu Sun, Dongzhan Zhou, Chen Lin, Conghui He, Wanli Ouyang, Han-Sen Zhong
Unsupervised Domain Adaptation for Keyphrase Generation using Citation Contexts
Florian Boudin, Akiko Aizawa
Sing it, Narrate it: Quality Musical Lyrics Translation
Zhuorui Ye, Jinhan Li, Rongwu Xu
Exploring Automated Keyword Mnemonics Generation with Large Language Models via Overgenerate-and-Rank
Jaewook Lee, Hunter McNichols, Andrew Lan
SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support
Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
Dual-teacher Knowledge Distillation for Low-frequency Word Translation
yifan guo, Hongying ZAN, Hongfei Xu
A Simple Angle-based Approach for Contrastive Learning of Unsupervised Sentence Representation
Yoo Hyun Jeong, Myeongsoo Han, Dong-Kyu Chae
Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language Models
Kimyeeun, Choi Youngrok, Eunkyung Choi, JinHwan Choi, Hai Jin Park, Wonseok Hwang
Visual Pivoting Unsupervised Multimodal Machine Translation in Low-Resource Distant Language Pairs
Turghun Tayir, Lin Li, Xiaohui Tao, Mieradilijiang Maimaiti, Ming Li, Jianquan Liu
Scalable Fine-tuning from Multiple Data Sources: A First-Order Approximation Approach
Dongyue Li, Ziniu Zhang, Lu Wang, Hongyang R. Zhang
DocEE-zh: A Fine-grained Benchmark for Chinese Document-level Event Extraction
Minghui Liu, MeiHan Tong, Yangda Peng, Lei Hou, Juanzi Li, Bin Xu
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models
Pengrui Han, Peiyang Song, Haofei Yu, Jiaxuan You
Evaluating Language Model Math Reasoning via Grounding in Educational Curricula
Li Lucy, Tal August, Rose E Wang, Luca Soldaini, Courtney Allison, Kyle Lo
Enhancing Multi-Label Text Classification under Label-Dependent Noise: A Label-Specific Denoising Framework
Pengyu Xu, Liping Jing, Jian Yu
Automatic Reconstruction of Ancient Chinese Pronunciations
Zhige Huang, Haoan Jin, Mengyue Wu, Kenny Q. Zhu
Instance-Level Dynamic LoRAs Composition for Cross-Task Generalization
WangZhiqi, Shizhu He, Kang Liu, Jun Zhao
LongWanjuan: Towards Systematic Measurement for Long Text Quality
Xiaoran Liu, Kai Lv, Qipeng Guo, Hang Yan, Conghui He, Xipeng Qiu, Dahua Lin
Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Tianxiang Hu, Pei Zhang, Baosong Yang, Jun Xie, Derek F. Wong, Rui Wang
MalayMMLU: A Multitask Benchmark for the Low-Resource Malay Language
Soon Chang Poh, Sze Jue Yang, Jeraelyn Ming Li Tan, Lawrence Leroy Tze Yao Chieng, Jia Xuan Tan, Zhenyu Yu, Foong Chee Mun, Chee Seng Chan
TriageAgent: Towards Better Multi-Agents Collaborations for Large Language Model-Based Clinical Triage
Meng Lu, Brandon Ho, Dennis Ren, Xuan Wang
Generative Deduplication For Socia Media Data Selection
Xianming LI, Jing Li
Gender Bias in Decision-Making with Large Language Models
Sharon Levy, William Adler, Tahilin Sanchez Karver, Mark Dredze, Michelle R Kaufman
Evaluating Biases in Context-Dependent Health Questions
Sharon Levy, Tahilin Sanchez Karver, William Adler, Michelle R Kaufman, Mark Dredze
Self-Evaluation of Large Language Model based on Glass-box Features
Hui Huang, Yingqi Qu, Jing Liu, Muyun Yang, Bing Xu, Tiejun Zhao, Wenpeng Lu
FASTTRACK: Reliable Fact Tracing via Clustering and LLM-Powered Evidence Validation
Si Chen, Feiyang Kang, Ning Yu, Ruoxi Jia
PKAD: Pretrained Knowledge is All You Need to Detect and Mitigate Textual Backdoor Attacks
Yu Chen, Qi Cao, Kaike Zhang, Xuchao Liu, Huawei Shen
Merely Judging Metaphor is Not Enough: Research on Reasonable Metaphor Detection
Puli Chen, Cheng Yang, Qingbao Huang
Can we teach language models to gloss endangered languages?
Michael Ginn, Mans Hulden, Alexis Palmer
On the token distance modeling ability of higher RoPE attention dimension
Xiangyu Hong, Che Jiang, Biqing Qi, Fandong Meng, Mo Yu, Bowen Zhou, Jie Zhou
Enhancing Byzantine-Resistant Aggregations with Client Embedding
Zhiyuan Zhang, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun
Exploiting Careful Design of SVM Solution for Aspect-term Sentiment Analysis
Hanfeng Liu, Minping Chen, Zhenya Zheng, Zeyi Wen
Learning to Generate Rules for Realistic Few-Shot Relation Classification: An Encoder-Decoder Approach
Mayank Singh, Eduardo Blanco
Plot Twist: Multimodal Models Don’t Comprehend Simple Chart Details
Yasaman Razeghi, Ishita Dasgupta, Fangyu Liu, Vinay Venkatesh Ramasesh, Sameer Singh
HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
Huy Nghiem, Hal Daumé III
Giving Control Back to Models: Enabling Offensive Language Detection Models to Autonomously Identify and Mitigate Biases
Jiapeng Liu, Weijie Li, Wenjun Deng, Xiaochao Fan, Liang Yang
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization
Tobias Schnabel, Jennifer Neville
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Konstantin Yakovlev, Sergey Nikolenko, Andrey Bout
Learning to Route for Dynamic Adapter Composition in Lifelong Language Learning
Vladimir Araujo, Marie-Francine Moens, Tinne Tuytelaars
SecureSQL: Evaluating Data Leakage of Large Language Models as Natural Language Interfaces to Databases
Yanqi Song, Ruiheng Liu, Shu Chen, Qianhao Ren, Yu Zhang, Yongqi Yu
Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Tianxiang Chen, Zhentao Tan, Tao Gong, Yue Wu, Qi Chu, Bin Liu, Jieping Ye, Nenghai Yu
Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction
Frank Martin Mtumbuka, Steven Schockaert
LLM-supertagger: Categorial Grammar Supertagging via Large Language Models
Jinman Zhao, Gerald Penn
Self-Consistency Boosts Calibration for Math Reasoning
Ante Wang, Linfeng Song, Ye Tian, Baolin Peng, Lifeng Jin, Haitao Mi, Jinsong Su, Dong Yu
Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning
Yuanhao Yue, Chengyu Wang, Jun Huang, Peng Wang
On Creating an English-Thai Code-switched Machine Translation in Medical Domain
Parinthapat Pengpun, Krittamate Tiankanon, Amrest Chinkamol, Jiramet Kinchagawat, Pitchaya Chairuengjitjaras, Pasit Supholkhan, Pubordee Aussavavirojekul, Chiraphat Boonnag, Kanyakorn Veerakanjana, Hirunkul Phimsiri, Boonthicha Sae-jia, Nattawach Sataudom, Piyalitt Ittichaiwong, Peerat Limkonchotiwat
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models
Yaojia Lv, Haojie Pan, Zekun Wang, Jiafeng Liang, Yuanxing Liu, Ruiji Fu, Ming Liu, Zhongyuan Wang, Bing Qin
Can LLMs Recognize Toxicity? A Structured Investigation Framework and Toxicity Metric
Hyukhun Koh, Dohyung Kim, Minwoo Lee, Kyomin Jung
Toeing the party line: election manifestos as a key to understand political discourse on Twitter
Maximilian Maurer, Tanise Ceron, Sebastian Padó, Gabriella Lapesa
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition
Zhenrong Zhang, Shuhang Liu, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Yu Hu
PolyWER: A Holistic Evaluation Framework for Code-Switched Speech Recognition
Karima Kadaoui, Maryam Al Ali, Hawau Olamide Toyin, Ibrahim Mohammed, Hanan Aldarmaki
A Deep Analysis of the Impact of Multiword Expressions and Named Entities on Chinese-English Machine Translations
Huacheng Song, Hongzhi Xu
SCA: Selective Compression Attention for Efficiently Extending the Context Window of Large Language Models
Huanran Zheng, Wei Zhu, Xiaoling Wang
FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking
Zhuoer Wang, Leonardo F. R. Ribeiro, Alexandros Papangelis, Rohan Mukherjee, Tzu-Yen Wang, Xinyan Zhao, Arijit Biswas, James Caverlee, Angeliki Metallinou
Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models
Spyridon Mouselinos, Henryk Michalewski, Mateusz Malinowski
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
Zihao Zeng, Yibo Miao, Hongcheng Gao, Hao Zhang, Zhijie Deng
Learning from Relevant Subgoals in Successful Dialogs using Iterative Training for Task-oriented Dialog Systems
Magdalena Kaiser, Patrick Ernst, György Szarvas
CLEAR: Can Language Models Really Understand Causal Graphs?
Sirui Chen, Mengying Xu, Kun Wang, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Chaochao Lu
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning
Gyeongman Kim, Doohyuk Jang, Eunho Yang
M2QA: Multi-domain Multilingual Question Answering
Leon Engländer, Hannah Sterz, Clifton A Poth, Jonas Pfeiffer, Ilia Kuznetsov, Iryna Gurevych
Unveiling the Invisible: Captioning Videos with Metaphors
Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Sumit Shekhar
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?
Ehsan Doostmohammadi, Oskar Holmström, Marco Kuhlmann
RippleCOT: Amplifying Ripple Effect of Knowledge Editing in Language Models via Chain-of-Thought In-Context Learning
Zihao Zhao, Yuchen Yang, Yijiang Li, Yinzhi Cao
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Dominik Macko, Robert Moro, Adaku Uchendu, Ivan Srba, Jason S Lucas, Michiharu Yamashita, Nafis Irtiza Tripto, Dongwon Lee, Jakub Simko, Maria Bielikova
https://openreview.net/forum?id=S3qR5O1yioH
Peter Vickers, Kenneth Church
DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
LaCo: Large Language Model Pruning via Layer Collapse
Yifei Yang, zouying cao, hai zhao
LLaMIPa: An Incremental Discourse Parser
Kate Thompson, Akshay Chaturvedi, Julie Hunter, Nicholas Asher
NeBuLa: A discourse aware Minecraft Builder
Akshay Chaturvedi, Kate Thompson, Nicholas Asher
Improving Referring Ability for Biomedical Language Models
Junfeng Jiang, Fei Cheng, Akiko Aizawa
CapEEN: Image Captioning with Early Exits and Knowledge Distillation
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
LumberChunker: Long-Form Narrative Document Segmentation
André V. Duarte, João DS Marques, Miguel Graça, Miguel Freire, Lei Li, Arlindo L. Oliveira
Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions
Jordan Meadows, Tamsin Emily James, Andre Freitas
Unlocking Continual Learning Abilities in Language Models
Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu
On the Rigour of Scientific Writing: Criteria, Analysis, and Insights
Joseph James, Chenghao Xiao, YUCHENG LI, Chenghua Lin
MMUTF: Multimodal Multimedia Event Argument Extraction with Unified Template Filling
Philipp Seeberger, Dominik Wagner, Korbinian Riedhammer
Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning
Sen Yang, Leyang Cui, Deng Cai, Xinting Huang, Shuming Shi, Wai Lam
Cross-lingual Contextualized Phrase Retrieval
Huayang Li, Deng Cai, Zhi Qu, Qu Cui, Hidetaka Kamigaito, Lemao Liu, Taro Watanabe
VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatial-Temporal Reasoning
Ruotong Liao, Max Erler, Huiyu Wang, Guangyao Zhai, Gengyuan Zhang, Yunpu Ma, Volker Tresp
Self-Constructed Context Decompilation with Fined-grained Alignment Enhancement
Yunlong Feng, Dechuan Teng, Yang Xu, Xiao Xu, Honglin Mu, Libo Qin, Qingfu Zhu, Wanxiang Che
Measuring Susceptibility to Irrelevant Context in Language Models
Tianyu Liu, Kevin Du, Mrinmaya Sachan, Ryan Cotterell
ESG-Kor: A Korean Dataset for ESG-related Information Extraction and Practical Use Cases
Jaeyoung Lee, Geonyeong Son, Misuk Kim
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
Yongheng Zhang, Qiguang Chen, Jingxuan Zhou, Peng Wang, Jiasheng Si, Jin Wang, Wenpeng Lu, Libo Qin
Hope `The Paragraph Guy’ explains the rest : Introducing MeSum, the Meme Summarizer
Anas Anwarul haq Khan, Tanik Saikh, Arpan Phukan, Asif Ekbal
Learning Semantic Structure through First-Order-Logic Translation
Akshay Chaturvedi, Nicholas Asher
A Training Data Recipe to Accelerate A* Search with Language Models
Devaansh Gupta, Boyang Li
From Generation to Selection Findings of Converting Analogical Problem-Solving into Multiple-Choice Questions
Donghyeon Shin, Seungpil Lee, Klea Lena Kovacec, Sundong Kim
What’s under the hood: Investigating Automatic Metrics on Meeting Summarization
Frederic Kirstein, Jan Philip Wahle, Terry Ruas, Bela Gipp
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages
Fabian David Schmidt, Philipp Borchert, Ivan Vulić, Goran Glavaš
CERD: A Comprehensive Chinese Rhetoric Dataset for Rhetorical Understanding and Generation in Essays
Nuowei Liu, Xinhao Chen, Hongyi Wu, Changzhi Sun, Man Lan, Yuanbin Wu, Xiaopeng Bai, Shaoguang Mao, Yan Xia
An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference
Atsuki Yamaguchi, Aline Villavicencio, Nikolaos Aletras
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Jiale Cheng, Yida Lu, Xiaotao Gu, Pei Ke, Xiao Liu, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang
BAPO: Base-Anchored Preference Optimization for Personalized Alignment in LLMs
Gihun Lee, Minchan Jeong, Yujin Kim, Hojung Jung, Jaehoon Oh, SangMook Kim, Se-Young Yun
Beyond Common Words: Enhancing ASR Cross-Lingual Proper Noun Recognition Using Large Language Models
Rishabh Kumar, Sabyasachi Ghosh, Ganesh Ramakrishnan
Few-shot clinical entity recognition in English, French and Spanish: masked language models outperform generative model prompting
Marco Naguib, Xavier Tannier, Aurélie Névéol
STTATTS: Unified Speech-To-Text And Text-To-Speech Model
Hawau Olamide Toyin, Hao Li, Hanan Aldarmaki
From Text Segmentation to Enhanced Representation Learning: A Novel Approach to Multi-Label Classification for Long Texts
Wang Zhang, Xin Wang, Qian Wang, Tao Deng, Xiaoru Wu
Editing Conceptual Knowledge for Large Language Models
Xiaohan Wang, Shengyu Mao, Shumin Deng, Yunzhi Yao, YUE SHEN, Lei Liang, Jinjie GU, Huajun Chen, Ningyu Zhang
Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL
Qihuang Zhong, Kunfeng Chen, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao
ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees
Zhiyuan Wang, Jinhao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Xiaoshuang Shi, Kaidi Xu, Heng Tao Shen, Xiaofeng Zhu
Irrelevant Alternatives Bias Large Language Model Hiring Decisions
Kremena Valkanova, Pencho Yordanov
PclGPT: A Large Language Model for Patronizing and Condescending Language Detection
Hongbo Wang, LiMingDa, Junyu Lu, Hebin Xia, Liang Yang, Bo Xu, Ruizhu Liu, Hongfei Lin
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Alfonso Amayuelas, Xianjun Yang, Antonis Antoniades, Wenyue Hua, Liangming Pan, William Yang Wang
CEAMC: Corpus and Empirical Study of Argument Analysis in Education via LLMs
Yupei Ren, Hongyi Wu, Zhaoguang Long, Shangqing Zhao, Xinyi Zhou, Zheqin Yin, Xinlin Zhuang, Xiaopeng Bai, Man Lan
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Wanyun Cui, Qianle Wang
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
Sihui Yang, Keping Bi, Wanqing Cui, Jiafeng Guo, Xueqi Cheng
Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations
Nuo Chen, Zinan Zheng, Ning Wu, MING GONG, Dongmei Zhang, Jia Li
SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation
Raoyuan Zhao, Abdullatif Köksal, Yihong Liu, Leonie Weissweiler, Anna Korhonen, Hinrich Schuetze
TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Arda Yüksel, Abdullatif Köksal, Lütfi Kerem Senel, Anna Korhonen, Hinrich Schuetze
LongForm: Effective Instruction Tuning with Reverse Instructions
Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schuetze
Explaining Graph Neural Networks with Large Language Models: A Counterfactual Perspective on Molecule Graphs
Yinhan He, Zaiyi Zheng, Patrick Soga, Yaochen Zhu, Yushun Dong, Jundong Li
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Mengru Wang, Yunzhi Yao, Ziwen Xu, Shuofei Qiao, Shumin Deng, Peng Wang, Xiang Chen, Jia-Chen Gu, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang
LongHeads: Multi-Head Attention is Secretly a Long Context Processor
Yi Lu, Xin Zhou, Wei He, Jun Zhao, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
Crisis counselor language and perceived genuine concern in crisis conversations
Greg Buda, Ignacio J. Tripodi, Margaret Meagher, Elizabeth A. Olson
Edit-Constrained Decoding for Sentence Simplification
Tatsuya Zetsu, Yuki Arase, Tomoyuki Kajiwara
Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases
Salvatore Giorgi, Tingting Liu, Ankit Aich, Kelsey Jane Isman, Garrick Sherman, Zachary Fried, João Sedoc, Lyle Ungar, Brenda Curtis
Multi-Loss Fusion: Angular and Contrastive Integration for Machine-Generated Text Detection
Iqra Zahid, Yue Chang, Youcheng Sun, Riza Batista-Navarro
Intermediate Layer Distillation with the Reused Teacher Classifier: A Study on the Importance of the Classifier of Attention-based Models
Hang Zhang, Seyyed Hasan Mozafari, James J. Clark, Brett H. Meyer, Warren J. Gross
Enhancing Large Language Model Based Sequential Recommender Systems with Pseudo Labels Reconstruction
Hyunsoo Na, Minseok Gang, Youngrok Ko, Jinseok Seol, Sang-goo Lee
On the Generalization of Training-based ChatGPT Detection Methods
Han Xu, Jie Ren, Pengfei He, Shenglai Zeng, Yingqian Cui, Amy Liu, Hui Liu, Jiliang Tang
Private prediction for large-scale synthetic text generation
Kareem Amin, Alex Bie, Weiwei Kong, Alexey Kurakin, Natalia Ponomareva, Umar Syed, Andreas Terzis, Sergei Vassilvitskii
RAG-Studio: Towards In-Domain Adaptation Of Retrieval Augmented Generation Through Self-Alignment
Kelong Mao, Zheng Liu, Hongjin Qian, Fengran Mo, Chenlong Deng, Zhicheng Dou
Generalists vs. Specialists: Evaluating Large Language Models for Urdu
Samee Arif, Abdul Hameed Azeemi, Agha Ali Raza, Awais Athar
Improving Multi-Agent Debate with Sparse Communication Topology
Yunxuan Li, Yibing Du, Jiageng Zhang, Le Hou, Peter Grabowski, Yeqing Li, Eugene Ie
Evidence Retrieval for Fact Verification using Multi-stage Reranking
Shrikant Malviya, Stamos Katsigiannis
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
Zihan Wang, Yunxuan Li, Yuexin Wu, Liangchen Luo, Le Hou, Hongkun Yu, Jingbo Shang
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Jessica Maria Echterhoff, Fartash Faghri, Raviteja Vemulapalli, Ting-Yao Hu, Chun-Liang Li, Oncel Tuzel, Hadi Pouransari
Event-Keyed Summarization
William Gantt, Alexander Martin, Pavlo Kuchmiichuk, Aaron Steven White
The Effect of Sampling Temperature on Problem Solving in Large Language Models
Matthew Renze
HiCuLR: Hierarchical Curriculum Learning for Rhetorical Role Labeling of Legal Documents
Santosh T.Y.S.S, Apolline Isaia, Shiyu Hong, Matthias Grabmair
MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems
Kaixin Li, Yuchen Tian, Qisheng Hu, Ziyang Luo, Jing Ma
Semi-Supervised Reward Modeling via Iterative Self-Training
Yifei He, Haoxiang Wang, Ziyan Jiang, Alexandros Papangelis, Han Zhao
Few-shot Selections for Numerical Time Series Data-to-Text
Masayuki Kawarada, Tatsuya Ishigaki, Goran Topić, Hiroya Takamura
Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction
Chenlong Deng, Kelong Mao, Yuyao Zhang, Zhicheng Dou
ALIGN-SIM: A Task-Free Test Bed for Evaluating and Interpreting Sentence Embeddings through Semantic Similarity Alignment
Yash mahajan, Naman Bansal, Eduardo Blanco, Santu Karmaker
BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
Aofei Chang, Jiaqi Wang, Han Liu, Parminder Bhatia, Cao Xiao, Ting Wang, Fenglong Ma
In-Context Learning with Iterative Demonstration Selection
Chengwei Qin, Aston Zhang, Chen Chen, Anirudh Dagar, Wenming Ye
Preserving Pre-trained Representation Space: On Effectiveness of Prefix-tuning for Large Multi-modal Models
Donghoon Kim, Gusang Lee, Kyuhong Shim, Byonghyo Shim
On Evaluating Explanation Utility for Human-AI Decision Making in NLP
Fateme Hashemi Chaleshtori, Atreya Ghosal, Alexander Gill, Purbid bambroo, Ana Marasovic
Unsupervised Hierarchical Topic Modeling via Anchor Word Clustering and Path Guidance
Jiyuan Liu, Hegang Chen, Chunjiang Zhu, Yanghui Rao
GuardEmb: Dynamic Watermark for Safeguarding Large Language Model Embedding Service Against Model Stealing Attack
Liaoyaqi Wang, Minhao Cheng
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Sihang Zhao, Youliang Yuan, Xiaoying Tang, Pinjia He
Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery
Yimin Deng, Yuxia Wu, Li Zhu, Guoshuai Zhao, Xueming Qian
RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Xijie Huang, Zechun Liu, Shih-Yang Liu, Kwang-Ting Cheng
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent Collaboration
Weikang Yuan, Junjie Cao, Zhuoren Jiang, Yangyang Kang, Jun Lin, Kaisong Song, tianqianjin lin, Pengwei Yan, Changlong Sun, Xiaozhong Liu
Retrieval and Reasoning on KGs: Integrate Knowledge Graphs into Large Language Models for Complex Question Answering
Yixin Ji, Kaixin Wu, Juntao Li, Wei Chen, mingjie zhong, Xu Jia, Min Zhang
Insights into LLM Long-Context Failures: When Transformers Know but Don’t Tell
Muhan Gao, TaiMing Lu, Kuai Yu, Adam Byerly, Daniel Khashabi
Exploration-based Error Correction Learning in Embodied Language Models
Hanlin Wang, Chak Tou Leong, Jian Wang, Wenjie Li
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
David Rau, Hervé Déjean, Nadezhda Chirkova, Thibault Formal, shuai wang, Stéphane CLINCHANT, Vassilina Nikoulina
Should Cross-Lingual AMR Parsing go Meta? An Empirical Assessment of Meta-Learning and Joint Learning AMR Parsing
Jeongwoo Kang, Maximin Coavoux, Cédric Lopez, Didier Schwab
Contextualized Graph Representations for Generating Counter-Narrative against Hate Speech
Selene Baez Santamaria, Helena Gomez Adorno, Ilia Markov
Modeling Historical Relevant and Local Frequency Context for Representation-Based Temporal Knowledge Graph Forecasting
Shengzhe Zhang, Wei Wei, Rikui Huang, Wenfeng xie, Dangyang Chen
Representation Alignment and Adversarial Networks for Cross-lingual Dependency Parsing
Ying Li, Jianjian Liu, Zhengtao Yu, Shengxiang Gao, Yuxin Huang, Cunli Mao
What Would Happen Next? Predicting Consequences from An Event Causality Graph
Chuanhong Zhan, Wei Xiang, 梁超, Bang Wang
An Instruction Tuning-Based Contrastive Learning Framework for Aspect Sentiment Quad Prediction with Implicit Aspects and Opinions
Hao Zhang, Yu-N Cheah, Congqing He, Feifan YI
MACAROON: Training Vision-Language Models To Be Your Engaged Partners
Shujin Wu, Yi Fung, Sha Li, Yixin Wan, Kai-Wei Chang, Heng Ji
ICL: Iterative Continual Learning for Multi-domain Neural Machine Translation
Zhibo Man, Kaiyu Huang, Yujie Zhang, Yuanmeng Chen, Yufeng Chen, Jinan Xu
Mitigating Hallucinations of Large Language Models in Medical Domain via Contrastive Decoding
Derong Xu, Ziheng Zhang, Zhihong Zhu, Zhenxi Lin, Qidong Liu, Xian Wu, Tong Xu, Xiangyu Zhao, Yefeng Zheng, Enhong Chen
NeuroMax: Enhancing Neural Topic Modeling via Maximizing Mutual Information and Group Topic Regularization
Duy-Tung Pham, Thien Trang Nguyen Vu, Tung Nguyen, Linh Van Ngo, Duc Anh Nguyen, Thien Huu Nguyen
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz, Kartik Mehta, Yu-Hsiang Lin, Haw-Shiuan Chang, Shereen Oraby, Sijia Liu, Vivek Subramanian, Tagyoung Chung, Mohit Bansal, Nanyun Peng
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, YUE SHEN, Peng Wei, Zhiqiang Zhang, Jinjie GU, JUN ZHOU, Jeff Z. Pan, Wen Zhang, Huajun Chen
Is Compound Aspect-Based Sentiment Analysis Addressed by ChatGPT?
Yinhao Bai, Zhixin Han, Yuhua Zhao, Hang Gao, Zhuowei Zhang, Xunzhi Wang, Mengting Hu
Multilingual Fine-Grained News Headline Hallucination Detection
Jiaming Shen, Tianqi Liu, Jialu Liu, Zhen Qin, Jay Pavagadhi, Simon Baumgartner, Michael Bendersky
PE: A Poincare Explanation Method for Fast Text Hierarchy Generation
Qian Chen, Dongyang Li, Xiaofeng He, Hongzhao Li, Hongyu Yi
Step-level Value Preference Optimization for Mathematical Reasoning
Guoxin Chen, Minpeng Liao, Chengxi Li, Kai Fan
Towards Benchmarking Situational Awareness of Large Language Models:Comprehensive Benchmark, Evaluation and Analysis
Guo Tang, Zheng Chu, Wenxiang Zheng, Ming Liu, Bing Qin
Balancing Visual Context Understanding in Dialogue for Image Retrieval
zhaohui Wei, Lizi Liao, Xiaoyu Du, Xinguang Xiang
Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations
Lei Yu, Meng Cao, Jackie CK Cheung, Yue Dong
A Study of Implicit Ranking Unfairness in Large Language Models
Chen Xu, Wenjie Wang, Yuxin Li, Liang Pang, Jun Xu, Tat-Seng Chua
Compression Parity: Measuring and Predicting the Multilingual Capabilities of Language Models
Alexander Tsvetkov, Alon Kipnis
Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization
Mingyang Wang, Lukas Lange, Heike Adel, Jannik Strötgen, Hinrich Schuetze
Can LLMs Learn From Mistakes? An Empirical Study on Reasoning Tasks
Shengnan An, Zexiong Ma, Siqi Cai, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen
A Semantic Search Engine for Mathlib4
Guoxiong Gao, Haocheng Ju, Jiedong Jiang, Zihan Qin, Bin Dong
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs
Seyed Mahed Mousavi, Simone Alghisi, giuseppe riccardi
Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Huifang Du, Shuqin Li, Minghao Wu, Xuejing Feng, Yuan-Fang Li, Haofen Wang
Assistive Large Language Model Agents for Socially-Aware Negotiation Dialogues
YUNCHENG HUA, Lizhen Qu, Reza Haf
HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing
Jing Chen, Xinyu Zhu, Cheng Yang, Chufan Shi, Yadong Xi, Yuxiang Zhang, Junjie Wang, Jiashu Pu, Rongsheng Zhang, Yujiu Yang, Tian Feng
General Collaborative Framework between Large Language Model and Experts for Universal Information Extraction
K Bao, Ning Wang
Causal Discovery Inspired Unsupervised Domain Adaptation for Emotion-Cause Pair Extraction
YUNCHENG HUA, Yujin Huang, Shuo Huang, Tao Feng, Lizhen Qu, Christopher Bain, Richard Bassed, Reza Haf
Large Language Models are Students at Various Levels: Zero-shot Question Difficulty Estimation
Jae-Woo Park, Seong-Jin Park, Hyun-Sik Won, Mingyu Lee, Kang-Min Kim
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data
Han Xia, Songyang Gao, Qiming Ge, Zhiheng Xi, Qi Zhang, Xuanjing Huang
Temporal Cognitive Tree: A Hierarchical Modeling Approach for Event Temporal Relation Extraction
Wanting Ning, Lishuang Li, Xueyang Qin, Yubo Feng, Jingyao Tang
Activation Scaling for Attribution and Intervention in Language Models
Niklas Stoehr, Kevin Du, Vésteinn Snæbjarnarson, Robert West, Ryan Cotterell, Aaron Schein
LaRA: Large Rank Adaptation for Speech and Text Cross-Modal Learning in Large Language Models
Zuhair hasan shaik, Pradyoth Hegde, Prashant Bannulmath, Deepak K T
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models
Mohammadreza Pourreza, Davood Rafiei
MedINST: Meta Dataset of Biomedical Instructions
Wenhan Han, Meng Fang, Zihan Zhang, Yu Yin, Zirui Song, Ling Chen, Mykola Pechenizkiy, Qingyu Chen
PropTest: Automatic Property Testing for Improved Visual Programming
Jaywon Koo, Ziyan Yang, Paola Cascante-Bonilla, Baishakhi Ray, Vicente Ordonez
BaFair: Backdoored Fairness Attacks with Group-conditioned Triggers
Jiaqi Xue, Qian Lou, Mengxin Zheng
Is GPT-4V (ision) All You Need for Automating Academic Data Visualization? Exploring Vision-Language Models’ Capability in Reproducing Academic Charts
Zhehao Zhang, Weicheng Ma, Soroush Vosoughi
Financial Forecasting from Textual and Tabular Time Series
Ross Koval, Nicholas Andrews, Xifeng Yan
Learning to Ask Denotative and Connotative Questions for Knowledge-based VQA
Xiaoying Xing, Peixi Xiong, Lei Fan, Yunxuan Li, Ying Wu
CONTOR: Benchmarking Strategies for Completing Ontologies with Plausible Missing Rules
Na Li, Thomas Bailleux, Zied Bouraoui, Steven Schockaert
Towards Pareto-Efficient RLHF: Paying Attention to a Few High-Reward Samples
Changhun Lee, Chiehyeon Lim
Weak-to-Strong Reasoning
Yuqing Yang, Yan Ma, Pengfei Liu
Fine-Tuning Language Models with Differential Privacy through Adaptive Noise Allocation
Xianzhi Li, Ran Zmigrod, Xiaodan Zhu, Zhiqiang Ma, Xiaomo Liu
The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning
Xiyan Fu, Anette Frank
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Xiyang Wu, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, Dinesh Manocha
MetaKP: On-Demand Keyphrase Generation
Di Wu, Xiaoxian Shen, Kai-Wei Chang
PSST: A Benchmark for Evaluation-driven Text Public-Speaking Style Transfer
Huashan Sun, Yixiao Wu, Yizhe Yang, Yinghao Li, Jiawei Li, Yuhao Ye, Yang Gao
LongGenBench: Long-context Generation Benchmark
Xiang Liu, Peijie Dong, Xuming Hu, Xiaowen Chu
TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation
Jinyuan Fang, Zaiqiao Meng, Craig MacDonald
Enable Fast Sampling for Seq2Seq Text Diffusion
Pan Liu, Xiaohua Tian, Zhouhan Lin
AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference
Yang Han, Yiming Wang, Rui Wang, Lu Chen, Kai Yu
CHIRON: Rich Character Representations in Long-Form Narratives
Alexander Gurung, Mirella Lapata
$\textit{Refiner}$: Restructure Retrieved Content Efficiently to Advance Question-Answering Capabilities
Zhonghao Li, Xuming Hu, Aiwei Liu, Kening Zheng, Sirui Huang, Hui Xiong
SEAVER: Attention Reallocation for Mitigating Distractions in Language Models for Conditional Semantic Textual Similarity Measurement
Baixuan Li, Yunlong Fan, Zhiqiang Gao
Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models
Shixin Jiang, Zerui Chen, Jiafeng Liang, Yanyan Zhao, Ming Liu, Bing Qin
LPZero: Language Model Zero-cost Proxy Search from Zero
Peijie Dong, Lujun Li, Xiang Liu, Zhenheng Tang, Xuebo Liu, Qiang Wang, Xiaowen Chu
Traffic Light or Light Traffic? Investigating Phrasal Semantics in Large Language Models
Rui Meng, Ye Liu, Lifu Tu, Daqing He, Yingbo Zhou, Semih Yavuz
How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment
Heyan Huang, Yinghao Li, Huashan Sun, Yu Bai, Yang Gao
Variational Language Concepts for Interpreting Pretrained Language Models
Hengyi Wang, Zhiqing Hong, Shiwei Tan, Desheng Zhang, Hao Wang
Exploring the Capability of Multimodal LLMs with Yonkoma Manga: The YManga Dataset and Its Challenging Tasks
Qi Yang, Liang Yang, Jingjie Zeng, Zhihao Yang, Hongfei Lin
TWBias: A Benchmark for Assessing Social Bias in Traditional Chinese Large Language Models within the Taiwan Cultural Context
Hsin-Yi Hsieh, Shih-Cheng Huang, Richard Tzong-Han Tsai
Unlocking the Potential of Model Merging for Low-Resource Languages
Mingxu Tao, Chen Zhang, Quzhe Huang, Tianyao Ma, Songfang Huang, Dongyan Zhao, Yansong Feng
PURE: Aligning LLM via Pluggable Query Reformulation for Enhanced Helpfulness
Wenjin Yao, Yidong Wang, Zhuohao Yu, Rui Xie, Shikun Zhang, Wei Ye
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Binxu Li, Tiankai Yan, Yuanting Pan, Jie Luo, Ruiyang Ji, Jiayuan Ding, Zhe Xu, Shilong Liu, Haoyu Dong, Zihao Lin, Yixin Wang
SALMON: A Structure-Aware Language Model with logicality and densification strategy for Temporal Knowledge Graph Reasoning
Fu Zhang, Jinghao Lin, Jingwei Cheng
RaFe: Ranking Feedback Improves Query Rewriting for RAG
Shengyu Mao, Yong Jiang, Boli Chen, Xiao Li, Peng Wang, Xinyu Wang, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang
Amateur-free Contrastive Decoding via Cognitive Layers Skipping
Wenhao Zhu, Sizhe Liu, Shujian Huang, Shuaijie She, Chris Wendler, Jiajun Chen
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Bolei Ma, Xinpeng Wang, Tiancheng Hu, Anna-Carolina Haensch, Michael A. Hedderich, Barbara Plank, Frauke Kreuter
Low-Resource Machine Translation through the Lens of Personalized Federated Learning
Viktor Moskvoretskii, Nazarii Tupitsa, Chris Biemann, Samuel Horváth, Eduard Gorbunov, Irina Nikishina
Can Language Models Recognize Convincing Arguments?
Paula Rescala, Manoel Horta Ribeiro, Tiancheng Hu, Robert West
Knowledge Navigator: Hierarchical Subtopic Organization for Exploratory Search in Scientific Literature
Uri Katz, Mosh Levy, Yoav Goldberg
Scalable and Domain-General Abstractive Proposition Segmentation
Mohammad Javad Hosseini, Yang Gao, Tim Baumgärtner, Alex Fabrikant, Reinald Kim Amplayo
Hit the Nail on the Head: Parameter-Efficient Multi-task Tuning via Human Language Intervention
wenxuan lu, Songhao Jiang, WangYijing, Tianning Zang
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents
Ruiyang Ren, Peng Qiu, Yingqi Qu, Jing Liu, Xin Zhao, Hua Wu, Ji-Rong Wen, Haifeng Wang
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Jun Zhao
Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals
Yupei Wang, Renfen Hu, Zhe Zhao
TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models
Chen Zhang, chengguang tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo, Stefano Campese, Federico Agostini, Alessandro Moschitti
Active Learning for Abstractive Text Summarization via LLM-Determined Curriculum and Certainty Gain Maximization
Dongyuan Li, Ying Zhang, Zhen Wang, Shiyin Tan, Satoshi Kosugi, Manabu Okumura
Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question Answering
Yu Zhang, Kehai Chen, Xuefeng Bai, zhao kang, Quanjiang Guo, Min Zhang
Achieving Stronger Generation via Simple Contrastive Tuning
Zhimeng Wang, Pinzheng Wang, Juntao Li, Yibin Chen, Min Zhang
Make Large Language Model a Better Ranker
Wenshuo Chao, Zhi Zheng, Hengshu Zhu, Hao Liu
Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
Daehoon Gwak, Junwoo Park, Minho Park, ChaeHun Park, Hyunchan Lee, Edward Choi, Jaegul Choo
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
Minsang Kim, Seung Jun Baek
ICON: Improving Inter-Report Consistency of Radiology Report Generation via Lesion-aware Mix-up Augmentation
Wenjun Hou, Yi Cheng, Kaishuai Xu, Yan Hu, Wenjie Li, Jiang Liu
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models
KediChen, Qin Chen, Jie Zhou, He Yishen, Liang He
ExpertEase: A Multi-Agent Framework for Grade-Specific Document Simplification with Large Language Models
Kaijie Mo, Renfen Hu
Class Name Guided Out-of-Scope Intent Classification
Chandan Gautam, Sethupathy Parameswaran, Aditya Kane, Yuan Fang, Savitha Ramasamy, Suresh Sundaram, Sunil Kumar Sahu, Xiaoli Li
Search if you don’t know! Knowledge-Augmented Korean Grammatical Error Correction with Large Language Models
Seonmin Koo, Jinsung Kim, Chanjun Park, Heuiseok Lim
Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Qin Zhu, Qinyuan Cheng, Runyu Peng, Xiaonan Li, Ru Peng, Tengxiao Liu, Xipeng Qiu, Xuanjing Huang
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech
Taejun Bak, Youngsik Eom, SeungJae Choi, Young-Sun Joo
RoBERT2VecTM: A Novel Approach for Topic Extraction in Islamic Studies
Sania Aftar, Amina El Ganadi, Luca Gagliardelli, Sonia Bergamaschi
Are ELECTRA’s Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity
Ivan Rep, David Dukić, Jan Snajder
DetectiveNN: Imitating Human Emotional Reasoning with a Recall-Detect-Predict Framework for Emotion Recognition in Conversations
Simin Hong, Jun Sun, Taihao Li
HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs
Adrián Bazaga, Pietro Lio, Gos Micklem
On Diversified Preferences of Large Language Model Alignment
Dun Zeng, Yong Dai, Pengyu Cheng, Longyue Wang, Tianhao Hu, Wanshun CHEN, nan du, Zenglin Xu
LoRAExit: Empowering Dynamic Modulation of LLMs in Resource-limited Settings using Low-rank Adapters
Jiacheng Liu, Peng Tang, Xiaofeng Hou, Chao Li, Pheng-Ann Heng
Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
Tianhui Zhang, Bei Peng, Danushka Bollegala
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan, Yao Wan, Zhangqian Bi, Zheng Wang, Hongyu Zhang, Yulei Sui, Pan Zhou, Lichao Sun
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
Joseph Marvin Imperial, Harish Tayyar Madabushi
StablePT : Towards Stable Prompting for Few-shot Learning via Input Separation
Xiaoming Liu, Chen Liu, Zhaohan Zhang, Chengzhengxu Li, Longtian Wang, Yu Lan, Chao Shen
Natural Evolution-based Dual-Level Aggregation for Temporal Knowledge Graph Reasoning
Bin Chen, Chunjing Xiao, Fan Zhou
Creative and Context-Aware Translation of East Asian Idioms with GPT-4
Kenan Tang, Peiyang Song, Yao Qin, Xifeng Yan
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions
Angana Borah, Rada Mihalcea
Devil’s Advocate: Anticipatory Reflection for LLM Agents
Haoyu Wang, Tao Li, Zhiwei Deng, Dan Roth, Yang Li
HiGenQA: Exploring Hint Generation Approaches for Open Domain Question Answering
Jamshid Mozafari, Abdelrahman Abdallah, Bhawna Piryani, Adam Jatowt
On the Causal Nature of Sentiment Analysis
Zhiheng Lyu, Zhijing Jin, Fernando Gonzalez Adauto, Rada Mihalcea, Bernhard Schölkopf, Mrinmaya Sachan
PEDANTS (Precise Evaluations of Diverse Answer Nominee Text for Skinflints): Use Evaluation Metrics Wisely–Efficient Evaluation Analysis and Benchmarking for Open-Domain Question Answering
Zongxia Li, Ishani Mondal, Huy Nghiem, Yijun Liang, Jordan Lee Boyd-Graber
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation
Zhitao He, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, Kang Liu, Jun Zhao
Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
Cheng-Hsun Hsueh, Paul Kuo-Ming Huang, Tzu-Han Lin, CHE WEI LIAO, Hung-Chieh Fang, Chao-Wei Huang, Yun-Nung Chen
Explaining Language Models via Randomized Path-Integration
Oren Barkan, Yehonatan Elisha, Yonatan toib, Jonathan Weill, Noam Koenigstein
VeriScore: Evaluating the factuality of verifiable claims in long-form text generation
Yixiao Song, Yekyung Kim, Mohit Iyyer
Instruct, Not Assist: LLM-based Multi-Turn Planning and Hierarchical Questioning for Socratic Code Debugging
Priyanka Kargupta, Ishika Agarwal, Dilek Hakkani Tur, Jiawei Han
Tutor-ICL: Guiding Large Language Models for Improved In-Context Learning Performance
Ikhyun Cho, Gaeul Kwon, Julia Hockenmaier
Conversation Redirection in Mental Health Therapy
Vivian Nguyen, Sang Min Jung, Lillian Lee, Thomas D. Hull, Cristian Danescu-Niculescu-Mizil
Explainability via Attributive Masking Learning
Oren Barkan, Yonatan toib, Yehonatan Elisha, Jonathan Weill, Noam Koenigstein
How Entangled is Factuality and Deception in German?
Aswathy Velutharambath, Amelie Wuehrl, Roman Klinger
Train Once, Use Flexibly: A Modular Framework for Multi-Aspect Neural News Recommendation
Andreea Iana, Goran Glavaš, Heiko Paulheim
A LLM-based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation
Irune Zubiaga, Aitor Soroa, Rodrigo Agerri
A Survey on Open Information Extraction from Rule-based Model to Large Language Model
Liu Pai, Wenyang Gao, Wenjie Dong, Lin Ai, Ziwei Gong, Songfang Huang, Li Zongsheng, Ehsan Hoque, Julia Hirschberg, Yue Zhang
Enhancing Tool Retrieval with Iterative Feedback from Large Language Models
Xu Qiancheng, Yongqi Li, Heming Xia, Wenjie Li
Detecting Temporal Ambiguity in Questions
Bhawna Piryani, Abdelrahman Abdallah, Jamshid Mozafari, Adam Jatowt
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation
Seyedarmin Azizi, Souvik Kundu, Massoud Pedram
Measuring the Robustness of NLP Models to Domain Shifts
Nitay Calderon, Naveh Porat, Eyal Ben-David, Alexander Chapanin, Zorik Gekhman, Nadav Oved, Vitaly Shalumov, Roi Reichart
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models
Kenza Benkirane, Laura Gongas, Shahar Pelles, Naomi Fuchs, Joshua Darmon, Pontus Stenetorp, David Ifeoluwa Adelani, Eduardo Sánchez
Navigating Hallucinations for Reasoning of Unintentional Activities
Shresth Grover, Vibhav Vineet, Yogesh S Rawat
Pruning Foundation Models for High Accuracy without Retraining
Pu Zhao, Fei Sun, Xuan Shen, Pinrui Yu, Zhenglun Kong, Xue Lin, Yanzhi Wang
From Pixels to Personas: Investigating and Modeling Self-Anthropomorphism in Human-Robot Dialogues
Yu Li, Devamanyu Hazarika, Di Jin, Julia Hirschberg, Yang Liu
DisGeM: Distractor Generation for Multiple Choice Questions with Span Masking
Devrim Çavuşoğlu, Seçil Şen, Ulaş Sert
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Zhao wenyi, Jie Tang, Yuxiao Dong
MobileQuant: Mobile-friendly Quantization for On-device Language Models
Fuwen Tan, Royson Lee, Łukasz Dudziak, Shell Xu Hu, Sourav Bhattacharya, Timothy Hospedales, Georgios Tzimiropoulos, Brais Martinez
Do they mean ‘us’? Interpreting Referring Expressions in Intergroup Bias
Venkata Subrahmanyan Govindarajan, Matianyu Zang, Kyle Mahowald, David Beaver, Junyi Jessy Li
A Survey on Detection of LLMs-Generated Content
Xianjun Yang, Liangming Pan, Xuandong Zhao, Haifeng Chen, Linda Ruth Petzold, William Yang Wang, Wei Cheng
Can LLMs Reason in the Wild with Programs?
Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri
Can Textual Unlearning Solve Cross-Modality Safety Alignment?
Trishna Chakraborty, Erfan Shayegani, Zikui Cai, Nael B. Abu-Ghazaleh, M. Salman Asif, Yue Dong, Amit Roy-Chowdhury, Chengyu Song
VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Xueqing Wu, Zongyu Lin, Songyan Zhao, Te-Lin Wu, Pan Lu, Nanyun Peng, Kai-Wei Chang
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu, Fei Wang, Nan Xu, Tianyi Lorena Yan, Tao Meng, Muhao Chen
MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization
Yasaman Jafari, Dheeraj Mekala, Rose Yu, Taylor Berg-Kirkpatrick
Understanding Faithfulness and Reasoning of Large Language Models on Plain Biomedical Summaries
Biaoyan Fang, Xiang Dai, Sarvnaz Karimi
Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy
Razvan-Gabriel Dumitru, Paul Ioan Clotan, Vikas Yadav, Darius Peteleaza, Mihai Surdeanu
API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access
Jiayuan Su, Jing Luo, Hongwei Wang, Lu Cheng
Pruning Multilingual Large Language Models for Multilingual Inference
Hwichan Kim, Jun Suzuki, Tosho Hirasawa, Mamoru Komachi
Video Discourse Parsing and Its Application to Multimodal Summarization: A Dataset and Baseline Approaches
Tsutomu Hirao, Naoki Kobayashi, Hidetaka Kamigaito, Manabu Okumura, Akisato Kimura
Length Extrapolation of Transformers: A Survey from the Perspective of Positional Encoding
Liang Zhao, Xiachong Feng, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin, Ting Liu
VPL: Visual Proxy Learning Framework for Zero-Shot Medical Image Diagnosis
Jiaxiang Liu, Tianxiang Hu, Huimin Xiong, Jiawei Du, YANG FENG, Jian Wu, Joey Tianyi Zhou, Zuozhu Liu
Word-Conditioned 3D American Sign Language Motion Generation
Lu Dong, Xiao Wang, Ifeoma Nwogu
TrustAgent: Towards Safe and Trustworthy LLM-based Agents through Agent Constitution
Wenyue Hua, Xianjun Yang, Mingyu Jin, Zelong Li, Wei Cheng, Ruixiang Tang, Yongfeng Zhang
Enabling Cross-Platform Comparison of Online Communities Using Content and Opinion Similarity
Prasanna Lakkur Subramanyam, Jeng-Yu Chou, Kevin K. Nam, Brian Levine
CNEQ: Incorporating numbers into Knowledge Graph Reasoning
Xianshu Peng, Wei Wei, Kaihe xu, Dangyang Chen
StraGo: Harnessing Strategic Guidance for Prompt Optimization
Yurong Wu, Yan Gao, Bin Benjamin Zhu, Zineng Zhou, Xiaodi Sun, Sheng Yang, Jian-Guang Lou, Zhiming Ding, Linjun Yang
Learning to Plan by Updating Natural Language
Yiduo Guo, Yaobo Liang, Chenfei Wu, Wenshan Wu, Dongyan Zhao, Nan Duan
Introducing Compiler Semantics into Large Language Models as Programming Language Translators: A Case Study of C to x86 Assembly
Shuoming Zhang, Jiacheng Zhao, Chunwei Xia, Zheng Wang, Yunji Chen, Huimin Cui
C-ICL: Contrastive In-context Learning for Information Extraction
Ying Mo, Jiahao Liu, Jian Yang, Qifan Wang, Shun Zhang, Jingang Wang, Zhoujun Li
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task
Javier Ferrando, Marta R. Costa-jussà
Can LLM be a Personalized Judge?
Yijiang River Dong, Tiancheng Hu, Nigel Collier
Who’s Who: Large Language Models Meet Knowledge Conflicts in Practice
Quang Hieu Pham, Hoang Ngo, Anh Tuan Luu, Dat Quoc Nguyen
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models
Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao
Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis
Jianxiang Yu, Zichen Ding, Jiaqi Tan, Kangyang Luo, Zhenmin Weng, Chenghua Gong, Long Zeng, RenJing Cui, Chengcheng Han, Qiushi Sun, Zhiyong Wu, Yunshi Lan, Xiang Li
Knowledge-based Consistency Testing of Large Language Models
Sai Sathiesh Rajan, Ezekiel Soremekun, Sudipta Chattopadhyay
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes
He CAO, Yanjun Shao, Zhiyuan Liu, Zijing Liu, Xiangru Tang, Yuan Yao, Yu Li
Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario
Feiteng Mu, Yong Jiang, Liwen Zhang, Liuchu, Wenjie Li, Pengjun Xie, Fei Huang
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
Qinzhuo Wu, Weikai Xu, Wei Liu, Tao Tan, Liujianfeng, Ang Li, Jian Luan, Bin Wang, Shuo Shang
Schema-Driven Information Extraction from Heterogeneous Tables
Fan Bai, Junmo Kang, Gabriel Stanovsky, Dayne Freitag, Mark Dredze, Alan Ritter
Is There a One-Model-Fits-All Approach to Information Extraction? Revisiting Task Definition Biases
Wenhao Huang, Qianyu He, Zhixu Li, Jiaqing Liang, Yanghua Xiao
PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning
Jiaru Zou, Mengyu Zhou, Tao Li, Shi Han, Dongmei Zhang
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning
Yuan Sui, Jiaru Zou, Mengyu Zhou, Xinyi He, Lun Du, Shi Han, Dongmei Zhang
In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models
Ayrton San Joaquin, Bin Wang, Zhengyuan Liu, Philippe Muller, Nicholas Asher, Brian Lim, Nancy F. Chen
How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models
Yin Jou Huang, Rafik Hadfi
Introducing Spatial Information and a Novel Evaluation Scheme for Open-Domain Live Commentary Generation
Erica Kido Shimomoto, Edison Marrese-Taylor, Ichiro Kobayashi, Hiroya Takamura, Yusuke Miyao
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation
Bolei He, CHENNUO, Xinran He, Lingyong Yan, zhenkai wei, Jinchang Luo, Zhen-Hua Ling
Detecting Machine-Generated Long-Form Content with Latent-Space Variables
Yufei Tian, Zeyu Pan, Nanyun Peng
Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System
Wanshi Xu
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Zhexin Zhang, Yida Lu, Jingyuan Ma, Di Zhang, Rui Li, Pei Ke, Hao Sun, Lei Sha, Zhifang Sui, Hongning Wang, Minlie Huang
BiasDora: Exploring Hidden Biased Associations in Vision-Language Models
Chahat Raj, Anjishnu Mukherjee, Aylin Caliskan, Antonios Anastasopoulos, Ziwei Zhu
MoE-I$^2$: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition
Cheng Yang, Yang Sui, Jinqi Xiao, Lingyi Huang, Yu Gong, Yuanlin Duan, Wenqi Jia, Miao Yin, Yu Cheng, Bo Yuan
Multimodal Misinformation Detection by Learning from Synthetic Data with Multimodal LLMs
Fengzhu ZENG, Wenqian Li, Wei Gao, Yan Pang
Exploring Design Choices for Building Language-Specific LLMs
Atula Tejaswi, Nilesh Gupta, Eunsol Choi
Promoting Data and Model Privacy in Federated Learning through Quantized LoRA
Zhu JianHao, Changze Lv, Xiaohua Wang, Muling Wu, Wenhao Liu, Tianlong Li, Zixuan Ling, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang
Intended Target Identification for Anomia Patients with Gradient-based Selective Augmentation
Jongho Kim, Romain Storaï, seung-won hwang
Fine-tuning Smaller Language Models for Question Answering over Financial Documents
Karmvir Singh Phogat, Sai Akhil Puranam, Sridhar Dasaratha, Chetan Harsha, Shashishekar Ramakrishna
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs.
Clement Christophe, Tathagata Raha, Svetlana Maslenkova, Muhammad Umar Salman, Praveenkumar Kanithi, Marco AF Pimentel, Shadab Khan
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation
Yusheng Liao, Shuyang Jiang, Zhe Chen, Yu Wang, Yanfeng Wang
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
Haoxiang Wang, Wei Xiong, Tengyang Xie, Han Zhao, Tong Zhang
Code Membership Inference for Detecting Unauthorized Data Use in Code Pre-trained Language Models
Sheng Zhang, Hui Li, Rongrong Ji
Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA
Nirmal Roy, Leonardo F. R. Ribeiro, Rexhina Blloshmi, Kevin Small
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication
Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun
Learning to Use Tools via Cooperative and Interactive Agents
Zhengliang Shi, Shen Gao, Xiuyi Chen, Yue Feng, Lingyong Yan, Haibo Shi, Dawei Yin, Pengjie Ren, Suzan Verberne, Zhaochun Ren
STARD: A Chinese Statute Retrieval Dataset Derived from Real-life Queries by Non-professionals
Weihang Su, Yiran HU, Anzhe Xie, Qingyao Ai, quezibing, Ning Zheng, Yun Liu, Weixing Shen, Yiqun LIU
What if…?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models
Junho Kim, KIM YEONJU, Yong Man Ro
MELT: Materials-aware Continued Pre-training for Language Model Adaptation to Materials Science
Junho Kim, Yeachan Kim, Jun-Hyung Park, Yerim Oh, Suho Kim, SangKeun Lee
PDF-to-Tree: Parsing PDF Text Blocks into a Tree
Yue Zhang, Zhihao Zhang, Wenbin Lai, Chong Zhang, Tao Gui, Qi Zhang, Xuanjing Huang
Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Dibyanayan Bandyopadhyay, Mohammed Hasanuzzaman, Asif Ekbal
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models
Minseok Choi, Kyunghyun Min, Jaegul Choo
XLLaMA2: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
Yinquan Lu, Wenhao Zhu, Lei Li, Yu Qiao, Fei Yuan
Enhancing Emotion-Cause Pair Extraction in Conversations via Center Event Detection and Reasoning
Botao Wang, Keke Tang, Peican Zhu
Light-weight Fine-tuning Method for Defending Adversarial Noise in Pre-trained Medical Vision-Language Models
Xu Han, Linghao Jin, Xuezhe Ma, Xiaofeng Liu
Together We Can: Mulitlingual Automatic Post-Editing for Low-Resource Languages
Sourabh Dattatray Deoghare, Diptesh Kanojia, Pushpak Bhattacharyya
CERT-ED: Certifiably Robust Text Classification for Edit Distance
Zhuoqun Huang, Neil G Marchant, Olga Ohrimenko, Benjamin I. P. Rubinstein
Ask-before-Plan: Proactive Language Agents for Real-World Planning
Xuan Zhang, Yang Deng, Zifeng Ren, See-Kiong Ng, Tat-Seng Chua
From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models
Qianyu He, Jie Zeng, Qianxi He, Jiaqing Liang, Yanghua Xiao
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Ruixuan Xiao, Wentao Ma, Ke Wang, Yuchuan Wu, Junbo Zhao, Haobo Wang, Fei Huang, Yongbin Li
Mental Disorder Classification via Temporal Representation of Text
Raja Kumar, Kishan Maharaj, Ashita Saxena, Pushpak Bhattacharyya
Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Yiming Chen, Xianghu Yue, Xiaoxue Gao, Chen Zhang, Luis Fernando D’Haro, Robby T. Tan, Haizhou Li
Multimodal Procedural Planning via Dual Text-Image Prompting
Yujie Lu, Pan Lu, Zhiyu Chen, Wanrong Zhu, Xin Eric Wang, William Yang Wang
Functionality learning through specification instructions
Pedro Henrique Luz de Araujo, Benjamin Roth
DictDis: Dictionary Constrained Disambiguation for Improved NMT
Ayush Maheshwari, Preethi Jyothi, Ganesh Ramakrishnan
Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Branislav Pecher, Jan Cegin, Robert Belanec, Jakub Simko, Ivan Srba, Maria Bielikova
Rethinking Code Refinement: Learning to Judge Code Efficiency
Minju Seo, Jinheon Baek, Sung Ju Hwang
Negating Negatives: Alignment with Human Negative Samples via Distributional Dispreference Optimization
Shitong Duan, Xiaoyuan Yi, Peng Zhang, Yan Liu, Zheng Liu, Tun Lu, Xing Xie, Ning Gu
Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability
Tsz Ting Chung, Leyang Cui, Lemao Liu, Xinting Huang, Shuming Shi, Dit-Yan Yeung
Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities
Baolong Bi, Shenghua Liu, Yiwei Wang, Lingrui Mei, Hongcheng Gao, Yilong Xu, Xueqi Cheng
Improving Factual Consistency of News Summarization by Contrastive Preference Optimization
Huawen Feng, Yan Fan, Xiong Liu, Ting-En Lin, ZekunYao, Yuchuan Wu, Fei Huang, Yongbin Li, Qianli Ma
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding
Alessandro Suglia, Claudio Greco, Katie Baker, Jose L. Part, Ioannis Papaioannou, Arash Eshghi, Ioannis Konstas, Oliver Lemon
Platform-Invariant Topic Modeling via Contrastive Learning to Mitigate Platform-Induced Bias
Minseo Koo, DoeunKim, Sungwon Han, Sungkyu Shaun Park
MAVEN-FACT: A Large-scale Event Factuality Detection Dataset
Chunyang Li, Hao Peng, Xiaozhi Wang, Yunjia Qi, Lei Hou, Bin Xu, Juanzi Li
Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft
Kranti CH, Sherzod Hakimov, David Schlangen
Make Compound Sentences Simple to Analyze: Learning to Split Sentences for Aspect-based Sentiment Analysis
Yongsik Seo, Sungwon Song, Ryang Heo, Jieyong Kim, Dongha Lee
LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
Jiahao Ying, Mingbao Lin, Yixin Cao, Wei Tang, Bo Wang, Qianru Sun, Xuanjing Huang, Shuicheng YAN
ITER: Iterative Transformer-based Entity Recognition and Relation Extraction
Moritz Hennen, Florian Babl, Michaela Geierhos
Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval
Kazuaki Furumai, Roberto Legaspi, Julio Cesar Vizcarra Romero, Yudai Yamazaki, Yasutaka Nishimura, Sina Semnani, Kazushi Ikeda, Weiyan Shi, Monica Lam
Logits Reranking via Semantic Labels for Hard Samples in Text Classification
Peijie Huang, Junbao Huang, Yuhong Xu, Weizhen li, Xisheng Xiao
Scaling Laws for Fact Memorization of Large Language Models
Xingyu Lu, Xiaonan Li, Qinyuan Cheng, Kai Ding, Xipeng Qiu
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment
Orgest Xhelili, Yihong Liu, Hinrich Schuetze
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Jing Zhou, Chenglin Jiang, Wei Shen, Xiao Zhou, Xiaonan He
Designing Logic Pattern Templates for Counter-Argument Logical Structure Analysis
Shoichi Naito, Wenzhi Wang, Paul Reisert, Naoya Inoue, Camélia Guerraoui, Kenshi Yamaguchi, Jungmin Choi, Irfan Robbani, Surawat Pothong, Kentaro Inui
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
Wenhua Cheng, Weiwei Zhang, Haihao Shen, Yiyang Cai, Xin He, Lv Kaokao, Yi Liu
Using LLMs to simulate students’ responses to exam questions
Luca Benedetto, Giovanni Aradelli, Antonia Donvito, Alberto Lucchetti, Andrea Cappelli, Paula Buttery
HSDreport: Heart Sound Diagnosis with Echocardiography Reports
Zihan Zhao, Pingjie Wang, Liudan Zhao, Yuchen Yang, Ya Zhang, Kun Sun, Xin Sun, Xin Zhou, Yu Wang, Yanfeng Wang
Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement
Zhiyuan Chang, Mingyang Li, Junjie Wang, Yi Liu, Qing Wang, Yang Liu
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing
Jeonghun Yeo, Seunghee Han, Minsu Kim, Yong Man Ro
MDCR: A Dataset for Multi-Document Conditional Reasoning
Peter Baile Chen, Yi Zhang, Chunwei Liu, Sejal Gupta, Yoon Kim, Mike Cafarella
Will LLMs Sink or Swim? Exploring Decision-Making Under Pressure
Kyusik Kim, Hyeonseok Jeon, Jeongwoo Ryu, Bongwon Suh
Zero-shot Commonsense Reasoning over Machine Imagination
Hyuntae Park, Yeachan Kim, Jun-Hyung Park, SangKeun Lee
OffsetBias: Leveraging Debiased Data for Tuning Evaluators
Junsoo Park, Seungyeon Jwa, REN MEIYING, Daeyoung Kim, Sanghyuk Choi
A Framework of Knowledge Graph-Enhanced Large Language Model Based on Question Decomposition and Atomic Retrieval
Yading Li, Dandan Song, Changzhi Zhou, Yuhang Tian, Hao Wang, Ziyi Yang, Shuhao Zhang
Vanessa: Visual Connotation and Aesthetic Attributes Understanding Network for Multimodal Aspect-based Sentiment Analysis
Luwei Xiao, Rui Mao, Xulang Zhang, Liang He, Erik Cambria
Consistent Document-level Relation Extraction via Counterfactuals
Ali Modarressi, Abdullatif Köksal, Hinrich Schuetze
Enhancing Learning-Based Binary Code Similarity Detection Model through Adversarial Training with Multiple Function Variants
Lichen Jia, Chenggang Wu, Bowen Tang, Peihua Zhang, Zihan Jiang, Ning Liu, Jingfeng Zhang, Zhe Wang
Ask the experts: sourcing a high-quality nutrition counseling dataset through Human-AI collaboration
Simone Balloccu, Ehud Reiter, Karen Jia-Hui Li, Rafael Sargsyan, Vivek Kumar, Diego Reforgiato, Daniele Riboni, Ondrej Dusek
HealthAlignSumm : Utilizing Alignment for Multimodal Summarization of Code-Mixed Healthcare Dialogues
Akash Ghosh, Arkadeep Acharya, Sriparna Saha, Gaurav Pandey, Dinesh Raghu, Setu Sinha
Revisiting the Impact of Pursuing Modularity for Code Generation
Deokyeong Kang, KiJung Seo, Taeuk Kim
A Decoding Algorithm Based on Directed Acyclic Transformers for Length-Control Summarization
Chenyang Huang, Hao Zhou, Cameron Jen, Kangjie Zheng, Osmar Zaiane, Lili Mou
R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation
Fuda Ye, Shuangyin Li, Yongqi Zhang, Lei Chen
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition
Aditya Kaushik Surikuchi, Raquel Fernández, Sandro Pezzelle
Gender Identity in Pretrained Language Models: An Inclusive Approach to Data Creation and Probing
Urban Knupleš, Agnieszka Falenska, Filip Miletić
“Vorbești Românește?” A Recipe to Train Powerful Romanian LLMs with English Instructions
Mihai Masala, Denis Ilie-Ablachim, Alexandru Dima, Dragos Georgian Corlatescu, Miruna-Andreea Zavelca, Ovio Olaru, Simina-Maria Terian, Andrei Terian, Marius Leordeanu, Horia Velicu, Marius Popescu, Mihai Dascalu, Traian Rebedea
Generalized Measures of Anticipation and Responsivity in Online Language Processing
Mario Giulianelli, Andreas Opedal, Ryan Cotterell
Towards Effective Counter-Responses: Aligning Human Preferences with Strategies to Combat Online Trolling
Huije Lee, Hoyun Song, Jisu Shin, Sukmin Cho, SeungYoon Han, Jong C. Park
Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs
John Mendonça, Isabel Trancoso, Alon Lavie
Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Review
Pranab Sahoo, Prabhash Meharia, Akash Ghosh, Sriparna Saha, Vinija Jain, Aman Chadha
Employing Glyphic Information for Chinese Event Extraction with Vision-Language Model
Xiaoyi Bao, Jinghang Gu, Zhongqing Wang, Minjie Qiang, Chu-Ren Huang
Predicting generalization performance with correctness discriminators
Yuekun Yao, Alexander Koller
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models
Junyi Zhu, Shuochen Liu, Yu Yu, Bo Tang, Yibo Yan, Zhiyu li, Feiyu Xiong, Tong Xu, Matthew B. Blaschko
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks
Anas Himmi, Ekhine Irurozki, Nathan Noiry, Stephan Clémençon, Pierre Colombo
Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP
Zeliang Zhang, Zhuo Liu, Mingqian Feng, Chenliang Xu
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning
Silin Meng, Yiwei Wang, Cheng-Fu Yang, Nanyun Peng, Kai-Wei Chang
Mixed-Session Conversation with Egocentric Memory
Jihyoung Jang, Taeyoung Kim, Hyounghun Kim
CSLM: A Framework for Question Answering Dataset Generation through Collaborative Small Language Models
Yiming Wang, Yang Liu, Lingchen Wang, An Xiao
Large Language Models Can Not Perform Well in Understanding and Manipulating Natural Language at Both Character and Word Levels?
Yidan Zhang, Zhenan He
Virtual Context Enhancing Jailbreak Attacks with Special Token Injection
YuqiZhou, Lin Lu, Hanchi Sun, Lichao Sun, Pan Zhou
Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection
Moxin Li, Wenjie Wang, Fuli Feng, Fengbin ZHU, Qifan Wang, Tat-Seng Chua
Automating Easy Read Text Segmentation
Jesus Javier Calleja Perez, Thierry Etchegoyhen, Antonio David Ponce Martínez
Position Paper: Data-Centric AI in the Age of Large Language Models
Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, Gregory Kang Ruey Lau, Hieu Dao, Lucas Agussurja, Rachael Hwee Ling Sim, Xiaoqiang Lin, Wenyang Hu, Zhongxiang Dai, Pang Wei Koh, Bryan Kian Hsiang Low
MATHWELL: Generating Educational Math Word Problems
Bryan R Christ, Jonathan Kropko, Thomas Hartvigsen
Resilience of Large Language Models for Noisy Instructions
Bin Wang, Chengwei Wei, Zhengyuan Liu, Geyu Lin, Nancy F. Chen
LLM-TOPLA: Efficient LLM Ensemble by Maximising Diversity
Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Ling Liu
Guided Knowledge Generation with Language Models for Commonsense Reasoning
Xiao Wei, Haoran Chen, Hang Yu, Hao Fei, Qian Liu
Augmenting Reasoning Capabilities of LLMs with Graph Structures in Knowledge Base Question Answering
Yuhang Tian, Dandan Song, Zhijing Wu, Changzhi Zhou, Hao Wang, Jun Yang, Jing Xu, Ruanmin Cao, HaoYu Wang
Position Paper: Creative Problem Solving in Large Language and Vision Models – What Would it Take?
Lakshmi Nair, Evana Gizzi, Jivko Sinapov
Cross-Lingual Multi-Hop Knowledge Editing – Benchmarks, Analysis and a Simple Contrastive Learning based Approach
Aditi Khandelwal, Harman Singh, Hengrui Gu, Tianlong Chen, Kaixiong Zhou
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Jiwen Zhang, Jihao Wu, Teng Yihua, Minghui Liao, Nuo Xu, Xiao Xiao, zhongyu wei, Duyu Tang
Self-Recognition in Language Models
Tim Ruben Davidson, Viacheslav Surkov, Veniamin Veselovsky, Giuseppe Russo, Robert West, Caglar Gulcehre
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
Daniele Rege Cambrin, Giuseppe Gallipoli, Irene Benedetto, Luca Cagliero, Paolo Garza
The Shape of Word Embeddings: Quantifying Non-Isometry with Topological Data Analysis
Ondřej Draganov, Steven Skiena
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations
Abhinav Joshi, Shaswati Saha, Divyaksh Shukla, Sriram Vema, Harsh Jhamtani, Manas Gaur, Ashutosh Modi
Numbers Matter! Bringing Quantity-awareness to Retrieval Systems
Satya Almasian, Milena Bruseva, Michael Gertz
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge
Young-Jun Lee, Dokyong Lee, junyoung youn, Kyeong-Jin Oh, Byungsoo Ko, Jonghwan Hyeon, Ho-Jin Choi
Dual-Phase Accelerated Prompt Optimization
Muchen Yang, Moxin Li, Yongle Li, Zijun Chen, Chongming Gao, Junqi Zhang, Yangyang Li, Fuli Feng
BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain
Kaisi Guan, Qian Cao, Yuchong Sun, Xiting Wang, Ruihua Song
ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering
Yifan Wu, Lutao Yan, Leixian Shen, Yunhai Wang, Nan Tang, Yuyu Luo
Communicate to Play: Pragmatic Reasoning for Efficient Cross-Cultural Communication
Isadora White, Sashrika Pandey, Michelle Pan
SAFARI: Cross-lingual Bias and Factuality Detection in News Media and News Articles
Dilshod Azizov, Zain Muhammad Mujahid, Hilal AlQuabeh, Preslav Nakov, Shangsong Liang
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
Makesh Narsimhan Sreedhar, Traian Rebedea, Shaona Ghosh, Jiaqi Zeng, Christopher Parisien
An LLM-Enabled Knowledge Elicitation and Retrieval Framework for Zero-Shot Cross-Lingual Stance Identification
Ruike Zhang, Yuan Tian, Penghui Wei, Daniel Dajun Zeng, Wenji Mao
TuringQ: Benchmarking AI Comprehension in Theory of Computation
Pardis Sadat Zahraei, Ehsaneddin Asgari
Learning to Refine with Fine-Grained Natural Language Feedback
Manya Wadhwa, Xinyu Zhao, Junyi Jessy Li, Greg Durrett
Implicit Personalization in Language Models: A Systematic Study
Zhijing Jin, Nils Heil, Jiarui Liu, Shehzaad Dhuliawala, Yahang Qi, Bernhard Schölkopf, Rada Mihalcea, Mrinmaya Sachan
When the Misidentified Adverbial Phrase Functions as a Complement
Yige Chen, Kyuwon Kim, KyungTae Lim, Jungyeul Park, Chulwoo Park
Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization
Kwangwook Seo, Jinyoung Yeo, Dongha Lee
Few-shot Pairwise Ranking Prompting: An Effective Non-Parametric Retrieval Model
Nilanjan Sinhababu, Andrew Parry, Debasis Ganguly, Debasis Samanta, Pabitra Mitra
Self-training Language Models in Arithmetic Reasoning
Marek Kadlčík, Michal Štefánik
NCPrompt: NSP-Based Prompt Learning and Contrastive Learning for Implicit Discourse Relation Recognition
Yuetong Rong, Yijun Mo
Efficient Pointwise-Pairwise Learning-to-Rank for News Recommendation
Nithish Kannen, Yao Ma, Gerrit J.J. Van den Burg, Jean Baptiste Faddoul
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Han Guo, William Brandon, Radostin Cholakov, Jonathan Ragan-Kelley, Eric P. Xing, Yoon Kim
Distance-aware Calibration for Pre-trained Language Models Download PDF
Alberto Gasparin, Gianluca Detommaso
Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks
Jack Gallifant, Shan Chen, Pedro José Ferreira Moreira, Nikolaj Munch, Mingye Gao, Jackson Pond, Leo Anthony Celi, Hugo Aerts, Thomas Hartvigsen, Danielle Bitterman
To Err Is Human, but Llamas Can Learn It Too
Agnes Luhtaru, Taido Purason, Martin Vainikko, Maksym Del, Mark Fishel
PizzaCommonSense: A Dataset for Commonsense Reasoning about Intermediate Steps in Cooking Recipes
Aissatou Diallo, Antonis Bikakis, Luke Dickens, Anthony Hunter, Rob Miller
Enhancing Discourse Dependency Parsing with Sentence Dependency Parsing: A Unified Generative Method Based on Code Representation
Zizhuo Shen, Yanqiu Shao
SAFETY-J: Evaluating Safety with Critique
Yixiu Liu, Yuxiang Zheng, Shijie Xia, Jiajun Li, Yi Tu, Chaoling Song, Pengfei Liu
“Knowing When You Don’t Know”: A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation
Nandan Thakur, Luiz Bonifacio, Crystina Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin
Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking
James D. Finch, Jinho D. Choi
Can We Instruct LLMs to Compensate for Position Bias?
Meiru Zhang, Zaiqiao Meng, Nigel Collier
Textual Dataset Distillation via Language Model Embedding
Yefan Tao, Luyang Kong, Andrey Kan, Laurent Callot
TARA: Token-level Attribute Relation Adaptation for Multi-Attribute Controllable Text Generation
Yilin Cao, Jiahao Zhao, Ruike Zhang, Hanyi Zou, Wenji Mao
Guess You Will Think So: Adversarial User Intention Learning in Sequential Recommendation
Junjie Zhang, Ruobing Xie, Wenqi Sun, Leyu Lin, Xin Zhao, Ji-Rong Wen
Denoising Rationalization for Multi-hop Fact Verification via Multi-granular Explainer
Jiasheng Si, Yingjie Zhu, Wenpeng Lu, Deyu Zhou
README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP
Zonghai Yao, Nandyala Siddharth Kantu, Guanghao Wei, Hieu Tran, Zhangqi Duan, SUNJAE KWON, Zhichao Yang, hong yu
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts
Taehun Cha, Donghun Lee
Cognitive Bias in Decision-Making with LLMs
Jessica Maria Echterhoff, Yao Liu, Abeer Alessa, Julian McAuley, Zexue He
Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring Conversations
Rose E Wang, Pawan Wirawarn, Kenny Lam, Omar Khattab, Dorottya Demszky
Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of Language Models
Kang He, Yinghan Long, Kaushik Roy
Can’t Remember Details in Long Documents? You Need Some R&R
Devanshu Agrawal, Shang Gao, Martin Gajek
DAVINCI: Dataset for Detection of Violent Incidents
Hemank Lamba, Anton Abilov, Ke Zhang, Elizabeth M Olson, Henry Kudzanai Dambanemuya, João Cordovil Bárcia, David S. Batista, Christina Wille, Aoife Cahill, Joel R. Tetreault, Alejandro Jaimes
Improving Quotation Attribution with Fictional Character Embeddings
Gaspard Michel, Elena V. Epure, Romain Hennequin, Christophe Cerisara
Robust Text Classification: Analyzing Prototype-Based Networks
Zhivar Sourati, Darshan Girish Deshpande, Filip Ilievski, Kiril Gashteovski, Sascha Saralajew
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
Shilong Li, Yancheng He, Hangyu Guo, Xingyuan Bu, Ge Bai, Jie Liu, Jiaheng Liu, Xingwei Qu, Yangguang Li, Wanli Ouyang, Wenbo Su, Bo Zheng
Improving Demonstration Diversity by Human-Free Fusing for Text-to-SQL
Dingzirui Wang, Longxu Dou, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che
Compare without Despair: Reliable Preference Evaluation with Generation Separability
Sayan Ghosh, Tejas Srinivasan, Swabha Swayamdipta
Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning
Siwei Li, Yifan Yang, Yifei Shen, Fangyun Wei, Zongqing Lu, Lili Qiu, Yuqing Yang
SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models
Juan Pablo Munoz, Jinjie Yuan, Nilesh Jain
Securing Multi-turn Conversational Language Models from Distributed Backdoor Attacks
Terry Tong, Qin Liu, Jiashu Xu, Muhao Chen
InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Mohammad Beigi, Ying Shen, Runing Yang, Zihao Lin, Qifan Wang, Ankith Mohan, Jianfeng He, Ming Jin, Chang-Tien Lu, Lifu Huang
All You Need is Attention: Lightweight Attention-based Data Augmentation for Text Classification
Junehyung Kim, Sungjae Hwang
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models
Ashutosh Sathe, Prachi Jain, Sunayana Sitaram
Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation
G M Shahariar, Jia Chen, Jiachen Li, Yue Dong
Enhancing Alignment using Curriculum Learning & Ranked Preferences
Pulkit Pattnaik, Rishabh Maheshwary, Kelechi Ogueji, Vikas Yadav, Sathwik Tejaswi Madhusudhan
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach
Diogo Pernes, Gonçalo M. Correia, Afonso Mendes
Tab2Text - A framework for deep learning with tabular data
Tong Lin, Jason Yan, David Jurgens, Sabina J Tomkins
More Bang for your Context: Virtual Documents for Question Answering over Long Documents
Yosi Mass, Boaz Carmeli, Asaf Yehudai, Assaf Toledo, Nathaniel Mills
Out-of-Distribution Detection through Soft Clustering with Non-Negative Kernel Regression
Aryan Gulati, Xingjian Dong, Carlos Hurtado, Sarath Shekkizhar, Swabha Swayamdipta, Antonio Ortega
Synthetic Multimodal Question Generation
Ian Wu, Sravan Jayanthi, Vijay Viswanathan, Simon Rosenberg, Sina Khoshfetrat Pakazad, Tongshuang Wu, Graham Neubig
Lost in Translation: Chemical Language Models and the Misunderstanding of Molecule Structures
Veronika Ganeeva, Andrey Sakhovskiy, Kuzma Khrabrov, Andrey Savchenko, Artur Kadurin, Elena Tutubalina
Breaking the Boundaries: A Unified Framework for Chinese Named Entity Recognition Across Text and Speech
Jinzhong Ning, Yuanyuan Sun, Bo Xu, Zhihao Yang, Ling Luo, Hongfei Lin
HyQE: Ranking Contexts with Hypothetical Query Embeddings
Weichao Zhou, Jiaxin Zhang, Hilaf Hasson, Anu Singh, Wenchao Li
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch
Hasan Abed Al Kader Hammoud, Umberto Michieli, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem, Mete Ozay
Large Language Models Are Challenged by Habitat-Centered Reasoning
Sadaf Ghaffari, Nikhil Krishnaswamy
How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Jaeyoung Lee, Ximing Lu, Jack Hessel, Faeze Brahman, Youngjae Yu, Yonatan Bisk, Yejin Choi, Saadia Gabriel
Benchmarking Machine Translation with Cultural Awareness
Binwei Yao, Ming Jiang, Tara Bobinac, Diyi Yang, Junjie Hu
Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?
Tannon Kew, Florian Schottmann, Rico Sennrich
Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation
Siru Ouyang, Shuohang Wang, Minhao Jiang, Ming Zhong, Donghan Yu, Jiawei Han, yelong shen
Generate then Refine: Data Augmentation for Zero-shot Intent Detection
I-Fan Lin, Faegheh Hasibi, Suzan Verberne
Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting
Siyi Liu, Yang Li, Jiang Li, Shan Yang, Yunshi Lan
VGA: Vision GUI Assistant - Minimizing Hallucinations through Image-Centric Fine-Tuning
Meng ziyang, Yu Dai, Zezheng Gong, ShaoxiongGuo, Minglong Tang, Tongquan Wei
“What is the value of {templates}?” Rethinking Document Information Extraction Datasets for LLMs
Ran Zmigrod, Pranav Shetty, Mathieu Sibue, Zhiqiang Ma, Armineh Nourbakhsh, Xiaomo Liu, Manuela Veloso
What Matters in Learning Facts in Language Models? Multifaceted Knowledge Probing with Diverse Multi-Prompt Datasets
Xin Zhao, Naoki Yoshinaga, Daisuke Oba
On Leakage of Code Generation Evaluation Datasets
Alexandre Matton, Tom Sherborne, Dennis Aumiller, Elena Tommasone, Milad Alizadeh, Jingyi He, Raymond Ma, Maxime Voisin, Ellen Gilsenan-McMahon, Matthias Gallé
Understanding the Therapeutic Relationship between Counselors and Clients in Online Text-based Counseling using LLMs
Anqi Li, Yu Lu, Nirui Song, Shuai Zhang, Lizhi Ma, Zhenzhong Lan
The Language of Trauma: Modeling Traumatic Event Descriptions Across Domains with Explainable AI
Miriam Schirmer, Tobias Leemann, Gjergji Kasneci, Jürgen Pfeffer, David Jurgens
Auto-Evolve: Enhancing Large Language Model’s Performance via Self-Reasoning Framework
Krishna Aswani, Huilin Lu, Pranav Patankar, Priya Dhalwani, Xue Tan, Jayant Ganeshmohan, Simon Lacasse
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization
Yuxi Xie, Guanzhen Li, Xiao Xu, Min-Yen Kan
Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR
Minghan Wang, Yuxia Wang, Thuy-Trang Vu, Ehsan Shareghi, Reza Haf
Better Alignment with Instruction Back-and-Forth Translation
Thao Nguyen, Jeffrey Li, Sewoong Oh, Ludwig Schmidt, Jason E Weston, Luke Zettlemoyer, Xian Li
AliGATr: Graph-based layout generation for form understanding
Armineh Nourbakhsh, Zhao Jin, Siddharth Parekh, Sameena Shah, Carolyn Rose
Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Tao Meng, Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Aram Galstyan, Richard Zemel, Kai-Wei Chang, Rahul Gupta, Charith Peris
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement
Ishani Mondal, Zongxia Li, Yufang Hou, Anandhavelu Natarajan, Aparna Garimella, Jordan Lee Boyd-Graber
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
Zachary Horvitz, Ajay Patel, Kanishk Singh, Chris Callison-Burch, Kathleen McKeown, Zhou Yu
Can LLMs Understand the Implication of Emphasized Sentences in Dialogue?
Guan-Ting Lin, Hung-yi Lee
Why do LLaVA Vision-Language Models Reply to Images in English?
Musashi Hinck, Carolin Holtermann, Matthew Lyle Olson, Florian Schneider, Sungduk Yu, Anahita Bhiwandiwalla, Anne Lauscher, Shao-Yen Tseng, Vasudev Lal
Preference Tuning For Toxicity Mitigation Generalizes Across Languages
Xiaochen Li, Zheng Xin Yong, Stephen Bach
Calibrating Long-form Generations From Large Language Models
Yukun Huang, Yixin Liu, Raghuveer Thirukovalluru, Arman Cohan, Bhuwan Dhingra
Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal Recommendation
Yueqi Wang, Zhenrui Yue, Huimin Zeng, Dong Wang, Julian McAuley
Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Kamran Chitsaz, Quentin Fournier, Goncalo Mordido, Sarath Chandar
Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding
Yidan Sun, Jianfei Yu, Boyang Li
MVP-Bench: Can Large Vision-Language Models Conduct Multi-level Visual Perception Like Humans?
Guanzhen Li, Yuxi Xie, Min-Yen Kan
Topic Modeling: Contextual Token Embeddings Are All You Need
Dimo Angelov, Diana Inkpen
Dense Passage Retrieval: Is it Retrieving?
Benjamin Reichman, Larry Heck
Dynamic Planning for LLM-based Graphical User Interface Automation
Shaoqing Zhang, Zhuosheng Zhang, Kehai Chen, Xinbei Ma, Muyun Yang, Tiejun Zhao, Min Zhang
Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback
Kyuyoung Kim, Ah Jeong Seo, Hao Liu, Jinwoo Shin, Kimin Lee
AfriInstruct: Instruction Tuning of African Languages for Diverse Tasks
Kosei Uemura, Alex Pejovic, Mahe Chen, Chika Maduabuchi, Yifei Sun, En-Shiun Annie Lee
LLMs as Collaborator: Demands-Guided Collaborative Retrieval-Augmented Generation for Commonsense Knowledge-Grounded Open-Domain Dialogue Systems
Jiong Yu, Sixing Wu, Jiahao Chen, Wei Zhou
ClaimVer: Explainable Claim-Level Verification and Evidence Attribution of Text Through Knowledge Graphs
Preetam Prabhu Srikar Dammu, Himanshu Naidu, Mouly Dewan, YoungMin Kim, Tanya Roosta, Aman Chadha, Chirag Shah
Empirical Prior for Text Autoencoders
Yongjing Yin, Wenyang Gao, Haodong Wu, Jianhao Yan, Yue Zhang
Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Minda Hu, Licheng Zong, Hongru WANG, Jingyan Zhou, Jingjing Li, Yichen Gao, Kam-Fai Wong, Yu Li, Irwin King
Pedagogical Alignment of Large Language Models
Shashank Sonkar, Kangqi Ni, Sapana Chaudhary, Richard Baraniuk
Reference-based Metrics Disprove Themselves in Question Generation
Bang Nguyen, Mengxia Yu, Yun Huang, Meng Jiang
Regression (and Scoring) Aware Inference with LLMs
Michal Lukasik, Harikrishna Narasimhan, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar
Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Xueyang Feng, Zhi-Yuan Chen, Yujia Qin, Yankai Lin, Xu Chen, Zhiyuan Liu, Ji-Rong Wen
$R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL
Yuhang Zhou, Yu He, Siyu Tian, Yuchen Ni, Zhangyue Yin, Xiang Liu, Chuanjun Ji, Sen Liu, Xipeng Qiu, Guangnan Ye, Hongfeng Chai
Updating Large Language Models’ Memories with Time Constraints Download PDF
Xin Wu, Yuqi Bu, Yi Cai, Tao Wang
DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model
Chao Gao, Sai Qian Zhang
Defending Jailbreak Attack in VLMs via Cross-modality Information Detector
Yue Xu, XiuyuanQi, Zhan Qin, Wenjie Wang
Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions
Poojitha Thota, Shirin Nilizadeh
One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks
Sebastian Nehrdich, Oliver Hellwig, Kurt Keutzer
NALA: an Effective and Interpretable Entity Alignment Method
Chuanhao Xu, Jingwei Cheng, Fu Zhang
ConTReGen: Context-driven Tree-structured Retrieval for Open-domain Long-form Text Generation
Kashob Kumar Roy, Pritom Saha Akash, Lucian Popa, Kevin Chen-Chuan Chang
Aligners: Decoupling LLMs and Alignment
Lilian Ngweta, Mayank Agarwal, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin
TOWER: Tree Organized Weighting for Evaluating Complex Instructions
Noah Ziems, Zhihan Zhang, Meng Jiang
Extractive Medical Entity Disambiguation with Memory Mechanism and Memorized Entity Information
Guobiao Zhang, Xueping Peng, Tao Shen, Guodong Long, Jiasheng Si, Libo Qin, Wenpeng Lu
QEFT: Quantization for Efficient Fine-Tuning of LLMs
Changhun Lee, Jun-gyu Jin, YoungHyun Cho, Eunhyeok Park
Skills-in-Context: Unlocking Compositionality in Large Language Models
Jiaao Chen, Xiaoman Pan, Dian Yu, Kaiqiang Song, Xiaoyang Wang, Dong Yu, Jianshu Chen
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLMs Jailbreakers
Xirui Li, Ruochen Wang, Minhao Cheng, Tianyi Zhou, Cho-Jui Hsieh
Can LLMs Replace Clinical Doctors? Exploring Bias in Disease Diagnosis by Large Language Models
Yutian Zhao, Huimin WANG, Xian Wu, Yefeng Zheng
BLADE: Benchmarking Language Model Agents for Data-Driven Science
Ken Gu, Ruoxi Shang, Ruien Jiang, Keying Kuang, Richard-John Lin, Donghe Lyu, Yue Mao, Youran Pan, Teng Wu, Jiaqian Yu, Yikun Zhang, Tianmai M. Zhang, Lanyi Zhu, Mike A Merrill, Jeffrey Heer, Tim Althoff
Phonetic and Lexical Discovery of Canine Vocalization
Sinong Wang, Xingyuan Li, Chunhao Zhang, Mengyue Wu, Kenny Q. Zhu
Audio-Based Linguistic Feature Extraction for Enhancing Multi-lingual and Low-Resource Text-to-Speech
Youngjae Kim, Yejin Jeon, Gary Lee
LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons
Zheng Xin Yong, Cristina Menghini, Stephen Bach
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks
Yun-Shiuan Chuang, Zach Studdiford, Krirk Nirunwiroj, Agam Goyal, Vincent V. Frigo, Sijia Yang, Dhavan V. Shah, Junjie Hu, Timothy T. Rogers
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Trang Le, Daniel Lazar, Suyoun Kim, Shan Jiang, Duc Le, Adithya Sagar, Aleksandr Livshits, Ahmed A Aly, Akshat Shrivastava
Performance Trade-offs of a Family of Text Watermarks
Anirudh Ajith, Sameer Singh, Danish Pruthi
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables
Suyash Vardhan Mathur, Jainit Sushil Bafna, Kunal Kartik, Harshita Khandelwal, Manish Shrivastava, Vivek Gupta, Mohit Bansal, Dan Roth
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification
Kai Sun, Yushi Bai, Ji Qi, Lei Hou, Juanzi Li
Representational Isomorphism and Alignment of Multilingual Large Language Models
Di Wu, Yibin Lei, Andrew Yates, Christof Monz
SWAG: Storytelling With Action Guidance
Jonathan Pei, Zeeshan Patel, Karim El-Refai, Tianle Li
Random Label Forests: An Ensemble Method with Label Subsampling For Extreme Multi-Label Problems
Sheng-Wei Chen, Chih-Jen Lin
Active Listening: Personalized Question Generation in Open-Domain Social Conversation with User Model Based Prompting
Kevin Bowden, Yue Fan, Winson Chen, Wen Cui, Davan Harrison, Marilyn Walker, Xin Eric Wang
Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM
SooHwan Eom, Jay Shim, Gwanhyeong Koo, Haebin Na, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo
LLM as a metric critic for low resource relation identification
ZHE YANG, Yi Huang, Yaqin Chen, XiaotingWu, Junlan Feng, Chao Deng
Experience as Source for Anticipation and Planning: Experiential Policy Learning for Target-driven Recommendation Dialogues
Huy Quang Dao, Yang Deng, Khanh-Huyen Bui, Dung D. Le, Lizi Liao
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers
Yuxia Wang, Revanth Gangi Reddy, Zain Muhammad Mujahid, Arnav Arora, Aleksandr Rubashevskii, Jiahui Geng, OSAMA MOHAMMED AFZAL, Liangming Pan, Nadav Borenstein, Aditya Pillai, Isabelle Augenstein, Iryna Gurevych, Preslav Nakov
Open-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models
Shayekh Bin Islam, Md Asib Rahman, K S M Tozammel Hossain, Enamul Hoque, Shafiq Joty, Md Rizwan Parvez
Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
Suyeon Lee, Sunghwan Kim, Minju Kim, Dongjin Kang, Dongil Yang, Harim Kim, Minseok Kang, Dayi jung, Min Hee Kim, Seungbeen Lee, Kyong-Mee Chung, Youngjae Yu, Dongha Lee, Jinyoung Yeo
Customizing Language Models for Text-to-Layout Planning
Jian Chen, Ruiyi Zhang, Yufan Zhou, Jennifer Healey, Jiuxiang Gu, Changyou Chen
LongAlign: A Recipe for Long Context Alignment of Large Language Models
Yushi Bai, Xin Lv, Jiajie Zhang, Yuze He, Ji Qi, Lei Hou, Jie Tang, Yuxiao Dong, Juanzi Li
Data-driven Coreference-based Ontology Building
Shir Ashury Tahan, Amir David Nissan Cohen, Nadav Cohen, Yoram Louzoun, Yoav Goldberg
Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision
Philipp Christmann, Svitlana Vakulenko, Ionut Teodor Sorodoc, Adrià de Gispert, Bill Byrne
Persuasiveness of Generated Free-Text Rationales in Subjective Decisions: A Case Study on Pairwise Argument Ranking
Mohamed Elaraby, Diane Litman, Xiang Lorraine Li, Ahmed Magooda
Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Eunji Kim, Kyuhong Shim, Simyung Chang, Sungroh Yoon
From Internal Conflict to Contextual Adaptation of Language Models
Sara Vera Marjanovic, Haeun Yu, Pepa Atanasova, Maria Maistro, Christina Lioma, Isabelle Augenstein
LLMs to Replace Crowdsourcing For Parallel Data Creation: The Case of Text Detoxification
Daniil Moskovskiy, Sergey Pletenev, Alexander Panchenko
Efficient Active Learning with Adapters
Daria Galimzianova, Leonid Sanochkin
How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection
Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki
Let’s Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Yichuan Li, Zhengyu Hu, Zhengyu Chen, Jingang Wang, Han Liu, Kyumin Lee, Kaize Ding
“Seeing the Big through the Small”: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Beiduo Chen, Xinpeng Wang, Siyao Peng, Robert Litschko, Anna Korhonen, Barbara Plank
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Erik Miehling, Manish Nagireddy, Prasanna Sattigeri, Elizabeth M. Daly, David Piorkowski, John T. Richards
LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments
Ruirui Chen, Weifeng Jiang, Chengwei Qin, Ishaan Singh Rawal, Cheston Tan, Dongkyu Choi, Bo Xiong, Bo Ai
Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awareness
Jian Li, Haojing Huang, Yujia Zhang, Pengfei Xu, Xi Chen, Rui Song, Lida Shi, Jingwen Wang, Hao Xu
Mitigating Hallucination in Fictional Character Role-Play
Nafis Sadeq, Zhouhang Xie, Byungkyu Kang, Prarit Lamba, Xiang Gao, Julian McAuley
I’m sure you’re a real scholar yourself: Exploring Ironic Content Generation by Large Language Models
Pier Felice Balestrucci, Silvia Casola, Soda Marem Lo, Valerio Basile, Alessandro Mazzei
Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering
Wanqi Yang, Yanda Li, Meng Fang, Ling Chen
Minimal Yet Big Impact: How AI Agent Back-channeling Enhances Conversational Engagement through Conversation Persistence and Context Richness
Jin Yea Jang, Saim Shin, gahgene gweon
Large Language Models for Propaganda Span Annotation
Maram Hasanain, Fatema Ahmad, Firoj Alam
Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
Xiao Pu, Tianxing He, Xiaojun Wan
POSIX: A Prompt Sensitivity Index For Large Language Models
Anwoy Chatterjee, H S V N S Kowndinya Renduchintala, Sumit Bhatia, Tanmoy Chakraborty
Capturing Minds, Not Just Words: Enhancing Role-Playing Language Models with Personality-Indicative Data
Yiting Ran
Local and Global Decoding in Text Generation
Daniel Gareev, Thomas Hofmann, ezhilmathi krishnasamy, Tiago Pimentel
LEGOBench: Scientific Leaderboard Generation Benchmark
Shruti Singh, Shoaib Alam, Husain Malwat, Mayank Singh
H-LegalKI: A Hierarchical Legal Knowledge Integration Framework for Legal Community Question Answering
Yue Jiang, Ziyu Guan, Jie Zhao, Wei Zhao, Jiaqi Yang
Identifying Factual Inconsistencies in Summaries: Grounding Model Inference via Task Taxonomy
Liyan Xu, Zhenlin Su, Mo Yu, Jin Xu, Jinho D. Choi, Jie Zhou, Fei Liu
Long Sequence Modeling with Attention Tensorization: From Sequence to Tensor Learning
Aosong Feng, Rex Ying, Leandros Tassiulas
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI Systems
Qianli Wang, Tatiana Anikina, Nils Feldhus, Simon Ostermann, Sebastian Möller
BanglaTLit: A Benchmark Dataset for Back-Transliteration of Romanized Bangla
Md Fahim, Fariha Tanjim Shifat, Md Farhan Ishmam, Deeparghya Dutta Barua, Fabiha Haider, MD SAKIB UL RAHMAN SOUROVE, Md Farhad Alam Bhuiyan
Finding the Optimal Byte-Pair Encoding Merge Operations for Neural Machine Translation in a Low-Resource Setting
Kristine Mae M. Adlaon
Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs
Muhammad Arslan Manzoor, Yuxia Wang, Minghan Wang, Preslav Nakov
EU DisinfoTest: a Benchmark for Evaluating Language Models’ Ability to Detect Disinformation Narratives
Witold Sosnowski, Arkadiusz Modzelewski, Kinga Skorupska, Jahna Otterbacher, Adam Wierzbicki
Adaptive BPE Tokenization for Enhanced Vocabulary Adaptation in Finetuning Pretrained Language Models
Gunjan Balde, Soumyadeep Roy, Mainack Mondal, Niloy Ganguly
From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression
Eunseong Choi, Sunkyung Lee, Minjin Choi, June Park, Jongwuk Lee
Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis
Xinyu Feng, Yuming Lin, Lihua He, You Li, Liang Chang, Ya Zhou
LexMatcher: Dictionary-centric Data Curation for LLM-based Machine Translation
Yongjing Yin, Jiali Zeng, Yafu Li, Fandong Meng, Yue Zhang
SARCAT: Generative Span-Act Guided Response Generation using Copy-enhanced Target Augmentation
Jeong-Doo Lee, Hyeongjun Choi, Beomseok Hong, Youngsub Han, Byoung-Ki Jeon, Seung-Hoon Na
Does Context Help Mitigate Gender Bias in Neural Machine Translation?
Harritxu Gete, Thierry Etchegoyhen
A Critical Look at Meta-evaluating Summarization Evaluation Metrics
Xiang Dai, Sarvnaz Karimi, Biaoyan Fang
LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study
Van Bach Nguyen, Paul Youssef, Jörg Schlötterer, Christin Seifert
Unlocking Black-Box Prompt Tuning Efficiency via Zeroth-Order Optimization
Heshen Zhan, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Hung-Ting Su, Ya-Ching Hsu, Xudong Lin, Xiang-Qian Shi, Yulei Niu, Han-Yuan Hsu, Hung-yi Lee, Winston H. Hsu
Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models
Jie Chen, Yupeng Zhang, Bingning Wang, Xin Zhao, Ji-Rong Wen
CED: Comparing Embedding Differences for Detecting Out-of-Distribution and Hallucinated Text
Hakyung Lee, Keon-Hee Park, Hoyoon Byun, Jeyoon Yeom, Jihee Kim, Gyeong-Moon Park, Kyungwoo Song
CHAmbi: A New Benchmark on Chinese Ambiguity Challenges for Large Language Models
Qin Zhang, Sihan Cai, Jiaxu Zhao, Mykola Pechenizkiy, Meng Fang
Analyzing Context Contributions in LLM-based Machine Translation
Emmanouil Zaranis, Nuno M Guerreiro, Andre Martins
Evaluating Language Model Character Traits
Francis Rhys Ward, Zejia Yang, Alex Jackson, Randy Brown, Chandler Smith, Grace Beaney Colverd, Louis Alexander Thomson, Raymond Douglas, Patrik Bartak, Andrew Rowan
ARTS: Assessing Readability & Text Simplicity 🎨
Björn Engelmann, Christin Katharina Kreutz, Fabian Haak, Philipp Schaer
AXCEL: Automated eXplainable Consistency Evaluation using LLMs
P Aditya Sreekar, Sahil Verma, Suransh Chopra, Abhishek Persad, Sarik Ghazarian, Narayanan Sadagopan
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking
Byoungjip Kim, Youngsoo Jang, Lajanugen Logeswaran, Geon-Hyeong Kim, Yu Jin Kim, Honglak Lee, Moontae Lee
Characterizing Text Datasets with Psycholinguistic Features
Marcio Monteiro, Charu Karakkaparambil James, Marius Kloft, Sophie Fellenz
Talking the Talk Does Not Entail Walking the Walk: On the Limits of Large Language Models in Lexical Entailment Recognition
Candida Maria Greco, Lucio La Cava, Andrea Tagarelli
Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning
Debjit Paul, Robert West, Antoine Bosselut, Boi Faltings
Self-training Large Language Models through Knowledge Detection
Yeo Wei Jie, Teddy Ferdinan, Przemyslaw Kazienko, Ranjan Satapathy, Erik Cambria
VE-KD: Vocabulary-Expansion Knowledge-Distillation for Training Smaller Domain-Specific Language Models
Pengju Gao, Tomohiro Yamasaki, Kazunori Imoto
Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation
Esteban Garces Arias, Julian Rodemann, Meimingwei Li, Christian Heumann, Matthias Aßenmacher
Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards
Hyeonbin Hwang, Doyoung Kim, Seungone Kim, Seonghyeon Ye, Minjoon Seo
SSP: Self-Supervised Prompting for Cross-Lingual Transfer to Low-Resource Languages using Large Language Models
Vipul Kumar Rathore, Aniruddha Deb, Ankish Kumar Chandresh, Parag Singla, Mausam .
Re-examining Sexism and Misogyny Classification with Annotator Attitudes
Aiqi Jiang, Nikolas Vitsakis, Tanvi Dinkar, Gavin Abercrombie, Ioannis Konstas
When ‘‘A Helpful Assistant’’ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models
Mingqian Zheng, Jiaxin Pei, Lajanugen Logeswaran, Moontae Lee, David Jurgens
Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks
Sungkyung Kim, Adam Lee, Junyoung Park, Andrew Chung, Jusang Oh, Jay-Yoon Lee
Text2Model: Text-based Model Induction for Zero-shot Image Classification
Ohad Amosy, Tomer Volk, Eilam Shapira, Eyal Ben-David, Roi Reichart, Gal Chechik
Modeling Gender and Dialect Bias in Automatic Speech Recognition
Camille Harris, Chijioke Mgbahurike, Neha Kumar, Diyi Yang
Are Large Language Models Consistent over Value-laden Questions?
Jared Moore, Tanvi Deshpande, Diyi Yang
xTower: A Multilingual LLM for Explaining and Correcting Translation Errors
Marcos V Treviso, Nuno M Guerreiro, Sweta Agrawal, Ricardo Rei, José Pombal, Tania Vaz, Helena Wu, Beatriz Silva, Daan van Stigt, Andre Martins
LAMBDA: Large Language Model-Based Data Augmentation for Multi-Modal Machine Translation
Yusong Wang, Dongyuan Li, Jialun Shen, Yicheng Xu, Mingkun Xu, Kotaro Funakoshi, Manabu Okumura
Generating and Evaluating Synthetic Data for Privacy Preservation in High-Stakes Domains
Krithika Ramesh, Nupoor Gandhi, Pulkit Madaan, Lisa Bauer, Charith Peris, Anjalie Field
Dual Process Masking for Dialogue Act Recognition
Yeo Jin Kim, Halim Acosta, Wookhee Min, Jonathan Rowe, Bradford Mott, Snigdha Chaturvedi, James Lester
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Joao Monteiro, Étienne Marcotte, Pierre-Andre Noel, Valentina Zantedeschi, David Vazquez, Nicolas Chapados, Christopher Pal, Perouz Taslakian
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion
Hengrui Gu, Kaixiong Zhou, Yili Wang, Ruobing Wang, Xin Wang
DEFT: Distribution-guided Efficient Fine-Tuning for Human Alignment
Liang Zhu, Feiteng Fang, yuelin bai, Longze Chen, Zhexiang Zhang, Minghuan Tan, Min Yang
Eigen Attention: Attention in Low-Rank Space for KV Cache Compression
Utkarsh Saxena, Gobinda Saha, Sakshi Choudhary, Kaushik Roy
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin, Wei-Hua Li, Jun-cheng Chen, Chu-Song Chen
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression
Zhichao Xu, Ashim Gupta, Tao Li, Oliver Bentham, Vivek Srikumar
One-to-Many Testing for Code Generation from (Just) Natural Language
Mansi Uniyal, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le
A Unified Framework for Model Editing
Akshat Gupta, Dev Sajnani, Gopala Anumanchipalli
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models
Chuhan Li, Ziyao Shangguan, Yilun Zhao, Deyuan Li, Yixin Liu, Arman Cohan
Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction
Sonny George, Chris Sypherd, Dylan Cashman
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang, Rui Wang, Gongshen Liu
Knowledge-Centric Templatic Views of Documents
Isabel Alyssa Cachola, Silviu Cucerzan, Allen herring, Vuksan Mijovic, Erik Oveson, Sujay Kumar Jauhar
EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction
Li Yang, Qifan Wang, Jianfeng Chi, Jiahao Liu, Jingang Wang, Fuli Feng, Zenglin Xu, Yi Fang, Lifu Huang, Dongfang Liu
Shoes-ACOSI: A Dataset for Aspect-Based Sentiment Analysis with Implicit Opinion Extraction
Joseph J Peper, Wenzhao Qiu, Ryan Bruggeman, Yi Han, Estefania Ciliotta Chehade, Lu Wang
Socratic Human Feedback (SoHF): Understanding Socratic Feedback Based Steering Strategies Used by Expert Programmers for Code-generation with LLMs
Subramanian Chidambaram, Li Erran Li, Min Bai, Xiaopeng Li, Kaixiang Lin, Xiong Zhou, Alex C. Williams
Large Language Models Know What To Say But Not When To Speak
Muhammad Umair, Vasanth Sarathy, Jan Ruiter
Towards Explainable Chinese Native Learner Essay Fluency Assessment: Dataset, Tasks, and Method
Xinshu Shen, Hongyi Wu, Yadong Zhang, Man Lan, Xiaopeng Bai, Shaoguang Mao, Yuanbin Wu, Xinlin Zhuang, Li Cai
CoCoHD: Congress Committee Hearing Dataset
Arnav Hiray, Yunsong Liu, Mingxiao Song, Agam Shah, Sudheer Chava
The Student Data Paradox: Examining the Regressive Side Effects of Training LLMs for Personalized Learning
Shashank Sonkar, Naiming Liu, Richard Baraniuk
MalAlgoQA: A Pedagogical Approach for Evaluating Counterfactual Reasoning Abilities of Large Language Models
Shashank Sonkar, Naiming Liu, MyCo Le, Richard Baraniuk
Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets
Melanie Walsh, Maria Antoniak, Anna Preus
Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Jacob Morrison, Noah A. Smith, Hannaneh Hajishirzi, Pang Wei Koh, Jesse Dodge, Pradeep Dasigi
To Ask LLMs about English Grammaticality, Prompt Them in a Different Language
Shabnam Behzad, Amir Zeldes, Nathan Schneider
Prefix-VAE: Efficient and Consistent Short-Text Topic Modeling with LLMs
Pritom Saha Akash, Kevin Chen-Chuan Chang
Targeted Multilingual Adaptation for Low-resource Language Families
C. M. Downey, Terra Blevins, Dhwani Serai, Dwija Parikh, Shane Steinert-Threlkeld
A Pointer Network based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Ankan Mullick, Sombit Bose, Abhilash Nandy, Gajula Sai Chaitanya, Pawan Goyal
Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs
Arijit Nag, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti
Advancing Vision-Language Models with Adapter Ensemble Strategies
Yue Bai, Handong Zhao, Zhe Lin, Ajinkya Kale, Jiuxiang Gu, Tong Yu, Sungchul Kim, Yun Fu
Who Wrote When? Author Diarization in Social Media Discussions
Benedikt Boenninghoff, Henry Hosseini, Robert M. Nickel, Dorothea Kolossa
Controlled Transformation of Text-Attributed Graphs
Nidhi Vakil, Hadi Amiri
Misinformation with Legal Consequences (MisLC): A New Task Towards Harnessing Societal Harm of Misinformation
Chu Fei Luo, Radin Shayanfar, Rohan V Bhambhoria, Samuel Dahan, Xiaodan Zhu
CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models
Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, T K Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit singh sodhi
Explicit Inductive Inference using Large Language Models
Tianyang Liu, Tianyi Li, Liang Cheng, Mark Steedman
MultiSkill: Evaluating Large Multimodal Models for Fine-grained Alignment Skills
Zhenran Xu, Senbao Shi, Baotian Hu, Longyue Wang, Min Zhang
Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA
Wenyu Huang, Guancheng Zhou, Hongru WANG, Pavlos Vougiouklis, Mirella Lapata, Jeff Z. Pan
Evaluating Gender Bias of LLMs in Making Morality Judgements
Divij Bajaj, Yuanyuan Lei, Jonathan Tong, Ruihong Huang
A Study of Parameter Efficient Fine-tuning by Learning to Efficiently Fine-Tune
Taha Ceritli, Savas Ozkan, Jeongwon Min, Eunchung Noh, Cho Jung Min, Mete Ozay
Explaining Mixtures of Sources in News Articles
Alexander Spangher, James Youn, Matt DeButts, Nanyun Peng, Jonathan May
LLM generated responses to mitigate the impact of hate speech
Jakub Podolak, Szymon Łukasik, Paweł Balawender, Jan Ossowski, Jan Piotrowski, Katarzyna Bąkowicz, Piotr Sankowski
Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word Level Perspective
Taelin Karidi, Eitan Grossman, Omri Abend
SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
Tianyu Yang, Yiyang Nan, Lisen Dai, Zhenwen Liang, Yapeng Tian, Xiangliang Zhang
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
Bozhong Tian, Xiaozhuan Liang, Siyuan Cheng, Qingbin Liu, Mengru Wang, Dianbo Sui, Xi Chen, Huajun Chen, Ningyu Zhang
Grounding Complex Events in Multimodal Data
Kate Sanders, Reno Kriz, David Etter, Hannah Recknor, Alexander Martin, Cameron Carpenter, Jingyang Lin, Benjamin Van Durme
How Does Quantization Affect Multilingual LLMs?
Kelly Marchisio, Saurabh Dash, Hongyu Chen, Dennis Aumiller, Ahmet Üstün, Sara Hooker, Sebastian Ruder
Presentations are not always linear! GNN meets LLM for Document-to-Presentation Transformation with Attribution
Himanshu Maheshwari, Sambaran Bandyopadhyay, Aparna Garimella, Anandhavelu Natarajan
Domain Adaptation via Prompt Learning for Alzheimer’s Detection
Shahla Farzana
SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions
Shicheng Liu, Sina Semnani, Harold Triedman, Jialiang Xu, Isaac Dan Zhao, Monica Lam
Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models
Muhan Lin, Shuyang Shi, Yue Guo, Behdad Chalaki, Vaishnav Tadiparthi, Ehsan Moradi Pari, Simon Stepputtis, Joseph Campbell, Katia P. Sycara
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Yong Lin, Skyler Seto, Maartje Ter Hoeve, Katherine Metcalf, Barry-John Theobald, Xuan Wang, Yizhe Zhang, Chen Huang, Tong Zhang
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
Yibin Yan, Weidi Xie
Gazelle: An Instruction Dataset for Arabic Writing Assistance
Samar Mohamed Magdy, Fakhraddin Alwajih, Sang Yun Kwon, Reem Abdel-Salam, Muhammad Abdul-Mageed
Extrinsic Evaluation of Cultural Competence in Large Language Models
Shaily Bhatt, Fernando Diaz
BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation
David Dale, Marta R. Costa-jussà
Multi-label Sequential Sentence Classification via Large Language Model
Mengfei Lan, Lecheng Zheng, Shufan Ming, Halil Kilicoglu
InsertGNN: A Hierarchical Graph Neural Network for the TOEFL Sentence Insertion Problem
Fang Wu, Stan Z. Li
Multi-trait User Simulation with Adaptive Decoding for Conversational Task Assistants
Rafael Ferreira, David Semedo, Joao Magalhaes
VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Kun Qian, Shunji Wan, Claudia Tang, Youzhi Wang, Xuanming Zhang, Maximillian Chen, Zhou Yu
Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing
Pooya Fayyazsanavi, Antonios Anastasopoulos, Jana Kosecka
Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Md Arafat Sultan, Jatin Ganhotra, Ramón Fernandez Astudillo
Gradient Localization Improves Lifelong Pretraining of Language Models
Jared Fernandez, Yonatan Bisk, Emma Strubell
PFA-ERC Psuedo-Future Augmented Dynamic Emotion Recognition in Conversations
Tanmay Khule, Rishabh Agrawal, Apurva Narayan
Textless Speech-to-Speech Translation With Limited Parallel Data
Anuj Diwan, Anirudh Srinivasan, David Harwath, Eunsol Choi
The Overlooked Repetitive Lengthening Form in Sentiment Analysis
Lei Wang, Eduard Dragut
Remember This Event That Year? Assessing Temporal Information and Understanding in Large Language Models
Himanshu Beniwal, Dishant Patel, Kowsik Nandagopan D, Hritik Ladia, Ankit Yadav, Mayank Singh
Hop, skip, jump to Convergence: Dynamics of Learning Rate Transitions for Improved Training of Large Language Models
Vignesh Ganapathiraman, Shreyas Subramanian, Corey D Barrett
FactAlign: Long-form Factuality Alignment of Large Language Models
Chao-Wei Huang, Yun-Nung Chen
HyperLoRA: Efficient Cross-task Generalization via Constrained Low-Rank Adapters Generation
Chuancheng Lv, Lei Li, shitou zhang, Gang Chen, Fanchao Qi, Ningyu Zhang, Hai-Tao Zheng
Infer-then-Verbalize: How do LMs Map true/false to cat/dog During In-Context Learning?
Junyi Tao, Xiaoyin Chen, Nelson F. Liu
Debate as Optimization: Adaptive Conformal Prediction and Diverse Retrieval for Event Extraction
Sijia Wang, Lifu Huang
Rationale-based Ensemble of Multiple QA Strategies for Zero-shot Knowledge-based VQA
Miaoyu Li, Haoxin Li, Zilin Du, Boyang Li
MiRAGeNews: Multimodal Realistic AI-Generated News Detection
Runsheng Huang, Liam Dugan, Chris Callison-Burch
MORE: Evaluating and Quantifying Unimodal Biases in Multimodal Large Language Models through a Causal Lens
Meiqi Chen, Yixin Cao, Yan Zhang, Chaochao Lu
Large Language Models are In-context Teachers for Knowledge Reasoning
Jiachen Zhao, Zonghai Yao, Zhichao Yang, hong yu
SocialGaze: Improving the Integration of Human Social Norms in Large Language Models
Anvesh Rao Vijjini, Rakesh R Menon, Shashank Srivastava, Snigdha Chaturvedi
Improving Temporal Reasoning of Language Models via Recounted Narratives
Xinliang Frederick Zhang, Nicholas Beauchamp, Lu Wang
Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Agents
Jaekyeom Kim, Dong-Ki Kim, Lajanugen Logeswaran, Sungryull Sohn, Honglak Lee
See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation Learning
Chengxin Zheng, Junzhong Ji, Yanzhao Shi, Xiaodan Zhang, Liangqiong Qu
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
SIMENG HAN, Aaron Yu, Rui Shen, Zhenting Qi, Martin Riddell, Wenfei Zhou, Yujie Qiao, Yilun Zhao, Semih Yavuz, Ye Liu, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev
TRIP NEGOTIATOR: A Travel Persona-aware Reinforced Dialogue Generation Model for Personalized Integrative Negotiation in Tourism
Priyanshu Priya, Desai Vishesh Yasheshbhai, Ratnesh Kumar Joshi, Roshni Ramnani, ANUTOSH MAITRA, Shubhashis Sengupta, Asif Ekbal
Chain of Condition: Construct, Verify and Solve Conditions for Conditional Question Answering
Jiuheng Lin, Yuxuan Lai, Yansong Feng
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
Yu-Min Tseng, Yu-Chao Huang, Teng-Yun Hsiao, Wei-Lin Chen, Chao-Wei Huang, Yu Meng, Yun-Nung Chen
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui, Zhaoxiao Guo, Hang Zhao, Juanyong Duan, Congrui Huang
Look Who’s Talking Now: Covert Channels From Biased LLMs
Daniel Silva, Frederic Sala, Ryan Gabrys
ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Chan Young Park, Shuyue Stella Li, Hayoung Jung, Svitlana Volkova, Tanu Mitra, David Jurgens, Yulia Tsvetkov
Unraveling the Truth: Do LLMs really Understand Charts? A Deep Dive into Consistency and Robustness
Srija Mukhopadhyay, Adnan Qidwai, Aparna Garimella, Pritika Ramu, Vivek Gupta, Dan Roth
Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification
Zeren Shui, Petros Karypis, Daniel S. Karls, Mingjian Wen, Saurav Manchanda, Ellad B. Tadmor, George Karypis
TransferCVLM: Transferring Cross-Modal Knowledge for Vision-Language Modeling
Dongha Choi, Jung-jae Kim, Hyunju Lee
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper
Iuliia Thorbecke, Juan Pablo Zuluaga Gomez, Esaú VILLATORO-TELLO, Shashi Kumar, Pradeep Rangappa, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S, Aravind Ganapathiraju
Reasoning Paths Optimization: A Framework For Exploring And Learning From Diverse Reasoning Paths
Yew Ken Chia, Guizhen Chen, Weiwen Xu, Anh Tuan Luu, Soujanya Poria, Lidong Bing
Uncertainty Calibration for Tool-Using Language Agents
Hao Liu, Zi-Yi Dou, Yixin Wang, Nanyun Peng, Yisong Yue
Personalized Video Comment Generation
Xudong Lin, Ali Zare, Shiyuan Huang, Ming-Hsuan Yang, Shih-Fu Chang, Li Zhang
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?
Kuei-Chun Kao, Ruochen Wang, Cho-Jui Hsieh
MedLogic-AQA: Enhancing Medicare Question Answering with Abstractive Models Focusing on Logical Structures
Aizan Zafar, Kshitij Mishra, Asif Ekbal
EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information
Yu Xi Li, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
Noah Wang, Feiyu Duan, Yibo Zhang, Wangchunshu Zhou, Ke Xu, Wenhao Huang, Jie Fu
SedarEval: Automated Evaluation using Self-Adaptive Rubrics
Zhiyuan Fan, Weinong Wang, Xing W, Debing Zhang
Towards One-to-Many Visual Question Answering
Huishan Ji, Qingyi Si, Zheng Lin, Yanan Cao, Weiping Wang
Document-level Causal Relation Extraction with Knowledge-guided Binary Question Answering
Zimu Wang, Lei Xia, Wei Wang, Xinya Du
Block-Diagonal Orthogonal Relation and Matrix Entity for Knowledge Graph Embedding
Yihua Zhu, Hidetoshi Shimodaira
When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models
Weilan Wang, Yu Mao, TANG DONGDONG, Du Hongchao, Nan Guan, Chun Jason Xue
BiMediX: Bilingual Medical Mixture of Experts LLM
Sara Pieri, Sahal Shaji Mullappilly, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman Khan, Timothy Baldwin, Hisham Cholakkal
Improving Adversarial Robustness in Vision-Language Models with Architecture and Prompt Design
Rishika Bhagwatkar, Shravan Nayak, Pouya Bashivan, Irina Rish
Zero-Shot Fact Verification via Natural Logic and Large Language Models
Marek Strong, Rami Aly, Andreas Vlachos
Robust AI-Generated Text Detection by Restricted Embeddings
Kristian Kuznetsov, Eduard Tulchinskii, Laida Kushnareva, German Magai, Serguei Barannikov, Sergey Nikolenko, Irina Piontkovskaya
CROWD: Certified Robustness via Weight Distribution for Smoothed Classifiers against Backdoor Attack
Siqi Sun, Procheta Sen, Wenjie Ruan
Reconfidencing LLMs from the Grouping Loss Perspective
Lihu Chen, Alexandre Perez-Lebel, Fabian M. Suchanek, Gael Varoquaux
EM-LoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning
Wei Zhu, Huanran Zheng, Yi Zhao, Xing Tian, Jingfan Zhang, Yi Ge, Jiawen Lyn
Revealing Fine-Grained Values and Opinions in Large Language Models
Dustin Wright, Arnav Arora, Nadav Borenstein, Srishti Yadav, Serge Belongie, Isabelle Augenstein
PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLMs
Ankit Yadav, Mayank Singh, Himanshu Beniwal
Efficient and Interpretable Grammatical Error Correction with Mixture of Experts
Muhammad Reza Qorib, Alham Fikri Aji, Hwee Tou Ng
Dial BeInfo for Faithfulness: Improving Factuality of Information-Seeking Dialogue via Behavioural Fine-Tuning
Evgeniia Razumovskaia, Ivan Vulić, Pavle Marković, Tomasz Cichy, Qian Zheng, Tsung-Hsien Wen, Paweł Budzianowski
Unified Active Retrieval for Retrieval Augmented Generation
Qinyuan Cheng, Xiaonan Li, Shimin Li, Qin Zhu, Zhangyue Yin, Yunfan Shao, Linyang Li, Tianxiang Sun, Hang Yan, Xipeng Qiu
Unleashing Large Language Models’ Proficiency in Zero-shot Essay Scoring
Sanwoo Lee, Yida Cai, Desong Meng, Ziyang Wang, Yunfang Wu
Mitigating Catastrophic Forgetting in Language Transfer via Model Merging
Anton Alexandrov, Veselin Raychev, Mark Niklas Mueller, Ce Zhang, Martin Vechev, Kristina Toutanova
ATQ: Activation Transformation forWeight-Activation Quantization of Large Language Models
Yundong Gai, Ping Li
Stochastic Fine-Tuning of Language Models Using Masked Gradients
Mohammad Akbar-Tajari, Mohammad Taher Pilehvar
To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity
Anastasiia Sedova, Robert Litschko, Diego Frassinelli, Benjamin Roth, Barbara Plank
Tokenization Falling Short: The Curse of Tokenization
Yekun Chai, Yewei Fang, Qiwei Peng, Xuhong Li
AC-EVAL: Evaluating Ancient Chinese Language Understanding in Large Language Models
Yuting Wei, Yuanxing Xu, Xinru Wei, yangsimin, Yangfu Zhu, Yuqing Li, Di Liu, Bin Wu
MMAR: Multilingual and Multimodal Anaphora Resolution in Instructional Videos
Cennet Oguz, Pascal Denis, Simon Ostermann, Emmanuel Vincent, Natalia Skachkova, Josef van Genabith
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Zhouhong Gu, Lin Zhang, Xiaoxuan Zhu, Jiangjie Chen, Wenhao Huang, Yikai Zhang, Shusen Wang, Zheyu Ye, Yan Gao, Hongwei Feng, Yanghua Xiao
Coping with Emotion Coping: A Corpus to Model Emotions in Text Based on Role Playing
Enrica Troiano, Sofie Labat, Marco Antonio Stranisci, Rossana Damiano, Viviana Patti, Roman Klinger
MATE: Meet At The Embedding - Connecting Images with Long Texts
Young Kyun Jang, Junmo Kang, Yong Jae Lee, Donghyun Kim
Mixed Distillation Helps Smaller Language Models Reason Better
Li Chenglin, Qianglong Chen, Liangyue Li, Caiyu Wang, FengTao, Yicheng Li, Zulong Chen, Yin Zhang
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models
Xinyi Chen, Baohao Liao, Jirui Qi, Panagiotis Eustratiadis, Christof Monz, Arianna Bisazza, Maarten de Rijke
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Li Chenglin, Qianglong Chen, Zhi Li, FengTao, Yicheng Li, Hao Chen, Fei Yu, Yin Zhang
Suri: Multi-constraint Instruction Following in Long-form Text Generation
Chau Minh Pham, Simeng Sun, Mohit Iyyer
Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering
Yubo Wang, Xueguang Ma, Wenhu Chen
Exploring Multilingual Concepts of Human Values in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?
Shaoyang Xu, Weilong Dong, Zishan Guo, Xinwei Wu, Deyi Xiong
PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models
Huixuan Zhang, Yun Lin, Xiaojun Wan
UrbanLLM: Autonomous Urban Activity Planning and Management with Large Language Models
YUE JIANG, Qin Chao, Yile Chen, Xiucheng Li, SHUAI LIU, Gao Cong
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling
Yao-Ching Yu, Chun Chih Kuo, Ye Ziqi, CHANG YUCHENG, Yueh-Se Li
Eliciting Instruction-tuned Code Language Models’ Capabilities to Utilize Auxiliary Function for Code Generation
Seonghyeon Lee, Suyeon Kim, Joonwon Jang, HeeJae Chon, Dongha Lee, Hwanjo Yu
AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses
Xiaotian Lu, Jiyi Li, Koh Takeuchi, Hisashi Kashima
Enhancing Fine-Grained Image Classifications via Cascaded Vision Language Models
Canshi Wei
Exploring the Best Practices of Query Expansion with Large Language Models
Le Zhang, Yihong Wu, Qian Yang, Jian-Yun Nie
Chain-of-Rewrite: Aligning Question and Documents for Open-Domain Question Answering
Chunlei Xin, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun
MGCL: Multi-Granularity Clue Learning for Emotion-Cause Pair Extraction via Cross-Grained Knowledge Distillation
Yang Yu, Xin Alex Lin, Changqun Li, Shizhou Huang, Liang He
Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks
Xinyue Liu, Yunlong Gao, Linlin Zong, Bo Xu
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts
Lotem Golany, Filippo Galgani, Maya Mamo, Nimrod Parasol, Omer Vandsburger, Nadav Bar, Ido Dagan
Visual Question Decomposition on Multimodal Large Language Models
Haowei Zhang, Jianzhe Liu, Zhen Han, Shuo Chen, Bailan He, Volker Tresp, zhiqiang xu, Jindong Gu
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen
Layerwise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models
Kai Yao, Penglei Gao, Lichun Li, Yuan Zhao, Xiaofeng Wang, Wei Wang, Jianke Zhu
Abstraction-of-Thought Makes Language Models Better Reasoners
Ruixin Hong, Hongming Zhang, Xiaoman Pan, Dong Yu, Changshui Zhang
LLMs Cannot (Yet) Match the Specificity and Simplicity of Online Communities in Long Form Question Answering
Kris-Fillip Kahl, Tolga Buz, Russa Biswas, Gerard de Melo
Automated Tone Transcription and Clustering with Tone2Vec
Yi Yang, Yiming Wang, ZhiQiang Tang, Jiahong Yuan
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Moshe Berchansky, Daniel Fleischer, Moshe Wasserblat, Peter Izsak
Multi-dimensional Evaluation of Empathetic Dialogue Responses
Zhichao Xu, Jiepu Jiang
Translation of Multifaceted Data without Re-Training of Machine Translation Systems
Hyeonseok Moon, Seungyoon Lee, SeongTae Hong, Seungjun Lee, Chanjun Park, Heuiseok Lim
Offline RLHF Methods Need More Accurate Supervision Signals
Shiqi Wang, Zhengze Zhang, Rui Zhao, Fei Tan, Nguyen Cam-Tu
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Yifan Song, Weimin Xiong, Xiutian Zhao, Dawei Zhu, Wenhao Wu, Ke Wang, Cheng LI, Wei Peng, Sujian Li
Are LLMs Aware that Some Questions are not Open-ended?
Dongjie Yang, hai zhao
Conditioned Language Policy: A General Framework For Steerable Multi-Objective Finetuning
Kaiwen Wang, Rahul Kidambi, Ryan Sullivan, Alekh Agarwal, Christoph Dann, Andrea Michi, Marco Gelmi, Yunxuan Li, Raghav Gupta, Kumar Avinava Dubey, Alexandre Rame, Johan Ferret, Geoffrey Cideron, Le Hou, Hongkun Yu, Amr Ahmed, Aranyak Mehta, Leonard Hussenot, Olivier Bachem, Edouard Leurent
DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer’s Disease Questions with Scientific Literature
Dawei Li, Shu Yang, Zhen Tan, Jae Young Baik, Sukwon Yun, Joseph Lee, Aaron Chacko, Bojian Hou, Duy Duong-Tran, Ying Ding, huan liu, Li Shen, Tianlong Chen
Can AI Relate: Testing Large Language Model Response for Mental Health Support
Saadia Gabriel, Isha Puri, Xuhai Xu, Matteo Malgaroli, Marzyeh Ghassemi
Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology
Son Quoc Tran, Matt Kretchmar
SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Jielin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei Li, Babak Damavandi, Seungwhan Moon
Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion
Giuseppe Ruggiero, Matteo Testa, Jurgen Van de Walle, Luigi Di Caro
IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce
Wenxuan Ding, Weiqi Wang, Sze Heng Douglas Kwok, Minghao LIU, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Junxian He, Yangqiu Song
Draft on the Fly: Adaptive Self-Speculative Decoding using Cosine Similarity
Michael R. Metel, Peng Lu, Boxing Chen, Mehdi Rezagholizadeh, Ivan Kobyzev
EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning
Yinzhu Quan, Zefang Liu
The Base-Rate Effect on LLM Benchmark Performance: Disambiguating Test-Taking Strategies from Benchmark Performance
Kyle Moore, Jesse Roberts, Thao Pham, Oseremhen Ewaleifoh, Douglas Fisher
Can LLM Graph Reasoning Generalize beyond Pattern Memorization?
Yizhuo Zhang, Heng Wang, Shangbin Feng, Zhaoxuan Tan, Xiaochuang Han, Tianxing He, Yulia Tsvetkov
Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Sathish Reddy Indurthi, Wenxuan Zhou, Shamil Chollampatt, Ravi Agrawal, Kaiqiang Song, Lingxiao Zhao, Chenguang Zhu
ASTE-Transformer: Modelling Dependencies in Aspect-Sentiment Triplet Extraction
Iwo Naglik, Mateusz Lango
Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach
Adam Wojciechowski, Mateusz Lango, Ondrej Dusek
SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA
Siyue Zhang, Anh Tuan Luu, Chen Zhao
Exploring Open Graph Models with Large Language Models
Lianghao Xia, Ben Kao, Chao Huang
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework
Lu Chen, Ruqing Zhang, Jiafeng Guo, Yixing Fan, Xueqi Cheng
Learning to Paraphrase for Alignment with Model Preference
Junbo Fu, Guoshuai Zhao, Yimin Deng, Yunqi Mi, Xueming Qian
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
Siyuan Huang, Zhiyuan Ma, Jintao Du, Changhua Meng, Weiqiang Wang, Zhouhan Lin
Adaptive Contrastive Decoding in Retrieval-Augmented Generation for Handling Noisy Contexts
Youna Kim, Hyuhng Joon Kim, Cheonbok Park, Choonghyun Park, Hyunsoo Cho, Junyeob Kim, Kang Min Yoo, Sang-goo Lee, Taeuk Kim
SRAP-Agent: Simulating and Optimizing Scarce Resource Allocation Policy with LLM-based Agent
Jiarui Ji, Yang Li, Hongtao Liu, Zhicheng Du, Zhewei Wei, Qi Qi, Weiran Shen, Yankai Lin
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji
In-Context Former: Lightning-fast Compressing Context for Large Language Model
Xiangfeng Wang, Zaiyi Chen, Tong Xu, Zheyong Xie, Yongyi He, Enhong Chen
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States
Zhenhong Zhou, Haiyang Yu, Xinghua Zhang, Rongwu Xu, Fei Huang, Yongbin Li
A Coarse-to-Fine Prototype Learning Approach for Multi-Label Few-Shot Intent Detection
Xiaotong Zhang, Xinyi Li, Feng Zhang, Zhiyi Wei, Junfeng Liu, Han Liu
Can Large Language Models Understand DL-Lite Ontologies? An Empirical Study
Keyu Wang, Guilin Qi, Jiaqi Li, Songlin Zhai
Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration
Jeremy Qin, Bang Liu, Quoc Dinh Nguyen
EvoR: Evolving Retrieval for Code Generation
Hongjin SU, Shuyang Jiang, Yuhang Lai, Haoyuan Wu, Boao Shi, Che Liu, Qian Liu, Tao Yu
Head-wise Shareable Attention for Large Language Models
zouying cao, Yifei Yang, hai zhao
Divide-or-Conquer? Which Part Should You Distill Your LLM?
Zhuofeng Wu, Richard He Bai, Aonan Zhang, Jiatao Gu, V.G.Vinod Vydiswaran, Navdeep Jaitly, Yizhe Zhang
Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models
Yuqing Zhou, Ruixiang Tang, Ziyu Yao, Ziwei Zhu
Privacy Evaluation Benchmarks for NLP Models
Wei Huang, Yinggui Wang, Cen Chen
MM-ChatAlign: A Novel Multimodal Reasoning Framework based on Large Language Models for Entity Alignment
Xuhui Jiang, Yinghan Shen, Zhichao Shi, Chengjin Xu, Wei Li, Huang Zihe, Jian Guo, Yuanzhuo Wang
Towards Explainable Computerized Adaptive Testing with Large Language Model
Cheng Cheng, GuanHao Zhao, Zhenya Huang, Yan Zhuang, Zhaoyuan Pan, Qi Liu, Xin Li, Enhong Chen
Multi-view Content-aware Indexing for Long Document Retrieval
Kuicai Dong, Derrick Goh Xin Deik, Yi Quan Lee, Hao Zhang, Xiangyang Li, Cong Zhang, Yong Liu
Ukrainian Resilience: A Dataset for Detection of Help-Seeking Signals Amidst the Chaos of War
MSVPJ Sathvik, Abhilash Dowpati, Srreyansh Sethi
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems
Kentaro Mitsui, Koh Mitsuda, Toshiaki Wakatsuki, Yukiya Hono, Kei Sawada
Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Jiayi Lin, Chenyang Zhang, Haibo Tong, Dongyu Zhang, Qingqing Hong, Bingxuan Hou, Junli Wang
Are Large Language Models (LLMs) Good Social Predictors?
Kaiqi Yang, Hang Li, Hongzhi Wen, Tai-Quan Peng, Jiliang Tang, Hui Liu
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS.
Onkar Kishor Susladkar, Vishesh Tripathi, Biddwan Ahmed
Selective Annotation via Data Allocation: These Data Should Be Triaged to Experts for Annotation Rather Than the Model
Chen Huang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Ido Dagan
MINERS: Multilingual Language Models as Semantic Retrievers
Genta Indra Winata, Ruochen Zhang, David Ifeoluwa Adelani
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?
Zongmeng Zhang, Jinhua Zhu, Wengang Zhou, Xiang Qi, peng zhang, Houqiang Li
McCrolin: Multi-consistency Cross-lingual Training for Retrieval Question Answering
Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Lalita Lowphansirikul, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
Samuel Ackerman, Ella Rabinovich, Eitan Farchi, Ateret Anaby Tavor
Learning Musical Representations for Music Performance Question Answering
Xingjian Diao, Chunhui Zhang, Tingxuan Wu, Ming Cheng, Zhongyu Ouyang, Weiyi Wu, Soroush Vosoughi, Jiang Gui
Transfer Learning for Text Classification via Model Risk Analysis
Yujie Sun, Chuyi Fan, Qun Chen
Document Hashing with Multi-Grained Prototype-Induced Hierarchical Generative Model
Qian Zhang, Qinliang Su, Jiayang Chen, Zhenpeng Song
Typos that Broke the RAG’s Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations
Sukmin Cho, Soyeong Jeong, Jeongyeon Seo, Taeho Hwang, Jong C. Park
Enhancing Temporal Modeling of Video LLMs via Time Gating
Zi-Yuan Hu, Yiwu Zhong, Shijia Huang, Michael Lyu, Liwei Wang
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng Yang, Yinya Huang, Jing Xiong, Liang Feng, Xiaodan Liang, Yiwei Wang, Jing Tang
Predictive Multiplicity of Knowledge Graph Embeddings in Link Prediction
Yuqicheng Zhu, Nico Potyka, Mojtaba Nayyeri, Bo Xiong, Yunjie He, Evgeny Kharlamov, Steffen Staab
On the Empirical Complexity of Reasoning and Planning in LLMs
Liwei Kang, Zirui Zhao, David Hsu, Wee Sun Lee
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen, Jiaxin Ge, Tianjun Zhang, Jiaming Liu, Shanghang Zhang
Are modern neural ASR architectures robust for polysynthetic languages?
Eric Le Ferrand, Zoey Liu, Antti Arppe, Emily Prud’hommeaux
A Notion of Complexity for Theory of Mind via Discrete World Models
X. Angelo Huang, Emanuele La Malfa, Samuele Marro, Andrea Asperti, Anthony G. Cohn, Michael J. Wooldridge
Learning Dynamic Multi-attribute Interest for Personalized Product Search
Yutong Bai, Zhicheng Dou, Ji-Rong Wen
Evaluating Automatic Metrics with Incremental Machine Translation Systems
Guojun Wu, Shay B Cohen, Rico Sennrich
Temporal Fact Reasoning over Hyper-Relational Knowledge Graphs
Zifeng Ding, Jingcheng Wu, Jingpei Wu, Yan Xia, Bo Xiong, Volker Tresp
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble
Yujeong Lee, Sangwoo Shin, Wei-Jin Park, Honguk Woo
GREEN: Generative Radiology Report Evaluation and Error Notation
Sophie Ostmeier, Justin Xu, Zhihong Chen, Maya Varma, Louis Blankemeier, Christian Bluethgen, Arne Edward Michalson MD, Michael Moseley, Curtis Langlotz, Akshay S Chaudhari, Jean-Benoit Delbrouck
Self-Renewal Prompt Optimizing with Implicit Reasoning
Zihan Liang, Ben Chen, Zhuoran Ran, ZihanWang, Huangyu Dai, Yufei Ma, Dehong Gao, Xiaoyan Cai, Libin Yang
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Jiaming Li, Lei Zhang, Yunshui Li, Ziqiang Liu, yuelin bai, Run Luo, Longze Chen, Min Yang
Women Are Beautiful, Men Are Leaders: Gender Stereotypes in Machine Translation and Language Modeling
Matúš Pikuliak, Stefan Oresko, Andrea Hrckova, Marian Simko
Recent Trends in Linear Text Segmentation: A Survey
Iacopo Ghinassi, Lin Wang, Chris Newell, Matthew Purver
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou
Exploring Question Guidance and Answer Calibration for Visually Grounded Video Question Answering
Yuanxing Xu, Yuting Wei, Shuai Zhong, Xinming chen, Jinsheng Qi, Bin Wu
LoRAN: Improved Low-Rank Adaptation by a Non-Linear Transformation
Yinqiao Li, Linqi Song, Hanxu Hou
Limited Out-of-Context Knowledge Reasoning in Large Language Models
Peng Hu, Changjiang Gao, Ruiqi Gao, Jiajun Chen, Shujian Huang
BiKT: Enabling Bidirectional Knowledge Transfer Between Pretrained Models and Sequential Downstream Tasks
Hang Zeng, Chaoyue Niu, Fan Wu, Shaojie Tang, Leihao Pei, chengfei lv, Guihai Chen
Double-Checker: Large Language Model as a Checker for Few-shot Named Entity Recognition
Wei Chen, Lili Zhao, Zhi Zheng, Tong Xu, Yang Wang, Enhong Chen
XRec: Large Language Models for Explainable Recommendation
Qiyao Ma, Xubin Ren, Chao Huang
Scaling Sentence Embeddings with Large Language Models
Ting Jiang, Shaohan Huang, Zhongzhi Luan, deqing wang, Fuzhen Zhuang
Exploring the Relationship between In-Context Learning and Instruction Tuning
Hanyu Duan, Yixuan Tang, Yi Yang, Ahmed Abbasi, KAR YAN TAM
Granular Entity Mapper: Advancing Fine-grained Multimodal Named Entity Recognition and Grounding
ziqi wang, Chen Zhu, Zhi Zheng, Xinhang Li, Tong Xu, Yongyi He, Qi Liu, Ying Yu, Enhong Chen
JobFair: A Framework for Benchmarking Gender Hiring Bias in Large Language Models
Ze Wang, Zekun Wu, Xin Guan, Michael Thaler, Adriano Koshiyama, Skylar Lu, Sachin Beepath, Ediz Ertekin, Maria Perez-Ortiz
Contrastive Token Learning with Similarity Decay for Repetition Suppression in Machine Translation
Huangyu Dai, Ben Chen, Kaidi Chen, Ying Han, Zihan Liang, Wen Jiang
A Psycholinguistic Evaluation of Language Models’ Sensitivity to Argument Roles
Eun-Kyoung Rosa Lee, Sathvik Nair, Naomi Feldman
Tending Towards Stability: Convergence Challenges in Small Language Models
Richard Diehl Martinez, Pietro Lesci, Paula Buttery
Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming
Rui Li, Peiyi Wang, Jingyuan Ma, Di Zhang, Lei Sha, Zhifang Sui
Modeling News Interactions and Influence for Financial Market Prediction
Mengyu Wang, Shay B Cohen, Tiejun Ma
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Yuhang Zhou, Jing Zhu, Paiheng Xu, Xiaoyu Liu, Xiyao Wang, Danai Koutra, Wei Ai, Furong Huang
Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning
Mohammed Saidul Islam, Raian Rahman, Ahmed Masry, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, Enamul Hoque
HoneyComb: A Flexible LLM-Based Agent System for Materials Science
Huan Zhang, Yu Song, Ziyu Hou, Santiago Miret, Bang Liu
Revealing COVID-19’s Social Dynamics: Diachronic Semantic Analysis of Vaccine and Symptom Discourse on Twitter
Zeqiang Wang, Jiageng Wu, Yuqi Wang, Wei Wang XJTLU, Jie Yang, Nishanth R. Sastry, Jon Johnson, Suparna De
Divide and Conquer: Legal Concept-guided Criminal Court View Generation
Qi Xu, Xiao Wei, Hang Yu, Qian Liu, Hao Fei
Data Diversity Matters for Robust Instruction Tuning
Alexander Bukharin, Shiyang Li, Zhengyang Wang, Jingfeng Yang, Bing Yin, Xian Li, Chao Zhang, Tuo Zhao, Haoming Jiang
LLM Questionnaire Completion for Automatic Psychiatric Assessment
Gony Rosenman, Talma Hendler, Lior Wolf
GE2PE: Persian End-to-End Grapheme-to-Phoneme Conversion
Elnaz Rahmati, Hossein Sameti
Characterizing LLM Abstention Behavior in Science QA with Context Perturbations
Bingbing Wen, Bill Howe, Lucy Lu Wang
Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning
Shramay Palta, Nishant Balepur, Peter A. Rankel, Sarah Wiegreffe, Marine Carpuat, Rachel Rudinger
Cost-Efficient Subjective Task Annotation and Modeling through Few-Shot Annotator Adaptation
Preni Golazizian, Alireza Salkhordeh Ziabari, Ali Omrani, Morteza Dehghani
EDEN: Empathetic Dialogues for English learning
Siyan Li, Teresa Shao, Zhou Yu, Julia Hirschberg
Language Models Still Struggle to Zero-shot Reason about Time Series
Mike A Merrill, Mingtian Tan, Vinayak Gupta, Thomas Hartvigsen, Tim Althoff
Enhancing Agent Learning through World Dynamics Modeling
Zhiyuan Sun, Haochen Shi, Marc-Alexandre Côté, Glen Berseth, Xingdi Yuan, Bang Liu
NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalization
Md Mahadi Hasan Nahid, Davood Rafiei
Zero-Resource Hallucination Prevention for Large Language Models
Junyu Luo, Cao Xiao, Fenglong Ma
Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals
Yanai Elazar, Bhargavi Paranjape, Hao Peng, Sarah Wiegreffe, Khyathi Chandu, Vivek Srikumar, Sameer Singh, Noah A. Smith
Disordered-DABS: A Benchmark for Dynamic Aspect-Based Summarization in Disordered Texts
Xiaobo Guo, Soroush Vosoughi
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning
Zifan Xu, Haozhu Wang, Dmitriy Bespalov, Xian Wu, Peter Stone, Yanjun Qi
TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning
Joshua Feinglass, Yezhou Yang
The Craft of Selective Prediction: Towards Reliable Case Outcome Classification - An Empirical Study on European Court of Human Rights Cases
Santosh T.Y.S.S, Irtiza Chowdhury, Shanshan Xu, Matthias Grabmair
InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration
Fali Wang, Runxue Bao, Suhang Wang, Wenchao Yu, Yanchi Liu, Wei Cheng, Haifeng Chen
SummaCoz: A Dataset for Improving the Interpretability of Factual Consistency Detection for Summarization
Ge Luo, Weisi Fan, Miaoran Li, Guoruizhe Sun, Runlong Zhang, Chenyu Xu, Forrest Sheng Bao
Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model
Sheng Cheng, Maitreya Patel, Yezhou Yang
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
Akshara Prabhakar, Thomas L. Griffiths, R. Thomas McCoy
Self-contradictory reasoning evaluation and detection
Ziyi Liu, Soumya Sanyal, Isabelle Lee, Yongkang Du, Rahul Gupta, Yang Liu, Jieyu Zhao
Incorporating Precedents for Legal Judgement Prediction on European Court of Human Rights Cases
Santosh T.Y.S.S, Mohamed Hesham Elganayni, Stanisław Sójka, Matthias Grabmair
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification
Anisha Gunjal, Greg Durrett
MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension
Xingyu Lu, He CAO, Zijing Liu, Shengyuan Bai, leqingchen, Yuan Yao, Hai-Tao Zheng, Yu Li
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets
Israel Abebe Azime, Atnafu Lambebo Tonja, Tadesse Destaw Belay, Mitiku Yohannes Fuge, Aman Kassahun Wassie, Eyasu Shiferaw Jada, Yonas Chanie, Walelign Tewabe Sewunetie, Seid Muhie Yimam
Sanitizing Large Language Models in Bug Detection with Data-Flow
Chengpeng Wang, Wuqi Zhang, Zian Su, Xiangzhe Xu, Xiangyu Zhang
Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia
Zhejian Zhou, JIayu Wang, Dahua Lin, Kai Chen
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context
Enrique Noriega-Atala, Robert Vacareanu, Salena Torres Ashton, Adarsh Pyarelal, Clayton T Morrison, Mihai Surdeanu
Enhancing Incremental Summarization with Structured Representations
EunJeong Hwang, Yichao Zhou, James Bradley Wendt, Beliz Gunel, Nguyen Vo, Jing Xie, Sandeep Tata
Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models
Songtao Jiang, Tuo zheng, Yan Zhang, YEYING JIN, Li Yuan, Zuozhu Liu
Multiple Knowledge-Enhanced Interactive Graph Network for Multimodal Conversational Emotion Recognition
Geng Tu, Jun Wang, Zhenyu Li, Shiwei Chen, Bin Liang, Xi Zeng, Min Yang, Ruifeng Xu
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation
Jia Fu, Xiaoting Qin, Fangkai Yang, Lu Wang, Jue Zhang, Qingwei Lin, Yubo Chen, Dongmei Zhang, Saravan Rajmohan, Qi Zhang
Unleashing the Potential of Large Language Models through Spectral Modulation
Peng Sun, Yao Zhu, Yunjian Zhang, Xiu Yan, Zizhe Wang, Xiangyang Ji
LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Muhammad Farid Adilazuarda, Samuel Cahyawijaya, Genta Indra Winata, Ayu Purwarianti, Alham Fikri Aji
QUEST: Efficient Extreme Multi-Label Text Classification with Large Language Models on Commodity Hardware
Chuang Zhou, Junnan Dong, Xiao Huang, Zirui Liu, Kaixiong Zhou, Zhaozhuo Xu
UniSumEval: Towards Unified, Fine-grained, Multi-dimensional Summarization Evaluation for LLMs
Yuho Lee, Taewon Yun, Jason Cai, Hang Su, Hwanjun Song
Enhancing Arguments Recognition for Financial Mathematical Reasoning over Hybrid Data
Jinsu Lim, Yechan Hwang, Young-Jun Lee, Ho-Jin Choi
Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Haiming Wu, Hanqing Zhang, richeng xuan, Dawei Song
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Zexuan Qiu, Jingjing Li, Shijue Huang, Xiaoqi Jiao, Wanjun Zhong, Irwin King
Guided Profile Generation Improves Personalization with Large Language Models
Jiarui Zhang
MABC: Multi-Agent Blockchain-inspired Collaboration for Root Cause Analysis in Micro-Services Architecture
Wei Zhang, Hongcheng Guo, Jian Yang, Zhoujin Tian, Yi Zhang, Yan Chaoran, Zhoujun Li, Tongliang Li, xu Shi, liangfan zheng, Bo Zhang
Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens
Weiyao Luo, Suncong Zheng, Heming Xia, weikang wang, Yan Lei, Tianyu Liu, Shuang Chen, Zhifang Sui
Are LLMs Good Annotators for Discourse-level Event Relation Extraction?
Kangda Wei, Aayush Gautam, Ruihong Huang
Reward Modeling Requires Automatic Adjustment Based on Data Quality
Binghai Wang, Rui Zheng, Lu Chen, Zhiheng Xi, Wei Shen, Yuhao Zhou, Dong Yan, Tao Gui, Qi Zhang, Xuanjing Huang
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Zhongwei Wan, ZiangWu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan
The Fall of ROME: Understanding the Collapse of LLMs in Model Editing
Wanli Yang, Fei Sun, Jiajun Tan, Xinyu Ma, Du Su, Dawei Yin, Huawei Shen
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs
Jintian Zhang, Cheng Peng, Mengshu Sun, Xiang Chen, Lei Liang, Zhiqiang Zhang, JUN ZHOU, Huajun Chen, Ningyu Zhang
Can Large Language Models Identify Authorship?
Baixiang Huang, Canyu Chen, Kai Shu
Self-Evolution Fine-Tuning for Policy Optimization
Ruijun Chen, Jiehao Liang, Shiping Gao, Fanqi Wan, Xiaojun Quan
Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Qingyu Yin, Xuzheng He, Chak Tou Leong, Fan Wang, Yanzhao Yan, Xiaoyu Shen, Qiang Zhang
Adaptive Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization
Yixin Ji, Yang Xiang, Juntao Li, Qingrong Xia, Zi Ye, Xinyu Duan, Zhefeng Wang, Kehai Chen, Min Zhang
Emosical: An Emotion Annotated Musical Theatre Dataset
Hayoon Kim, Ahyeon Choi, Sungho Lee, Hyun Jin Jung, Kyogu Lee
TransLLaMa: LLM-based Simultaneous Translation System
Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura
Inference-Time Language Model Alignment via Integrated Value Guidance
Zhixuan Liu, Zhanhui Zhou, Yuanfu Wang, Chao Yang, Yu Qiao
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models
Jiahuan Cao, Dezhi Peng, Peirong Zhang, Yongxin Shi, Yang Liu, Kai Ding, Lianwen Jin
NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding
Chunkit Chan, Cheng Jiayang, Yauwai Yim, Zheye Deng, Wei Fan, Haoran Li, Xin Liu, Hongming Zhang, Weiqi Wang, Yangqiu Song
A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect
Lingyun Song, Chengkun Yang, Xuanyu Li, Xuequn Shang
PyramidCodec: Hierarchical Codec for Long-form Music Generation in Audio Domain
Jianyi Chen, Zheqi DAI, Zhen Ye, Xu Tan, Qifeng Liu, Yike Guo, Wei Xue
Beyond Persuasion: Towards Conversational Recommender System with Credible Explanations
Peixin Qin, Chen Huang, Yang Deng, Wenqiang Lei, Tat-Seng Chua
Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings
Hiroaki Yamagiwa, Yusuke Takase, Hidetoshi Shimodaira
Revisiting Query Variation Robustness of Transformer Models
Tim Hagen, Harrisen Scells, Martin Potthast
Revisiting Catastrophic Forgetting in Large Language Model Tuning
Hongyu Li, Liang Ding, Meng Fang, Dacheng Tao
M5 – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks
Florian Schneider, Sunayana Sitaram
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Susanna Paoli, Alba Cercas Curry, Dirk Hovy
Boosting Large Language Models with Continual Learning for Aspect-based Sentiment Analysis
Xuanwen Ding, Jie Zhou, Liang Dou, Qin Chen, Yuanbin Wu, Arlene Chen, Liang He
ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
Zirui Wu, Yansong Feng
Granularity is crucial when applying differential privacy to text
Doan Nam Long Vu, Timour Igamberdiev, Ivan Habernal
An Open-Source Data Contamination Report for Large Language Models
YUCHENG LI, YUNHAO GUO, Frank Guerin, Chenghua Lin
Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models
Ming Shan Hee, Shivam Sharma, RUI CAO, Palash Nandi, Preslav Nakov, Tanmoy Chakraborty, Roy Ka-Wei Lee
Quantifying Generative Media Bias with a Corpus of Real-world and Generated News Articles
Filip Trhlík, Pontus Stenetorp
OEE-CFC: A Dataset for Open Event Extraction from Chinese Financial Commentary
Qizhi Wan, Changxuan Wan, Rong Hu, Dexi Liu, XuWenwu, Kang Xu, Zou Meihua, LiuTao, 杨杰, xiongzhenwei
Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification
Sudipta Singha Roy, Xindi Wang, Robert Mercer, Frank Rudzicz
BookWorm: A Dataset for Character Description and Analysis
Argyrios Papoudakis, Mirella Lapata, Frank Keller
Leveraging Grammar Induction for Language Understanding and Generation
Jushi Kai, Shengyuan Hou, Yusheng Huang, Zhouhan Lin
SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully
Jushi Kai, Tianhang Zhang, Hai Hu, Zhouhan Lin
RoQLlama: A Lightweight Romanian Adapted Language Model
George-Andrei Dima, Andrei-Marius Avram, Cristian-George Craciun, Dumitru-Clementin Cercel
Reference-free Hallucination Detection for Large Vision-Language Models
Qing Li, Jiahui Geng, Chenyang Lyu, Derui Zhu, Maxim Panov, Fakhri Karray
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Shujie HU, Long Zhou, Shujie LIU, Sanyuan Chen, Lingwei Meng, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei
Learning from Implicit User Feedback, Emotions and Demographic Information in Task-Oriented Document-Grounded Dialogues
Dominic Petrak, Thy Thy Tran, Iryna Gurevych
Improving Argument Effectiveness Across Ideologies using Instruction-tuned Large Language Models
Roxanne El Baff, Khalid Al Khatib, Milad Alshomary, Kai Konen, Benno Stein, Henning Wachsmuth
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Jiayi Yuan, Hongyi Liu, Shaochen Zhong, Yu-Neng Chuang, Songchen Li, Guanchu Wang, Duy Le, Hongye Jin, Vipin Chaudhary, Zhaozhuo Xu, Zirui Liu, Xia Hu
An Evaluation Mechanism of LLM-based Agents on Manipulating APIs
Bing Liu, Zhou Jianxiang, Dan Meng, Haonan Lu
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
WENHAO SHI, Zhiqiang Hu, Yi Bin, Junhua Liu, Yang Yang, See-Kiong Ng, Lidong Bing, Roy Ka-Wei Lee
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Zehao Wang, Minye Wu, Yixin Cao, Yubo Ma, Meiqi Chen, Tinne Tuytelaars
Re-Invoke: Tool Invocation Rewriting for Zero-Shot Tool Retrieval
Yanfei Chen, Jinsung Yoon, Devendra Singh Sachan, Qingze Wang, Vincent Cohen-Addad, Mohammadhossein Bateni, Chen-Yu Lee, Tomas Pfister
Rethinking Evaluation Methods for Machine Unlearning
Leon Wichert, Sandipan Sikdar
Evaluating Moral Beliefs across LLMs through a Pluralistic Framework
Xuelin Liu, Yanfei Zhu, Shucheng Zhu, Pengyuan Liu, Ying Liu, Dong Yu
Knowledge Editing in Language Models via Adapted Direct Preference Optimization
Amit Rozner, Barak Battash, Lior Wolf, Ofir Lindenbaum
Meta-Prompting Efficient Task-Adaptive Query Generator for Retrieval
Yoonsang Lee, Minsoo Kim, seung-won hwang
Reap the Wild Wind: Detecting Media Storms in Large-Scale News Corpora
Dror Kris Markus, Effi Levi, Tamir Sheafer, Shaul Rafael Shenhav
A Survey on Natural Language Counterfactual Generation
Yongjie Wang, Xiaoqi Qiu, Yu Yue, Xu Guo, Zhiwei Zeng, Yuhong Feng, Zhiqi Shen
Geneverse: A Collection of Open-source Multimodal Large Language Models for Genomic and Proteomic Research
Tianyu Liu, Yijia Xiao, Xiao Luo, Hua Xu, Wenjin Zheng, Hongyu Zhao
QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism
Bo Wang, Heyan Huang, Yixin Cao, Jiahao Ying, Wei Tang, Chong Feng
$LONG^{2}RAG$: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
Zehan Qi, Rongwu Xu, Zhijiang Guo, Cunxiang Wang, Hao Zhang, Wei Xu
IndoCL: Benchmarking Indonesian Language Development Assessment
Nankai Lin, Hongyan Wu, Weixiong Zheng, Xingming Liao, Shengyi Jiang, Aimin Yang, Lixian Xiao
Context-Driven Index Trimming: A Data Quality Perspective to Enhancing Precision of RALMs
Kexin Ma, Ruochun Jin, Wang Haotian, Wang Xi, Huan Chen, Yuhua Tang, Qian Wang
Few shot chain-of-thought driven reasoning to prompt LLMs for open ended medical question answering
Saeel Sandeep Nachane, Ojas Gramopadhye, Prateek Chanda, Ganesh Ramakrishnan, Kshitij Sharad Jadhav, Yatin Nandwani, Dinesh Raghu, Sachindra Joshi
Counter Turing Test ($CT^2$): Investigating AI-Generated Text Detection for Hindi - Ranking LLMs based on Hindi AI Detectability Index ($ADI_{hi}$)
Ishan Kavathekar, Anku Rani, Ashmit Chamoli, Ponnurangam Kumaraguru, Amit P. Sheth, Amitava Das
Generating Media Background Checks for Automated Source Critical Reasoning
Michael Sejr Schlichtkrull
In Defense of Structural Sparse Adapters for Concurrent LLM Serving
Junda Su, Zirui Liu, Zeju Qiu, Weiyang Liu, Zhaozhuo Xu
CONSTRUCTURE: Benchmarking CONcept STRUCTUre REasoning for Multimodal Large Language Models
Zhiwei Zha, Xiangru Zhu, Yuanyi Xu, Chenghua Huang, Jingping Liu, Zhixu Li, Xuwu Wang, Yanghua Xiao, Bei Yang, Xiaoxiao Xu
Stanceformer: Target-Aware Transformer for Stance Detection
Krishna Garg, Cornelia Caragea
Learning Autonomous Driving Tasks via Human Feedbacks with Large Language Models
Yunsheng Ma, Xu Cao, Wenqian Ye, Can Cui, Kai Mei, Ziran Wang
CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies
Weiyan Shi, Ryan Li, Yutong Zhang, Caleb Ziems, Sunny Yu, Raya Horesh, Rogério Abreu de Paula, Diyi Yang
TOOLVERIFIER: Generalization to New Tools via Self-Verification
Dheeraj Mekala, Jason E Weston, Jack Lanchantin, Roberta Raileanu, Maria Lomeli, Jingbo Shang, Jane Dwivedi-Yu
FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models
Liqiang Jing, Ruosen Li, Yunmo Chen, Xinya Du
Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and Expected Information Gain
Davide Mazzaccara, Alberto Testoni, Raffaella Bernardi
Advancing Cross-Lingual Entity Alignment with Large Language Models: Tailored Sample Segmentation and Zero-Shot Prompts
Linyan Yang, Jingwei Cheng, Fu Zhang