THE 2024 JOINT INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS, LANGUAGE
RESOURCES AND EVALUATION

20-25 MAY, 2024 / TORINO, ITALIA

List of Accepted Papers

/

/

List of Accepted Papers

List of Accepted Papers

Submission ID Authors Title
1 Min Zeng, Jiexin Kuang, Mengyang Qiu, Jayoung Song and Jungyeul Park Evaluating Prompting Strategies for Grammatical Error Correction Based on Language Proficiency
2 Tollef Emil Jørgensen and Andre Kåsen Aligning the Norwegian UD Treebank with Entity and Coreference Information
4 Amir Hazem, Kazuyuki Motohashi and chen zhu From Technology to Market. Bilingual Corpus on the Evaluation of Technology Opportunity Discovery
16 Matiss Rikters and Toshiaki Nakazawa Revisiting Context Choices for Context-aware Machine Translation
18 Yubing Ren, Yanan Cao, Hao Li, yingjie li, Zixuan ZM Ma, Fang Fang, Ping Guo and Wei Ma DEIE: Benchmarking Document-level Event Information Extraction with a Large-scale Chinese News Dataset
19 Hongfei Xu, Yang Song, Qiuhui Liu, Josef van Genabith and Deyi Xiong Rewiring the Transformer with Depth-Wise LSTMs
20 Yige Chen, Jae Ihn, KyungTae Lim and Jungyeul Park Towards Standardized Annotation and Parsing for Korean FrameNet
21 Dongqi Pu, Yifan Wang, Jia E. Loy and Vera Demberg SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation
22 Minghua Nuo and Chaofan Guo Hybrid of Spans and Table-Filling for Aspect-Level Sentiment Triplet Extraction
27 Ojas Nimase and Sanghyun Hong When Do "More Contexts" Help with Sarcasm Recognition?
28 YuHong Sun, Zhangyue Yin, Qipeng Guo, Jiawen Wu, Xipeng Qiu and Hui Zhao Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem
29 Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Tianxiang Sun, Cheng Chang, Qinyuan Cheng, Ding Wang, Xiaofeng Mou, Xipeng Qiu and Xuanjing Huang Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
30 Hyeonho Song, Jisu Hong, Chani Jung, Hyojin Chin, Mingi Shin, Yubin Choi, Junghoi Choi and Meeyoung Cha Detecting Offensive Language in an Open Chatbot Platform
35 Rikito Takahashi, Hirokazu Kiyomaru, Chenhui Chu and Sadao Kurohashi Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation
37 Bichen Wang, Yuzhe Zi, Yanyan Zhao, Pengfei Deng and Bing Qin ESDM: Early Sensing Depression Model in Social Media Streams
38 Marc Feger and Stefan Dietze TACO – Twitter Arguments from COnversations
39 Jonathan Dunn and Lane Edwards-Brown Geographically-Informed Language Identification
40 Jonathan Dunn Validating and Exploring Large Geographic Corpora
41 Jonathan Dunn, Benjamin Adams and Harish Tayyar Madabushi Pre-Trained Language Models Represent Some Geographic Populations Better Than Others
42 Jun Xu, Mengshu Sun, Zhiqiang Zhang and Jun Zhou ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
44 Ramona Christen, Anastassia Shaitarova, Matthias Stürmer and Joel Niklaus Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents
45 Yida Mu, Xingyi Song, Kalina Bontcheva and Nikolaos Aletras Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets
46 Haven Kim, Jongmin Jung, Dasaem Jeong and Juhan Nam K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling
59 Guijin Son, Hanwool Lee, suwan kim, Huiseo Kim, Jae cheol Lee, Je Won Yeom, Jihyu Jung, Jung woo Kim and Songseong Kim HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models
64 Stella Markantonatou, Vivian Stamou, Christina Christodoulou, Georgia Apostolopoulou, Antonis Balas and George Ioannakis The Corpus AIKIA: using ranking annotation for Offensive Language Detection in Modern Greek
69 Wenxin Guo, Lei Zhang, Kun Zhang, Yi Liu and Zhendong Mao Visual-Linguistic Dependency Encoding for Image-Text Retrieval
72 Gérard Bailly, Romain Legrand, Martin Lenglet, Frédéric Elisei, Maëva Hueber and Olivier Perrotin Emotags: Computer-Assisted Verbal Labelling of Expressive Audiovisual Utterances for Expressive Multimodal TTS
73 Vladimir Araujo, Maria Mihaela Trusca, Rodrigo Tufiño and Marie-Francine Moens Sequence-to-Sequence Spanish Pre-trained Language Models
79 Nigel Ward and Divette Marco A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances
82 Guangmin Zheng, Jin Wang, Xiaobing Zhou and Xuejie Zhang Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
85 Joosung Lee and Jinhong Kim Enhanced Facet Generation with LLM Editing
87 Rustem Yeshpanov and Huseyin Atakan Varol KazSAnDRA: Kazakh Sentiment Analysis Dataset of Reviews and Attitudes
91 Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun and Qi Zhang Calibrating LLM-Based Evaluator
93 Junbing Yan, Chengyu Wang, Taolin Zhang, XIAOFENG HE, jun huang, Wei Zhang, Longtao Huang and hui xue TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
94 Wenjun Kong and Yamei Xia CARE: Co-Attention Network for Joint Entity and Relation Extraction
101 Danqing Luo, Chen Zhang, Yan Zhang and Haizhou Li CrossTune: Black-Box Few-Shot Classification with Label Enhancement
104 ZhongXiang Sun, Kepu Zhang, Weijie Yu, Haoyu Wang and Jun Xu Logic Rules as Explanations for Legal Case Retrieval
108 David Gimeno-Gómez and Carlos-D. Martínez-Hinarejos Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition
109 Stephanie M. Lukin, Claire Bonial, Matthew Marge, Taylor A. Hudson, Cory J. Hayes, Kimberly Pollard, Anthony Baker, Ashley N. Foots, Ron Artstein, Felix Gervits, Mitchell Abrams, Cassidy Henry, Lucia Donatelli, Anton Leuski, Susan G. Hill, David Traum and Clare Voss SCOUT: A Situated and Multi-Modal Human-Robot Dialogue Corpus
110 Unggi Lee, Sungjun Yoon, Joon Seo Yun, Kyoungsoo Park, YoungHoon Jung, Damji Stratton and Hyeoncheol Kim Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction
118 Li-Ming Zhan, Bo LIU and Xiao-Ming Wu VI-OOD: A Unified Framework of Representation Learning for Textual Out-of-distribution Detection
119 Nina Markl, Lauren Hall-Lew and Catherine Lai Language Technologies as if People Mattered: Centering Communities in Language Technology Development
130 Yida Mu, Mali Jin, Kalina Bontcheva and Xingyi Song Examining Temporalities on Stance Detection Towards COVID-19 Vaccination
131 Huixuan Zhang and Xiaojun Wan Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection
135 Marta Lango, Borys Naglik, Mateusz Lango and Iwo Naglik Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish
136 Gabriel de Jesus and Sérgio Sobral Nunes Data Collection Pipeline for Low-Resource Languages: A Case Study on Constructing a Tetun Text Corpus
138 Anton F. Thielmann, Christoph Weisser and Benjamin Säfken Human in the loop: How to effectively create coherent topics by manually labeling only a few documents per class
139 Andy Luecking, Giuseppe Abrami, Leon Hammerla, Marc Rahn, Daniel Baumartz, Steffen Eger and Alexander Mehler Dependencies over Times and Tools (DoTT)
142 Fuhan Cai, Duo Liu, Zhongqiang Zhang, Ge Liu, Xiaozhe Yang and Xiangzhong Fang NER-guided Comprehensive Hierarchy-aware Prompt Tuning for Hierarchical Text Classification
143 José-M. Acosta-Triana, David Gimeno-Gómez and Carlos-D. Martínez-Hinarejos AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies
145 Siyu Duan, Jun Wang and Qi Su Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach
148 Yingying Zhang, Xian Wu, Yu Zhang and Yefeng Zheng Knowledge-aware Attention Network for Medication Effectiveness Prediction
154 Yiping Jin, Leo Wanner and Alexander Shvets GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
157 Paulo Cavalin and Claudio Santos Pinhanez Theoretical and Empirical Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Open-World Scenarios
162 Yuhong He, Yongqi Zhang, Shizhu He and Jun Wan BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation
170 Yugo Murawaki Principal Component Analysis as a Sanity Check for Bayesian Phylolinguistic Reconstruction
173 Guozheng Li, Wenjun Ke, Peng Wang, Zijie Xu, Ke Ji, Jiajun Liu, Ziyu Shang and Qiqing Luo Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction
174 Zhihong Zhu, Xuxin Cheng, Hao An, Zhichang Wang, Dongsheng Chen and Zhiqi Huang Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study
176 Jennifer A. Bishop, Sophia Ananiadou and Qianqian Xie LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
177 Sixing Yu, Juan Pablo Munoz and Ali Jannesari Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models
180 Kai Xu, Zhengyu Wang, Yuxuan Long and Qiaona Zhao Deep Reinforcement Learning-based Dialogue Policy with Graph Convolutional Q-network
183 lianyu hu, Liqing Gao, Zekang Liu and Wei Feng Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition
185 Atsushi Kojima Sub-Table Rescorer for Table Question Answering
190 Shizhou Huang, Bo Xu, Changqun Li, Jiabo Ye and xin Lin MNER-MI: A Multi-image Dataset for Multimodal Named Entity Recognition in Social Media
191 Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan and Weidong Xiao Distill, Fuse, Pre-train: Towards Effective Event Causality Identification with Commonsense-Aware Pre-trained Model
194 Yoshinari Nagai, Teruaki Oka and Mamoru Komachi A Document-Level Text Simplification Dataset for Japanese
195 Markus Bayer, Markus Neiczer, Maximilian Samsinger, Björn Buchhold and Christian Reuter XAI-Attack: Utilizing Explainable AI to Find Incorrectly Learned Patterns for Black-Box Adversarial Example Creation
197 hailay Teklehaimanot, Wolfgang Nejdl and Niloy Ganguly TIGQA: An Expert-Annotated Question-Answering Dataset in Tigrinya
200 Paul Grundmann, Jens-Michalis Papaioannou, Tom Oberhauser, Thomas Steffek, Amy Siu, Wolfgang Nejdl and Alexander Loeser Data Drift in Clinical Outcome Prediction from Admission Notes
206 Weiyao Luo, Junfeng Ran, Zailong Tian, Sujian Li and Zhifang Sui FaGANet: An Evidence-Based Fact-Checking Model with Integrated Encoder Leveraging Contextual Information
212 Mert Inan and Malihe Alikhani Seeing Eye-to-Eye: Cross-Modal Coherence Relations Inform Eye-gaze Patterns During Comprehension & Production
214 Connor Heaton and Prasenjit Mitra Deriving Entity-Specific Embeddings From Multi-Entity Sequences
216 Avril Gazeau and Francois Lareau Flexible Lexicalization in Rule-based Text Realization
219 Luke Gessler PrOnto: Language Model Evaluations for 859 Languages
223 Peiyu Liu, Ze-Feng Gao, Xiao Zhang, Wayne Xin Zhao and Ji-Rong Wen Enhancing Parameter-efficient Fine-tuning with Simple Calibration based on Stable Rank
227 Junyi He and Xia Li Zero-shot Cross-lingual Automated Essay Scoring
229 Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin and Berlin Chen DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
231 Yucheng Cai, Wentao Ma, Yuchuan Wu, Shuzheng Si, yuan shao, Zhijian Ou and Yongbin Li UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt
237 Iben Nyholm Debess, Annika Simonsen and Hafsteinn Einarsson Good or Bad News? Exploring GPT-4 for Sentiment Analysis for Faroese on a Public News Corpora
239 Christopher Weiss, Frauke Kreuter and Ivan Habernal To Share or Not to Share: What Risks Would Laypeople Accept to Give Sensitive Data to Differentially-Private NLP Systems?
243 Viktor Hangya and Alexander Fraser How to Solve Few-Shot Abusive Content Detection Using the Data We Actually Have
244 Truong Dinh Do, Phuong Minh Nguyen and Minh Nguyen ZeLa: Advancing Zero-Shot Multilingual Semantic Parsing with Large Language Models and Chain-of-Thought Strategies
246 Yuzhuang Xu, Shuo Wang, Peng Li, Xuebo Liu, Xiaolong Wang, Weidong Liu and Yang Liu Pluggable Neural Machine Translation Models via Memory-augmented Adapters
248 Abdullatif Koksal, Silvia Severini and Hinrich Schütze SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment
252 Bo Lv, Xin Liu, Kaiwen Wei, Ping Luo and Yue Yu TAeKD: Teacher Assistant Enhanced Knowledge Distillation for Closed-Source Multilingual Neural Machine Translation
255 Hongcheng Liu, Pingjie Wang, Zhiyuan Zhu, Yanfeng Wang and Yu Wang CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation
258 Sohom Ghosh, Arnab Maji, Aswartha Narayana and Sudip Kumar Naskar IndicFinNLP: Financial Natural Language Processing for Indian Languages
259 Yuang Li, Yinglu Li, Min Zhang, Chang Su, Jiawei Yu, Mengyao Piao, Xiaosong Qiao, Miaomiao Ma, Yanqing Zhao and Hao Yang CB-Whisper: Contextual Biasing Whisper using Open-Vocabulary Keyword-Spotting
260 Jimin An, YunSeok Choi and Jee-Hyong Lee Code Defect Detection using Pre-trained Language Models with Encoder-Decoder via Line-Level Defect Localization
262 Haoyu Xiong, Xinchun Zhang, Leixin Yang, Yu Xiang and Gang Fang STAF: Pushing the Boundaries of Test-Time Adaptation Towards Practical Noise Scenarios
264 Ryo Nagata, Yoshifumi Kawasaki, Naoki Otani and Hiroya Takamura A Computational Approach to Quantifying Grammaticization of English Deverbal Prepositions
265 Zhixiong Cao, Hai-Tao Zheng, Yangning Li, Jin Xu, Rongsheng Li and Hong-Gee Kim Depth Aware Hierarchical Replay Continual Learning for Knowledge Based Question Answering
268 Shunyu Liu, Jie Zhou, Qunxi Zhu, Qin Chen, Qingchun Bai, Jun Xiao and Liang He Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models
269 Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao and Nan Duan PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization
274 Amir Hossein Kargaran, François Yvon and Hinrich Schütze GlotScript: A Resource and Tool for Low Resource Writing System Identification
278 Masoud Monajatipoor, Zi-Yi Dou, Aichi Chien, Nanyun Peng and Kai-Wei Chang Medical Vision-Language Pre-Training for Brain Abnormalities
279 Ang Li, Yiquan Wu, Yifei Liu, Kun Kuang, Fei Wu and Ming Cai Enhancing Court View Generation with Knowledge Injection and Guidance
284 Xiaotong Feng, Meng-Fen Chiang, Wang-Chien Lee and Zixin Kuang Evidence-guided Inference for Neutralized Zero-shot Transfer
286 Eunkyul Leah Jo, Angela Yoonseo Park, Grace Tianjiao Zhang, Izia Xiaoxiao Wang, Junrui Wang, MingJia Mao and Jungyeul Park An Untold Story of Preprocessing Task Evaluation: An Alignment-based Joint Evaluation Approach
287 Binling Nie, Yiming Shao and Yigang Wang Know-Adapter: Towards Knowledge-Aware Parameter-Efficient Transfer Learning for Few-shot Named Entity Recognition
290 Daizong Liu, Xiaoye Qu, Xiang Fang, Jianfeng Dong, Pan Zhou, Guoshun Nan, Keke Tang, Wanlong Fang and Yu Cheng Towards Robust Temporal Activity Localization Learning with Noisy Labels
291 Feihong Lu, Xiaocui Yang, Qian Li, Qingyun Sun, Ke Jiang, Cheng Ji and Jianxin Li Few-Shot Multimodal Named Entity Recognition based on Mutlimodal Causal Intervention Graph
292 Yijun Liu, Feifei Dai, Xiaoyan Gu, Minghui Zhai, Bo Li and Meiou Zhang Domain-aware and Co-adaptive Feature Transformation for Domain Adaption Few-shot Relation Extraction
294 Junyu Luo, Xiaochen Wang, Jiaqi Wang, Aofei Chang, Yaqing Wang and Fenglong Ma CoRelation: Boosting Automatic ICD Coding Through Contextualized Code Relation Learning
305 Yufeng Wang, Chao Chen, Zhou Yang, shuhui wang and Xiangwen Liao CTSM: Combining Trait and State Emotions for Empathetic Response Model
310 Yo Sato Disambiguating homographs and homophones simultaneously: a regrouping method for Japanese
312 Ping Guo, Yue Hu, Yubing Ren, Yunpeng Li, jiarui zhang, Xingsheng Zhang and Heyan Huang Teaching Large Language Models to Translate on Low-resource Languages with Textbook Prompting
313 Heegon Jin, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh and Yeonsoo Lee Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation
316 Wei Zhou, Heike Adel, Hendrik Schuff and Ngoc Thang Vu Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings
319 Abhidip Bhattacharyya, Martha Palmer and Christoffer Heckman ReCAP: Semantic Role Enhanced Caption Generation
323 Subhradeep Kayal, Alexander Rakhlin, Ali Dashti and Serguei Stepaniants How Far is Too Far? Studying the Effects of Domain Discrepancy on Masked Language Models
327 Yongxiu Xu, Hao Xu, Heyan Huang, Shiyao Cui, Minghao Tang, Longzheng Wang and Hongbo Xu An Effective Span-based Multimodal Named Entity Recognition with Consistent Cross-Modal Alignment
329 Itsugun Cho, Ryota Takahashi, Yusaku Yanase and Hiroaki Saito Deep Reinforcement Learning with Hierarchical Action Exploration for Dialogue Generation
335 Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou and Houqiang Li Sinkhorn Distance Minimization for Knowledge Distillation
337 Zhen Wang, Peide Zhu and Jie Yang ControversialQA: Exploring Controversy in Question Answering
338 Terufumi Morishita, Atsuki Yamaguchi, Gaku Morio, Hikaru Tomonari, Osamu Imaichi and Yasuhiro Sogawa JFLD: A Japanese Benchmark for Deductive Reasoning based on Formal Logic
341 Hakyung Sung and Gyu-Ho Shin Constructing a Dependency Treebank for Second Language Learners of Korean
347 Tianqi Hu, Lishuang Li, Xueyang Qin and Yubo Feng Event Representation Learning with Multi-Grained Contrastive Learning and Triple-Mixture of Experts
348 Mingmin Wu, Guixin Su, Yongcheng Zhang, Zhongqiang Huang and Ying Sha Refining Idioms Semantics Comprehension via Contrastive Learning and Cross-Attention
353 Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang and Bin Dong ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models
354 Karen Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, Marthese Borg, Yongjian Chen, Fanny Ducel, Yoann Dupont, Guido Ivetta, Zhijian Li, Margot Mieskes, Marco Naguib, Yuyan Qian, Matteo Radaelli, Wolfgang S. Schmeisser-Nieto, Emma Raimundo Schulz, Thiziri Saci, Sarah Saidi, Javier Torroba Marchante, Shilin Xie, Sergio E. Zanotto and Aurélie Névéol Your Stereotypical Mileage may Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts
359 Siyu Ren and Kenny Q. Zhu Low-Rank Prune-And-Factorize for Language Model Compression
360 Adam Przepiórkowski, Magdalena Borysiak and Adam Głowacki An Argument for Symmetric Coordination from Dependency Length Minimization: A Replication Study
361 Shulin Huang, Shirong Ma, Yinghui Li, Mengzuo Huang, wuhe zou, Weidong Zhang and Haitao Zheng LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles
362 Sayed Muddashir Hossain, Jan Alexandersson and Philipp Müller M3TCM: Multi-modal Multi-task Context Model for Utterance Classification in Motivational Interviews
363 Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Tamás Grósz, Xinwei Cao, Torbjørn Svendsen and Giampiero Salvi Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages
368 Guanlin Li, Xuechen Zhao, Amir Jafari, Wenhao Shao, Reza Farahbakhsh and Noel Crespi Improving Cross-lingual Transfer with Contrastive Negative Learning and Self-training
369 Shichen Li, Zhongqing Wang, Yanzhi Xu and Guodong Zhou Structure-aware Generation Model for Cross-Domain Aspect-based Sentiment Classification
373 Valentina Dragos, Delphine Battistelli, Fatou Sow and Aline Etienne Exploring the Emotional Dimension of French Online Toxic Content
374 Pierre Nugues Linking Named Entities in Diderot's Encyclopédie to Wikidata
378 Chuyao Ding, Yu Hong and Jianmin Yao SGCM: Salience-Guided Context Modeling for Question Generation
379 Jinming Zhao, Katsuhito Sudoh, Satoshi Nakamura, Yuka Ko, Kosuke Doi and Ryo Fukuda NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus
380 Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nate B. Carlson, Nathaniel Romney Robinson, Mrinmaya Sachan and David R. Mortensen PWESuite: Phonetic Word Embeddings and Tasks They Facilitate
383 Philip Blair and Kfir Bar JRC-Names-Retrieval: A Standardized Benchmark for Name Search
385 Marco Gaido, Sara Papi, Matteo Negri and Luisa Bentivogli How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena
387 Houcemeddine Turki, Abraham Toluwase Owodunni, Mohamed Ali Hadj Taieb, René Fabrice Bile and Mohamed Ben Aouicha A Decade of Scholarly Research on Open Knowledge Graphs
390 Kai Zhang, Pengcheng Li, Kaisong Song, Xurui Li, Yangyang Kang, Xuhong Zhang and Xiaozhong Liu Knowledge Triplets Derivation from Scientific Publications via Dual-Graph Resonance
391 Ali Al-Laith, Alexander Conroy, Jens Bjerring-Hansen and Daniel Hershcovich Development and Evaluation of Pre-trained Language Models for Historical Danish and Norwegian Literary Texts
394 Hoang Nguyen, Chenwei Zhang, Ye Liu, Natalie Parde, Eugene Rohrbaugh and Philip S. Yu CORI: CJKV Benchmark with Romanization Integration - A step towards Cross-lingual Transfer Beyond Textual Scripts
395 Pranav Arora, Selen Pehlivan and Jorma Laaksonen Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer
400 Kaixuan Wu, Yanghao Lin, Donglin Cao and Dazhen Lin Interpretable Short Video Rumor Detection based on Modality Tampering
403 Zhihong Zhu, Xuxin Cheng, Guimin Hu, Yaowei Li, Zhiqi Huang and Yuexian Zou Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling
404 Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu and Guoqing Zhao Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
406 Vincent P. Martin and Jean-Luc Rouas Why Voice Biomarkers of Psychiatric Disorders are not used in Clinical Practice? Deconstructing the Myth of the Need for Objective Diagnosis
408 Sijie Li, Sha Li, Hao Zhang, Shuyang Li, Kai Chen, Jianyong Yuan, Yi Cao and Lvqing Yang EpiGEN: An Efficient Multi-Api Code GENeration Framework under Enterprise Scenario
409 Cam-Van Thi Nguyen, Cao-Bach Nguyen, Duc-Trong Le and Quang-Thuy Ha Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition
410 Jingyao Tang, Lishuang Li, Hongbin Lu, Xueyang Qin, Beibei Zhang and Haiming Wu Prototype-based Prompt-Instance Interaction with Causal Intervention for Few-shot Event Detection
413 Bin Cao, Kai Jiang, Fayu Pan, Chenlei Bao and Jing Fan Improving Grammatical Error Correction by Correction Acceptability Discrimination
416 Yifei Yuan, Chen Shi, Wang Runze, Liyi Chen, Renjun Hu, zengming zhang, Feijun Jiang and Wai Lam CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite
417 Shouhui Wang and Biao Qin No Need for Large-Scale Search: Exploring Large Language Models in Complex Knowledge Base Question Answering
419 Yongliang Lin, Zhen Zhang, Mengting Hu, Yufei Sun and Yuzhi Zhang Modalities Should be Appropriately Leveraged: Uncertainty Guidance for Multimodal Chinese Spelling Correction
421 Qiuyu Liang, Weihua Wang, Feilong Bao and Guanglai Gao L^2GC:Lorentzian Linear Graph Convolutional Networks For Node Classification
424 Yan Ge, Victor Junqiu Wei, Yuanfeng Song, Jason Chen Zhang and Raymond Chi-Wing Wong Automatic Data Visualization Generation from Chinese Natural Language Questions
426 Haiyang Wang, Zhiliang Tian, Xin Song, Yue Zhang, Yuchen Pan, Hongkui Tu, Minlie Huang and Bin Zhou Intent-Aware and Hate-Mitigating Counterspeech Generation via Dual-Discriminator Guided LLMs
428 Chuanpeng Yang, Fuqing Zhu, Yaxin Liu, Jizhong Han and Songlin Hu Uncertainty-Aware Cross-Modal Alignment for Hate Speech Detection
429 Seungyoon Lee, Chanjun Park, DaHyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo and Heuiseok Lim Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean
431 Yan Xiao, Yaochu Jin and Kuangrong Hao Federated Document-Level Biomedical Relation Extraction with Localized Context Contrast
432 Gregor Donabauer and Udo Kruschwitz Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations
433 Maria Berger, Sebastian Michael Reimann and Nieke Marie Kiwitt Applying Transfer Learning to German Metaphor Prediction
434 Chaojun Xiao, Yutao Sun, Yuan Yao, Xu Han, Wenbin Zhang, Zhiyuan Liu and Maosong Sun Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training
435 Dominik Andreas Kowieski, Michael Hellwig and Thomas Feilhauer TAPASGO: Transfer Learning towards a German-Language Tabular Question Answering Model
436 Shuvam Shiwakoti, Surendrabikram Thapa, Kritesh Rauniyar, Akshyat Shah, Aashish Bhandari and Usman Naseem Analyzing the Dynamics of Climate Change Discourse on Twitter: A New Annotated Corpus and Multi-Aspect Classification
439 Hongchuan Zeng, Hongshen Xu, Lu Chen and Kai Yu Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
440 Eujene Nikka V. Boquio and Prospero C. Naval, Jr. Beyond Canonical Fine-tuning: Leveraging Hybrid Multi-Layer Pooled Representations of BERT for Automated Essay Scoring
441 Huitong Pan, Qi Zhang, Cornelia Caragea, Eduard Dragut and Longin Jan Latecki SciDMT: A Large-Scale Corpus for Detecting Scientific Mentions
442 Sun Wei, Mingxiao Li, Jingyuan Sun, Jesse Davis and Marie-Francine Moens DMON: A Simple yet Effective Approach for Argument Structure Learning
443 Sondre Wold, Petter Mæhlum and Oddbjørn Hove Estimating Lexical Complexity from Document-Level Distributions
450 Jinan Zou, Maihao Guo, Yu Tian, Yuhao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad and Javen Qinfeng Shi Semantic Role Labeling Guided Out-of-distribution Detection
451 Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, Bowen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li and Xiping Hu MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property
452 Qiushi Sun, Chengcheng Han, Nuo Chen, Renyu Zhu, Jingyang Gong, Xiang Li and Ming Gao Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
455 Tom Roth, Inigo Jauregi Unanue, Alsharif Abuadbba and Massimo Piccardi XVD: Cross-Vocabulary Differentiable Training for Generative Adversarial Attacks
456 Tomoya Mizumoto, Takato Yamazaki, Katsumasa Yoshikawa, Masaya Ohagi, Toshiki Kawamoto and Toshinori Sato Dialogue Systems Can Generate Appropriate Responses without the Use of Question Marks?-- A Study of the Effects of ``?'' for Spoken Dialogue Systems --
457 Hao Lang, Yinhe Zheng, Binyuan Hui, Fei Huang and Yongbin Li Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts
458 Zihan Wang, Peiyi Wang and Houfeng Wang Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification
462 Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen and Wen Zhang Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion
463 Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi and Koichiro Yoshino A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions
466 Sho Hoshino, Akihiko Kato, Soichiro Murakami and Peinan Zhang Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity
469 Wooyoung Kim, TaeYong Kim, Byeongjin KIM, Myeong Jin MJ Lee, Gitaek Lee, kirok kim, Jisoo Cha and Wooju Kim Korean Disaster Safety Information Sign Language Translation Benchmark Dataset
470 Shasha Guo, Jing Zhang, Xirui Ke, Cuiping Li and Hong Chen Diversifying Question Generation over Knowledge Base via External Natural Questions
472 Eileen Wemmer, Sofie Labat and Roman Klinger EmoProgress: Cumulated Emotion Progression Analysis in Dreams and Customer Service Dialogues
477 Zhiyu Fang, Jingyan Qin, Xiaobin Zhu, Chun Yang and Xu-Cheng Yin Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
478 Jan Odijk A Canonical Form for Flexible Multiword Expressions
483 Julia Krebs, Evguenia A. Malaia, Isabella Fessl, Hans-Peter Wiesinger, Dietmar Roehm, Ronnie Wilbur and Hermann Schwameder Motion Capture Analysis of Verb and Adjective Types in Austrian Sign Language (ÖGS)
489 Duyoung Jeon, Junho Lee and Cheongtag Kim User Guide for KOTE: Korean Online That-gul Emotions Dataset
492 Mohamad MZ Elzohbi and Richard Zhao ContrastWSD: Enhancing Metaphor Detection with Word Sense Disambiguation Following the Metaphor Identification Procedure
495 Slawomir Dadas, Michał Perełkiewicz and Rafał Poświata PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods
498 Jose Diego Suarez and Luis Chiruzzo Null Subjects in Spanish as a Machine Translation Problem
503 Yifan Ding, Qingkai Zeng and Tim Weninger ChatEL: Entity Linking with Chatbots
507 Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
510 Savitha Sam Abraham, Marjan Alirezaie and Luc De Raedt CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments
512 Yizhi Jiang, Jinlong Li and huanhuan chen Relation Classification via Bidirectional Prompt Learning with Data Augmentation by Large Language Model
513 You Zhang, Jin Wang, Liang-Chih Yu, Dan Xu and Xuejie Zhang Improving Personalized Sentiment Representation with Knowledge-enhanced and Parameter-efficient Layer Normalization
514 Jiri Martinek, Pavel Kral, Ladislav Lenc and Josef Baloun COMICORDA: Dialogue Act Recognition in Comic Books
516 Xumeng Liu, Wenya Guo, Ying Zhang, Xubo Liu, Yu Zhao, Shenglong Yu and Xiaojie Yuan Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation
519 kyungho kim, Seongmin Park and Jihwa Lee RT-VQ2A2: Real Time Vector Quantized Question Answering with ASR
523 Donovan Ong, Shuo Sun, Jian Su and Bin Chen Mitigating Linguistic Artifacts in Emotion Recognition for Conversations from TV Scripts to Daily Conversations
527 Shuhei Tateishi, Makoto Nakatsuji and Yasuhito Osugi Word-Aware Modality Stimulation for Multimodal Fusion
528 Jan Odijk, Martin Kroon, Tijmen Baarda, Ben Bonfil and Sheean Spoel MWE-Finder: A Demonstration
529 Dingxin Hu, Xuanyu Zhang, Xingyue Zhang, Yiyang Li, Dongsheng Chen, Marina Litvak, Natalia Vanetik, Qing Yang, Dongliang Xu, Yanquan Zhou, Lei Li, Yuze Li and Yingqi Zhu Improving Factual Consistency in Abstractive Summarization with Sentence Structure Pruning
533 Ahmet Gunduz, Kamer Ali Yuksel, Kareem Darwish, Golara Javadi, Fabio Minazzi, Nicola Sobieski and Sébastien Bratières An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation
534 Samin Mahdizadeh Sani, Malak Rassem, Chris W. Jenkins, Filip Miletić and Sabine Schulte im Walde What Can Diachronic Contexts and Topics Tell Us About the Present-Day Compositionality of English Noun Compounds?
536 Andres Pineiro-Martin, Carmen Garcia-Mateo, Laura Docio-Fernandez, Maria del Carmen Lopez-Perez and Jose Gandarela-Rodriguez FalAI: A Dataset for End-to-end Spoken Language Understanding in a Low-Resource Scenario
537 Muhammad Huzaifah, Weihua Zheng, Nattapol Chanpaisit and Kui Wu Evaluating Code-Switching Translation with Large Language Models
538 Huawen Feng, Jingsong Yan, Junlong Liu, Junhao Zheng and Qianli Ma Well Begun is Half Done: An Implicitly Augmented Generative Framework with Distribution Modification for Hierarchical Text Classification
543 Georgios Velentzas, Andrew Caines, Rita Borgo, Erin Pacquetet, Clive Hamilton, Taylor Arnold, Diane Nicholls, Paula Buttery, Thomas Gaillat, Nicolas Ballier and Helen Yannakoudakis Logging Keystrokes in Writing by English Learners
546 Leonardo Zilio, Shenbin Qian, Diptesh Kanojia and Constantin Orasan Character-level language models for abbreviation and long-form detection
547 Weiran Chen, Xin Li, Jiaqi Su, Guiqian Zhu, Ying Li, Yi JI and Chunping Liu TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling
551 Chenhao Wu, Ruifang He, chang liu and Bo Wang Continuous Relational Diffusion driven Topic Model with Multi-grained Text for Microblog
552 Zecheng Wang, Chunshan Li, Zhao Yang, Qingbin Liu, Yanchao Hao, Xi Chen, Dianhui Chu and Dianbo Sui Analyzing Chain-of-thought Prompting in Black-Box Large Language Models via Estimated V-information
554 Ting Zhou, Ying Shen and Yinghui Li GCNet: Global-and-Context Collaborative Learning for Aspect-Based Sentiment Analysis
555 Anais Ollagnier CyberAgressionAdo-v2: Leveraging Pragmatic-Level Information to Decipher Online Hate in French Multiparty Chats
558 Zhiming Li, Yanzhou Li, Tianlin Li, Mengnan Du, bozhi wu, Yushi Cao, Junzhe Jiang and Yang Liu Unveiling Project-Specific Bias in Neural Code Models
559 Quang Anh Nguyen, Nadi Tomeh, Mustapha Lebbah, Thierry Charnois, Hanene Azzag and Santiago Cordoba Muñoz Enhancing Few-Shot Topic Classification with Verbalizers. A Study on Automatic Verbalizer and Ensemble Methods
561 Ge Gao, Jongin Kim, Sejin Paik, Ekaterina Novozhilova, Yi Liu, Sarah T. Bonna, Margrit Betke and Derry Tanti Wijaya Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation
564 Sebastian Steindl, Ulrich Schäfer and Bernd Ludwig Counterfactual Dialog Mixing as Data Augmentation for Task-Oriented Dialog Systems
566 Van-Tuan Bui and Agata Savary Cross-type French Multiword Expression Identification with Pre-trained Masked Language Models
567 Junzhe Liang, Haifeng Sun, Zirui Zhuang, Qi Qi, Jingyu Wang and Jianxin Liao Distantly Supervised Contrastive Learning for Low-Resource Scripting Language Summarization
568 Flor Miriam Plaza-del-Arco, Alba A. Cercas Curry, Amanda Cercas Curry and Dirk Hovy Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
571 Yanis Labrak, Mickael Rouvier and Richard Dufour A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks
572 Longxiang Zhang, Caleb D. Hart, Susanne Burger and Thomas Schaaf Annotate the Way You Think: An Incremental Note Generation Framework for the Summarization of Medical Conversations
577 Julius Monsen and Arne Jonsson Controllable Sentence Simplification in Swedish using Control Prefixes and Mined Paraphrases
578 David M. Chan, Yiming Ni, David Ross, Sudheendra Vijayanarasimhan, Austin Myers and John Canny Distribution Aware Metrics for Conditional Natural Language Generation
579 Yanis Labrak, Adrien Bazoge, Oumaima El Khettari, Mickael Rouvier, pacome constant dit beaufils, Natalia Grabar, Béatrice Daille, Solen Quiniou, Emmanuel Morin, Pierre-Antoine Gourraud and Richard Dufour DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain
581 Fan Huang, Haewoon Kwak, Kunwoo Park and Jisun An ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
583 kyungho kim, Seongmin Park, junseo lee and Jihwa Lee Non-Essential is NEcessary: Order-agnostic Multi-hop Question Generation
584 Zhiyuan Ma, Jintao Du, Changhua Meng and weiqiang wang Enhancing Distantly Supervised Named Entity Recognition with Strong Label Guided Lottery Training
588 Chengfeng Dou, Ying Zhang, Yanyuan Chen, Zhi Jin, Wenpin Jiao, Haiyan Zhao and Yu Huang Detection, Diagnosis, and Explanation: A Benchmark for Chinese Medial Hallucination Evaluation
589 Baohang Zhou, Ying Zhang, Kehui Song, Hongru Wang, Yu Zhao, Xuhui Sui and Xiaojie Yuan MCIL: Multimodal Counterfactual Instance Learning for Low-resource Entity-based Multimodal Information Extraction
590 Zhaoqi Zhang, Pasquale Balsebre, Siqiang Luo, Zhen Hai and Jiangping Huang StructAM: Enhancing Address Matching through Semantic Understanding of Structure-aware Information
592 Chen Zhang, Yang Yang, Qiuchi Li, Jingang Wang and Dawei Song Task-agnostic Distillation of Encoder-Decoder Language Models
597 Siyu Wang, Jianhui Jiang, Shengran Dai and Jiangtao Qiu A Hierarchical Sequence-to-Set Model with Coverage Mechanism for Aspect Category Sentiment Analysis
600 Baijun Ji, Xiangyu Duan, Zhenyu Qiu, Tong Zhang, Junhui Li, Hao Yang and Min Zhang Submodular-based In-context Example Selection for LLMs-based Machine Translation
602 Keyaki Ohno, Hirotaka Kameko, Keisuke Shirai, Taichi Nishimura and Shinsuke Mori Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks
603 Joanna Dolińska and Delphine Bernhard POS Tagging for the Endangered Dagur Language
604 Yi Zhang, Fei Yang, Shuang Peng, Fangyu Wang and Aimin Pan FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization
606 Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang and Jihun Choi CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
607 Fujun Zhang, Xiangdong Su, Jiang Li, Rong Yan and Guanglai Gao EpLSA: Synergy of Expert-prefix Mixtures and Task-Oriented Latent Space Adaptation for Diverse Generative Reasoning
611 Hafida Le Cloirec - Ait Yahya, Olga Seminck and Pascal Amsili FReND: A French Resource of Negation Data
612 Alvin C. Grissom II, Jo Shoemaker, Benjamin Goldman, Ruikang Shi, Craig Stewart, C. Anton Rytting, Leah Findlater and Jordan Boyd-Graber Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates
613 Mitja Nikolaus, Abhishek Agrawal, Petros Kaklamanis, Alex Warstadt and Abdellah Fourtassi Automatic Annotation of Grammaticality in Child-Caregiver Conversations
614 Piotr Rybak, Piotr Przybyła and Maciej Ogrodniczuk PolQA: Polish Question Answering Dataset
619 Shangkang Wang and Li Pan Target-Adaptive Consistency Enhanced Prompt-Tuning for Multi-Domain Stance Detection
620 Jamil Zaghir, Mina Bjelogrlic, Jean-Philippe Goldman, Soukaïna Aananou, Christophe Gaudet-Blavignac and Christian Lovis FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection
621 Piotr Rybak and Maciej Ogrodniczuk Silver Retriever: Advancing Neural Passage Retrieval for Polish Question Answering
622 Jennifer Ecker Labeling Results of Topic Models: Word Sense Disambiguation as Key Method for Automatic Topic Labeling with GermaNet
623 Qiushi Sun, Nuo Chen, Jianing Wang, Ming Gao and Xiang Li TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills
624 Piotr Rybak Transferring BERT Capabilities from High-Resource to Low-Resource Languages Using Vocabulary Matching
626 Nathanael Carraz Rakotonirina and Marco Baroni MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models
634 Jianyu Liu, Sheng Bi and Guilin Qi PRIMO: Progressive Induction for Multi-hop Open Rule Generation
636 Yusheng Huang, Ning Hu, Kunping Li, Nan Wang and Zhouhan Lin Extracting Financial Events from Raw Texts via Matrix Chunking
637 Jakub Šmíd, Pavel Přibáň and Ondrej Prazak Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks
638 Jutta Stock, Volha Petukhova and Dietrich Klakow Annotating Customer-Oriented Behaviour in Call Centre Sales Dialogues
639 Jun Cheng Yang, Zuchao Li, Shuai Xie, Wei Yu, Shijun Li and Bo Du Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning
640 Núria Gala, Brigitte BIGI and Marie Bauer Automatically Estimating Textual and Phonemic Complexity for Cued Speech: How to See the Sounds from French Texts
641 Yosuke Miyanishi and Minh Le Nguyen Causal Intersectionality and Dual Form of Gradient Descent for Multimodal Analysis: a Case Study on Hateful Memes
643 Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn and Alice Oh RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education
649 Robert Östling, Katarina Gillholm, Murathan Kurfalı, Marie Mattson and Mats Wirén Evaluation of Really Good Grammatical Error Correction
650 Jonathan Heitz, Gerold Schneider and Nicolas Langer The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer's Disease Detection from Spontaneous Speech
654 Christine Pinney, Casey Kennington, Maria Soledad Pera, Katherine Landau Wright and Jerry Alan Fails Incorporating Word-level Phonemic Decoding into Readability Assessment
655 Benjamin Winter, Alexei Gustavo Figueroa Rosero, Alexander Loeser, Felix Alexander Gers, Nancy Katerina Figueroa Rosero and Ralf Krestel DDxGym: Online Transformer Policies in a Knowledge Graph Based Natural Language Environment
656 Dandan Huang, Lu Cao, Zhenting Li and Yue Zhang Which Sense Dominates Multisensory Semantic Understanding? A Brain Decoding Study
659 Alice Millour, Yoann Dupont, Karen Fort and Liam Duignan Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: the Case of NER in French
660 Gaifan Zhang, Yi Zhou and Danushka Bollegala Evaluating Unsupervised Dimensionality Reduction Methods for Pretrained Sentence Embeddings
663 Harry Walsh, Ben Saunders and Richard Bowden Select and Reorder: A Novel Approach for Neural Sign Language Production
664 Jasper Degraeuwe and Patrick Goethals LexComSpaL2: A Lexical Complexity Corpus for Spanish as a Foreign Language
673 Chanho Park, Mingjie Chen and Thomas Hain Automatic Speech Recognition System-Independent Word Error Estimation
674 Liisi Jakobson, Jelena Kallas and Erko Jakobson Leveraging Domain Corpora for Enhanced Terminology: The Case of Estonian-English Remote Sensing Termbase
675 Väinö Aleksi Yrjänäinen, Fredrik Mohammadi Norén, Robert Borges, Johan Jarlbrink, Lotta Åberg Brorsson, Anders P. Olsson, Pelle Snickars and Måns Magnusson The Swedish Parliament Corpus 1867 – 2022
678 Xiaotong Song, Huiping Lin, Jiatao Zhu and Xinyi Gong CAGK: Collaborative Aspect Graph Enhanced Knowledge-based Recommendation
679 Wei-Yu Kao and An-Zi Yen MAGIC: Multi-Argument Generation with Self-Refinement for Domain Generalization in Automatic Fact-Checking
682 Mohamed Elaraby, Yang Zhong, Diane Litman, Ahmed Ashraf Butt and Muhsin Menekse ReflectSumm: A Benchmark for Course Reflection Summarization
684 Santosh T.Y.S.S, Mahmoud Aly and Matthias Grabmair LexAbSumm: Aspect-based Summarization of Legal Decisions
685 Cameron R. Jones and Sean Trott Multimodal Language Models Show Evidence of Embodied Simulation
687 Santosh T.Y.S.S, Elvin A. Quero Hernandez and Matthias Grabmair Query-driven Relevant Paragraph Extraction from Legal Judgments
688 Jaap Kruijt, Peggy van Minkelen, Lucia Donatelli, Piek T.J.M. Vossen, Elly Konijn and Thomas Baier SPOTTER: A Framework for Investigating Convention Formation in a Visually Grounded Human-Robot Reference Task
689 Albert Sawczyn, Jakub Binkowski, Piotr Bielak and Tomasz Kajdanowicz Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings
690 Maria Andreevna Petrova, Alexandra M. Ivoylova and Anastasia Tishchenkova CoBaLD Annotation: the Enrichment of the Enhanced Universal Dependencies with the Semantical Pattern
692 Khyati Mahajan and Samira Shaikh Persona-aware Multi-party Conversation Response Generation
696 Daniel Dakota and Sandra Kübler Bits and Pieces: Investigating the Effects of Subwords in Multi-task Parsing Across Languages and Domains
697 Xindi Wang, Robert E. Mercer and Frank Rudzicz Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification
698 Youmi Ma, An Wang and Naoaki Okazaki Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer
705 Zhipeng Xie and Yahe Li Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification
712 Arianne Reimerink, Melania Cabezas-García, Pilar León-Araúz and Pamela Faber Ideological Knowledge Representation: Framing Climate Change in EcoLexicon
715 Santosh T.Y.S.S, Hassan Sarwat, Ahmed Mohamed Abdelaal Abdou and Matthias Grabmair Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents
718 Zhihong Sun, Chen Lyu, Bolun Li, Yao Wan, Hongyu Zhang, Ge Li and Zhi Jin Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs
721 Santosh T.Y.S.S, Kristina Kaiser and Matthias Grabmair CuSINeS: Curriculum-driven Structure Induced Negative Sampling for Statutory Article Retrieval
724 Santosh T.Y.S.S, Rashid Haddad and Matthias Grabmair ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human Rights
725 Khai Le-Duc VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain
726 Atsushi Keyaki and Ribeka Keyaki Coarse-Tuning for Ad-hoc Document Retrieval Using Pre-trained Language Models
730 Wen Yin, Cencen Liu, YI XU, Ahmad Raza Wahla, Huang Yiting and Dezhang Zheng SynPrompt: Syntax-aware Enhanced Prompt Engineering for Aspect-based Sentiment Analysis
732 Haiyang Zhang, Qiuyi Chen, Yanjie Zou, Jia Wang, Yushan Pan and Mark Stevenson Document Set Expansion with Positive-Unlabeled Learning Using Intractable Density Estimation
733 Quan Wang, Licheng Zhang, Zikang Guo and Zhendong Mao IDEATE: Detecting AI-Generated Text using Internal and External Factual Structures
736 Santosh T.Y.S.S, Nina Baumgartner, Matthias Stürmer, Matthias Grabmair and Joel Niklaus Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset
740 Xiangci Li, Linfeng Song, Lifeng Jin, Haitao Mi, Jessica Ouyang and Dong Yu A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
743 Yike Wu, Yang Xiao, Mengting Hu, Mengying Liu, Pengcheng Wang and Mingming Liu Towards Robust Evidence-Aware Fake News Detection via Improving Semantic Perception
751 Wenpeng Lu, Guobiao Zhang, Xueping Peng, Hongjiao Guan and Shoujin Wang Medical Entity Disambiguation with Medical Mention Relation and Fine-grained Entity Knowledge
756 Takuto Asakura and Yusuke Miyao What Is Needed for Intra-document Disambiguation of Math Identifiers?
758 Shenshen Bu, Yujie Song, Taiji Li and Zhiming Dai Dynamic Knowledge Prompt for Chest X-ray Report Generation
760 Md Rashad Al Hasan Rony, Sudipto Kumar Shaha, Rakib Al Hasan Joy, Sumon Kanti Dey, amzad Hossain rafi, Ashraf Hasan Sirajee and Jens Lehmann BanglaQuAD: A Bengali Open-domain Question Answering Dataset
763 Lucie Polakova, Jiří Mírovský, Šárka Zikánová and Eva Hajicova Developing a Rhetorical Structure Theory Treebank for Czech
771 Yang Bai, Anthony Colas, Christan Grant and Zhe Wang M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval
772 Yuchen Wei and Milton King Sense of the Day: Short Timeframe Temporal-Aware Word Sense Disambiguation
773 Tyler K. Bikaun, Tim French, Michael Stewart, Wei Liu and Melinda Hodkiewicz MaintIE: A Fine-Grained Annotation Schema and Benchmark for Information Extraction from Maintenance Short Texts
775 Hongru Wang, Boyang XUE, Baohang Zhou, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang and Kam-Fai Wong UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
779 Hongchun Yu, Wei Pan, Xing Fan and Hanqi Li Multi-Granularity Fusion Text Semantic Matching Based on WoBERT
782 minjun zhu, Yixuan Weng, Shizhu He, Kang Liu, Haifeng Liu, yang jun jun and Jun Zhao Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database
785 Yuqi Liu, Guanyi Chen and Kees van Deemter Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases
786 Zirui Zhang, Yiyu Yang and Benhui Chen Prompt Tuning for Few-shot Relation Extraction via Modeling Global and Local Graphs
789 Jinpeng Li, Jiaze Chen, Huadong Chen, Dongyan Zhao and Rui Yan Multilingual Generation in Abstractive Summarization: A Comparative Study
790 Maximos Skandalis, Richard Moot, Christian Retoré and Simon Robillard New Datasets for Automatic Detection of Textual Entailment and of Contradictions between Sentences in French
792 Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun and Jun Zhao Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning
794 Jianhui Pang, Baosong Yang, Derek F. Wong, Dayiheng Liu, Xiangpeng Wei, Jun Xie and Lidia S. Chao MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation
800 Taiga Someya, Yushi Sugimoto and Yohei Oseki JCoLA: Japanese Corpus of Linguistic Acceptability
801 Ying Zhang, Xinying Qian, Yu Zhao, Baohang Zhou, Kehui Song and Xiaojie Yuan Bring Invariant To Variant: A Contrastive Prompt-based Framework for Temporal Knowledge Graph Forecasting
802 Alba M. Mármol Romero, Adrián Moreno Muñoz, Flor Miriam Plaza-del-Arco, M. Dolores Molina González, María-Teresa Martín-Valdivia, L. Alfonso Ureña-López and Arturo Montejo-Ráez MentalRiskES: A New Corpus for Early Detection of Mental Disorders in Spanish
803 Francois Meyer, Haiyue Song, Abhisek Chakrabarty, Jan Buys, Raj Dabre and Hideki Tanaka NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages
804 haoyu gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Fei Huang and Yongbin Li Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models
805 Feng Zhao, Wan Xianlin, Cheng Yan and Chu Kiong Loo Correcting Language Model Bias for Text Classification in True Zero-Shot Learning
806 Yunxin Li, Baotian Hu, Wenhan Luo, Lin Ma, Yuxin Ding and Min Zhang A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
807 Francesco Antici, Federico Ruggeri, Andrea Galassi, Katerina Korre, Arianna Muti, Alessandra Bardi, Alice Fedotova and Alberto Barrón-Cedeño A Corpus for Sentence-Level Subjectivity Detection on English News Articles
810 Yunlong Feng, Bohan Li, Libo Qin, Xiao Xu and Wanxiang Che A Two-Stage Framework with Self-Supervised Distillation for Cross-Domain Text Classification
812 Patrizia Paggio, Manex Agirrezabal, Costanza Navarretta and Leo Vitasovic Multimodal behaviour in an online environment: The GEHM Zoom corpus collection
813 Hans Ole Hatzel and Chris Biemann Tell me again! A Large-Scale Dataset of Multiple Summaries for the Same Story
815 Zhendong Liu, Changhong Xia, Wei He and Chongjun Wang Trustworthiness and Self-awareness in Large Language Models: An Exploration through the Think-Solve-Verify Framework
816 Iacopo Ghinassi, Lin Wang, Chris Newell and Matthew Purver When Cohesion Lies in the Embedding Space: Embedding-Based Reference-Free Metrics for Topic Segmentation
817 Maria Francis, Julius Steuer, Dietrich Klakow and Volha Petukhova Who Did You Blame When Your Project Failed? Designing a Corpus for Presupposition Generation in Cross-Examination Dialogues
818 Aleksandr Riaposov and Elena Lazarenko Corpus Services: a Framework to Curate XML Corpus Data
821 Joanna Kruyt, Róbert Sabo, Katarína Polónyiová, Daniela Ostatníková and Štefan Beňuš The Slovak Autistic and Non-Autistic Child Speech Corpus:Task-Oriented Child-Adult Interactions
822 Huimin Chen, Chengyu Wang, Yanhao Wang, Cen CHEN and Yinggui Wang TaiChi: Improving the Robustness of NLP Models by Seeking Common Ground While Reserving Differences
831 Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker and Sheak Rashed Haider Noori Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis
833 Hee-Soo Choi, Priyansh Trivedi, Mathieu Constant, Karen Fort and Bruno Guillaume Beyond Model Performance: Can Link Prediction Enrich French Lexical Graphs?
834 Shadi Manafi and Nikhil Krishnaswamy Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets
836 Mohammadamin Kanaani Triple-R: Automatic Reasoning for Fact Verification Using Language Models
846 Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu and Wenjuan Han CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction
851 Ruiting Li, Peiyan Wang, Libang Wang, Danqingxin Yang and Dongfeng Cai A Corpus and Method for Chinese Named Entity Recognition in Manufacturing
852 Yirong Zeng, Xiao Ding, Yi Zhao, Xiangyu Li, Jie Zhang, Chao Yao, Ting Liu and Bing Qin RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict
853 Yan Lei, Liang Pang, Yuanzhuo Wang, Huawei Shen and Xueqi Cheng Qsnail: A Questionnaire Dataset for Sequential Question Generation
860 Yunqi Zhang, Yubo Chen, jingzhe zhu, Jinyu Xu, shuai yang, zhaoliang wu, liang huang, Yongfeng Huang and Shuai Chen KnowVrDU: A Unified Knowledge-aware Prompt-Tuning Framework for Visually-rich Document Understanding
861 Ramon Ruiz-Dolz, CHR-JR CHIU, Chung-Chi Chen, Noriko Kando and Hsin-Hsi Chen Learning Strategies for Robust Argument Mining: An Analysis of Variations in Language and Domain
863 Jiawei Chen, Hongyu Lin, Xianpei Han, Yaojie Lu, Shanshan Jiang, Bin Dong and Le Sun Few-shot Named Entity Recognition via Superposition Concept Discrimination
864 Robert Forkel, Daniel G. Swanson and Steven Moran Converting legacy data to CLDF: A FAIR exit strategy for linguistic web apps
867 Jung-Ho Kim, Mathew John Huerta-Enochian, Changyong Ko and Du Hui Lee SignBLEU: Automatic Evaluation of Multi-channel Sign Language Translation
871 Junyu Lu, Bo Xu, Xiaokun Zhang, Kaiyuan Liu, Dongyu Zhang, Liang Yang and Hongfei LIN Take its Essence, Discard its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect
872 Lei Li, Yongfeng Zhang, Dugang Liu and Li Chen Large Language Models for Generative Recommendation: A Survey and Visionary Discussions
874 Steven Coats CoANZSE Audio: Creation of an Online Corpus for Linguistic and Phonetic Analysis of Australian and New Zealand Englishes
875 Seonwoo Lee, Jihyun Mun, Sunhee Kim and Minhwa Chung Speech Corpus for Korean Children with Autism Spectrum Disorder: Towards Automatic Assessment Systems
878 Xin Zheng, Qiming Zhu, Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun Executing Natural Language-Described Algorithms with Large Language Models: An Investigation
881 Yunfei Yin, Congrui Zou, Zheng Yuan and Xianjian Bao MLDSP-MA: Multidimensional Attention for Multi-Round Long Dialogue Sentiment Prediction
884 Jorge Palomar-Giner, Jose Javier Saiz, Ferran Espuña, Mario Mina, Severino Da Dalt, Joan Llop, Malte Ostendorff, Pedro Ortiz Suarez, Georg Rehm, Aitor Gonzalez-Agirre and Marta Villegas A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages
891 Maxime Arens, Lucile Callebert, Mohand Boughanem and Jose G. Moreno Rebalancing Label Distribution while Eliminating Inherent Waiting Time in Multi Label Active Learning applied to Transformers
893 Zhenxiao Cheng, Jie Zhou, Wen Wu, Qin Chen and Liang He Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis
898 Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li and Yang Liu DEEM: Dynamic Experienced Expert Modeling for Stance Detection
900 Ang Li, Qiangchao Chen, Yiquan Wu, Xiang Zhou, Kun Kuang, Fei Wu and Ming Cai From Graph to Word Bag: Introducing Domain Knowledge to Confusing Charge Prediction
902 Shi Yu, Chenghao Fan, Chenyan Xiong, David Jin, Zhiyuan Liu and Zhenghao Liu Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion
905 Minzheng Wang, Nan Xu, Jiahao Zhao, Yin Luo and Wenji Mao PromISe: Releasing the Capabilities of LLMs with Prompt Introspective Search
906 Jan Nehring, Aleksandra Gabryszak, Pascal Jürgens, Aljoscha Burchardt, Stefan Schaffer, Matthias Spielkamp and Birgit Stark Large Language Models are Echo Chambers
909 Kedi Chen, Jie Zhou, Qin Chen, Shunyu Liu and Liang He A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder
911 Fanheng Kong, Peidong Wang, Shi Feng, Daling Wang and Yifei Zhang TIGER: A Unified Generative Model Framework for Multimodal Dialogue Response Generation
912 Xiaohua Wang, Wenlong Fei, Min Hu, Qingyu Zhang and Aoqiang Zhu MEVTR: A Multilingual Model Enhanced With Visual Text Representations
913 Velizar Shulev and Khalil Sima'an Continual Reinforcement Learning for Controlled Text Generation
915 Yanis Labrak, Adrien Bazoge, Béatrice Daille, Mickael Rouvier and Richard Dufour How Important Is Tokenization in French Medical Masked Language Models?
917 xudong zhu, zhao kang and Bei Hui FCDS: Fusing Constituency and Dependency Syntax into Document-Level Relation Extraction
918 Jiangming Liu Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing
924 Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara and Gabriel Skantze Multilingual Turn-taking Prediction Using Voice Activity Projection
928 Shafiuddin Rehan Ahmed, George Arthur Baker, Evi Judge, Michael Reagan, Kristin Wright-Bettner, Martha Palmer and James H. Martin Linear Cross-document Event Coreference Resolution with X-AMR
929 Zechen Sun, Yisheng Xiao, Juntao Li, Yixin Ji, Wenliang Chen and Min Zhang Exploring and Mitigating Shortcut Learning for Generative Large Language Models
931 Pedro Fernandes, Sérgio Nunes and Luís Santos A Community-Driven Data-to-Text Platform for Football Match Summaries
932 Naoya Ueda, Masato Mita, Teruaki Oka and Mamoru Komachi Token-length Bias in Minimal-pair Paradigm Datasets
933 Georg Rehm, Stelios Piperidis, Khalid Choukri, Andrejs Vasiļjevs, Katrin Marheinecke, Victoria Arranz, Aivars Bērziņš, Miltos Deligiannis, Dimitris Galanis, Maria Giagkou, Katerina Gkirtzou, Dimitris Gkoumas, Annika Grützner-Zahn, Athanasia Kolovou, Penny Labropoulou, Andis Lagzdiņš, Elena Leitner, Valérie Mapelli, Hélène Mazo, Simon Ostermann, Stefania Racioppa, Mickaël Rigault and Leon Voukoutis Common European Language Data Space
935 Di Wu, Wasi U. Ahmad and Kai-Wei Chang On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation
941 xiaowei Zhao, Yong Zhou and xiujuan xu Dual Encoder: Exploiting the Potential of Syntactic and Semantic for Aspect Sentiment Triplet Extraction
942 Katherine Atwell, Mert Inan, Anthony B. Sicilia and Malihe Alikhani Combining Discourse Coherence with Large Language Models for More Inclusive, Equitable, and Robust Task-Oriented Dialogue
943 Tom S Juzek The Syntactic Acceptability Dataset (Preview): A Resource for Machine Learning and Linguistic Analysis of English
946 Carlos Daniel Hernandez Mena, Þorsteinn Daði Gunnarsson and Jon Gudnason Samrómur Milljón: An ASR Corpus of One Million Verified Read Prompts in Icelandic
947 Siyin Wang, Jie Zhou, Qin Chen, Qi Zhang, Tao Gui and Xuanjing Huang Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis
949 Wei Li, Shutan Huang and Yanqiu Shao An Unsupervised Framework for Adaptive Context-aware Simplified-Traditional Chinese Character Conversion
950 wenjie xu, yidan Chen and jianquan Ouyang A Streamlined Span-based Factorization Method for Few Shot Named Entity Recognition
953 Jing Jin and Houfeng Wang Select High-quality Synthetic QA Pairs to Augment Training Data in MRC Under the Reward Guidance of Generative Language Models
954 Yiding Liu, Jingjing Wang, Jiamin Luo, Tao Zeng and Guodong Zhou ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues
957 Jianhao Yan, Jin Xu, Fandong Meng, Jie Zhou and Yue Zhang DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
959 Nguyen Quang Chieu, Quang-Minh Tran and Khac-Hoai Nam Bui SynTOD: Augmented Response Synthesis for Robust End-to-End Task-Oriented Dialogue System
960 Md Nayem Uddin, Enfa Rose George, Eduardo Blanco and Steven R. Corman Asking and Answering Questions to Extract Event-Argument Structures
961 Xiangyu Lei, Junhui Li, shimin tao and Hao Yang Evaluation Dataset for Lexical Translation Consistency in Chinese-to-English Document-level Translation
962 Wenfeng Feng, Chuzhan Hao, Yuewei Zhang, Yu Han and Hao Wang Mixture-of-LoRAs: An Efficient Multitask Tuning Method for Large Language Models
963 Paramita Mirza, Viju Sudhi, Soumya Ranjan Sahoo and Sinchana Ramakanth Bhat ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
965 Kian Ahrabian, Alon Benhaim, Barun Patra, Jay Pujara, Saksham Singhal and Xia Song On The Adaptation of Unlimiformer for Decoder-Only Transformers
971 Xinshuo Hu, Dongfang Li, Xiaoguang Li, Yuxiang Wu, Lifeng Shang and Baotian Hu Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
974 Yoshihiko Hayashi Reassessing Semantic Knowledge Encoded in Large Language Models through the Word-in-Context Task
977 Erxin Yu, Jing Li and Chunpu Xu PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
978 Shiwen Ni, Min Yang, Ruifeng Xu, Chengming Li and Xiping Xiping Hu Layer-wise Regularized Dropout for Neural Language Models
979 Kyohoon Jin, Junho Lee, Juhwan Choi, Sangmin Song and Youngbin Kim Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation
983 Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne and Alice Oh CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
985 Yuan Gao, Yiheng Zhu, Yuanbin Cao, Yinzhi Zhou, Zhen Wu, Yujie Chen, Shenglan Wu, Haoyuan Hu and Xinyu Dai Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering
986 Xuemei Tang, Qi Su, Jun Wang and Zekun Deng CHisIEC: An Information Extraction Corpus for Ancient Chinese History
988 Dejan Stosic, Saša Marjanović, Delphine Bernhard, Myriam Bras, Laurent Kevers, Stella Retali-Medori, Marianne Vergez-Couret and Carole Werner The ParCoLab Parallel Corpus and its Extension to Four Regional Languages of France
989 Silin Li, Ruoyu Song, Tianwei Lan, Zeming Liu and Yuhang Guo TED-EL: A Corpus for Speech Entity Linking
990 Cherifa Ben Khelil, Jean-Yves Antoine, Anaïs Halftermeyer, Frédéric Rayar, Lisa Hoiry, Mathieu Thebaud and Mathieu Raynal Adapting AAC for Young Users: A Preliminary Study on the Influence of Age and Language Register on Word Prediction
992 Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han and Dongmei Zhang KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
994 Shengkun Ma, Jiale Han, Yi Liang and Bo Cheng Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors
996 Aitor Gonzalez-Agirre, Montserrat Marimon, Carlos Rodriguez-Penagos, Javier Aula-Blasco, Irene Baucells, Carme Armentano-Oller, Jorge Palomar-Giner, Baybars Kulebi and Marta Villegas Building a Data Infrastructure for a Mid-Resource Language: The Case of Catalan
999 Fynn Petersen-Frey and Chris Biemann Dataset of Quotation Attribution in German News Articles
1000 Bashar Alhafni, Reem Hazim, Juan David Pineros Liberato, Muhamed Al Khalil and Nizar Habash The SAMER Arabic Text Simplification Corpus
1005 Zhihong Zhu, Yunyan Zhang, Xuxin Cheng, Zhiqi Huang, Derong Xu, Xian Wu and Yefeng Zheng Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics
1006 Xue Gu, Zhihan Zhou, Ziyao Meng, Jian Li, Tiago Gomes, Adriano Tavares and Hao Xu EmoPrompt-ECPE: Emotion knowledge-aware Prompt-tuning for Emotion-Cause Pair Extraction
1007 Qiao Wang and Zheng Yuan Assessing the Efficacy of Grammar Error Correction: A Human Evaluation Approach in the Japanese Context
1008 shuai yang, Yu Hong, Shiming He, Qingting Xu and Jianmin Yao Word-level Commonsense Knowledge Selection for Event Detection
1010 Adal Abilbekov, Saida Mussakhojayeva, Rustem Yeshpanov and Huseyin Atakan Varol KazEmoTTS: A Dataset for Kazakh Emotional Text-to-Speech Synthesis
1011 Xingwu Sun, Zhen Yang, Ruobing Xie, Fengzong Lian, Zhanhui Kang and Chengzhong Xu LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders
1012 Zishuo Zhao, Ziyang Ma, Zhenzhou Lin, Jingyou Xie, Yinghui Li and Ying Shen Source-free Domain Adaptation for Aspect-based Sentiment Analysis
1014 Carme Armentano-Oller, Montserrat Marimon and Marta Villegas Becoming a High-Resource Language in Speech: The Catalan Case in the Common Voice Corpus
1017 Joykirat Singh, Sehban Fazili, Rohan Jain and Md. Shad Akhtar EROS:Entity-Driven Controlled Policy Document Summarization
1022 Nuria Bel, Marta Punsola and Valle Ruíz-Fernández EsCoLA: Spanish Corpus of Linguistic Acceptability
1023 Jiayi Wu, Renyu Zhu, Nuo Chen, Qiushi Sun, Xiang Li and Ming Gao Structure-aware Fine-tuning for Code Pre-trained Models
1026 Edwin Thomas and Sowmya Vajjala Keyphrase Generation: Lessons from a Reproducibility Study
1028 Sungjoo Byun, Jiseung Hong, Sumin Park, Dongjun Jang, Jean Seo, Minseok Kim, Chaeyoung OH and Hyopil Shin Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition
1030 Imen Laouirine, Rami Kammoun and Fethi Bougares TunArTTS: Tunisian Arabic Text-To-Speech Corpus
1031 Barbara Scalvini and Iben Nyholm Debess Evaluating the potential of language-family-specific generative models for low-resource data augmentation: a Faroese case study
1033 Yunlong Feng, Yang Xu, Libo Qin, Yasheng Wang and Wanxiang Che Improving Language Model Reasoning with Self-motivated Learning
1034 Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo and Yanqing Guo Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
1035 Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan and Erwei Yin Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
1036 Carinne Cherf and Yuval Pinter BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
1038 Dimitra Anastasiou, Carole Blond-Hanten and Marie Gallais A Luxembourgish corpus as a Gender Bias Evaluation Testset
1041 Brigitte Krenn, Johann Petrak, Marina Kubina and Christian Burger GERMS-AT: A Sexism/Misogyny Dataset of Forum Comments from an Austrian Online Newspaper
1045 Damien Sileo, Kanimozhi Uma and Marie-Francine Moens Generating multiple-choice questions for medical question answering with distractors and cue-masking
1046 Jiashuo Sun, Hang Zhang, Chen Lin, Xiangdong Su, Yeyun Gong and Jian Guo APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
1051 Mina Schütz, Daniela Pisoiu, Daria Liakhovets, Alexander Schindler and Melanie Siegel GerDISDETECT: A German Multilabel Dataset for Disinformation Detection
1053 Alessandra Teresa Cignarella, Manuela Sanguinetti, Simona Frenda, Andrea Marra, Cristina Bosco and Valerio Basile QUEEREOTYPES: A Multi-Source Italian Corpus of Stereotypes towards LGBTQIA+ Community Members
1055 Armand Stricker and Patrick Paroubek Chitchat as Interference: Adding User Backstories to Task-Oriented Dialogues
1059 Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks
1060 Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova and Barbara Plank Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
1062 Marcel Gohsen, Matthias Hagen, Martin Potthast and Benno Stein Task-Oriented Paraphrase Analytics
1063 Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki and Koh Mitsuda Release of Pre-Trained Models for the Japanese Language
1065 Julia Rozanova, Marco Valentino and André Freitas Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models
1068 Khadige Abboud and Gokmen Oz Towards Equitable Natural Language Understanding Systems for Dialectal Cohorts: Debiasing Training Data
1071 Shiva Taslimipoor, Luca Benedetto, Mariano Felice and Paula Buttery Distractor Generation Using Generative and Discriminative Capabilities of Transformer-based Models
1075 Mali Jin, Daniel Preotiuc-Pietro, A. Seza Doğruöz and Nikolaos Aletras Who is bragging more online? A large scale analysis of bragging in social media
1076 Chia-Wen Lu, Ching-Wen Yang and Wei-Yun Ma Automatic Construction of a Chinese Review Dataset for Aspect Sentiment Triplet Extraction via Iterative Weak Supervision
1078 Jianyou Wang, Kaicheng Wang, Xiaoyue Wang, Weili Cao, Ramamohan Paturi and Leon Bergen IR2: Information Regularization for Information Retrieval
1079 Zirui He, Huiqi Deng, Haiyan Zhao, Ninghao Liu and Mengnan Du Mitigating Shortcuts in Language Models with Soft Label Encoding
1080 Rob van der Goot, Zoey Liu and Max Müller-Eberstein Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies
1082 Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Arthur Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard and Nikhil Krishnaswamy Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles
1085 Amilleah Rodriguez, Shaonan Wang and Liina Pylkkänen Do Neural Language Models Inferentially Compose Concepts the Way Humans Can?
1090 Biswesh Mohapatra, Seemab Hassan, Laurent Romary and Justine Cassell Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units
1091 Faizad Ullah, Ali Faheem, Ubaid Azam, Muhammad Sohaib Ayub, Faisal Kamiran and Asim Karim Detecting Cybercrimes in Accordance with Pakistani Law: Dataset and Evaluation using PLMs
1092 Do June Min, Veronica Perez-Rosas, ken resnicow and Rada Mihalcea Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
1093 Ivana Filipović Petrović, Miguel López Otal and Slobodan Beliga Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset
1097 Christopher D. Sapp, Elliott Evans, Rex Sprouse and Daniel Dakota Introducing a Parsed Corpus of Historical High German
1101 Sylvain Coulange, Marie-Hélène Fries, Monica Masperi and Solange Rossato A corpus of spontaneous L2 English speech for real-situation speaking assessment
1103 ChangSu Choi, Yongbin Jeong, Seoyoon Park, InHo Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, HyeJin Lee, Younggyun Hahm, Hansaem Kim and KyungTae Lim Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
1104 Yejin Jeon, Yunsu Kim and Gary Geunbae Lee Leveraging the Interplay Between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation
1105 Charles Lam, Chaak-ming Lau and Jackson L. Lee Multi-Tiered Cantonese Word Segmentation
1107 Deepak Gupta, Kush Attal and Dina Demner-Fushman Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
1109 Viet Dac Lai, Duy Ngoc Pham, Jonathan Steinberg, Jamie Mikeska and Thien Huu Nguyen CAMAL: A Novel Dataset for Multi-label Conversational Argument Move Analysis
1110 Ileana Rugina, Rumen Dangovski, Li Jing, Preslav Nakov and Marin Soljacic Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks
1111 Shengjie Ji and Fang Kong A Novel Three-stage Framework for Few-shot Named Entity Recognition
1113 Chihiro Yano, Akihiko Fukuchi, Shoko Fukasawa, Hideyuki Tachibana and Yotaro Watanabe Multilingual Sentence-T5: Scalable Sentence Encoders for Multilingual Applications
1117 Ben Foley, Peter Sefton, Simon Musgrave and Moises Sacal Bonequi Access control framework for language collections
1119 Pengwei Zhan, Jing Yang, He Wang, Chao Zheng and Liming Wang Rethinking Word-level Adversarial Attack: The Trade-off Between Efficiency, Effectiveness, and Imperceptibility
1120 Kartik Kartik, Sanjana Soni, Anoop Kunchukuttan, Tanmoy Chakraborty and Md. Shad Akhtar Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
1123 Liqiang Niu, Fandong Meng and Jie Zhou UMTIT: Unifying Recognition, Translation, and Generation for Multimodal Text Image Translation
1125 Chandrai Kayal, Sayantan Chattopadhyay, Aryan Gupta, Satyen Abrol and Archie Gugol JLBert: Japanese Light BERT for Cross-Domain Short Text Classification
1129 Bo Xu, Yifei Wu, Shouang Wei, Ming Du and Hongya Wang Adaptive Reinforcement Tuning Language Models as Hard Data Generators for Sentence Representation
1132 Leonardo Campillos-Llanos, Ana Rosa Terroba, Rocío Bartolomé, Ana Valverde-Mateos, Cristina González, Adrián Capllonch-Carrión and Jonathan Heras Replace, Paraphrase or Fine-tune? Evaluating Automatic Simplification for Medical Texts in Spanish
1133 Bo-Han Lu, Yi-Hsuan Lin, Annie Lee and Richard Tzong-Han Tsai Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems
1138 Jin Cui, Fumiyo Fukumoto, Xinfeng Wang, Yoshimi Suzuki, Jiyi Li, Noriko Tomuro and Wanzeng Kong Enhanced Coherence-Aware Network with Hierarchical Disentanglement for Aspect-Category Sentiment Analysis
1141 Pavlína Synková, Jiří Mírovský, Lucie Poláková and Magdaléna Rysová Announcing the Prague Discourse Treebank 3.0
1147 Kazumasa Omura, Fei Cheng and Sadao Kurohashi An Empirical Study of Synthetic Data Generation for Implicit Discourse Relation Recognition
1153 Junjia Feng, Mingqian Lin, Lin Shang and Xiaoying Gao Autonomous Aspect-Image Instruction A2II: Q-Former Guided Multimodal Sentiment Classification
1160 Hwichan Kim, Shota Sasaki, Sho Hoshino and Ukyo Honda A Single Linear Layer Yields Task-Adapted Low-Rank Matrices
1161 Jian-Tao Huang, Chung-Chi Chen, Hen-Hsen Huang and Hsin-Hsi Chen NumHG: A Dataset for Number-Focused Headline Generation
1163 Robert Forkel, Johann-Mattis List, Christoph Rzymski and Guillaume Segerer Linguistic Survey of India and Polyglotta Africana: Two Retrostandardized Digital Editions of Large Historical Collections of Multilingual Wordlists
1166 Quentin Brabant, Lina M. Rojas Barahona, Gwénolé Lecorvé and Claire Gardent KGConv, a Conversational Corpus grounded in Wikidata
1168 Zhipeng Liu, Xiaoming Zhang, Litian Zhang and Zelong Yu MDS: A Fine-Grained Dataset for Multi-Modal Dialogue Summarization
1169 Shaoxiong Ji, Timothee Mickus, Vincent Segonne and Jörg Tiedemann Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?
1176 Leon Ackermann and Xenia Isabel Ohmer On the Relationship between Skill Neurons and Robustness in Prompt Tuning
1177 Ming Zhang, Ke Chang and Yunfang Wu Multi-modal Semantic Understanding with Contrastive Cross-modal Feature Alignment
1178 Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux and Maximin Coavoux What has LeBenchmark learnt about French Syntax?
1179 Chung-Chi Chen and Hiroya Takamura Term-Driven Forward-Looking Claim Synthesis in Earnings Calls
1181 Nurul Fajrin Ariyani, Zied Bouraoui, Richard Booth and Steven Schockaert Can Language Models Learn Embeddings of Propositional Logic Assertions?
1183 Pingjie Wang, Hongcheng Liu, Yanfeng Wang and Yu Wang Pruning before Fine-tuning: A Retraining-free Compression Framework for Pre-trained Language Models
1184 Laura Mascarell, Ribin Chalumattu and Annette Rios German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
1185 Michal Mochtak, Peter Rupnik and Nikola Ljubešić The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings
1187 Giulia D'Agostino, Chris A. Reed and Daniele Puccinelli Segmentation of Complex Question Turns for Argument Mining: A Corpus-based Study in the Financial Domain
1188 Nikola Ljubešić and Taja Kuzman CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation
1190 María Estrella Vallecillo Rodríguez, María Victoria Cantero Romero, Isabel Cabrera De Castro, Arturo Montejo Ráez and María Teresa Martín Valdivia CONAN-MT-SP: A Spanish Corpus for Counternarrative using GPT Models
1195 Zi Xiong, Lizhi Qing, Yangyang Kang, Jiawei Liu, Hongsong Li, Changlong Sun, Xiaozhong Liu and Wei Lu Enhance Robustness of Language Models Against Variation Attack through Graph Integration
1197 Guangming Huang, Yunfei Long, Cunjin Luo, Jiaxing Shen and Xia Sun Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process
1199 Chuanqi Dong, Wenjie Zhou, Xiangyu Duan, Yuqi Zhang and Min Zhang Multimodal Cross-lingual Phrase Retrieval
1200 Loic De Langhe, Orphee De Clercq and Veronique Hoste Enhancing Unrestricted Cross-Document Event Coreference with Graph Reconstruction Networks
1202 Michele Pulini and Johann-Mattis List First Steps Towards the Integration of Resources on Historical Glossing Traditions in the History of Chinese: A Collection of Standardized Fǎnqiè Spellings from the Guǎngyùn
1203 Marcio Lima Inacio, Gabriela Wick-Pedro, Renata Ramisch, Luís Espírito Santo, Xiomara S. Q. Chacon, Roney Santos, Rogério Sousa, Rafael Anchiêta and Hugo Goncalo Oliveira Puntuguese: A Corpus of Puns in Portuguese with Micro-edits
1205 Guochao Jiang, Ziqin Luo, Yuchen Shi, Dixuan Wang, Jiaqing Liang and Deqing Yang ToNER: Type-oriented Named Entity Recognition with Generative Language Model
1207 Giulia Rambelli and marianna bolognesi The Contextual Variability of English Nouns: The Impact of Categorical Specificity beyond Conceptual Concreteness
1208 Jaya Caporusso, Damar Hoogland, Mojca Brglez, Boshko Koloski, Matthew Purver and Senja Pollak A Computational Analysis of the Dehumanisation of Migrants from Syria and Ukraine in Slovene News Media
1209 Tim Czerniak and Elaine Uí Dhonnchadha Towards Semantic Tagging for Irish
1210 Cécile Macaire, Chloé Dion, Jordan Arrigo, Claire Lemaire, Emmanuelle Esperanca-Rodier, Benjamin Lecouteux and Didier Schwab A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation
1211 Adnan Al Ali and Jindřich Libovický How Gender Interacts with Political Values: A Case Study on Czech BERT Models
1215 Tomáš Horych, Martin Paul Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp and Timo Spinde MAGPIE: Multi-Task Analysis of Media-Bias Generalization with Pre-Trained Identification of Expressions
1218 Tim Fischer, Florian Schneider, Fynn Petersen-Frey, Anja Silvia Mollah Haque, Isabel Eiser, Gertraud Koch and Chris Biemann Extending the Discourse Analysis Tool Suite with Whiteboards for Visual Qualitative Analysis
1219 Marie Bexte, Andrea Horbach and Torsten Zesch EVil-Probe - A Composite Benchmark for Extensive Visio-Linguistic Probing
1220 Colin Swaelens, Ilse De Vos and Els Lefever Lemmatisation of Medieval Greek: Against the Limits of Transformer's Capabilities?
1221 Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Daniel AUDIBERT, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix E. Herron, Magali Norré, Massih R Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains
1222 Shan Zhang, Bin Cao and Jing Fan KCL: Few-shot Named Entity Recognition with Knowledge Graph and Contrastive Learning
1223 Alfarabi Imashev, Nurziya Oralbayeva, Gulmira Baizhanova and Anara Sandygulova Comparative Analysis of Sign Language Interpreting Agents Perception: A Study of the Deaf
1228 Jiang Li, Xiangdong Su, Fujun Zhang and Guanglai Gao TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation
1230 Nils-Jonathan Schaller, Andrea Horbach, Lars Ingver Höft, Yuning Ding, Jan Luca Bahr, Jennifer Meyer and Thorben Jansen DARIUS: A Comprehensive Learner Corpus for Argument Mining in German-Language Essays
1233 Mingyang Cai, Zhen Yang and Ping Jian Improving Implicit Discourse Relation Recognition with Semantics Confrontation
1234 Xiaocheng Zhang, Chang Wang, Guoping Zhao and Xiaohong Su LI4: Label-Infused Iterative Information Interacting based Fact Verification in Question-answering Dialogue
1236 Guowei Ge, Kuangrong Hao and Lingguang Hao IDC: Boost Text-to-image Retrieval via Indirect and Direct Connections
1237 Damien Sileo tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework
1242 Richard Johansson What Happens to a Dataset Transformed by a Projection-based Concept Removal Method?
1243 Ke Liang, Chu-Ren Huang and Xin-Lan Jiang From Text to Historical Ecological Knowledge: The Construction and Application of the Shan Jing Knowledge Base
1246 Yang Yiyuan, Guodong Long, Michael Blumenstein, Xiubo Geng, Chongyang Tao, Tao Shen and Daxin Jiang Pre-training Cross-Modal Retrieval by Expansive Lexicon-Patch Alignment
1248 Asma Farajidizaji, Vatsal Raina and Mark Gales Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models
1250 Lianxi Wang, Yujia Tian and Zhuowei Chen Enhancing Hindi Feature Representation Through Fusion of Dual-Script Word Embeddings
1251 Alexander Prochnow, Johannes E. Bendler, Caroline Lange, Foivos Ioannis Tzavellos, Bas Marco Göritzer, Marijn ten Thij and Riza Batista-Navarro IDEM: The IDioms with EMotions Dataset for Emotion Recognition
1252 John Pavlopoulos, Ryan Sandell, Maria Konstantinidou and Chiara Bozzone HoLM: Analyzing the Linguistic Unexpectedness in Homeric Poetry
1254 Yongqi Li, Mayi Xu, Xin Miao, Shen Zhou and Tieyun Qian Prompting Large Language Models for Counterfactual Generation: An Empirical Study
1255 Ginevra Martinelli, Paola Impicciché, Elisabetta Fersini, Francesco Mambrini and Marco Passarotti Exploring Neural Topic Modeling on a Classical Latin Corpus
1256 Alexander Yom Din, Taelin Karidi, Leshem Choshen and Mor Geva Jump to Conclusions: Short-Cutting Transformers with Linear Transformations
1258 Jonathan Kamp, Lisa Beinborn and Antske Fokkens The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement
1260 Zhenfei Yang, Beiming Yu, Yuan Cui, Shi Feng, Daling Wang and Yifei Zhang BERT-BC: A Unified Alignment and Interaction Model over Hierarchical BERT for Response Selection
1261 Yongxin Zhou, Fabien Ringeval and François Portet PSentScore: Evaluating Sentiment Polarity in Dialogue Summarization
1263 Jing Han Sun and Ali Emami EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
1264 Yuanyuan Xu, Linhai Zhang and Deyu Zhou TECA: A Two-stage Approach with Controllable Attention Soft Prompt for Few-shot Nested Named Entity Recognition
1265 Kangchen Zhu, Zhiliang Tian, Jingyu Wei, Ruifeng Luo, YIPING SONG and Xiaoguang Mao StyleFlow: Disentangle Latent Representations via Normalizing Flow for Unsupervised Text Style Transfer
1266 Wenbo Qiao, Peng Zhang and ZengLai Ma A Quantum-Inspired Matching Network with Linguistic Theories for Metaphor Detection
1267 Da Luo, Run Lin, Qiao Liu, Yuxiang Cai, Xueyi Liu, Yanglei Gan and Rui Hou Synergetic Interaction Network with Cross-task Attention for Joint Relational Triple Extraction
1268 Honggang Zhao, Chunling Xiao, Jiayi Yang, Guozhu Jin and Mingyong Li MccSTN: Multi-Scale Contrast and Fine-Grained Feature Fusion Networks for Subject-driven Style Transfer
1270 Larry Heck, Simon Heck and Anirudh S. Sundar mForms : Multimodal Form Filling with Question Answering
1272 Giulia Pensa, Begoña Altuna and Itziar Gonzalez-Dios A Multi-layered Approach to Physical Commonsense Understanding: Creation and Evaluation of an Italian Dataset
1277 Xiaojing Du, hanjie Zhao, danyan Xing, Yuxiang Jia and Hongying Zan MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training
1278 Zepeng Ding, Wenhao Huang, Jiaqing Liang, Yanghua Xiao and Deqing Yang Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction
1280 Seiji Shimizu, Lis Pereira, Shuntaro Yada and Eiji ARAMAKI QA-based Event Start-Points Ordering for Clinical Temporal Relation Annotation
1281 Angelo Basile, Marc Franco-Salvador and Paolo Rosso PyRater: A Python Toolkit for Annotation Analysis
1283 Khang Ly, Yury Kashnitsky, Savvas Chamezopoulos and Valeria Krzhizhanovskaya Article Classification with Graph Neural Networks and Multigraphs
1284 Gabriele Sarti and Malvina Nissim IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
1286 Bohao Yang, Chen Tang, Kun Zhao, Chenghao Xiao and Chenghua Lin Effective Distillation of Table-based Reasoning Ability from LLMs
1295 Jinliang Lu and Jiajun Zhang Improving Unsupervised Neural Machine Translation via Training Data Self-Correction
1297 Manu Narayanan and Noëmi Aepli A Tulu Resource for Machine Translation
1298 Noof Abdullah Alfear, Dimitar Kazakov and Hend Al-Khalifa Meta-Evaluation of Sentence Simplification Metrics
1299 Bo LIU, Li-Ming Zhan, Zexin Lu, Yujie Feng, Lei Xue and Xiao-Ming Wu How Good Are LLMs at Out-of-Distribution Detection?
1300 Dhaivat J. Bhatt, Seyed Ahmad Abdollahpouri Hosseini, Federico Fancellu and Afsaneh Fazly End-to-end Parsing of Procedural Text into Flow Graphs
1301 Yangruibo Ding, Zijian Wang, Wasi U. Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth and Bing Xiang CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
1303 Niyati Bafna, Cristina España-Bonet, Josef van Genabith, Benoît Sagot and Rachel Bawden When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages
1304 Claudiu Daniel Hromei, Daniele Margiotta, Danilo Croce and Roberto Basili MM-IGLU: Multi-Modal Interactive Grounded Language Understanding
1307 Derong Xu, Ziheng Zhang, Zhenxi Lin, Xian Wu, Zhihong Zhu, Tong Xu, Xiangyu Zhao, Yefeng Zheng and Enhong Chen Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models
1308 Milad Alshomary, Felix Lange, Meisam Booshehri, Meghdut Sengupta, Philipp Cimiano and Henning Wachsmuth Modeling the Quality of Dialogical Explanations
1309 Elena Shushkevich, Long Thanh Mai, Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels
1310 Wenxuan Zhang, Min Huang, Zhuoyang Song and Qinghai Miao DimA: A Parameter-efficient Fine-tuning Method with Knowledge transfer based on Transformer
1312 Hongfei Xue, Linyan Xu, Yu Tong, Rui Li, Jiali Lin and Dazhi Jiang Breakthrough from Nuance and Inconsistency: Enhancing Multimodal Sarcasm Detection with Context-Aware Self-Attention Fusion and Word Weight Calculation.
1313 Taiji Li, Zhi Li and Yin Zhang Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
1314 Marco Braga, Alessandro Raganato and Gabriella Pasi AdaKron: an Adapter-based Parameter Efficient Model Tuning with Kronecker Product
1316 Yana Nikolova Evaluating Word Expansion for Multilingual Sentiment Analysis of Parliamentary Speech
1317 Rémi Uro, Marie Tahon, Jane Wottawa, David Doukhan, Albert Rilliard and Antoine LAURENT Annotation of Transition-Relevance Places and Interruptions for the Description of Turn-Taking in conversations in French Media Content
1319 Hanyu Zhang, Xiting Wang, Xiang Ao and Qing He Distillation with Explanations from Large Language Models
1321 Adil Soubki and Owen Rambow Intention and Face in Dialog
1322 Francois Meyer and Jan Buys Triples-to-isiXhosa (T2X): Addressing the Challenges of Low-Resource Agglutinative Data-to-Text Generation
1323 Aditya Narayan Sankaran, Vigneshwaran Shankaran, Sampath Lonka and Rajesh Sharma Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems
1325 Enes Yavuz Ugan, Ngoc-Quan Pham and Alexander Waibel DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark
1328 Gustave Cortal Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives
1333 Andrea Gregor de Varda and Marco Marelli The Emergence of Semantic Units in Massively Multilingual Models
1337 Stephanie Brandl, Oliver Eberle, Tiago Ribeiro, Anders Søgaard and Nora Hollenstein Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations
1340 Francesca Grasso, Stefano Locci, Giovanni Siragusa and Luigi Di Caro EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection
1343 Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J. Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Özlem Uzuner and Meliha Yetisgen Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods
1344 Camila Antonio Barros, Jorge Francisco Ciprián-Sánchez and Saulo Mendes Santos A Tool for Determining Distances and Overlaps between Multimodal Annotations
1346 Van-Thuy Phi, Hiroki Teranishi, Yuji Matsumoto, Hiroyuki Oka and Masashi Ishii PolyNERE: A Novel Ontology and Corpus for Named Entity Recognition and Relation Extraction in Polymer Science Domain
1347 Ryan Soh-Eun Shim, Kalvin Chang and David R. Mortensen Phonotactic Complexity across Dialects
1348 Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof and Svetlana Kiritchenko Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
1351 Ziqiang Liu, Shujie Li, Zefeng Cai, Xiangyu Li, Yunshui Li, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang TP-Link: Fine-grained Pre-Training for Text-to-SQL Parsing with Linking Information
1356 Ariel Cohen, Alexandrine Lanson, Emmanuelle Kempf and Xavier Tannier Leveraging Information Redundancy of Real-World Data Through Distant Supervision
1357 Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale and Marta R. Costa-jussà SpeechAlign: a Framework for Speech Translation Alignment Evaluation
1358 Anna-Katharina Dick, Matthias Drews, Valentin Pickard and Victoria Pierz GIL-GALaD: Gender Inclusive Language - German Auto-Assembled Large Database
1359 Axel Ahlin, Alfred Myrne Blåder and Pierre Nugues Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata
1360 Masahiro Kaneko and Naoaki Okazaki Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction
1362 Renzo Arturo Alva Principe, Nicola Chiarini and Marco Viviani An LCF-IDF Document Representation Model Applied to Long Document Classification
1364 Jaemin Kim, Yohan Na, Kangmin Kim, Sang-Rak Lee and Dong-Kyu Chae SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity
1365 Jacob Collard, Valeria de Paiva and Eswaran Subrahmanian Mathematical Entities: Corpora and Benchmarks
1368 Garry Kuwanto, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga, Teresiah W. Nyoike, Aniefon D. Akpan, Nsima Ab Udouboh, Idongesit Udeme Archibong, Idara Effiong Moses, Ifeoluwatayo A. Ige, Benjamin Ajibade, Olumide Benjamin Awokoya, Idris Abdulmumin, Saminu Mohammad Aliyu, Ruqayya Nasir Iro, Ibrahim Said Ahmad, Deontae Smith, Praise-EL Michaels, David Ifeoluwa Adelani, Derry Tanti Wijaya and Anietie Andy Mitigating Translationese in Low-resource Languages: The Storyboard Approach
1370 Lucas Consolin Dezotti, Marco Passarotti and Francesco Mambrini Modelling and Linking an Old Latin-Portuguese Dictionary to the LiLa Knowledge Base
1372 Yanfei Lu, Patrick Littell and Keren Rice Empowering Oneida Language Revitalization: Development of An Oneida Verb Conjugator
1373 Kexin Luo, Yue Mao, Bei Zhang and Sophie Hao Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels
1374 Niclas Hertzberg and Anna Lokrantz MedQA-SWE - A Clinical Question & Answer Dataset for Swedish
1375 Antoni Brosa-Rodríguez and Sylvain Kahane New Proposal of Greenberg's Universal 14 from Typometrics
1377 Felipe Bravo-Marquez and Maria Jose Zambrano Unpacking Bias: An Empirical Study of Bias Measurement Metrics, Mitigation Algorithms, and their Interactions
1378 Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair and Yash Mahesh Bangera Language Variety Identification with True Labels
1379 Christian Hauptmann, Adrian Krenzer, Antonia Krause and Frank Puppe ADEA: An Argumentative Dialogue Dataset on Ethical Issues concerning Future A.I. Applications
1380 Soline Felice, Solene Virginie Evain, Solange Rossato and François Portet Audiocite.net : A Large Spoken Read Dataset in French
1381 Yifeng Xie, Zhihong Zhu, Xuan Lu, Zhiqi Huang and Haoran Xiong InfoEnh: Towards Multimodal Sentiment Analysis via Information Bottleneck Filter and Optimal Transport Alignment
1385 Maarten Janssen UDMorph: Morphosyntactically Tagged UD Corpora
1388 Krenare Pireva Nuci, Paul Landes and Barbara Di Eugenio RoBERTa Low Resource Fine Tuning for Sentiment Analysis in Albanian
1389 Raia Abu Ahmad, Ekaterina Borisova and Georg Rehm FoRC4CL: A Fine-grained Field of Research Classification and Annotated Dataset of NLP Articles
1391 Ange Richard, Laura Cristina Alonzo Canul and François Portet FRACAS: a FRench Annotated Corpus of Attribution relations in newS
1396 Yutong Han, Yan Yuan and Lili Mou A Dual-View Approach to Classifying Radiology Reports by Co-Training
1398 Carla Perez Almendros and Jose Camacho-Collados Do Large Language Models Understand Mansplaining? Well, actually...
1399 Isabelle Lorge, Li Zhang, Xiaowen Dong and Janet Pierrehumbert STEntConv: Predicting Disagreement between Reddit Users with Stance Detection and a Signed Graph Convolutional Network
1401 Dongheng Li, Yongchang Hao and Lili Mou LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
1402 Harry Bunt ISO 24617-12: A New Standard for Semantic Annotation
1403 Ben Cohen, Moreah Zisquit, Stav Yosef, Doron Friedman and Kfir Bar Motivational Interviewing Transcripts Annotated with Global Scores
1405 Margot Madina, Itziar Gonzalez-Dios and Melanie Siegel A Preliminary Study of ChatGPT for Spanish E2R Text Adaptation
1407 Jón Daðason and Hrafn Loftsson Text Filtering Classifiers for Medium-Resource Languages
1408 Myrthe Reuver, Suzan Verberne and Antske Fokkens Investigating the Robustness of Modelling Decisions for Few-Shot Cross-Topic Stance Detection: A Preregistered Study
1410 Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theodoros Rekatsinas, Benjamin Han, Yunyao Li, Jeffrey Pound, Joshua M. Susskind, Natalie Schluter, Ihab F. Ilyas and Navdeep Jaitly Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation
1413 Henning Wachsmuth, Gabriella Lapesa, Elena Cabrio, Anne Lauscher, Joonsuk Park, Eva Maria Vecchi, Serena Villata and Timon Ziegenbein Argument Quality Assessment in the Age of Instruction-Following Large Language Models
1415 Neema Kotonya and Francesca Toni Towards a Framework for Evaluating Explanations in Automated Fact Verification
1417 Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi and Thien Huu Nguyen CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
1420 Dalmo Buzato and Evandro Cunha Agent-based Modeling of Language Change in a Small-world Network
1421 Emil Svoboda and Magda Sevcikova PaReNT (Parent Retrieval Neural Tool): A Deep Dive into Word Formation Across Languages
1422 Hieu Man, Chien Van Nguyen, Nghia Trung Ngo, Linh Ngo, Franck Dernoncourt and Thien Huu Nguyen Hierarchical Selection of Important Context for Generative Event Causality Identification with Optimal Transports
1423 Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Dragomir Radev, Yejin Choi and Noah A. Smith A Call for Clarity in Beam Search: How It Works and When It Stops
1426 Martina Katalin Szabó, Veronika Vincze, Bernadett Dam, Csenge Guba, Anita Bagi and István Szendi Predictive and distinctive linguistic features in Schizophrenia-Bipolar Spectrum Disorders
1427 Masato Hagiwara and Joshua B. Tanner Project MOSLA: Recording Every Moment of Second Language Acquisition
1436 Lizzy Brans and Jelke Bloem SimLex-999 for Dutch
1438 Bangze Pan, Yang Li, Suge Wang, Xiaoli Li, Deyu Li, Jian Liao and Jianxing Zheng Document-Level Event Extraction via Information Interaction Based on Event Relation and Argument Correlation
1439 Joe Huamani-Malca, Miguel Rodriguez Mondoñedo, Francisco Cerna-Herrera, Gissella Bejarano, Carlos Vásquez Roque, Cesar Augusto Ramos Cantu and Sabina Oporto Pérez Lessons from Deploying the First Bilingual Peruvian Sign Language - Spanish Online Dictionary
1440 Sonu Gupta, Geetika Gopi, Harish Balaji, Ellen Poplavska, Nora O'Toole, Siddhant Arora, Thomas Norton, Norman Sadeh and Shomir Wilson Creation and Analysis of an International Corpus of Privacy Laws
1441 Yejin Kim, Scott Rome, Kevin Foley, Mayur Nankani, Rimon Melamed, Javier Morales, Abhay K. Yadav, Maria Peifer, Sardar Hamidian and H. Howie Huang Improving Content Recommendation: Knowledge Graph-Based Semantic Contrastive Learning for Diversity and Cold-Start Users
1442 Bin Wang, Fuyong Xu, Peiyu Liu and Zhenfang Zhu HyperMR: Hyperbolic Hypergraph Multi-hop Reasoning for Knowledge-based Visual Question Answering
1443 Yuanzhen Luo, Qingyu Zhou and Feng Zhou Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction
1448 Xin Wu, Yi Cai and Ho-fung Leung Abstract-level Deductive Reasoning for Pre-trained Language Models
1449 Li Yuan, Yi Cai, Haopeng Ren and Jiexin Wang A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation
1453 Youheng W. Wong, Natalie Parde and Erdem Koyuncu Humanistic Buddhism Corpus: A Challenging Domain-Specific Dataset of English Translations for Classical and Modern Chinese
1454 Baris Karacan, Ankit Aich, Avery Quynh, Amy Pinkham, Philip Harvey, Colin Depp and Natalie Parde Towards Comprehensive Language Analysis for Clinically Enriched Spontaneous Dialogue
1455 Hayato Tsukagoshi, Tsutomu Hirao, Makoto Morishita, Katsuki Chousa, Ryohei Sasano and Koichi Takeda WikiSplit++: Easy Data Refinement for Split and Rephrase
1456 Xi Wang, Hongliang Dai, Shen Gao and Piji Li Characteristic AI Agents via Large Language Models
1458 Fan Hu, Yanlin Wang, Lun Du, Hongyu Zhang, Dongmei Zhang and Xirong Li Tackling Long Code Search with Splitting, Encoding, and Aggregating
1460 Supryadi Supryadi, Leiyu Pan and Deyi Xiong An Empirical Study on the Robustness of Massively Multilingual Neural Machine Translation
1461 Eiki Murata and Daisuke Kawahara Time-aware COMET: a Commonsense Knowledge Model with Temporal Knowledge
1463 Maxwell A. Weinzierl and Sanda M. Harabagiu The Impact of Stance Object Type on the Quality of Stance Detection
1471 Panatchakorn Anantaprayoon, Masahiro Kaneko and Naoaki Okazaki Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels
1472 Zhongquan Jian, Ante Wang, Jinsong Su, Junfeng Yao, Meihong Wang and Qingqiang Wu EmoTrans: Emotional Transition-based Model for Emotion Recognition in Conversation
1474 Bin Li, Yunlong Fan, Yikemaiti Sataer, Chuanqi Shi, Miao Gao and Zhiqiang Gao Few-Shot Semantic Dependency Parsing via Graph Contrastive Learning
1476 Yao Sun, Anastasiia Tatlubaeva, Zhihan Li and Chester Palen-Michel What are the implications of your question? Non-Information Seeking Question-Type Identification in CNN Transcripts
1477 Dojun Park and Sebastian Padó Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean
1478 Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang and Xicheng Lu KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion
1479 Sayar Ghosh Roy and Jiawei Han ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation
1480 Yingting Li, Rishabh Bhardwaj, Ambuj Mehrish, Bo Cheng and Soujanya Poria HYPERTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
1482 Rossana Cunha, Thiago Castro Ferreira, Adriana Pagano and Fabio Alves A Persona-Based Corpus in the Diabetes Self-Care Domain - Applying a Human-Centered Approach to a Low-Resource Context
1483 Yaqi Chen, Hao Zhang, Xukui Yang, Wenlin Zhang and Dan Qu Meta-Adapter for Self-Supervised Speech Models: A Solution to Low-Resource Speech Recognition Challenges
1484 Sungjin Nam, Kevyn Collins-Thompson, David Jurgens and Xin Tong Finding Educationally Supportive Contexts for Vocabulary Learning with Attention-Based Models
1486 Liang Lu, Jingzhi Wang and David R. Mortensen Improved Neural Protoform Reconstruction via Reflex Prediction
1492 Sungjun Han and Sebastian Padó Towards Understanding the Relationship between In-context Learning and Compositional Generalization
1493 Kehan Long, Shasha Li, Pancheng Wang, Chenlong Bao, Jintao Tang and Ting Wang Recommending Missed Citations Identified by Reviewers: A New Task, Dataset and Baselines
1494 Chester Palen-Michel, Lizzie Liang, Zhe Wu and Constantine Lignos QueryNER: Segmentation of E-commerce Queries
1495 Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko and Shervin Malmasi Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data
1496 Bipesh Subedi, Sunil Regmi, Bal Krishna Bal and Praveen Acharya Exploring the Potential of Large Language Models (LLMs) for Low-resource Languages: A Study on Named-Entity Recognition (NER) and Part-Of-Speech (POS) Tagging for Nepali Language
1498 Yigeng Zhang, Mahsa Shafaei, Fabio Gonzalez and Thamar Solorio Positive and Risky Message Assessment for Music Products
1499 Yigeng Zhang, Fabio Gonzalez and Thamar Solorio Interpreting Themes from Educational Stories
1501 Chieko Nishimura, Shuhei Kurita and Yohei Seki Text360Nav: 360-Degree Image Captioning
1502 Eunike Andriani Kardinata, Hiroki Ouchi and Taro Watanabe Constructing Indonesian-English Travelogue Dataset
1503 Ruochen Zhang and Carsten Eickhoff CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization
1504 Yunhua Zhou, Pengyu wang, Peiju Liu, Yuxin Wang and Xipeng Qiu The Open-World Lottery Ticket Hypothesis for OOD Intent Classification
1505 Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam and Dong-Kyu Chae BanglaAutoKG: Automatic Bangla Knowledge Graph Construction with Semantic Neural Graph Filtering
1507 Tan Yue, Xuzhao Shi, Rui Mao, Zonghai Hu and Erik Cambria SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset
1510 Omar Kallas, Go Inoue and Nizar Habash EMAD: A Bridge Tagset for Unifying Arabic POS Annotations
1511 Jiaying Gong and Hoda Eldardiry Few-Shot Relation Extraction with Hybrid Visual Evidence
1512 Rustem Yeshpanov, Alina Polonskaya and Huseyin Atakan Varol KazParC: Kazakh Parallel Corpus for Machine Translation
1514 Thibault Clerice Detecting Sexual Content at the Sentence Level in First Millennium Latin Texts
1518 xiujuan xu, Xiaoxiao Shi, Zhehuan Zhao and Yu Liu ESCP: Enhancing Emotion Recognition in Conversation with Speech and Contextual Prefixes
1519 Fan Xu, Lei Zeng, Bowei Zou, AiTi Aw and Huan Rong CLFFRD: Curriculum Learning and Fine-grained Fusion for Multimodal Rumor Detection
1521 Haopeng Zhang, Hayate Iso, Sairam Gurajada and Nikita Bhutani XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
1522 Rui Mao, Guanyi Chen, Xulang Zhang, Frank Guerin and Erik Cambria GPTEval: A Survey on Assessments of ChatGPT and GPT-4
1526 Mengyi Huang, Meng Xiao, Ludi Wang and Yi Du DP-CRE: Continual Relation Extraction via Decoupled Contrastive Learning and Memory Structure Preservation
1528 Zhouhao Sun, Xiao Ding, Li Du, Bibo Cai, Jinglong Gao, Ting Liu and Bing Qin Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation
1530 Agrima Seth, Sanchit Ahuja, Kalika Bali and Sunayana Sitaram DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures
1531 Xinyue Liu, Jianan Zhang, Chi Ma, Wenxin Liang, Bo Xu and Linlin Zong Temporal Knowledge Graph Reasoning with Dynamic Hypergraph Embedding
1532 Jonne Saleva and Constantine Lignos ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages using Wikidata
1535 LIN LI, Shaopeng Tang and Renwei Wu Majority Rules Guided Aspect-Category based Sentiment Analysis via Label Prior Knowledge
1538 Thennal D K, Ganesh Nathan and Suchithra M S Fisher Mask Nodes for Language Model Merging
1540 Frederikus Hudi, Zhi Qu, Hidetaka Kamigaito and Taro Watanabe Disentangling Pretrained Representation to Leverage Low-Resource Languages in Multilingual Machine Translation
1543 Heyang Liu, Yanfeng Wang and Yu Wang Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
1548 Yuki Hironaka, Tomoyuki Kajiwara and Takashi Ninomiya Transfer Fine-tuning for Quality Estimation of Text Simplification
1552 Maxim Konca, Andy Luecking and Alexander Mehler German SRL: Corpus Construction and Model Training
1553 Guicai Xie, Ke Zhang, Lei Duan, Wei Zhang and Zeqian Huang Typos Correction Training Against Misspellings from Text-to-Text Transformers
1555 Tianyu Zheng, Ge Zhang, Xingwei Qu, Ming Kuang, Wenhao Huang and Zhaofeng He MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
1558 Zhenxi Lin, Ziheng Zhang, Xian Wu and Yefeng Zheng Biomedical Entity Linking as Multiple Choice Question Answering
1559 Chen Yang, Bin Cao and Jing Fan HS-GC: Holistic Semantic Embedding and Global Contrast for Effective Text Clustering
1561 Martyna Wiącek, Piotr Rybak, Łukasz Pszenny and Alina Wróblewska NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems
1562 Mersad Esalati, Mohammad Javad Dousti and Heshaam Faili Esposito: An English-Persian Scientific Parallel Corpus for Machine Translation
1563 Yue Wang, Zilong Zheng, Juntao Li, zhihui liu, Jinxiong Chang, Qishen Zhang, Zhongyi Liu, Guannan Zhang and Min Zhang Towards More Realistic Chinese Spell Checking with New Benchmark and Specialized Expert Model
1564 Puneet Mathur, Vlad I. Morariu, Aparna Garimella, Franck Dernoncourt, Jiuxiang Gu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha and Rajiv Jain DocScript: Document-level Script Event Prediction
1565 Kristina Kobrock, Xenia Isabel Ohmer, Elia Bruni and Nicole Gotzner Context Shapes Emergent Communication about Concepts at Different Levels of Abstraction
1566 Dongjun Jang, Sungjoo Byun and Hyopil Shin A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
1568 Yaxin Fan, Feng Jiang, Peifeng Li and Haizhou Li Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study
1570 Tu-Anh Tran and Yusuke Miyao Integrating Headedness Information into an Auto-generated Multilingual CCGbank for Improved Semantic Interpretation
1571 Marta Bañón, Gema Ramírez-Sánchez, Jaume Zaragoza-Bernabeu and Sergio Ortiz Rojas FastSpell: the LangId Magic Spell
1573 Seongbo Jang, Seonghyeon Lee and Hwanjo Yu KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark
1575 Yuya Ogasa, Tomoyuki Kajiwara and Yuki Arase Controllable Paraphrase Generation for Semantic and Lexical Similarities
1578 Hadeel Saadany, Constantin Orasan, Sophie Walker and Catherine Breslin Linking Judgement Text to Court Hearing Videos: UK Supreme Court as a Case Study
1585 Mengsha Liu, Daoyuan Chen, Yaliang Li, Guian Fang and Ying Shen ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization
1589 Wen-wai Yim, Yujuan Fu, Asma Ben Abacha and Meliha Yetisgen To Err is Human, How About Medical Large Language Models? Comparing Pre-trained Language Models for Medical Assessment Errors and Reliability
1591 Jiří Mírovský, Pavlína Synková, Lucie Polakova and Marie Paclíková Cost-Effective Discourse Annotation in the Prague Czech–English Dependency Treebank
1594 Akash Anil, Victor Gutierrez-Basulto, Yazmin Ibanez-Garcia and Steven Schockaert Inductive Knowledge Graph Completion with GNNs and Rules: An Analysis
1597 Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Qiang Zhang and Huajun Chen DET: A Dual-Encoding Transformer for Relational Graph Embedding
1600 Felipe Gonzalez-Pizarro and Giuseppe Carenini Neural Multimodal Topic Modeling: A Comprehensive Evaluation
1601 Ruilin Luo, Jiayi Li, Jianghangfan Zhang, Jing Xiao and Yujiu Yang Prior Relational Schema Assists Effective Contrastive Learning for Inductive Knowledge Graph Completion
1606 Dancheng Xin, Jiawei Yuan and Yang Li Diffusion based Counterfactual Augmentation for Dual Sentiment Classification
1608 Jooyoung Lee, Fan Yang, Thanh Tran, Qian Hu, Emre Barut and Kai-Wei Chang Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought
1610 Ronja Laarmann-Quante, Marco Müller and Eva Belke Automatic Extraction of Nominal Phrases from German Learner Texts of Different Proficiency Levels
1612 Yerin Hwang, Yongil Kim, Hyunkyung Bae, Jeesoo Bang, Hwanhee Lee and Kyomin Jung Kosmic: Korean Text Similarity Metric Reflecting Honorific Distinctions
1613 Antoine Jamelot, Solen Quiniou and Sophie Hamon Improving Text Readability through Segmentation into Rheses
1614 Elena Benzoni, Matteo Pellegrini, Francesco Dedè and Marco Passarotti Representing Compounding with OntoLex. An Evaluation of Vocabularies for Word Formation Resources
1615 Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata and Andrea Zaninello MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
1616 Miriam Schirmer, Christian Brechenmacher and Juergen Pfeffer GENTRAC: A Tool for Tracing Trauma in Genocide and Mass Atrocity Court Transcripts
1619 Davide Picca and John Pavlopoulos Deciphering Emotional Landscapes in the Iliad: A Novel French-Annotated Dataset for Emotion Recognition
1620 Harshita Diddee, Anurag Shukla, Tanuja Ganu, Vivek Seshadri, Sandipan Dandapat, Monojit Choudhury and Kalika Bali INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation
1621 Xiaohan Ma, Rize Jin and Tae-Sun Chung Multi-Channel Spatio-Temporal Transformer for Sign Language Production
1625 Tatiana Passali and Grigorios Tsoumakas Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods
1626 Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong and Longyue Wang A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models
1628 Anna Kuznetsova and Carlo Strapparava Multimodal and Multilingual Laughter Detection in Stand-Up Comedy Videos
1631 Takeru Isaka, Atsushi Otsuka and Iwaki Toshima Analysis of Sensation-transfer Dialogues in Motorsports
1633 Toru Urakawa, Yuya Taguchi, Takuro Niitsuma and Hideaki Tamori A Japanese News Simplification Corpus with Faithfulness
1635 Shirin Dabbaghi Varnosfaderani, Canasai Kruengkrai, Ramin Yahyapour and Junichi Yamagishi Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model
1637 Olga Zamaraeva, Lorena S. Allegue and Carlos Gómez-Rodríguez Spanish Resource Grammar version 2023
1638 Anni Eskelinen, Amanda Myntti, Erik Henriksson, Sampo Pyysalo and Veronika Laippala Building Question-Answer Data Using Web Register Identification
1639 Yash Jain, David M. Chan, PRANAV DHERAM, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran and Shalini Ghosh Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
1641 Thi-Nhung Nguyen, Bang Tien Tran, Trong-Nghia Luu, Thien Huu Nguyen and Kiem-Hieu Nguyen BKEE: Pioneering Event Extraction in the Vietnamese Language
1642 Audrey Mash, Carlos Escolano, Aleix Sant, Maite Melero and Francesca De Luca Fornaciari Unmasking Biases: Exploring Gender Bias in English-Catalan Machine Translation through Tokenization Analysis and Novel Dataset
1645 Ya Gao, Shaoxiong Ji and Pekka Marttinen Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection
1646 Hongzheng Li, Ruojin Wang, Ge Shi, Xing Lv, Lei Lei, Chong Feng, Fang Liu, Jinkun Lin, Yangguang Mei and Linnan Xu RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts
1647 Zhenyu Qian, Yiming Qian, Yuting Song, Fei Gao, Hai Jin, Chen Yu and Xia Xie Harnessing the Power of Large Language Model for Uncertainty Aware Graph Processing
1648 Maria Barrett, Max Müller-Eberstein, Elisa Bassignana, Amalie Brogaard Pauli, Mike Zhang and Rob van der Goot Can Humans Identify Domains?
1659 Nurbanu Aksoy, Nishant Ravikumar and Serge Sharoff Enhancing Image-to-Text Generation in Radiology Reports through Cross-modal Multi-Task Learning
1661 Chunlei Xin, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, weiqiang wang, Zhongyi Liu, Xianpei Han and Le Sun Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning
1662 Longyin Zhang, Bowei Zou and Ai Ti Aw Empowering Tree-structured Entailment Reasoning: Rhetorical Perception and LLM-driven Interpretability
1663 Lisa Raithel, Hui-Syuan Yeh, Shuntaro Yada, Cyril Grouin, Thomas Lavergne, Aurélie Névéol, Patrick Paroubek, Philippe Thomas, Tomohiro Nishiyama, Sebastian Möller, Eiji ARAMAKI, Yuji Matsumoto, Roland Roller and Pierre Zweigenbaum A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages
1664 Marie Mikulova Fine-grained Classification of Circumstantial Meanings within the Prague Dependency Treebank Annotation Scheme
1666 Injy Hamed, Fadhl Eryani, David Palfreyman and Nizar Habash ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
1667 Neele Falk and Gabriella Lapesa Stories and personal experiences in the COVID-19 Discourse
1668 Elisa Bassignana, Viggo Unmack Gascou, Frida Nøhr Laustsen, Gustav Kristensen, Marie Haahr Petersen, Rob van der Goot and Barbara Plank How to Encode Domain Information in Relation Classification
1669 Feifan Song, Bowen Yu, Hao Lang, Haiyang Yu, Fei Huang, Houfeng Wang and Yongbin Li Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
1674 Boxi Cao, Qiaoyu Tang, Hongyu Lin, Shanshan Jiang, Bin Dong, Xianpei Han, Jiawei Chen, Tianshu Wang and Le Sun Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models
1675 Johanna CORDOVA Towards Universal Dependencies For Ancash Quechua
1677 Yuting Yang, pei huang, Feifei Ma, Juan Cao and Jintao Li PAD: A Robustness Enhancement Ensemble Method via Promoting Attention Diversity
1679 Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan and Sophie Rosset Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification
1681 Aleksandra Edwards and Jose Camacho-Collados Language Models for Text Classification: Is In-Context Learning Enough?
1682 Annerose Eichel, Tana Deeg, Andre Blessing, Milena Belosevic, Sabine Arndt-Lappe and Sabine Schulte im Walde Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds
1683 Benjamin Icard, François Maine, Morgane Casanova, Géraud Faye, Julien Chanson, Guillaume Gadek, Ghislain Atemezing, François Bancilhon and Paul Égré A Multi-Label Dataset of French Fake News: Human and Machine Insights
1684 Ramona Kühn, Khouloud Saadi, Jelena Mitrović and Michael Granitzer Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection
1687 Veronika Grigoreva, Anastasiia Ivanova, Ilseyar Alimova and Ekaterina Artemova RuBia: A Russian Language Bias Detection Dataset
1695 Sérgio Nunes, Alípio Mario Jorge, Evelin Amorim, Hugo Sousa, António Leal, Purificação Moura Silvano, Inês Cantante and Ricardo Campos Text2Story Lusa: A Dataset for Narrative Analysis in European Portuguese News Articles
1697 Feiyan Liu, Liangzhi Li, Xiaoli Wang, Feng Luo, Chang Liu, Jinsong Su and Yiming Qian MHGRL: An Effective Representation Learning Model for Electronic Health Records
1699 Mokanarangan Thayaparan, Marco Valentino and André Freitas A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference
1701 Oliver Cakebread-Andrews Error Analysis of NLP Models and Non-Native Speakers of English Identifying Sarcasm in Reddit Comments
1704 Ingrid Espinoza, Steffen Frenzel, Laurin Friedrich, Wassiliki Siskou, Steffen Eckhard and Annette Hautli-Janisz PSE v1.0: The first open access corpus of public service encounters
1708 Marion Weller-Di Marco and Alexander Fraser Analyzing the Understanding of Morphologically Complex Words in Large Language Models
1709 Jue Hou, Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu and Roman Yangarber What do Transformers Know about Government?
1710 Dmitry Zmitrovich, Aleksandr Abramov, Andrey Kalmykov, Vitaly Kadulin, Maria Tikhonova, Ekaterina Taktasheva, Danil Astafurov, Mark Baushenko, Artem Snegirev, Tatiana Shavrina, Sergei S. Markov, Vladislav Mikhailov and Alena Fenogenova A Family of Pretrained Transformer Language Models for Russian
1711 Muhammed AbuOdeh, Long Phan, Ahmed Farouk Zakaria Elshabrawy and Nizar Habash Palmyra 3.0: A User-Friendly Cloud-Based Platform for Morphology and Dependency Syntax Annotation
1714 Jianyu Zheng, Fengfei Fan and Jianquan Li Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer
1715 Martin Popel, Lucie Polakova, Michal Novák, Jindřich Helcl, Jindřich Libovický, Pavel Straňák, Tomas Krabac, Jaroslava Hlavacova, Mariia Anisimova and Tereza Chlanova Charles Translator: A Machine Translation System between Ukrainian and Czech
1716 Sergey Kramp, Giovanni Cassani and Chris Emmery BigNLI: Native Language Identification with Big Bird Embeddings
1718 Weihao Zeng, Keqing He, Yejie Wang, Dayuan Fu and Weiran Xu BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses
1719 Marie Iversdatter Røsok and Ingerid Løyning Dale NB Uttale: A Norwegian Pronunciation Lexicon with Dialect Variation
1720 Agnieszka Falenska, Eva Maria Vecchi and Gabriella Lapesa Self-reported demographics and discourse dynamics in a persuasive online forum
1722 Yuhan Liu, Xiuying Chen, GAO XING, Ji Zhang and Rui Yan IAD: In-Context Learning Ability Decoupler of Large Language Models in Meta-Training
1723 Iacopo Ghinassi, Simone Tedeschi, Paola Marongiu, Roberto Navigli and Barbara McGillivray Language Pivoting from Parallel Corpora for Word Sense Disambiguation of Historical Languages: a Case Study on Latin
1724 Yikun Sun, Zhen Wan, Nobuhiro Ueda, Sakiko Yahata, Fei Cheng, Chenhui Chu and Sadao Kurohashi Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese
1725 Jens Nevens, Robin De Haes, Rachel Ringe, Mihai Pomarlan, Robert Porzel, Katrien Beuls and Paul Van Eecke A Benchmark for Recipe Understanding in Artificial Agents
1727 Rashid Nizamani, Sebastian Schuster and Vera Demberg SIGA: A Naturalistic NLI Dataset of English Scalar Implicatures with Gradable Adjectives
1731 Xiaotian Lu, Jiyi Li, Zhen Wan, Xiaofeng Lin, Koh Takeuchi and Hisashi Kashima Evaluating Saliency Explanations in NLP by Crowdsourcing
1733 Jiamin Luo, Jianing Zhao, Jingjing Wang and Guodong Zhou How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding
1739 Jiamin Luo, Jingjing Wang and Guodong Zhou TopicDiff: A Topic-enriched Diffusion Approach for Multimodal Conversational Emotion Detection
1743 Dimitar Trajanov, Elena Apostol, Radovan Garabík, Katerina Gkirtzou, Dagmar Gromann, Chaya Liebeskind, Cosimo Palma, Michael Rosner, Alexia Sampri, Gilles Sérasset, Blerina Spahiu, Ciprian-Octavian Truică and Giedre Valunaite Oleskeviciene From Linguistic Linked Data to Big Data
1744 Masaaki Nagata, Makoto Morishita, Katsuki Chousa and Norihito Yasuda JaParaPat: A Large-Scale Japanese-English Parallel Patent Application Corpus
1746 Abdelhak Kelious, Mathieu Constant and Christophe Coeur Complex Word Identification: a Comparative Study Between ChatGPT and a Dedicated Model for this Task
1747 Johanna Gerlach, Pierrette Bouillon, Jonathan Mutal and Hervé Spechbach A Concept Based Approach for Translation of Medical Dialogues into Pictographs
1748 Hongbin Na CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering
1749 Madalina Chitez, Mihai Dascalu, Aura Cristina Udrea, Cosmin Strilețchi, Karla Csürös, Roxana Rogobete and Alexandru Oravițan Towards Building the LEMI Readability Platform for Children's Literature in the Romanian Language
1753 Yantao Liu, Zijun Yao, Xin Lv, Yuchen Fan, Shulin Cao, Jifan Yu, Lei Hou and Juanzi Li Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models
1754 Bolette Pedersen, Nathalie Sørensen, Sussi Olsen, Sanni Nimb and Simon Gray Towards a Danish Semantic Reasoning Benchmark - Compiled from Lexical-Semantic Resources for Assessing Selected Language Understanding Capabilities of Large Language Models
1755 Xuefei Li, Huiwei Zhou, Weihong Yao, Wenchu Li, Yingyu Lin and Lei Du Sequential and Repetitive Pattern Learning for Temporal Knowledge Graph Reasoning
1757 Anna Rogers, Marzena Karpinska, Ankita Gupta, Vladislav Lialin, Gregory Smelkov and Anna Rumshisky NarrativeTime: Dense Temporal Annotation on a Timeline
1758 Chengyuan Liu, Fubang Zhao, Kun Kuang, Yangyang Kang, Zhuoren Jiang, Changlong Sun and Fei Wu Evolving Knowledge Distillation with Large Language Models and Active Learning
1759 Quan Tu, Chongyang Tao and Rui Yan Multi-Grained Conversational Graph Network for Retrieval-based Dialogue Systems
1760 Andrew Rueda, Elena Alvarez-Mellado and Constantine Lignos CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English
1761 Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Zijun Yao, Jing Zhang, Lei Hou and Juanzi Li A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation
1762 Soëlie Lerch, Patrice Bellot, Elisabeth Murisasco and Emmanuel Bruno EMOLIS App and Dataset to Find Emotionally Close Cartoons
1763 Samia Touileb, Jeanett Murstad, Petter Mæhlum, Lubos Steskal, Lilja Charlotte Storset, Huiling You and Lilja Øvrelid EDEN: A Dataset for Event Detection in Norwegian News
1764 Georg Rehm, Stelios Piperidis, Dimitris Galanis, Penny Labropoulou, Maria Giagkou, Miltos Deligiannis, Leon Voukoutis, Martin Courtois, Julian Moreno-Schneider and Katrin Marheinecke European Language Grid: One Year After
1770 Evelin Amorim, Ricardo Campos, Alipio Jorge, Pedro Mota and Rúben Almeida text2story: A Python Toolkit to Extract and Visualize Story Components of Narrative Text
1773 JingJie Zeng, Liang Yang, Jiahao Kang, Yufeng Diao, Zhihao Yang and Hongfei LIN "Barking Up the Right Tree", a GAN-Based Pun Generation Model through Semantic Pruning
1777 Zhenwen Liang, Dian Yu, Xiaoman Pan, Wenlin Yao, Qingkai Zeng, Xiangliang Zhang and Dong Yu MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning
1781 Karolina Zaczynska, Peter Bourgonje and Manfred Stede How Diplomats Dispute: The UN Security Council Conflict Corpus
1782 Songbo Hu, Ivan Vulić, Fangyu Liu and Anna Korhonen Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
1783 Mario Perez-Enriquez, Jose Manuel Masiello-Ruiz, Jose Luis Lopez-Cuadrado, Israel Gonzalez-Carrasco, Paloma Martinez-Fernandez and Belen Ruiz-Mezcua Automatic Punctuation Model for Spanish Live Transcriptions
1786 Christian Khairallah, Salam Khalifa, Reham Marzouk, Mayar Mohamadein Nassar and Nizar Habash Camel Morph MSA: A Large-Scale Open-Source Morphological Analyzer for Modern Standard Arabic
1787 Tom Bourgeade, Zongmin Li, Farah Benamara, Véronique MORICEAU, Jian Su and Aixin Sun Humans Need Context, What About Machines? Investigating Conversational Context in Abusive Language Detection
1792 Denis Kokosinskii and Nikolay Arefyev Multilingual Substitution-based Word Sense Induction
1795 Enrique Amigó, Jorge Carrillo-de-Albornoz, Andrés Fernández, Julio Gonzalo, Guillermo Marco, Roser Morante, Laura Plaza and Jacobo Pedrosa A Web Portal about the State of the Art of NLP Tasks in Spanish
1798 Manfred Klenner and Dylan Massey Is Gender Reference Gender-specific? Studies in a Polar Domain
1803 Kate Thompson, Julie Hunter and Nicholas Asher Discourse Structure for the Minecraft Corpus
1805 Chi Hu, Yuan Ge, Xiangnan Ma, Hang Cao, Qiang Li, Yonghua Yang, Tong Xiao and Jingbo Zhu RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners
1806 Ye Tao, Chaofeng Lu, Meng Liu, Kai Xu, Tianyu Liu, Yunlong Tian and Yongjie Du A Fast and High-quality Text-to-Speech Method with Compressed Auxiliary Corpus and Limited Target Speaker Corpus
1809 Anton Chernyavskiy, Svetlana Shomova, Irina Dushakova, Ilya Kiriya and Dmitry Ilvovsky ZenPropaganda: A Comprehensive Study on Identifying Propaganda Techniques in Russian Coronavirus-Related Media
1810 Di Wang, Yuzheng He, Xiao Liang, Yumin Tian, Shaofeng Li and Lin Zhao TMFN: A Target-oriented Multi-grained Fusion Network for End-to-end Aspect-based Multimodal Sentiment Analysis
1812 Jianxiang Xiang, Zhenhua Liu, Haodong Liu, Yin Bai, Jia Cheng and Wenliang Chen DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space
1815 Jiyao Wei, Saiping Guan, Xiaolong Jin, Jiafeng Guo and Xueqi Cheng Few-shot Link Prediction on Hyper-relational Facts
1816 Henrik Voigt, Kai Lawonn and Sina Zarrieß Plots Made Quickly: An Efficient Approach for Generating Visualizations from Natural Language Queries
1817 Martijn Bentum, Eric Sanders, Antal P.J. van den Bosch, Douwe Zeldenrust and Henk van den Heuvel Corpus Creation and Automatic Alignment of Historical Dutch Dialect Speech
1818 Shweta Misra and Johan Boye Nested Noun Phrase Identification using BERT
1821 Jorge Osés Grijalba, L. Alfonso Ureña-López, Eugenio Martínez Cámara and Jose Camacho-Collados Question Answering over Tabular Data with DataBench: A Large-Scale Empirical Evaluation of LLMs
1822 Aaron Maladry, Alessandra Teresa Cignarella, Els Lefever, Cynthia Van Hee and Veronique Hoste Human and System Perspectives on the Expression of Irony: an Analysis of Likelihood Labels and Rationales
1823 Pascal Tilli and Ngoc Thang Vu Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering
1825 Jeremy Robichaud and Paul Cook WaCadie: Towards an Acadian French Corpus
1834 Augusto R. Mendes and Helena Caseli Identifying Fine-grained Depression Signs in Social Media Posts
1836 Arianna Graciotti, Valentina Presutti and Rocco Tripodi Latent vs Explicit Knowledge Representation: How ChatGPT Answers Questions about Low-Frequency Entities
1837 Ivan Sedykh, Nikita Sorokin, Dmitry Abulkhanov, Sergey I. Nikolenko and Valentin Malykh Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
1838 Marianne Vergez-Couret, Myriam Bras, Aleksandra Miletić and Clamença Poujade Loflòc: A Morphological Lexicon for Occitan using Universal Dependencies
1840 Ashwathy T Revi, Stuart E. Middleton and David E. Millard Rationale-based Learning using Self-Supervised Narrative Events for Text Summarisation of Interactive Digital Narratives
1841 Fengkai Liu and John S. Y. Lee CSSWiki: A Chinese Sentence Simplification Dataset with Linguistic and Content Operations
1842 Noémi Ligeti-Nagy, Gergő Ferenczi, Enikő Héja, László János Laki, Noémi Vadász, Zijian Győző Yang and Tamás Váradi HuLU: Hungarian Language Understanding Benchmark Kit
1845 Kelvin Wey Han Chan, Christopher Bryant, Li Nguyen, Andrew Caines and Zheng Yuan Grammatical Error Correction for Code-Switched Sentences by Learners of English
1847 Zhuorui Liu, Chen Zhang and Dawei Song How Speculative Can Speculative Decoding Be?
1848 Adnen Abdessaied, Manuel Hochmeister and Andreas Bulling OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog
1849 Serhii Hamotskyi, Nata Kozaeva and Christian Hänig FinCorpus-DE10k: A Corpus for the German Financial Domain
1850 Xiang Luo, Zhiwen Tang, Jin Wang and Xuejie Zhang DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
1852 Shuo Yang and Gjergji Kasneci Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization
1853 Yue Wang, Hua Zheng, Yaqi Yin, 王 涵思, Qiliang Liang and Yang Liu Morpheme Sense Disambiguation: A New Task Aiming for Understanding the Language at Character Level
1855 Gaurish Thakkar, Sherzod Hakimov and Marko Tadić M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
1856 Qingqing Gao, Jiuxin Cao, Biwei Cao, Xin Guan and Bo Liu CEPT: a Contrast-Enhanced Prompt-Tuning Framework for Emotion Recognition in Conversation
1857 Andres Garcia-Silva, Cristian Berrio and Jose Manuel Gomez-Perez SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation
1859 Haotian Xu, Yuhua Wang and Jiahui Fan Self-Knowledge Distillation for Knowledge Graph Embedding
1864 Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao and Min Zhang 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
1866 Olli Kuparinen Murre24: Dialect Identification of Finnish Internet Forum Messages
1870 Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Chandra Pujari and Annemarie Friedrich AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports
1873 Zhuoqun Li, Hongyu Lin, Yaojie Lu, Hao Xiang, Xianpei Han and Le Sun Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models
1874 Rik van Noord, Taja Kuzman, Peter Rupnik, Nikola Ljubešić, Miquel Esplà-Gomis, Gema Ramírez-Sánchez and Antonio Toral Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages
1876 Emmett Strickland, Anne Lacheret-Dujour, Sylvain Kahane, Marc Evrard, Perrine Quennehen, Bernard Caron, Francis Egbokhare and Bruno Guillaume New Methods for Exploring Intonosyntax: Introducing an Intonosyntactic Treebank for Nigerian Pidgin
1878 Tianbao Song, Jingbo Sun, Xin Liu and Weiming Peng Scale-VAE: Preventing Posterior Collapse in Variational Autoencoder
1881 Sebastian Reimann and Tatjana Scheffler Metaphors in Online Religious Communication: a Detailed Dataset and Cross-Genre Metaphor Detection
1882 Kira Droganova and Daniel Zeman Towards a Unified Taxonomy of Deep Syntactic Relations
1884 Yaqi Yin, Yue Wang and Yang Liu Chinese Morpheme-informed Evaluation of Large Language Models
1885 Mingxiu Cai, Daling Wang, Shi Feng and Yifei Zhang EmpCRL: Controllable Empathetic Response Generation via In-Context Commonsense Reasoning and Reinforcement Learning
1888 tianxiang wu, Han Chen, Luozheng Qin, Ziqiang Cao and Chunhui Ai Improving Copy-oriented Text Generation via EDU Copy Mechanism
1894 Ruina Bai and Qi Bai Improving multi-view document clustering: leveraging multi-structure processor and hybrid ensemble clustering module
1895 Liyan Wang, Haotong Wang and Yves Lepage Continued Pre-training on Sentence Analogies for Translation with Small Data
1898 Baptiste Blouin, Cécile Armand and Christian Henriot A Dataset for Named Entity Recognition and Entity Linking in Chinese Historical Newspapers
1899 Pierre Magistry, Ilaine Wang and Ty Eng Lim Experiments on Speech Synthesis for Teochew, Can Taiwanese Help ?
1901 Somaiyeh Dehghan and Berrin Yanıkoğlu Multi-domain Hate Speech Detection Using Dual Contrastive Learning and Paralinguistic Features
1902 Eric Sanders, Sara Petrollino, Gilles R. Scheifer, Henk van den Heuvel and Christopher Handy FAIRification of LeiLanD
1903 Ariel Ekgren, Amaru Cuba Gyllensten, Felix Stollenwerk, Joey Öhman, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Judit Casademont and Magnus Sahlgren GPT-SW3: An Autoregressive Language Model for the Scandinavian Languages
1904 Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann and Animesh Mukherjee On Zero-Shot Counterspeech Generation by LLMs
1908 Hao Gu, Jiangyan Yi, Zheng Lian, Jianhua Tao and Xinrui Yan NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption
1911 Shun Katada, Ryu Takeda and Kazunori Komatani Collecting Human-Agent Dialogue Dataset with Frontal Brain Signal toward Capturing Unexpressed Sentiment
1913 Yige Chen, KyungTae Lim and Jungyeul Park A Linguistically-Informed Annotation Strategy for Korean Semantic Role Labeling
1914 Zhicheng Lin, HeGang Chen, Yuyin Lu, Yanghui Rao, Hao Xu and Hanjiang Lai Hierarchical Topic Modeling via Contrastive Learning and Hyperbolic Embedding
1916 Wajdi Zaghouani, Hamdy Mubarak and Md. Rafiul Biswas So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset
1918 Nezih Younsi, Catherine Pelachaud and Laurence Chaby Beyond Words: Decoding Facial Expression Dynamics in Motivational Interviewing
1921 Ke Zhang, Yimiao Feng and Jie Zheng Prompt-based Generation of Natural Language Explanations of Synthetic Lethality for Cancer Drug Discovery
1922 Miriam Winkler, Virginija Juozapaityte, Rob van der Goot and Barbara Plank Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants
1923 Nesrine Bannour, Christophe Servan, Aurélie Névéol and Xavier Tannier A Benchmark Evaluation of Clinical Named Entity Recognition in French
1924 Alina Karakanta, Mauro Cettolo, Matteo Negri and Luisa Bentivogli Evaluating Automatic Subtitling: Correlating Post-editing Effort and Automatic Metrics
1927 Qiwei Peng, Yekun Chai and Xuhong Li HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
1928 Tomáš Sourada, Jana Straková and Rudolf Rosa OOVs in the Spotlight: How to Inflect them?
1930 Joshua Miles Jansen van Vüren, Febe De Wet and Thomas Niesler Automatic Partitioning of a Code-Switched Speech Corpus Using Mixed-Integer Programming
1931 Hossam Zawbaa, Wael Rashwan, Sourav Dutta and Haytham Assem Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
1934 Alessio Miaschi, Felice Dell'Orletta and Giulia Venturi Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)
1937 Chuang Liu, Renren Jin, Yuqi Ren and Deyi Xiong LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
1942 Xin Sun, Jiahuan Pei, Jan de Wit, Mohammad Aliannejadi, Emiel Krahmer, Jos T.P. Dobber and Jos A. Bosch Eliciting Motivational Interviewing Skill Codes in Psychotherapy with LLMs: A Bilingual Dataset and Analytical Study
1943 Da Ren and Qing Li Releasing the Capacity of GANs in Non-Autoregressive Image Captioning
1944 Katarzyna Krasnowska-Kieraś and Marcin Woliński Parsing Headed Constituencies
1948 Mithun Das, Saurabh Kumar Pandey and Animesh Mukherjee Evaluating ChatGPT Against Functionality Tests for Hate Speech Detection
1954 Shulin Zhang, John Hale, Margaret Renwick, Zvjezdana Vrzić and Keith Langston An Evaluation of Croatian ASR Models for Čakavian Transcription
1956 Sacha Beniamine, Mari Aigro, Matthew Baerman, Jules Bouton and Maria Copot Eesthetic: A Paralex Lexicon of Estonian Paradigms
1959 Elizaveta Korotkova, Taido Purason, Agnes Luhtaru and Mark Fishel Multilinguality or Back-translation? A Case Study with Estonian
1963 Ibrahim Khalil Khebour, Kenneth Lai, Mariah Bradford, Yifan Zhu, Richard A. Brutti, Christopher Tam, Jingxuan Tu, Benjamin A. Ibarra, Nathaniel Blanchard, Nikhil Krishnaswamy and James Pustejovsky Common Ground Tracking in Multimodal Dialogue
1966 Gustav Ryberg Smidt, Els Lefever and Katrien De Graef At the Crossroad of Cuneiform and NLP: Challenges for Fine-grained Part-of-speech Tagging
1968 Michaela Regneri, Alhassan Abdelhalim and Soeren Laue Detecting Conceptual Abstraction in LLMs
1969 Sugyeong Eo, Jungwoo Lim, Chanjun Park, DaHyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo and Heuiseok Lim Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation
1973 Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang and Xuanjing Huang RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
1979 Seonjeong Hwang, Yunsu Kim and Gary Geunbae Lee Explainable Multi-hop Question Generation: An End-to-End Approach without Intermediate Question Labeling
1982 Shaolin Zhu, Menglong Cui and Deyi Xiong Towards Robust In-Context Learning for Machine Translation with Large Language Models
1983 Ai Ishii, Naoya Inoue, Hisami Suzuki and Satoshi Sekine JEMHopQA: Dataset for Japanese Explainable Multi-Hop Question Answering
1985 Rahul Ponnusamy, Kathiravan Pannerselvam, Saranya R, Prasanna Kumar Kumaresan, Sajeetha Thavareesan, Bhuvaneswari S, Anshid K.A, Susminu S Kumar, Paul Buitelaar and Bharathi Raja Chakravarthi From Laughter to Inequality: Annotated Dataset for Misogyny Detection in Tamil and Malayalam Memes
1987 Prasanna Kumar Kumaresan, Rahul Ponnusamy, Dhruv Sharma, Paul Buitelaar and Bharathi Raja Chakravarthi Dataset for Identification of Homophobia and Transphobia for Telugu, Kannada, and Gujarati
1988 Abhishek Agrawal, Mitja Nikolaus, Benoit Favre and Abdellah Fourtassi Automatic Coding of Contingency in Child-Caregiver Conversations
1990 Janire Arana, Mikel Idoyaga, Maitane Urruela, Elisa Espina, Aitziber Atutxa Salazar and Koldo Gojenola A Virtual Patient Dialogue System Based on Question-Answering on Clinical Records
1993 Olia Toporkov and Rodrigo Agerri Evaluating Shortest Edit Script Methods for Contextual Lemmatization
1994 Daria Romanovna Ledneva and Denis Pavlovich Kuznetsov Reimagining Intent Prediction: Insights from Graph-Based Dialogue Modeling and Sentence Encoders
1997 Ziqian Zeng, Runyu Wu, Yuxiang Xiao, Xiaoda Zhong, Hanlin Wang, Zhengdong Lu and Huiping Zhuang Zero-shot Event Detection using a Textual Entailment Model as an Enhanced Annotator
1998 Wenjie Zhou, Qiang Wang, Mingzhou Xu, MING CHEN and Xiangyu Duan Revisiting the Self-Consistency Challenges in Multi-Choice Question Formats for Large Language Model Evaluation
1999 Bo Xu, Longjiao Li, Wei Luo, Mehdi Naseriparsa, Zhehuan Zhao, Hongfei Lin and Feng Xia Beyond Linguistic Cues: Fine-grained Conversational Emotion Recognition via Belief-Desire Modelling
2000 Adrian Cosma, Ioan-Bogdan Iordache and Paolo Rosso RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
2001 Paulina Garcia Corral, Hanna Bechara, Ran Zhang and Slava Jankin PolitiCause: An Annotation Scheme and Corpus for Causality in Political Texts
2002 Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon and Donald Metzler OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement
2004 Olivier Ferret Language Models and Semantic Relations: a Dual Relationship
2005 Léane Isabelle Jourdan, Florian Boudin, Nicolas Hernandez and Richard Dufour CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions
2006 Zhaobo Zhang, Rui Gan, Pingpeng Yuan and Hai Jin Correcting Pronoun Homophones with Subtle Semantics in Chinese Speech Recognition
2008 Moreno La Quatra, Alkis Koudounas, Elena Baralis and Sabato Marco Siniscalchi Speech Analysis of Language Varieties in Italy
2009 Claudia Collacciani, Andrea Amelio Ravelli and marianna bolognesi Specifying Genericity through Inclusiveness and Abstractness Continuous Scales
2010 Jianwei Wang, Tianyin Wang and Ziqian Zeng On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction
2011 Junlin Li, Bo Peng and Yu-Yin Hsu Emstremo: Adapting Emotional Support Response with Enhanced Emotion-Strategy Integrated Selection
2014 Mariana O. Silva and Mirella M. Moro PPORTAL_ner: An Annotated Corpus of Portuguese Literary Entities
2015 Iqra Ali, Hidetaka Kamigaito and Taro Watanabe Monolingual Paraphrase Detection Corpus for Low Resource Pashto Language at Sentence Level
2016 Scott Friedman, Joan Zheng and Hillel Steinmetz Debiasing Multi-Entity Aspect-Based Sentiment Analysis with Norm-Based Data Augmentation
2017 Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Sazia Mehnaz, Kanij Fatema, Mohammad Mamun Or Rashid and Farig Sadeque Unicode Normalization and Grapheme Parsing of Indic Languages
2023 Martin Lebourdais, Marie Tahon, Antoine LAURENT and Sylvain Meignier Automatic Speech Interruption Detection: Analysis, Corpus, and System
2026 Sergio E. Zanotto, Qi Yu, Miriam Butt and Diego Frassinelli GRIT: A Dataset of Group Reference Recognition in Italian
2027 Ilaria Fiorentini, Marco Forlano and Nicholas Nese Towards the WhAP Corpus: A resource for the study of Italian on WhatsApp
2029 Sebastian Vincent, Rowanne Sumner, Alice Dowek, Charlotte Prescott, Emily Preston, Chris Bayliss, Chris Oakley and Carolina Scarton Reference-less Analysis of Context Specificity in Translation with Personalised Language Models
2031 Artem Abzaliev, Humberto Perez-Espinosa and Rada Mihalcea Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification
2034 Linda Wiechetek, Flammie A. Pirinen, Børre Gaup, Trond Trosterud, Maja Lisa Kappfjell and Sjur Moshagen The Ethical Question -- Use of Indigenous Corpora for Large Language Models
2035 Xulong Du, Xingnan Zhang, Dandan Wang, Yingying Xu, Zhiyuan Wu, Shiqing Zhang*, Xiaoming Zhao*, Jun Yu and Liangliang Lou Integrating Representation Subspace Mapping with Unimodal Auxiliary Loss for Attention-based Multimodal Emotion Recognition
2037 Nadège Alavoine, Gaëlle Laperrière, Christophe Servan, Sahar Ghannay and Sophie Rosset New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
2039 Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez and David Chiang Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
2040 Jan Gorisch and Thomas Schmidt Evaluating Workflows for Creating Orthographic Transcripts for Oral Corpora by Transcribing from Scratch or Correcting ASR-Output
2042 Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel and Theophile Gonos RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot
2043 Feiteng Fang, Liang Zhu, Xi Feng, Jinchang Hou, Qixuan Zhao, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment
2044 Ana Cimitan, Ana Alves Pinto and Michaela Geierhos Curation of Benchmark Templates for Measuring Gender Bias in Named Entity Recognition Models
2045 Tuan Nguyen, Corinne Fredouille, Alain Ghio, Mathieu Balaguer and Virginie Woisard Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context
2046 Hao Wang, Tang Li, Chenhui Chu, Rui Wang and Pinpin Zhu Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents
2050 Seunghee Han, Sunhee Kim and Minhwa Chung Constructing Korean Learners' L2 Speech Corpus of Seven Languages for Automatic Pronunciation Assessment
2052 Rui Gao, Miaomiao Cheng, Xu Han and Wei Song High-Order Semantic Alignment for Unsupervised Fine-Grained Image-Text Retrieval
2055 Jiaxin Duan, Fengyu Lu and Junfei Liu Alleviating Exposure Bias in Abstractive Summarization via Sequentially Generating and Revising
2056 Ruize Yuan, Xiang Ao, Li Zeng and Qing He DRAMA: Dynamic Multi-Granularity Graph Estimate Retrieval over Tabular and Textual Question Answering
2057 Leiyu Pan, Yongqi Leng and Deyi Xiong Can Large Language Models Learn Translation Robustness from Noisy-Source In-context Demonstrations?
2061 Nobuhiro Ueda, Hideko Habe, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi and Koichiro Yoshino J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution
2063 Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan and jun huang AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
2065 Guanhua Chen, Yutong Yao, Derek F. Wong and Lidia S. Chao A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU
2071 Anisia Popescu, Lori Lamel and Ioana Vasilescu Using Speech Technology to test Theories of Phonetic and Phonological Typology
2074 Byungha Kang and Youhyun Shin Improving Low-Resource Keyphrase Generation through Unsupervised Title Phrase Generation
2076 Jiaxin Duan, Fengyu Lu and Junfei Liu Prophecy Distillation for Boosting Abstractive Summarization
2077 Angus Addlesee, Oliver Lemon and Arash Eshghi Clarifying Completions: Evaluating How LLMs Respond to Incomplete Questions
2078 Lingxing Kong, Yougang Chu, Zheng Ma, Jianbing Zhang, Liang He and Jiajun Chen MixRED: A Mix-lingual Relation Extraction Dataset
2079 Xiao Zhang, Heqi Zheng, Yuxiang Nie, Heyan Huang and Xian-Ling Mao SciMRC: Multi-perspective Scientific Machine Reading Comprehension
2084 Zhaolin Li, Monika Rind-Pawlowski and Jan Niehues Speech Recognition Corpus of the Khinalug Language for Documenting Endangered Languages
2087 Eliot Maës, Hossam Boudraa, Philippe Blache and Leonor Becerra-Bonache Did You Get It? A Zero-Shot Approach to Locate Information Transfers in Conversations
2092 Lorenzo Lupo, Paul Bose, Mahyar Habibi, Dirk Hovy and Carlo Schwarz DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
2093 Massimo Poesio, Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Amir Zeldes, Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský and Daniel Zeman Universal Anaphora: The First Three Years
2095 Daniel G. Swanson, Bryce D. Bussert and Francis Tyers Producing a Parallel Universal Dependencies Treebank of Ancient Hebrew and Ancient Greek via Cross-Lingual Projection
2097 Marcello Ferro, Claudia Marzi, Andrea Nadalini, Loukia Taxitari, Alessandro Lento and Vito Pirrelli ReadLet: a Dataset for Oral, Visual and Tactile Text Reading Data of Early and Mature Readers
2100 Asahi Yoshida, Yoshihide Kato and Shigeki Matsubara Negation Scope Conversion: Towards a Unified Negation-Annotated Dataset
2102 Xiaoyan Zhao, Lingzhi Wang, Zhanghao Wang, Hong Cheng, Rui Zhang and Kam-Fai Wong PACAR: Automated Fact-Checking with Planning and Customized Action Reasoning using Large Language Models
2104 Aswathy Velutharambath, Roman Klinger and Amelie Wührl Can Factual Statements be Deceptive? The DeFaBel Corpus of Belief-based Deception
2105 Wissam Antoun, Benoît Sagot and Djamé Seddah From Text to Source: Results in Detecting Large Language Model-Generated Content
2107 Mikel Zubillaga, Oscar Sainz, Ainara Estarrona, Oier Lopez de Lacalle and Eneko Agirre Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis
2109 Leran Zhang and Nora Hollenstein Eye-Tracking Features Masking Transformer Attention in Question-Answering Tasks
2111 Huacheng Song and Hongzhi Xu Benchmarking the Performance of Machine Translation Evaluation Metrics with Chinese Multiword Expressions
2112 Dan Li, Vikrant Yadav, Zi Long Zhu, Maziar Moradi Fard, Zubair Afzal and George Tsatsaronis Scalable Patent Classification with Aggregated Multi-View Ranking
2114 Christophe Servan, Sahar Ghannay and Sophie Rosset mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
2117 Elisa Di Nuovo, Manuela Sanguinetti, Pier Felice Balestrucci, Luca Anselma, Cristian Bernareggi and Alessandro Mazzei Educational Dialogue Systems for Visually Impaired Students: Introducing a Task-Oriented User-Agent Corpus
2118 Stephen Joseph Meisenbacher, Nihildev Nandakumar, Alexandra Klymenko and Florian Matthes A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off
2120 Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze and Barbara Plank MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
2121 Huadai Liu, XU WENQIANG, xuan lin, Jingjing Huo, Hong nullpointer Chen and Zhou Zhao AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments
2124 Yuning Ding, Omid Kashefi, Swapna Somasundaran and Andrea Horbach When Argumentation Meets Cohesion: Enhancing Automatic Feedback in Student Writing
2125 Honglin Mu, Yang Xu, Yunlong Feng, Xiaofeng Han, Yitong Li, Yutai Hou and Wanxiang Che Beyond Static Evaluation: A Dynamic Approach to Assessing AI Assistants' API Invocation Capabilities
2130 Yue Li and Carolina Scarton Can We Identify Stance Without Target Arguments? A Study for Rumour Stance Classification
2131 Fabrizio Nunnari, Eleftherios Avramidis, Cristina España-Bonet, Marco González, Anna Hennes and Patrick Gebhard DGS-Fabeln-1: A Multi-Angle Parallel Corpus of Fairy Tales between German Sign Language and German Text
2134 Fuqiang Niu, Min Yang, Ang Li, Baoquan Zhang, Xiaojiang Peng and Bowen Zhang A Challenge Dataset and Effective Models for Conversational Stance Detection
2135 Lorenzo Proietti, Stefano Perrella, Simone Tedeschi, Giulia Vulpis, Leonardo Lavalle, Andrea Sanchietti, Andrea Ferrari and Roberto Navigli Analyzing Homonymy Disambiguation Capabilities of Pretrained Language Models
2136 Camille Challant and Michael Filhol Extending AZee with Non-manual Gesture Rules for French Sign Language
2141 Nathan Godey, Éric de la Clergerie and Benoît Sagot On the Scaling Laws of Geographical Representation in Language Models
2143 Sondes Abderrazek, Corinne Fredouille, Alain Ghio, muriel lalain, Christine Meunier, Mathieu Balaguer and Virginie Woisard Interpretable Assessment of Speech Intelligibility using Deep Learning: A Case Study on Speech Disorders due to Head and Neck Cancers
2144 Shuoran Jiang, Qingcai Chen, Yang Xiang, Youcheng Pan and Yukang Lin Linguistic Rule Induction Improves Adversarial and OOD Robustness in Large Language Models
2145 Eleanor Chodroff, Blaž Pažon, Annie Baker and Steven Moran Phonetic Segmentation of the UCLA Phonetics Lab Archive
2147 Sebastian Schuster, Ayesha Ansar, Om Agarwal and Vera Demberg SpreadNaLa: A Naturalistic Code Generation Evaluation Dataset of Spreadsheet Formulas
2148 Ines Reinig, Ines Rehbein and Simone Paolo Ponzetto How to do politics with words: Investigating speech acts in parliamentary debates
2151 Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixing Jiang, Shaoyang Song and Chunlai Zhou Locally Differentially Private In-Context Learning
2152 Yuchen Fan, Yantao Liu, Zijun Yao, Jifan Yu, Lei Hou and Juanzi Li Evaluating Generative Language Models in Information Extraction as Subjective Question Correction
2153 Weihao Zhao, Weidong He, Hao Wang, Haoyang Bi, Han Wu, Chen Zhu, Tong Xu and Enhong Chen MRT: Multi-modal Short- and Long-range Temporal Convolutional Network for Time-sync Comment Video Behavior Prediction
2158 Sandy Ritchie, Daan van Esch, Uche Okonkwo, Shikhar Vashishth and Emily Drummond LinguaMeta: Unified Metadata for Thousands of Languages
2161 Hichem Ammar Khodja, Frederic Bechet, Quentin Brabant, Alexis Nasr and Gwénolé Lecorvé WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models
2162 Francesca Zermiani, Prajit Dhar, Ekta Sood, Fabian Kögel, Andreas Bulling and Maria Wirzberger InteRead: An Eye Tracking Dataset of Interrupted Reading
2163 Jian Zhang, Changlin Yang, Haiping Zhu, Qika Lin, Fangzhi Xu and Jun Liu A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction
2166 Mustafa Jarrar and Tymaa Hasanain Hammouda Qabas: An Open-Source Arabic Lexicographic Database
2167 Maxim K. Surkov and Ivan P. Yamshchikov Vygotsky Distance: Measure for Benchmark Task Similarity
2168 Xin Liu, Hongwei Sun, Shaojie Dai, Bo Lv, Youcheng Pan, Hui Wang and Yue Yu A Lifelong Multilingual Multi-granularity Semantic Alignment Approach via Maximum Co-occurrence Probability
2170 Natalia Loukachevitch, Andrey Sakhovskiy and Elena Tutubalina Biomedical Concept Normalization over Nested Entities with Partial UMLS Terminology in Russian
2171 Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering
2172 Naziya Mahamdul Shaikh, Jyoti D. Pawar and Mubarak Banu Sayed Konidioms Corpus: A Dataset of Idioms in Konkani Language
2173 Pin-Jie Lin, Merel Scholman, Muhammed Saeed and Vera Demberg Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin
2174 Hay Man Htun, Ye Kyaw Thu, Hutchatai Chanlekha, Kotaro Funakoshi and Thepchai Supnithi myMediCon: End-to-End Burmese Automatic Speech Recognition for Medical Conversations
2175 Nadège Alavoine, Maximin Coavoux, Emmanuelle Esperanca-Rodier, Romane Gallienne, carlos gonzalez gallardo, Jérôme Goulian, Jose G. Moreno, Aurélie Névéol, Didier Schwab, Vincent Segonne and johanna simoens Limitations of Human Identification of Automatically Generated Text
2176 Eleni Metheniti, Philippe Muller, Chloé Braud and Margarita Hernández Casas Zero-shot learning for multilingual discourse relation classification
2178 Daan van Esch, Sandy Ritchie, Sebastian Ruder, Julia Kreutzer, Clara Rivera, Ishank Saxena and Isaac Caswell Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages
2179 Phil Sidney Ostheimer, Mayank Kumar Nagda, Marius Kloft and Sophie Fellenz Text Style Transfer Evaluation Using Large Language Models
2180 Zifan Jiang, Anne Göhring, Amit Moryossef, Rico Sennrich and Sarah Ebling SwissSLi: the Multi-parallel Sign Language Corpus for Switzerland
2186 Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar and Jan Wieczorek Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles
2187 Abelardo Carlos Martinez Lorenzo and Roberto Navigli Efficient AMR parsing with CLAP: Compact Linearization with an Adaptable Parser
2189 Fahad Khan, Maxim Ionov, Christian Chiarcos, Laurent Romary, Gilles Sérasset and Besim Kabashi On Modelling Corpus Citations in Computational Lexical Resources
2191 Andrea Gulli, Francesco Costantini, Diego Sidraschi and Emanuela Li Destri Fine-Tuning a Pre-Trained Wav2Vec2 Model for Automatic Speech Recognition- Experiments with de zahrar sproche
2198 Shuo Yang A Trusted Multi-View Evidential Fusion Framework for Commonsense Reasoning
2199 Ona de Gibert, Graeme Nail, Nikolay Arefyev, Marta Bañón, Jelmer van der Linde, Shaoxiong Ji, Jaume Zaragoza-Bernabeu, Mikko Aulamo, Gema Ramírez-Sánchez, Andrey Kutuzov, Sampo Pyysalo, Stephan Oepen and Jörg Tiedemann A New Massive Multilingual Dataset for High-Performance Language Technologies
2200 Matej Klemen, Aleš Žagar, Jaka Čibej and Marko Robnik-Šikonja SI-NLI: A Slovene Natural Language Inference Dataset and its Evaluation
2201 Alice Millour, Lorenza Brasile, Alberto Ghia and Laurent Kevers Agettivu, Aggitivu o Aghjettivu? POS Tagging Corsican Dialects
2203 Josef Ruppenhofer, Matthias Schwendemann, Annette Portmann, Katrin Wisniewski and Torsten Zesch Every Verb in its Right Place? A Roadmap for Operationalizing Developmental Stages in the Acquisition of L2 German
2204 Atilla Kaan Alkan, Felix Grezes, Cyril Grouin, Fabian Schussler and Pierre Zweigenbaum Enriching a Time-Domain Astrophysics Corpus with Named Entity, Coreference and Astrophysical Relationship Annotations
2206 Recep Firat Cekinel, Çağrı Çöltekin and Pinar Karagoz Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish
2207 Chloe SEKKAT, Fanny Leroy, salima mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau and Alice Coucke Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
2208 Qianlong Wang, Hongling Xu, Keyang Ding, Bin Liang and Ruifeng Xu In-Context Example Retrieval from Multi-Perspectives for Few-Shot Aspect-Based Sentiment Analysis
2213 Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi and Wray Buntine Improving Vietnamese-English Medical Machine Translation
2214 Wajdi Zaghouani, Abdelhamid Ahmed, Xiao Zhang and Lameya Rezk QCAW 1.0: Building a Qatari Corpus of Student Argumentative Writing
2218 Gennaro Nolano, Moritz Blum, Basil Ell and Philipp Cimiano Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials
2219 Rian Touchent and Éric de la Clergerie CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective Models on French Biomedical Data
2220 Darinka Verdonik, Kaja Dobrovoljc, Tomaž Erjavec and Nikola Ljubešić Gos 2: A New Reference Corpus of Spoken Slovenian
2224 Jin-seo Kim, Anna Seo Gyeong Choi and Sunghye Cho KoFREN: Comprehensive Korean Word Frequency Norms Derived from Large Scale Free Speech Corpora
2225 Hamdy Mubarak, Hend Al-Khalifa and Khaloud Suliman Alkhalefah Halwasa: Quantify and Analyze Hallucinations in Large Language Models: Arabic as a Case Study
2226 Amanda Cercas Curry, Zeerak Talat and Dirk Hovy Impoverished Language Technology: The Lack of (Social) Class in NLP
2227 Wonkee Lee, Seong-Hwan Heo and Jong-Hyeok Lee Advancing Semi-Supervised Learning for Automatic Post-Editing: Data-Synthesis by Mask-Infilling with Erroneous Terms
2229 Filip Dobranić, Bojan Evkoski and Nikola Ljubešić A Lightweight Approach to a Giga-Corpus of Historical Periodicals: The Story of a Slovenian Historical Newspaper Collection
2231 D. Fortuné KPONOU, Fréjus A. A. Laleye and Eugène Cokou Ezin FFSTC: Fongbe to French Speech Translation Corpus
2232 Yuqing Zhang, Tessa Verhoef, Gertjan van Noord and Arianna Bisazza Endowing Neural Language Learners with Human-like Biases: A Case Study on Dependency Length Minimization
2235 Pietro Giovanni Bizzaro, Elena Della Valentina, Maurizio Napolitano, Nadia Mana and Massimo Zancanaro Annotation and Classification of Relevant Clauses in Terms-and-Conditions Contracts
2237 Jaione Bengoetxea, Yi-Ling Chung, Marco Guerini and Rodrigo Agerri Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
2239 Antonio F. G. Sevilla, José María Lahoz-Bengoechea and ALBERTO DIAZ Automated Extraction of Prosodic Structure from Unannotated Sign Language Video
2240 Samee Arif, Sualeha Farid, Awais Athar and Agha Ali Raza UQA: Corpus for Urdu Question Answering
2241 Peteris Paikens, Lauma Pretkalniņa and Laura Rituma A Computational Model of Latvian Morphology
2242 Ulla Petti and Anna Korhonen LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer's Disease Related Changes in Spontaneous Speech