Submission ID | Authors | Title |
1 | Min Zeng, Jiexin Kuang, Mengyang Qiu, Jayoung Song and Jungyeul Park | Evaluating Prompting Strategies for Grammatical Error Correction Based on Language Proficiency |
2 | Tollef Emil Jørgensen and Andre Kåsen | Aligning the Norwegian UD Treebank with Entity and Coreference Information |
4 | Amir Hazem, Kazuyuki Motohashi and chen zhu | From Technology to Market. Bilingual Corpus on the Evaluation of Technology Opportunity Discovery |
16 | Matiss Rikters and Toshiaki Nakazawa | Revisiting Context Choices for Context-aware Machine Translation |
18 | Yubing Ren, Yanan Cao, Hao Li, yingjie li, Zixuan ZM Ma, Fang Fang, Ping Guo and Wei Ma | DEIE: Benchmarking Document-level Event Information Extraction with a Large-scale Chinese News Dataset |
19 | Hongfei Xu, Yang Song, Qiuhui Liu, Josef van Genabith and Deyi Xiong | Rewiring the Transformer with Depth-Wise LSTMs |
20 | Yige Chen, Jae Ihn, KyungTae Lim and Jungyeul Park | Towards Standardized Annotation and Parsing for Korean FrameNet |
21 | Dongqi Pu, Yifan Wang, Jia E. Loy and Vera Demberg | SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation |
22 | Minghua Nuo and Chaofan Guo | Hybrid of Spans and Table-Filling for Aspect-Level Sentiment Triplet Extraction |
27 | Ojas Nimase and Sanghyun Hong | When Do "More Contexts" Help with Sarcasm Recognition? |
28 | YuHong Sun, Zhangyue Yin, Qipeng Guo, Jiawen Wu, Xipeng Qiu and Hui Zhao | Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem |
29 | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Tianxiang Sun, Cheng Chang, Qinyuan Cheng, Ding Wang, Xiaofeng Mou, Xipeng Qiu and Xuanjing Huang | Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models |
30 | Hyeonho Song, Jisu Hong, Chani Jung, Hyojin Chin, Mingi Shin, Yubin Choi, Junghoi Choi and Meeyoung Cha | Detecting Offensive Language in an Open Chatbot Platform |
35 | Rikito Takahashi, Hirokazu Kiyomaru, Chenhui Chu and Sadao Kurohashi | Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation |
37 | Bichen Wang, Yuzhe Zi, Yanyan Zhao, Pengfei Deng and Bing Qin | ESDM: Early Sensing Depression Model in Social Media Streams |
38 | Marc Feger and Stefan Dietze | TACO – Twitter Arguments from COnversations |
39 | Jonathan Dunn and Lane Edwards-Brown | Geographically-Informed Language Identification |
40 | Jonathan Dunn | Validating and Exploring Large Geographic Corpora |
41 | Jonathan Dunn, Benjamin Adams and Harish Tayyar Madabushi | Pre-Trained Language Models Represent Some Geographic Populations Better Than Others |
42 | Jun Xu, Mengshu Sun, Zhiqiang Zhang and Jun Zhou | ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models |
44 | Ramona Christen, Anastassia Shaitarova, Matthias Stürmer and Joel Niklaus | Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents |
45 | Yida Mu, Xingyi Song, Kalina Bontcheva and Nikolaos Aletras | Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets |
46 | Haven Kim, Jongmin Jung, Dasaem Jeong and Juhan Nam | K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling |
59 | Guijin Son, Hanwool Lee, suwan kim, Huiseo Kim, Jae cheol Lee, Je Won Yeom, Jihyu Jung, Jung woo Kim and Songseong Kim | HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models |
64 | Stella Markantonatou, Vivian Stamou, Christina Christodoulou, Georgia Apostolopoulou, Antonis Balas and George Ioannakis | The Corpus AIKIA: using ranking annotation for Offensive Language Detection in Modern Greek |
69 | Wenxin Guo, Lei Zhang, Kun Zhang, Yi Liu and Zhendong Mao | Visual-Linguistic Dependency Encoding for Image-Text Retrieval |
72 | Gérard Bailly, Romain Legrand, Martin Lenglet, Frédéric Elisei, Maëva Hueber and Olivier Perrotin | Emotags: Computer-Assisted Verbal Labelling of Expressive Audiovisual Utterances for Expressive Multimodal TTS |
73 | Vladimir Araujo, Maria Mihaela Trusca, Rodrigo Tufiño and Marie-Francine Moens | Sequence-to-Sequence Spanish Pre-trained Language Models |
79 | Nigel Ward and Divette Marco | A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances |
82 | Guangmin Zheng, Jin Wang, Xiaobing Zhou and Xuejie Zhang | Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling |
85 | Joosung Lee and Jinhong Kim | Enhanced Facet Generation with LLM Editing |
87 | Rustem Yeshpanov and Huseyin Atakan Varol | KazSAnDRA: Kazakh Sentiment Analysis Dataset of Reviews and Attitudes |
91 | Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun and Qi Zhang | Calibrating LLM-Based Evaluator |
93 | Junbing Yan, Chengyu Wang, Taolin Zhang, XIAOFENG HE, jun huang, Wei Zhang, Longtao Huang and hui xue | TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models |
94 | Wenjun Kong and Yamei Xia | CARE: Co-Attention Network for Joint Entity and Relation Extraction |
101 | Danqing Luo, Chen Zhang, Yan Zhang and Haizhou Li | CrossTune: Black-Box Few-Shot Classification with Label Enhancement |
104 | ZhongXiang Sun, Kepu Zhang, Weijie Yu, Haoyu Wang and Jun Xu | Logic Rules as Explanations for Legal Case Retrieval |
108 | David Gimeno-Gómez and Carlos-D. Martínez-Hinarejos | Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition |
109 | Stephanie M. Lukin, Claire Bonial, Matthew Marge, Taylor A. Hudson, Cory J. Hayes, Kimberly Pollard, Anthony Baker, Ashley N. Foots, Ron Artstein, Felix Gervits, Mitchell Abrams, Cassidy Henry, Lucia Donatelli, Anton Leuski, Susan G. Hill, David Traum and Clare Voss | SCOUT: A Situated and Multi-Modal Human-Robot Dialogue Corpus |
110 | Unggi Lee, Sungjun Yoon, Joon Seo Yun, Kyoungsoo Park, YoungHoon Jung, Damji Stratton and Hyeoncheol Kim | Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction |
118 | Li-Ming Zhan, Bo LIU and Xiao-Ming Wu | VI-OOD: A Unified Framework of Representation Learning for Textual Out-of-distribution Detection |
119 | Nina Markl, Lauren Hall-Lew and Catherine Lai | Language Technologies as if People Mattered: Centering Communities in Language Technology Development |
130 | Yida Mu, Mali Jin, Kalina Bontcheva and Xingyi Song | Examining Temporalities on Stance Detection Towards COVID-19 Vaccination |
131 | Huixuan Zhang and Xiaojun Wan | Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection |
135 | Marta Lango, Borys Naglik, Mateusz Lango and Iwo Naglik | Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish |
136 | Gabriel de Jesus and Sérgio Sobral Nunes | Data Collection Pipeline for Low-Resource Languages: A Case Study on Constructing a Tetun Text Corpus |
138 | Anton F. Thielmann, Christoph Weisser and Benjamin Säfken | Human in the loop: How to effectively create coherent topics by manually labeling only a few documents per class |
139 | Andy Luecking, Giuseppe Abrami, Leon Hammerla, Marc Rahn, Daniel Baumartz, Steffen Eger and Alexander Mehler | Dependencies over Times and Tools (DoTT) |
142 | Fuhan Cai, Duo Liu, Zhongqiang Zhang, Ge Liu, Xiaozhe Yang and Xiangzhong Fang | NER-guided Comprehensive Hierarchy-aware Prompt Tuning for Hierarchical Text Classification |
143 | José-M. Acosta-Triana, David Gimeno-Gómez and Carlos-D. Martínez-Hinarejos | AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies |
145 | Siyu Duan, Jun Wang and Qi Su | Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach |
148 | Yingying Zhang, Xian Wu, Yu Zhang and Yefeng Zheng | Knowledge-aware Attention Network for Medication Effectiveness Prediction |
154 | Yiping Jin, Leo Wanner and Alexander Shvets | GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection? |
157 | Paulo Cavalin and Claudio Santos Pinhanez | Theoretical and Empirical Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Open-World Scenarios |
162 | Yuhong He, Yongqi Zhang, Shizhu He and Jun Wan | BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation |
170 | Yugo Murawaki | Principal Component Analysis as a Sanity Check for Bayesian Phylolinguistic Reconstruction |
173 | Guozheng Li, Wenjun Ke, Peng Wang, Zijie Xu, Ke Ji, Jiajun Liu, Ziyu Shang and Qiqing Luo | Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction |
174 | Zhihong Zhu, Xuxin Cheng, Hao An, Zhichang Wang, Dongsheng Chen and Zhiqi Huang | Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study |
176 | Jennifer A. Bishop, Sophia Ananiadou and Qianqian Xie | LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation |
177 | Sixing Yu, Juan Pablo Munoz and Ali Jannesari | Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models |
180 | Kai Xu, Zhengyu Wang, Yuxuan Long and Qiaona Zhao | Deep Reinforcement Learning-based Dialogue Policy with Graph Convolutional Q-network |
183 | lianyu hu, Liqing Gao, Zekang Liu and Wei Feng | Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition |
185 | Atsushi Kojima | Sub-Table Rescorer for Table Question Answering |
190 | Shizhou Huang, Bo Xu, Changqun Li, Jiabo Ye and xin Lin | MNER-MI: A Multi-image Dataset for Multimodal Named Entity Recognition in Social Media |
191 | Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan and Weidong Xiao | Distill, Fuse, Pre-train: Towards Effective Event Causality Identification with Commonsense-Aware Pre-trained Model |
194 | Yoshinari Nagai, Teruaki Oka and Mamoru Komachi | A Document-Level Text Simplification Dataset for Japanese |
195 | Markus Bayer, Markus Neiczer, Maximilian Samsinger, Björn Buchhold and Christian Reuter | XAI-Attack: Utilizing Explainable AI to Find Incorrectly Learned Patterns for Black-Box Adversarial Example Creation |
197 | hailay Teklehaimanot, Wolfgang Nejdl and Niloy Ganguly | TIGQA: An Expert-Annotated Question-Answering Dataset in Tigrinya |
200 | Paul Grundmann, Jens-Michalis Papaioannou, Tom Oberhauser, Thomas Steffek, Amy Siu, Wolfgang Nejdl and Alexander Loeser | Data Drift in Clinical Outcome Prediction from Admission Notes |
206 | Weiyao Luo, Junfeng Ran, Zailong Tian, Sujian Li and Zhifang Sui | FaGANet: An Evidence-Based Fact-Checking Model with Integrated Encoder Leveraging Contextual Information |
212 | Mert Inan and Malihe Alikhani | Seeing Eye-to-Eye: Cross-Modal Coherence Relations Inform Eye-gaze Patterns During Comprehension & Production |
214 | Connor Heaton and Prasenjit Mitra | Deriving Entity-Specific Embeddings From Multi-Entity Sequences |
216 | Avril Gazeau and Francois Lareau | Flexible Lexicalization in Rule-based Text Realization |
219 | Luke Gessler | PrOnto: Language Model Evaluations for 859 Languages |
223 | Peiyu Liu, Ze-Feng Gao, Xiao Zhang, Wayne Xin Zhao and Ji-Rong Wen | Enhancing Parameter-efficient Fine-tuning with Simple Calibration based on Stable Rank |
227 | Junyi He and Xia Li | Zero-shot Cross-lingual Automated Essay Scoring |
229 | Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin and Berlin Chen | DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition |
231 | Yucheng Cai, Wentao Ma, Yuchuan Wu, Shuzheng Si, yuan shao, Zhijian Ou and Yongbin Li | UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt |
237 | Iben Nyholm Debess, Annika Simonsen and Hafsteinn Einarsson | Good or Bad News? Exploring GPT-4 for Sentiment Analysis for Faroese on a Public News Corpora |
239 | Christopher Weiss, Frauke Kreuter and Ivan Habernal | To Share or Not to Share: What Risks Would Laypeople Accept to Give Sensitive Data to Differentially-Private NLP Systems? |
243 | Viktor Hangya and Alexander Fraser | How to Solve Few-Shot Abusive Content Detection Using the Data We Actually Have |
244 | Truong Dinh Do, Phuong Minh Nguyen and Minh Nguyen | ZeLa: Advancing Zero-Shot Multilingual Semantic Parsing with Large Language Models and Chain-of-Thought Strategies |
246 | Yuzhuang Xu, Shuo Wang, Peng Li, Xuebo Liu, Xiaolong Wang, Weidong Liu and Yang Liu | Pluggable Neural Machine Translation Models via Memory-augmented Adapters |
248 | Abdullatif Koksal, Silvia Severini and Hinrich Schütze | SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment |
252 | Bo Lv, Xin Liu, Kaiwen Wei, Ping Luo and Yue Yu | TAeKD: Teacher Assistant Enhanced Knowledge Distillation for Closed-Source Multilingual Neural Machine Translation |
255 | Hongcheng Liu, Pingjie Wang, Zhiyuan Zhu, Yanfeng Wang and Yu Wang | CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation |
258 | Sohom Ghosh, Arnab Maji, Aswartha Narayana and Sudip Kumar Naskar | IndicFinNLP: Financial Natural Language Processing for Indian Languages |
259 | Yuang Li, Yinglu Li, Min Zhang, Chang Su, Jiawei Yu, Mengyao Piao, Xiaosong Qiao, Miaomiao Ma, Yanqing Zhao and Hao Yang | CB-Whisper: Contextual Biasing Whisper using Open-Vocabulary Keyword-Spotting |
260 | Jimin An, YunSeok Choi and Jee-Hyong Lee | Code Defect Detection using Pre-trained Language Models with Encoder-Decoder via Line-Level Defect Localization |
262 | Haoyu Xiong, Xinchun Zhang, Leixin Yang, Yu Xiang and Gang Fang | STAF: Pushing the Boundaries of Test-Time Adaptation Towards Practical Noise Scenarios |
264 | Ryo Nagata, Yoshifumi Kawasaki, Naoki Otani and Hiroya Takamura | A Computational Approach to Quantifying Grammaticization of English Deverbal Prepositions |
265 | Zhixiong Cao, Hai-Tao Zheng, Yangning Li, Jin Xu, Rongsheng Li and Hong-Gee Kim | Depth Aware Hierarchical Replay Continual Learning for Knowledge Based Question Answering |
268 | Shunyu Liu, Jie Zhou, Qunxi Zhu, Qin Chen, Qingchun Bai, Jun Xiao and Liang He | Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models |
269 | Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao and Nan Duan | PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization |
274 | Amir Hossein Kargaran, François Yvon and Hinrich Schütze | GlotScript: A Resource and Tool for Low Resource Writing System Identification |
278 | Masoud Monajatipoor, Zi-Yi Dou, Aichi Chien, Nanyun Peng and Kai-Wei Chang | Medical Vision-Language Pre-Training for Brain Abnormalities |
279 | Ang Li, Yiquan Wu, Yifei Liu, Kun Kuang, Fei Wu and Ming Cai | Enhancing Court View Generation with Knowledge Injection and Guidance |
284 | Xiaotong Feng, Meng-Fen Chiang, Wang-Chien Lee and Zixin Kuang | Evidence-guided Inference for Neutralized Zero-shot Transfer |
286 | Eunkyul Leah Jo, Angela Yoonseo Park, Grace Tianjiao Zhang, Izia Xiaoxiao Wang, Junrui Wang, MingJia Mao and Jungyeul Park | An Untold Story of Preprocessing Task Evaluation: An Alignment-based Joint Evaluation Approach |
287 | Binling Nie, Yiming Shao and Yigang Wang | Know-Adapter: Towards Knowledge-Aware Parameter-Efficient Transfer Learning for Few-shot Named Entity Recognition |
290 | Daizong Liu, Xiaoye Qu, Xiang Fang, Jianfeng Dong, Pan Zhou, Guoshun Nan, Keke Tang, Wanlong Fang and Yu Cheng | Towards Robust Temporal Activity Localization Learning with Noisy Labels |
291 | Feihong Lu, Xiaocui Yang, Qian Li, Qingyun Sun, Ke Jiang, Cheng Ji and Jianxin Li | Few-Shot Multimodal Named Entity Recognition based on Mutlimodal Causal Intervention Graph |
292 | Yijun Liu, Feifei Dai, Xiaoyan Gu, Minghui Zhai, Bo Li and Meiou Zhang | Domain-aware and Co-adaptive Feature Transformation for Domain Adaption Few-shot Relation Extraction |
294 | Junyu Luo, Xiaochen Wang, Jiaqi Wang, Aofei Chang, Yaqing Wang and Fenglong Ma | CoRelation: Boosting Automatic ICD Coding Through Contextualized Code Relation Learning |
305 | Yufeng Wang, Chao Chen, Zhou Yang, shuhui wang and Xiangwen Liao | CTSM: Combining Trait and State Emotions for Empathetic Response Model |
310 | Yo Sato | Disambiguating homographs and homophones simultaneously: a regrouping method for Japanese |
312 | Ping Guo, Yue Hu, Yubing Ren, Yunpeng Li, jiarui zhang, Xingsheng Zhang and Heyan Huang | Teaching Large Language Models to Translate on Low-resource Languages with Textbook Prompting |
313 | Heegon Jin, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh and Yeonsoo Lee | Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation |
316 | Wei Zhou, Heike Adel, Hendrik Schuff and Ngoc Thang Vu | Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings |
319 | Abhidip Bhattacharyya, Martha Palmer and Christoffer Heckman | ReCAP: Semantic Role Enhanced Caption Generation |
323 | Subhradeep Kayal, Alexander Rakhlin, Ali Dashti and Serguei Stepaniants | How Far is Too Far? Studying the Effects of Domain Discrepancy on Masked Language Models |
327 | Yongxiu Xu, Hao Xu, Heyan Huang, Shiyao Cui, Minghao Tang, Longzheng Wang and Hongbo Xu | An Effective Span-based Multimodal Named Entity Recognition with Consistent Cross-Modal Alignment |
329 | Itsugun Cho, Ryota Takahashi, Yusaku Yanase and Hiroaki Saito | Deep Reinforcement Learning with Hierarchical Action Exploration for Dialogue Generation |
335 | Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou and Houqiang Li | Sinkhorn Distance Minimization for Knowledge Distillation |
337 | Zhen Wang, Peide Zhu and Jie Yang | ControversialQA: Exploring Controversy in Question Answering |
338 | Terufumi Morishita, Atsuki Yamaguchi, Gaku Morio, Hikaru Tomonari, Osamu Imaichi and Yasuhiro Sogawa | JFLD: A Japanese Benchmark for Deductive Reasoning based on Formal Logic |
341 | Hakyung Sung and Gyu-Ho Shin | Constructing a Dependency Treebank for Second Language Learners of Korean |
347 | Tianqi Hu, Lishuang Li, Xueyang Qin and Yubo Feng | Event Representation Learning with Multi-Grained Contrastive Learning and Triple-Mixture of Experts |
348 | Mingmin Wu, Guixin Su, Yongcheng Zhang, Zhongqiang Huang and Ying Sha | Refining Idioms Semantics Comprehension via Contrastive Learning and Cross-Attention |
353 | Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang and Bin Dong | ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models |
354 | Karen Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, Marthese Borg, Yongjian Chen, Fanny Ducel, Yoann Dupont, Guido Ivetta, Zhijian Li, Margot Mieskes, Marco Naguib, Yuyan Qian, Matteo Radaelli, Wolfgang S. Schmeisser-Nieto, Emma Raimundo Schulz, Thiziri Saci, Sarah Saidi, Javier Torroba Marchante, Shilin Xie, Sergio E. Zanotto and Aurélie Névéol | Your Stereotypical Mileage may Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts |
359 | Siyu Ren and Kenny Q. Zhu | Low-Rank Prune-And-Factorize for Language Model Compression |
360 | Adam Przepiórkowski, Magdalena Borysiak and Adam Głowacki | An Argument for Symmetric Coordination from Dependency Length Minimization: A Replication Study |
361 | Shulin Huang, Shirong Ma, Yinghui Li, Mengzuo Huang, wuhe zou, Weidong Zhang and Haitao Zheng | LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles |
362 | Sayed Muddashir Hossain, Jan Alexandersson and Philipp Müller | M3TCM: Multi-modal Multi-task Context Model for Utterance Classification in Motivational Interviews |
363 | Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Tamás Grósz, Xinwei Cao, Torbjørn Svendsen and Giampiero Salvi | Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages |
368 | Guanlin Li, Xuechen Zhao, Amir Jafari, Wenhao Shao, Reza Farahbakhsh and Noel Crespi | Improving Cross-lingual Transfer with Contrastive Negative Learning and Self-training |
369 | Shichen Li, Zhongqing Wang, Yanzhi Xu and Guodong Zhou | Structure-aware Generation Model for Cross-Domain Aspect-based Sentiment Classification |
373 | Valentina Dragos, Delphine Battistelli, Fatou Sow and Aline Etienne | Exploring the Emotional Dimension of French Online Toxic Content |
374 | Pierre Nugues | Linking Named Entities in Diderot's Encyclopédie to Wikidata |
378 | Chuyao Ding, Yu Hong and Jianmin Yao | SGCM: Salience-Guided Context Modeling for Question Generation |
379 | Jinming Zhao, Katsuhito Sudoh, Satoshi Nakamura, Yuka Ko, Kosuke Doi and Ryo Fukuda | NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus |
380 | Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nate B. Carlson, Nathaniel Romney Robinson, Mrinmaya Sachan and David R. Mortensen | PWESuite: Phonetic Word Embeddings and Tasks They Facilitate |
383 | Philip Blair and Kfir Bar | JRC-Names-Retrieval: A Standardized Benchmark for Name Search |
385 | Marco Gaido, Sara Papi, Matteo Negri and Luisa Bentivogli | How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena |
387 | Houcemeddine Turki, Abraham Toluwase Owodunni, Mohamed Ali Hadj Taieb, René Fabrice Bile and Mohamed Ben Aouicha | A Decade of Scholarly Research on Open Knowledge Graphs |
390 | Kai Zhang, Pengcheng Li, Kaisong Song, Xurui Li, Yangyang Kang, Xuhong Zhang and Xiaozhong Liu | Knowledge Triplets Derivation from Scientific Publications via Dual-Graph Resonance |
391 | Ali Al-Laith, Alexander Conroy, Jens Bjerring-Hansen and Daniel Hershcovich | Development and Evaluation of Pre-trained Language Models for Historical Danish and Norwegian Literary Texts |
394 | Hoang Nguyen, Chenwei Zhang, Ye Liu, Natalie Parde, Eugene Rohrbaugh and Philip S. Yu | CORI: CJKV Benchmark with Romanization Integration - A step towards Cross-lingual Transfer Beyond Textual Scripts |
395 | Pranav Arora, Selen Pehlivan and Jorma Laaksonen | Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer |
400 | Kaixuan Wu, Yanghao Lin, Donglin Cao and Dazhen Lin | Interpretable Short Video Rumor Detection based on Modality Tampering |
403 | Zhihong Zhu, Xuxin Cheng, Guimin Hu, Yaowei Li, Zhiqi Huang and Yuexian Zou | Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling |
404 | Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu and Guoqing Zhao | Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation |
406 | Vincent P. Martin and Jean-Luc Rouas | Why Voice Biomarkers of Psychiatric Disorders are not used in Clinical Practice? Deconstructing the Myth of the Need for Objective Diagnosis |
408 | Sijie Li, Sha Li, Hao Zhang, Shuyang Li, Kai Chen, Jianyong Yuan, Yi Cao and Lvqing Yang | EpiGEN: An Efficient Multi-Api Code GENeration Framework under Enterprise Scenario |
409 | Cam-Van Thi Nguyen, Cao-Bach Nguyen, Duc-Trong Le and Quang-Thuy Ha | Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition |
410 | Jingyao Tang, Lishuang Li, Hongbin Lu, Xueyang Qin, Beibei Zhang and Haiming Wu | Prototype-based Prompt-Instance Interaction with Causal Intervention for Few-shot Event Detection |
413 | Bin Cao, Kai Jiang, Fayu Pan, Chenlei Bao and Jing Fan | Improving Grammatical Error Correction by Correction Acceptability Discrimination |
416 | Yifei Yuan, Chen Shi, Wang Runze, Liyi Chen, Renjun Hu, zengming zhang, Feijun Jiang and Wai Lam | CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite |
417 | Shouhui Wang and Biao Qin | No Need for Large-Scale Search: Exploring Large Language Models in Complex Knowledge Base Question Answering |
419 | Yongliang Lin, Zhen Zhang, Mengting Hu, Yufei Sun and Yuzhi Zhang | Modalities Should be Appropriately Leveraged: Uncertainty Guidance for Multimodal Chinese Spelling Correction |
421 | Qiuyu Liang, Weihua Wang, Feilong Bao and Guanglai Gao | L^2GC:Lorentzian Linear Graph Convolutional Networks For Node Classification |
424 | Yan Ge, Victor Junqiu Wei, Yuanfeng Song, Jason Chen Zhang and Raymond Chi-Wing Wong | Automatic Data Visualization Generation from Chinese Natural Language Questions |
426 | Haiyang Wang, Zhiliang Tian, Xin Song, Yue Zhang, Yuchen Pan, Hongkui Tu, Minlie Huang and Bin Zhou | Intent-Aware and Hate-Mitigating Counterspeech Generation via Dual-Discriminator Guided LLMs |
428 | Chuanpeng Yang, Fuqing Zhu, Yaxin Liu, Jizhong Han and Songlin Hu | Uncertainty-Aware Cross-Modal Alignment for Hate Speech Detection |
429 | Seungyoon Lee, Chanjun Park, DaHyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo and Heuiseok Lim | Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean |
431 | Yan Xiao, Yaochu Jin and Kuangrong Hao | Federated Document-Level Biomedical Relation Extraction with Localized Context Contrast |
432 | Gregor Donabauer and Udo Kruschwitz | Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations |
433 | Maria Berger, Sebastian Michael Reimann and Nieke Marie Kiwitt | Applying Transfer Learning to German Metaphor Prediction |
434 | Chaojun Xiao, Yutao Sun, Yuan Yao, Xu Han, Wenbin Zhang, Zhiyuan Liu and Maosong Sun | Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training |
435 | Dominik Andreas Kowieski, Michael Hellwig and Thomas Feilhauer | TAPASGO: Transfer Learning towards a German-Language Tabular Question Answering Model |
436 | Shuvam Shiwakoti, Surendrabikram Thapa, Kritesh Rauniyar, Akshyat Shah, Aashish Bhandari and Usman Naseem | Analyzing the Dynamics of Climate Change Discourse on Twitter: A New Annotated Corpus and Multi-Aspect Classification |
439 | Hongchuan Zeng, Hongshen Xu, Lu Chen and Kai Yu | Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind |
440 | Eujene Nikka V. Boquio and Prospero C. Naval, Jr. | Beyond Canonical Fine-tuning: Leveraging Hybrid Multi-Layer Pooled Representations of BERT for Automated Essay Scoring |
441 | Huitong Pan, Qi Zhang, Cornelia Caragea, Eduard Dragut and Longin Jan Latecki | SciDMT: A Large-Scale Corpus for Detecting Scientific Mentions |
442 | Sun Wei, Mingxiao Li, Jingyuan Sun, Jesse Davis and Marie-Francine Moens | DMON: A Simple yet Effective Approach for Argument Structure Learning |
443 | Sondre Wold, Petter Mæhlum and Oddbjørn Hove | Estimating Lexical Complexity from Document-Level Distributions |
450 | Jinan Zou, Maihao Guo, Yu Tian, Yuhao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad and Javen Qinfeng Shi | Semantic Role Labeling Guided Out-of-distribution Detection |
451 | Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, Bowen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li and Xiping Hu | MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property |
452 | Qiushi Sun, Chengcheng Han, Nuo Chen, Renyu Zhu, Jingyang Gong, Xiang Li and Ming Gao | Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives |
455 | Tom Roth, Inigo Jauregi Unanue, Alsharif Abuadbba and Massimo Piccardi | XVD: Cross-Vocabulary Differentiable Training for Generative Adversarial Attacks |
456 | Tomoya Mizumoto, Takato Yamazaki, Katsumasa Yoshikawa, Masaya Ohagi, Toshiki Kawamoto and Toshinori Sato | Dialogue Systems Can Generate Appropriate Responses without the Use of Question Marks?-- A Study of the Effects of ``?'' for Spoken Dialogue Systems -- |
457 | Hao Lang, Yinhe Zheng, Binyuan Hui, Fei Huang and Yongbin Li | Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts |
458 | Zihan Wang, Peiyi Wang and Houfeng Wang | Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification |
462 | Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen and Wen Zhang | Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion |
463 | Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi and Koichiro Yoshino | A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions |
466 | Sho Hoshino, Akihiko Kato, Soichiro Murakami and Peinan Zhang | Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity |
469 | Wooyoung Kim, TaeYong Kim, Byeongjin KIM, Myeong Jin MJ Lee, Gitaek Lee, kirok kim, Jisoo Cha and Wooju Kim | Korean Disaster Safety Information Sign Language Translation Benchmark Dataset |
470 | Shasha Guo, Jing Zhang, Xirui Ke, Cuiping Li and Hong Chen | Diversifying Question Generation over Knowledge Base via External Natural Questions |
472 | Eileen Wemmer, Sofie Labat and Roman Klinger | EmoProgress: Cumulated Emotion Progression Analysis in Dreams and Customer Service Dialogues |
477 | Zhiyu Fang, Jingyan Qin, Xiaobin Zhu, Chun Yang and Xu-Cheng Yin | Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding |
478 | Jan Odijk | A Canonical Form for Flexible Multiword Expressions |
483 | Julia Krebs, Evguenia A. Malaia, Isabella Fessl, Hans-Peter Wiesinger, Dietmar Roehm, Ronnie Wilbur and Hermann Schwameder | Motion Capture Analysis of Verb and Adjective Types in Austrian Sign Language (ÖGS) |
489 | Duyoung Jeon, Junho Lee and Cheongtag Kim | User Guide for KOTE: Korean Online That-gul Emotions Dataset |
492 | Mohamad MZ Elzohbi and Richard Zhao | ContrastWSD: Enhancing Metaphor Detection with Word Sense Disambiguation Following the Metaphor Identification Procedure |
495 | Slawomir Dadas, Michał Perełkiewicz and Rafał Poświata | PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods |
498 | Jose Diego Suarez and Luis Chiruzzo | Null Subjects in Spanish as a Machine Translation Problem |
503 | Yifan Ding, Qingkai Zeng and Tim Weninger | ChatEL: Entity Linking with Chatbots |
507 | Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao | Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models |
510 | Savitha Sam Abraham, Marjan Alirezaie and Luc De Raedt | CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments |
512 | Yizhi Jiang, Jinlong Li and huanhuan chen | Relation Classification via Bidirectional Prompt Learning with Data Augmentation by Large Language Model |
513 | You Zhang, Jin Wang, Liang-Chih Yu, Dan Xu and Xuejie Zhang | Improving Personalized Sentiment Representation with Knowledge-enhanced and Parameter-efficient Layer Normalization |
514 | Jiri Martinek, Pavel Kral, Ladislav Lenc and Josef Baloun | COMICORDA: Dialogue Act Recognition in Comic Books |
516 | Xumeng Liu, Wenya Guo, Ying Zhang, Xubo Liu, Yu Zhao, Shenglong Yu and Xiaojie Yuan | Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation |
519 | kyungho kim, Seongmin Park and Jihwa Lee | RT-VQ2A2: Real Time Vector Quantized Question Answering with ASR |
523 | Donovan Ong, Shuo Sun, Jian Su and Bin Chen | Mitigating Linguistic Artifacts in Emotion Recognition for Conversations from TV Scripts to Daily Conversations |
527 | Shuhei Tateishi, Makoto Nakatsuji and Yasuhito Osugi | Word-Aware Modality Stimulation for Multimodal Fusion |
528 | Jan Odijk, Martin Kroon, Tijmen Baarda, Ben Bonfil and Sheean Spoel | MWE-Finder: A Demonstration |
529 | Dingxin Hu, Xuanyu Zhang, Xingyue Zhang, Yiyang Li, Dongsheng Chen, Marina Litvak, Natalia Vanetik, Qing Yang, Dongliang Xu, Yanquan Zhou, Lei Li, Yuze Li and Yingqi Zhu | Improving Factual Consistency in Abstractive Summarization with Sentence Structure Pruning |
533 | Ahmet Gunduz, Kamer Ali Yuksel, Kareem Darwish, Golara Javadi, Fabio Minazzi, Nicola Sobieski and Sébastien Bratières | An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation |
534 | Samin Mahdizadeh Sani, Malak Rassem, Chris W. Jenkins, Filip Miletić and Sabine Schulte im Walde | What Can Diachronic Contexts and Topics Tell Us About the Present-Day Compositionality of English Noun Compounds? |
536 | Andres Pineiro-Martin, Carmen Garcia-Mateo, Laura Docio-Fernandez, Maria del Carmen Lopez-Perez and Jose Gandarela-Rodriguez | FalAI: A Dataset for End-to-end Spoken Language Understanding in a Low-Resource Scenario |
537 | Muhammad Huzaifah, Weihua Zheng, Nattapol Chanpaisit and Kui Wu | Evaluating Code-Switching Translation with Large Language Models |
538 | Huawen Feng, Jingsong Yan, Junlong Liu, Junhao Zheng and Qianli Ma | Well Begun is Half Done: An Implicitly Augmented Generative Framework with Distribution Modification for Hierarchical Text Classification |
543 | Georgios Velentzas, Andrew Caines, Rita Borgo, Erin Pacquetet, Clive Hamilton, Taylor Arnold, Diane Nicholls, Paula Buttery, Thomas Gaillat, Nicolas Ballier and Helen Yannakoudakis | Logging Keystrokes in Writing by English Learners |
546 | Leonardo Zilio, Shenbin Qian, Diptesh Kanojia and Constantin Orasan | Character-level language models for abbreviation and long-form detection |
547 | Weiran Chen, Xin Li, Jiaqi Su, Guiqian Zhu, Ying Li, Yi JI and Chunping Liu | TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling |
551 | Chenhao Wu, Ruifang He, chang liu and Bo Wang | Continuous Relational Diffusion driven Topic Model with Multi-grained Text for Microblog |
552 | Zecheng Wang, Chunshan Li, Zhao Yang, Qingbin Liu, Yanchao Hao, Xi Chen, Dianhui Chu and Dianbo Sui | Analyzing Chain-of-thought Prompting in Black-Box Large Language Models via Estimated V-information |
554 | Ting Zhou, Ying Shen and Yinghui Li | GCNet: Global-and-Context Collaborative Learning for Aspect-Based Sentiment Analysis |
555 | Anais Ollagnier | CyberAgressionAdo-v2: Leveraging Pragmatic-Level Information to Decipher Online Hate in French Multiparty Chats |
558 | Zhiming Li, Yanzhou Li, Tianlin Li, Mengnan Du, bozhi wu, Yushi Cao, Junzhe Jiang and Yang Liu | Unveiling Project-Specific Bias in Neural Code Models |
559 | Quang Anh Nguyen, Nadi Tomeh, Mustapha Lebbah, Thierry Charnois, Hanene Azzag and Santiago Cordoba Muñoz | Enhancing Few-Shot Topic Classification with Verbalizers. A Study on Automatic Verbalizer and Ensemble Methods |
561 | Ge Gao, Jongin Kim, Sejin Paik, Ekaterina Novozhilova, Yi Liu, Sarah T. Bonna, Margrit Betke and Derry Tanti Wijaya | Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation |
564 | Sebastian Steindl, Ulrich Schäfer and Bernd Ludwig | Counterfactual Dialog Mixing as Data Augmentation for Task-Oriented Dialog Systems |
566 | Van-Tuan Bui and Agata Savary | Cross-type French Multiword Expression Identification with Pre-trained Masked Language Models |
567 | Junzhe Liang, Haifeng Sun, Zirui Zhuang, Qi Qi, Jingyu Wang and Jianxin Liao | Distantly Supervised Contrastive Learning for Low-Resource Scripting Language Summarization |
568 | Flor Miriam Plaza-del-Arco, Alba A. Cercas Curry, Amanda Cercas Curry and Dirk Hovy | Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions |
571 | Yanis Labrak, Mickael Rouvier and Richard Dufour | A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks |
572 | Longxiang Zhang, Caleb D. Hart, Susanne Burger and Thomas Schaaf | Annotate the Way You Think: An Incremental Note Generation Framework for the Summarization of Medical Conversations |
577 | Julius Monsen and Arne Jonsson | Controllable Sentence Simplification in Swedish using Control Prefixes and Mined Paraphrases |
578 | David M. Chan, Yiming Ni, David Ross, Sudheendra Vijayanarasimhan, Austin Myers and John Canny | Distribution Aware Metrics for Conditional Natural Language Generation |
579 | Yanis Labrak, Adrien Bazoge, Oumaima El Khettari, Mickael Rouvier, pacome constant dit beaufils, Natalia Grabar, Béatrice Daille, Solen Quiniou, Emmanuel Morin, Pierre-Antoine Gourraud and Richard Dufour | DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain |
581 | Fan Huang, Haewoon Kwak, Kunwoo Park and Jisun An | ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales? |
583 | kyungho kim, Seongmin Park, junseo lee and Jihwa Lee | Non-Essential is NEcessary: Order-agnostic Multi-hop Question Generation |
584 | Zhiyuan Ma, Jintao Du, Changhua Meng and weiqiang wang | Enhancing Distantly Supervised Named Entity Recognition with Strong Label Guided Lottery Training |
588 | Chengfeng Dou, Ying Zhang, Yanyuan Chen, Zhi Jin, Wenpin Jiao, Haiyan Zhao and Yu Huang | Detection, Diagnosis, and Explanation: A Benchmark for Chinese Medial Hallucination Evaluation |
589 | Baohang Zhou, Ying Zhang, Kehui Song, Hongru Wang, Yu Zhao, Xuhui Sui and Xiaojie Yuan | MCIL: Multimodal Counterfactual Instance Learning for Low-resource Entity-based Multimodal Information Extraction |
590 | Zhaoqi Zhang, Pasquale Balsebre, Siqiang Luo, Zhen Hai and Jiangping Huang | StructAM: Enhancing Address Matching through Semantic Understanding of Structure-aware Information |
592 | Chen Zhang, Yang Yang, Qiuchi Li, Jingang Wang and Dawei Song | Task-agnostic Distillation of Encoder-Decoder Language Models |
597 | Siyu Wang, Jianhui Jiang, Shengran Dai and Jiangtao Qiu | A Hierarchical Sequence-to-Set Model with Coverage Mechanism for Aspect Category Sentiment Analysis |
600 | Baijun Ji, Xiangyu Duan, Zhenyu Qiu, Tong Zhang, Junhui Li, Hao Yang and Min Zhang | Submodular-based In-context Example Selection for LLMs-based Machine Translation |
602 | Keyaki Ohno, Hirotaka Kameko, Keisuke Shirai, Taichi Nishimura and Shinsuke Mori | Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks |
603 | Joanna Dolińska and Delphine Bernhard | POS Tagging for the Endangered Dagur Language |
604 | Yi Zhang, Fei Yang, Shuang Peng, Fangyu Wang and Aimin Pan | FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization |
606 | Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang and Jihun Choi | CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions |
607 | Fujun Zhang, Xiangdong Su, Jiang Li, Rong Yan and Guanglai Gao | EpLSA: Synergy of Expert-prefix Mixtures and Task-Oriented Latent Space Adaptation for Diverse Generative Reasoning |
611 | Hafida Le Cloirec - Ait Yahya, Olga Seminck and Pascal Amsili | FReND: A French Resource of Negation Data |
612 | Alvin C. Grissom II, Jo Shoemaker, Benjamin Goldman, Ruikang Shi, Craig Stewart, C. Anton Rytting, Leah Findlater and Jordan Boyd-Graber | Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates |
613 | Mitja Nikolaus, Abhishek Agrawal, Petros Kaklamanis, Alex Warstadt and Abdellah Fourtassi | Automatic Annotation of Grammaticality in Child-Caregiver Conversations |
614 | Piotr Rybak, Piotr Przybyła and Maciej Ogrodniczuk | PolQA: Polish Question Answering Dataset |
619 | Shangkang Wang and Li Pan | Target-Adaptive Consistency Enhanced Prompt-Tuning for Multi-Domain Stance Detection |
620 | Jamil Zaghir, Mina Bjelogrlic, Jean-Philippe Goldman, Soukaïna Aananou, Christophe Gaudet-Blavignac and Christian Lovis | FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection |
621 | Piotr Rybak and Maciej Ogrodniczuk | Silver Retriever: Advancing Neural Passage Retrieval for Polish Question Answering |
622 | Jennifer Ecker | Labeling Results of Topic Models: Word Sense Disambiguation as Key Method for Automatic Topic Labeling with GermaNet |
623 | Qiushi Sun, Nuo Chen, Jianing Wang, Ming Gao and Xiang Li | TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills |
624 | Piotr Rybak | Transferring BERT Capabilities from High-Resource to Low-Resource Languages Using Vocabulary Matching |
626 | Nathanael Carraz Rakotonirina and Marco Baroni | MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models |
634 | Jianyu Liu, Sheng Bi and Guilin Qi | PRIMO: Progressive Induction for Multi-hop Open Rule Generation |
636 | Yusheng Huang, Ning Hu, Kunping Li, Nan Wang and Zhouhan Lin | Extracting Financial Events from Raw Texts via Matrix Chunking |
637 | Jakub Šmíd, Pavel Přibáň and Ondrej Prazak | Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks |
638 | Jutta Stock, Volha Petukhova and Dietrich Klakow | Annotating Customer-Oriented Behaviour in Call Centre Sales Dialogues |
639 | Jun Cheng Yang, Zuchao Li, Shuai Xie, Wei Yu, Shijun Li and Bo Du | Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning |
640 | Núria Gala, Brigitte BIGI and Marie Bauer | Automatically Estimating Textual and Phonemic Complexity for Cued Speech: How to See the Sounds from French Texts |
641 | Yosuke Miyanishi and Minh Le Nguyen | Causal Intersectionality and Dual Form of Gradient Descent for Multimodal Analysis: a Case Study on Hateful Memes |
643 | Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn and Alice Oh | RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education |
649 | Robert Östling, Katarina Gillholm, Murathan Kurfalı, Marie Mattson and Mats Wirén | Evaluation of Really Good Grammatical Error Correction |
650 | Jonathan Heitz, Gerold Schneider and Nicolas Langer | The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer's Disease Detection from Spontaneous Speech |
654 | Christine Pinney, Casey Kennington, Maria Soledad Pera, Katherine Landau Wright and Jerry Alan Fails | Incorporating Word-level Phonemic Decoding into Readability Assessment |
655 | Benjamin Winter, Alexei Gustavo Figueroa Rosero, Alexander Loeser, Felix Alexander Gers, Nancy Katerina Figueroa Rosero and Ralf Krestel | DDxGym: Online Transformer Policies in a Knowledge Graph Based Natural Language Environment |
656 | Dandan Huang, Lu Cao, Zhenting Li and Yue Zhang | Which Sense Dominates Multisensory Semantic Understanding? A Brain Decoding Study |
659 | Alice Millour, Yoann Dupont, Karen Fort and Liam Duignan | Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: the Case of NER in French |
660 | Gaifan Zhang, Yi Zhou and Danushka Bollegala | Evaluating Unsupervised Dimensionality Reduction Methods for Pretrained Sentence Embeddings |
663 | Harry Walsh, Ben Saunders and Richard Bowden | Select and Reorder: A Novel Approach for Neural Sign Language Production |
664 | Jasper Degraeuwe and Patrick Goethals | LexComSpaL2: A Lexical Complexity Corpus for Spanish as a Foreign Language |
673 | Chanho Park, Mingjie Chen and Thomas Hain | Automatic Speech Recognition System-Independent Word Error Estimation |
674 | Liisi Jakobson, Jelena Kallas and Erko Jakobson | Leveraging Domain Corpora for Enhanced Terminology: The Case of Estonian-English Remote Sensing Termbase |
675 | Väinö Aleksi Yrjänäinen, Fredrik Mohammadi Norén, Robert Borges, Johan Jarlbrink, Lotta Åberg Brorsson, Anders P. Olsson, Pelle Snickars and Måns Magnusson | The Swedish Parliament Corpus 1867 – 2022 |
678 | Xiaotong Song, Huiping Lin, Jiatao Zhu and Xinyi Gong | CAGK: Collaborative Aspect Graph Enhanced Knowledge-based Recommendation |
679 | Wei-Yu Kao and An-Zi Yen | MAGIC: Multi-Argument Generation with Self-Refinement for Domain Generalization in Automatic Fact-Checking |
682 | Mohamed Elaraby, Yang Zhong, Diane Litman, Ahmed Ashraf Butt and Muhsin Menekse | ReflectSumm: A Benchmark for Course Reflection Summarization |
684 | Santosh T.Y.S.S, Mahmoud Aly and Matthias Grabmair | LexAbSumm: Aspect-based Summarization of Legal Decisions |
685 | Cameron R. Jones and Sean Trott | Multimodal Language Models Show Evidence of Embodied Simulation |
687 | Santosh T.Y.S.S, Elvin A. Quero Hernandez and Matthias Grabmair | Query-driven Relevant Paragraph Extraction from Legal Judgments |
688 | Jaap Kruijt, Peggy van Minkelen, Lucia Donatelli, Piek T.J.M. Vossen, Elly Konijn and Thomas Baier | SPOTTER: A Framework for Investigating Convention Formation in a Visually Grounded Human-Robot Reference Task |
689 | Albert Sawczyn, Jakub Binkowski, Piotr Bielak and Tomasz Kajdanowicz | Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings |
690 | Maria Andreevna Petrova, Alexandra M. Ivoylova and Anastasia Tishchenkova | CoBaLD Annotation: the Enrichment of the Enhanced Universal Dependencies with the Semantical Pattern |
692 | Khyati Mahajan and Samira Shaikh | Persona-aware Multi-party Conversation Response Generation |
696 | Daniel Dakota and Sandra Kübler | Bits and Pieces: Investigating the Effects of Subwords in Multi-task Parsing Across Languages and Domains |
697 | Xindi Wang, Robert E. Mercer and Frank Rudzicz | Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification |
698 | Youmi Ma, An Wang and Naoaki Okazaki | Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer |
705 | Zhipeng Xie and Yahe Li | Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification |
712 | Arianne Reimerink, Melania Cabezas-García, Pilar León-Araúz and Pamela Faber | Ideological Knowledge Representation: Framing Climate Change in EcoLexicon |
715 | Santosh T.Y.S.S, Hassan Sarwat, Ahmed Mohamed Abdelaal Abdou and Matthias Grabmair | Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents |
718 | Zhihong Sun, Chen Lyu, Bolun Li, Yao Wan, Hongyu Zhang, Ge Li and Zhi Jin | Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs |
721 | Santosh T.Y.S.S, Kristina Kaiser and Matthias Grabmair | CuSINeS: Curriculum-driven Structure Induced Negative Sampling for Statutory Article Retrieval |
724 | Santosh T.Y.S.S, Rashid Haddad and Matthias Grabmair | ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human Rights |
725 | Khai Le-Duc | VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain |
726 | Atsushi Keyaki and Ribeka Keyaki | Coarse-Tuning for Ad-hoc Document Retrieval Using Pre-trained Language Models |
730 | Wen Yin, Cencen Liu, YI XU, Ahmad Raza Wahla, Huang Yiting and Dezhang Zheng | SynPrompt: Syntax-aware Enhanced Prompt Engineering for Aspect-based Sentiment Analysis |
732 | Haiyang Zhang, Qiuyi Chen, Yanjie Zou, Jia Wang, Yushan Pan and Mark Stevenson | Document Set Expansion with Positive-Unlabeled Learning Using Intractable Density Estimation |
733 | Quan Wang, Licheng Zhang, Zikang Guo and Zhendong Mao | IDEATE: Detecting AI-Generated Text using Internal and External Factual Structures |
736 | Santosh T.Y.S.S, Nina Baumgartner, Matthias Stürmer, Matthias Grabmair and Joel Niklaus | Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset |
740 | Xiangci Li, Linfeng Song, Lifeng Jin, Haitao Mi, Jessica Ouyang and Dong Yu | A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation |
743 | Yike Wu, Yang Xiao, Mengting Hu, Mengying Liu, Pengcheng Wang and Mingming Liu | Towards Robust Evidence-Aware Fake News Detection via Improving Semantic Perception |
751 | Wenpeng Lu, Guobiao Zhang, Xueping Peng, Hongjiao Guan and Shoujin Wang | Medical Entity Disambiguation with Medical Mention Relation and Fine-grained Entity Knowledge |
756 | Takuto Asakura and Yusuke Miyao | What Is Needed for Intra-document Disambiguation of Math Identifiers? |
758 | Shenshen Bu, Yujie Song, Taiji Li and Zhiming Dai | Dynamic Knowledge Prompt for Chest X-ray Report Generation |
760 | Md Rashad Al Hasan Rony, Sudipto Kumar Shaha, Rakib Al Hasan Joy, Sumon Kanti Dey, amzad Hossain rafi, Ashraf Hasan Sirajee and Jens Lehmann | BanglaQuAD: A Bengali Open-domain Question Answering Dataset |
763 | Lucie Polakova, Jiří Mírovský, Šárka Zikánová and Eva Hajicova | Developing a Rhetorical Structure Theory Treebank for Czech |
771 | Yang Bai, Anthony Colas, Christan Grant and Zhe Wang | M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval |
772 | Yuchen Wei and Milton King | Sense of the Day: Short Timeframe Temporal-Aware Word Sense Disambiguation |
773 | Tyler K. Bikaun, Tim French, Michael Stewart, Wei Liu and Melinda Hodkiewicz | MaintIE: A Fine-Grained Annotation Schema and Benchmark for Information Extraction from Maintenance Short Texts |
775 | Hongru Wang, Boyang XUE, Baohang Zhou, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang and Kam-Fai Wong | UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval |
779 | Hongchun Yu, Wei Pan, Xing Fan and Hanqi Li | Multi-Granularity Fusion Text Semantic Matching Based on WoBERT |
782 | minjun zhu, Yixuan Weng, Shizhu He, Kang Liu, Haifeng Liu, yang jun jun and Jun Zhao | Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database |
785 | Yuqi Liu, Guanyi Chen and Kees van Deemter | Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases |
786 | Zirui Zhang, Yiyu Yang and Benhui Chen | Prompt Tuning for Few-shot Relation Extraction via Modeling Global and Local Graphs |
789 | Jinpeng Li, Jiaze Chen, Huadong Chen, Dongyan Zhao and Rui Yan | Multilingual Generation in Abstractive Summarization: A Comparative Study |
790 | Maximos Skandalis, Richard Moot, Christian Retoré and Simon Robillard | New Datasets for Automatic Detection of Textual Entailment and of Contradictions between Sentences in French |
792 | Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun and Jun Zhao | Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning |
794 | Jianhui Pang, Baosong Yang, Derek F. Wong, Dayiheng Liu, Xiangpeng Wei, Jun Xie and Lidia S. Chao | MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation |
800 | Taiga Someya, Yushi Sugimoto and Yohei Oseki | JCoLA: Japanese Corpus of Linguistic Acceptability |
801 | Ying Zhang, Xinying Qian, Yu Zhao, Baohang Zhou, Kehui Song and Xiaojie Yuan | Bring Invariant To Variant: A Contrastive Prompt-based Framework for Temporal Knowledge Graph Forecasting |
802 | Alba M. Mármol Romero, Adrián Moreno Muñoz, Flor Miriam Plaza-del-Arco, M. Dolores Molina González, María-Teresa Martín-Valdivia, L. Alfonso Ureña-López and Arturo Montejo-Ráez | MentalRiskES: A New Corpus for Early Detection of Mental Disorders in Spanish |
803 | Francois Meyer, Haiyue Song, Abhisek Chakrabarty, Jan Buys, Raj Dabre and Hideki Tanaka | NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages |
804 | haoyu gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Fei Huang and Yongbin Li | Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models |
805 | Feng Zhao, Wan Xianlin, Cheng Yan and Chu Kiong Loo | Correcting Language Model Bias for Text Classification in True Zero-Shot Learning |
806 | Yunxin Li, Baotian Hu, Wenhan Luo, Lin Ma, Yuxin Ding and Min Zhang | A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation |
807 | Francesco Antici, Federico Ruggeri, Andrea Galassi, Katerina Korre, Arianna Muti, Alessandra Bardi, Alice Fedotova and Alberto Barrón-Cedeño | A Corpus for Sentence-Level Subjectivity Detection on English News Articles |
810 | Yunlong Feng, Bohan Li, Libo Qin, Xiao Xu and Wanxiang Che | A Two-Stage Framework with Self-Supervised Distillation for Cross-Domain Text Classification |
812 | Patrizia Paggio, Manex Agirrezabal, Costanza Navarretta and Leo Vitasovic | Multimodal behaviour in an online environment: The GEHM Zoom corpus collection |
813 | Hans Ole Hatzel and Chris Biemann | Tell me again! A Large-Scale Dataset of Multiple Summaries for the Same Story |
815 | Zhendong Liu, Changhong Xia, Wei He and Chongjun Wang | Trustworthiness and Self-awareness in Large Language Models: An Exploration through the Think-Solve-Verify Framework |
816 | Iacopo Ghinassi, Lin Wang, Chris Newell and Matthew Purver | When Cohesion Lies in the Embedding Space: Embedding-Based Reference-Free Metrics for Topic Segmentation |
817 | Maria Francis, Julius Steuer, Dietrich Klakow and Volha Petukhova | Who Did You Blame When Your Project Failed? Designing a Corpus for Presupposition Generation in Cross-Examination Dialogues |
818 | Aleksandr Riaposov and Elena Lazarenko | Corpus Services: a Framework to Curate XML Corpus Data |
821 | Joanna Kruyt, Róbert Sabo, Katarína Polónyiová, Daniela Ostatníková and Štefan Beňuš | The Slovak Autistic and Non-Autistic Child Speech Corpus:Task-Oriented Child-Adult Interactions |
822 | Huimin Chen, Chengyu Wang, Yanhao Wang, Cen CHEN and Yinggui Wang | TaiChi: Improving the Robustness of NLP Models by Seeking Common Ground While Reserving Differences |
831 | Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker and Sheak Rashed Haider Noori | Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis |
833 | Hee-Soo Choi, Priyansh Trivedi, Mathieu Constant, Karen Fort and Bruno Guillaume | Beyond Model Performance: Can Link Prediction Enrich French Lexical Graphs? |
834 | Shadi Manafi and Nikhil Krishnaswamy | Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets |
836 | Mohammadamin Kanaani | Triple-R: Automatic Reasoning for Fact Verification Using Language Models |
846 | Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu and Wenjuan Han | CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction |
851 | Ruiting Li, Peiyan Wang, Libang Wang, Danqingxin Yang and Dongfeng Cai | A Corpus and Method for Chinese Named Entity Recognition in Manufacturing |
852 | Yirong Zeng, Xiao Ding, Yi Zhao, Xiangyu Li, Jie Zhang, Chao Yao, Ting Liu and Bing Qin | RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict |
853 | Yan Lei, Liang Pang, Yuanzhuo Wang, Huawei Shen and Xueqi Cheng | Qsnail: A Questionnaire Dataset for Sequential Question Generation |
860 | Yunqi Zhang, Yubo Chen, jingzhe zhu, Jinyu Xu, shuai yang, zhaoliang wu, liang huang, Yongfeng Huang and Shuai Chen | KnowVrDU: A Unified Knowledge-aware Prompt-Tuning Framework for Visually-rich Document Understanding |
861 | Ramon Ruiz-Dolz, CHR-JR CHIU, Chung-Chi Chen, Noriko Kando and Hsin-Hsi Chen | Learning Strategies for Robust Argument Mining: An Analysis of Variations in Language and Domain |
863 | Jiawei Chen, Hongyu Lin, Xianpei Han, Yaojie Lu, Shanshan Jiang, Bin Dong and Le Sun | Few-shot Named Entity Recognition via Superposition Concept Discrimination |
864 | Robert Forkel, Daniel G. Swanson and Steven Moran | Converting legacy data to CLDF: A FAIR exit strategy for linguistic web apps |
867 | Jung-Ho Kim, Mathew John Huerta-Enochian, Changyong Ko and Du Hui Lee | SignBLEU: Automatic Evaluation of Multi-channel Sign Language Translation |
871 | Junyu Lu, Bo Xu, Xiaokun Zhang, Kaiyuan Liu, Dongyu Zhang, Liang Yang and Hongfei LIN | Take its Essence, Discard its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect |
872 | Lei Li, Yongfeng Zhang, Dugang Liu and Li Chen | Large Language Models for Generative Recommendation: A Survey and Visionary Discussions |
874 | Steven Coats | CoANZSE Audio: Creation of an Online Corpus for Linguistic and Phonetic Analysis of Australian and New Zealand Englishes |
875 | Seonwoo Lee, Jihyun Mun, Sunhee Kim and Minhwa Chung | Speech Corpus for Korean Children with Autism Spectrum Disorder: Towards Automatic Assessment Systems |
878 | Xin Zheng, Qiming Zhu, Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun | Executing Natural Language-Described Algorithms with Large Language Models: An Investigation |
881 | Yunfei Yin, Congrui Zou, Zheng Yuan and Xianjian Bao | MLDSP-MA: Multidimensional Attention for Multi-Round Long Dialogue Sentiment Prediction |
884 | Jorge Palomar-Giner, Jose Javier Saiz, Ferran Espuña, Mario Mina, Severino Da Dalt, Joan Llop, Malte Ostendorff, Pedro Ortiz Suarez, Georg Rehm, Aitor Gonzalez-Agirre and Marta Villegas | A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages |
891 | Maxime Arens, Lucile Callebert, Mohand Boughanem and Jose G. Moreno | Rebalancing Label Distribution while Eliminating Inherent Waiting Time in Multi Label Active Learning applied to Transformers |
893 | Zhenxiao Cheng, Jie Zhou, Wen Wu, Qin Chen and Liang He | Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis |
898 | Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li and Yang Liu | DEEM: Dynamic Experienced Expert Modeling for Stance Detection |
900 | Ang Li, Qiangchao Chen, Yiquan Wu, Xiang Zhou, Kun Kuang, Fei Wu and Ming Cai | From Graph to Word Bag: Introducing Domain Knowledge to Confusing Charge Prediction |
902 | Shi Yu, Chenghao Fan, Chenyan Xiong, David Jin, Zhiyuan Liu and Zhenghao Liu | Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion |
905 | Minzheng Wang, Nan Xu, Jiahao Zhao, Yin Luo and Wenji Mao | PromISe: Releasing the Capabilities of LLMs with Prompt Introspective Search |
906 | Jan Nehring, Aleksandra Gabryszak, Pascal Jürgens, Aljoscha Burchardt, Stefan Schaffer, Matthias Spielkamp and Birgit Stark | Large Language Models are Echo Chambers |
909 | Kedi Chen, Jie Zhou, Qin Chen, Shunyu Liu and Liang He | A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder |
911 | Fanheng Kong, Peidong Wang, Shi Feng, Daling Wang and Yifei Zhang | TIGER: A Unified Generative Model Framework for Multimodal Dialogue Response Generation |
912 | Xiaohua Wang, Wenlong Fei, Min Hu, Qingyu Zhang and Aoqiang Zhu | MEVTR: A Multilingual Model Enhanced With Visual Text Representations |
913 | Velizar Shulev and Khalil Sima'an | Continual Reinforcement Learning for Controlled Text Generation |
915 | Yanis Labrak, Adrien Bazoge, Béatrice Daille, Mickael Rouvier and Richard Dufour | How Important Is Tokenization in French Medical Masked Language Models? |
917 | xudong zhu, zhao kang and Bei Hui | FCDS: Fusing Constituency and Dependency Syntax into Document-Level Relation Extraction |
918 | Jiangming Liu | Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing |
924 | Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara and Gabriel Skantze | Multilingual Turn-taking Prediction Using Voice Activity Projection |
928 | Shafiuddin Rehan Ahmed, George Arthur Baker, Evi Judge, Michael Reagan, Kristin Wright-Bettner, Martha Palmer and James H. Martin | Linear Cross-document Event Coreference Resolution with X-AMR |
929 | Zechen Sun, Yisheng Xiao, Juntao Li, Yixin Ji, Wenliang Chen and Min Zhang | Exploring and Mitigating Shortcut Learning for Generative Large Language Models |
931 | Pedro Fernandes, Sérgio Nunes and Luís Santos | A Community-Driven Data-to-Text Platform for Football Match Summaries |
932 | Naoya Ueda, Masato Mita, Teruaki Oka and Mamoru Komachi | Token-length Bias in Minimal-pair Paradigm Datasets |
933 | Georg Rehm, Stelios Piperidis, Khalid Choukri, Andrejs Vasiļjevs, Katrin Marheinecke, Victoria Arranz, Aivars Bērziņš, Miltos Deligiannis, Dimitris Galanis, Maria Giagkou, Katerina Gkirtzou, Dimitris Gkoumas, Annika Grützner-Zahn, Athanasia Kolovou, Penny Labropoulou, Andis Lagzdiņš, Elena Leitner, Valérie Mapelli, Hélène Mazo, Simon Ostermann, Stefania Racioppa, Mickaël Rigault and Leon Voukoutis | Common European Language Data Space |
935 | Di Wu, Wasi U. Ahmad and Kai-Wei Chang | On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation |
941 | xiaowei Zhao, Yong Zhou and xiujuan xu | Dual Encoder: Exploiting the Potential of Syntactic and Semantic for Aspect Sentiment Triplet Extraction |
942 | Katherine Atwell, Mert Inan, Anthony B. Sicilia and Malihe Alikhani | Combining Discourse Coherence with Large Language Models for More Inclusive, Equitable, and Robust Task-Oriented Dialogue |
943 | Tom S Juzek | The Syntactic Acceptability Dataset (Preview): A Resource for Machine Learning and Linguistic Analysis of English |
946 | Carlos Daniel Hernandez Mena, Þorsteinn Daði Gunnarsson and Jon Gudnason | Samrómur Milljón: An ASR Corpus of One Million Verified Read Prompts in Icelandic |
947 | Siyin Wang, Jie Zhou, Qin Chen, Qi Zhang, Tao Gui and Xuanjing Huang | Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis |
949 | Wei Li, Shutan Huang and Yanqiu Shao | An Unsupervised Framework for Adaptive Context-aware Simplified-Traditional Chinese Character Conversion |
950 | wenjie xu, yidan Chen and jianquan Ouyang | A Streamlined Span-based Factorization Method for Few Shot Named Entity Recognition |
953 | Jing Jin and Houfeng Wang | Select High-quality Synthetic QA Pairs to Augment Training Data in MRC Under the Reward Guidance of Generative Language Models |
954 | Yiding Liu, Jingjing Wang, Jiamin Luo, Tao Zeng and Guodong Zhou | ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues |
957 | Jianhao Yan, Jin Xu, Fandong Meng, Jie Zhou and Yue Zhang | DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding |
959 | Nguyen Quang Chieu, Quang-Minh Tran and Khac-Hoai Nam Bui | SynTOD: Augmented Response Synthesis for Robust End-to-End Task-Oriented Dialogue System |
960 | Md Nayem Uddin, Enfa Rose George, Eduardo Blanco and Steven R. Corman | Asking and Answering Questions to Extract Event-Argument Structures |
961 | Xiangyu Lei, Junhui Li, shimin tao and Hao Yang | Evaluation Dataset for Lexical Translation Consistency in Chinese-to-English Document-level Translation |
962 | Wenfeng Feng, Chuzhan Hao, Yuewei Zhang, Yu Han and Hao Wang | Mixture-of-LoRAs: An Efficient Multitask Tuning Method for Large Language Models |
963 | Paramita Mirza, Viju Sudhi, Soumya Ranjan Sahoo and Sinchana Ramakanth Bhat | ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler |
965 | Kian Ahrabian, Alon Benhaim, Barun Patra, Jay Pujara, Saksham Singhal and Xia Song | On The Adaptation of Unlimiformer for Decoder-Only Transformers |
971 | Xinshuo Hu, Dongfang Li, Xiaoguang Li, Yuxiang Wu, Lifeng Shang and Baotian Hu | Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer |
974 | Yoshihiko Hayashi | Reassessing Semantic Knowledge Encoded in Large Language Models through the Word-in-Context Task |
977 | Erxin Yu, Jing Li and Chunpu Xu | PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction |
978 | Shiwen Ni, Min Yang, Ruifeng Xu, Chengming Li and Xiping Xiping Hu | Layer-wise Regularized Dropout for Neural Language Models |
979 | Kyohoon Jin, Junho Lee, Juhwan Choi, Sangmin Song and Youngbin Kim | Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation |
983 | Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne and Alice Oh | CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean |
985 | Yuan Gao, Yiheng Zhu, Yuanbin Cao, Yinzhi Zhou, Zhen Wu, Yujie Chen, Shenglan Wu, Haoyuan Hu and Xinyu Dai | Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering |
986 | Xuemei Tang, Qi Su, Jun Wang and Zekun Deng | CHisIEC: An Information Extraction Corpus for Ancient Chinese History |
988 | Dejan Stosic, Saša Marjanović, Delphine Bernhard, Myriam Bras, Laurent Kevers, Stella Retali-Medori, Marianne Vergez-Couret and Carole Werner | The ParCoLab Parallel Corpus and its Extension to Four Regional Languages of France |
989 | Silin Li, Ruoyu Song, Tianwei Lan, Zeming Liu and Yuhang Guo | TED-EL: A Corpus for Speech Entity Linking |
990 | Cherifa Ben Khelil, Jean-Yves Antoine, Anaïs Halftermeyer, Frédéric Rayar, Lisa Hoiry, Mathieu Thebaud and Mathieu Raynal | Adapting AAC for Young Users: A Preliminary Study on the Influence of Age and Language Register on Word Prediction |
992 | Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han and Dongmei Zhang | KET-QA: A Dataset for Knowledge Enhanced Table Question Answering |
994 | Shengkun Ma, Jiale Han, Yi Liang and Bo Cheng | Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors |
996 | Aitor Gonzalez-Agirre, Montserrat Marimon, Carlos Rodriguez-Penagos, Javier Aula-Blasco, Irene Baucells, Carme Armentano-Oller, Jorge Palomar-Giner, Baybars Kulebi and Marta Villegas | Building a Data Infrastructure for a Mid-Resource Language: The Case of Catalan |
999 | Fynn Petersen-Frey and Chris Biemann | Dataset of Quotation Attribution in German News Articles |
1000 | Bashar Alhafni, Reem Hazim, Juan David Pineros Liberato, Muhamed Al Khalil and Nizar Habash | The SAMER Arabic Text Simplification Corpus |
1005 | Zhihong Zhu, Yunyan Zhang, Xuxin Cheng, Zhiqi Huang, Derong Xu, Xian Wu and Yefeng Zheng | Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics |
1006 | Xue Gu, Zhihan Zhou, Ziyao Meng, Jian Li, Tiago Gomes, Adriano Tavares and Hao Xu | EmoPrompt-ECPE: Emotion knowledge-aware Prompt-tuning for Emotion-Cause Pair Extraction |
1007 | Qiao Wang and Zheng Yuan | Assessing the Efficacy of Grammar Error Correction: A Human Evaluation Approach in the Japanese Context |
1008 | shuai yang, Yu Hong, Shiming He, Qingting Xu and Jianmin Yao | Word-level Commonsense Knowledge Selection for Event Detection |
1010 | Adal Abilbekov, Saida Mussakhojayeva, Rustem Yeshpanov and Huseyin Atakan Varol | KazEmoTTS: A Dataset for Kazakh Emotional Text-to-Speech Synthesis |
1011 | Xingwu Sun, Zhen Yang, Ruobing Xie, Fengzong Lian, Zhanhui Kang and Chengzhong Xu | LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders |
1012 | Zishuo Zhao, Ziyang Ma, Zhenzhou Lin, Jingyou Xie, Yinghui Li and Ying Shen | Source-free Domain Adaptation for Aspect-based Sentiment Analysis |
1014 | Carme Armentano-Oller, Montserrat Marimon and Marta Villegas | Becoming a High-Resource Language in Speech: The Catalan Case in the Common Voice Corpus |
1017 | Joykirat Singh, Sehban Fazili, Rohan Jain and Md. Shad Akhtar | EROS:Entity-Driven Controlled Policy Document Summarization |
1022 | Nuria Bel, Marta Punsola and Valle Ruíz-Fernández | EsCoLA: Spanish Corpus of Linguistic Acceptability |
1023 | Jiayi Wu, Renyu Zhu, Nuo Chen, Qiushi Sun, Xiang Li and Ming Gao | Structure-aware Fine-tuning for Code Pre-trained Models |
1026 | Edwin Thomas and Sowmya Vajjala | Keyphrase Generation: Lessons from a Reproducibility Study |
1028 | Sungjoo Byun, Jiseung Hong, Sumin Park, Dongjun Jang, Jean Seo, Minseok Kim, Chaeyoung OH and Hyopil Shin | Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition |
1030 | Imen Laouirine, Rami Kammoun and Fethi Bougares | TunArTTS: Tunisian Arabic Text-To-Speech Corpus |
1031 | Barbara Scalvini and Iben Nyholm Debess | Evaluating the potential of language-family-specific generative models for low-resource data augmentation: a Faroese case study |
1033 | Yunlong Feng, Yang Xu, Libo Qin, Yasheng Wang and Wanxiang Che | Improving Language Model Reasoning with Self-motivated Learning |
1034 | Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo and Yanqing Guo | Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning |
1035 | Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan and Erwei Yin | Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization |
1036 | Carinne Cherf and Yuval Pinter | BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation |
1038 | Dimitra Anastasiou, Carole Blond-Hanten and Marie Gallais | A Luxembourgish corpus as a Gender Bias Evaluation Testset |
1041 | Brigitte Krenn, Johann Petrak, Marina Kubina and Christian Burger | GERMS-AT: A Sexism/Misogyny Dataset of Forum Comments from an Austrian Online Newspaper |
1045 | Damien Sileo, Kanimozhi Uma and Marie-Francine Moens | Generating multiple-choice questions for medical question answering with distractors and cue-masking |
1046 | Jiashuo Sun, Hang Zhang, Chen Lin, Xiangdong Su, Yeyun Gong and Jian Guo | APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning |
1051 | Mina Schütz, Daniela Pisoiu, Daria Liakhovets, Alexander Schindler and Melanie Siegel | GerDISDETECT: A German Multilabel Dataset for Disinformation Detection |
1053 | Alessandra Teresa Cignarella, Manuela Sanguinetti, Simona Frenda, Andrea Marra, Cristina Bosco and Valerio Basile | QUEEREOTYPES: A Multi-Source Italian Corpus of Stereotypes towards LGBTQIA+ Community Members |
1055 | Armand Stricker and Patrick Paroubek | Chitchat as Interference: Adding User Backstories to Task-Oriented Dialogues |
1059 | Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya | Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks |
1060 | Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova and Barbara Plank | Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data |
1062 | Marcel Gohsen, Matthias Hagen, Martin Potthast and Benno Stein | Task-Oriented Paraphrase Analytics |
1063 | Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki and Koh Mitsuda | Release of Pre-Trained Models for the Japanese Language |
1065 | Julia Rozanova, Marco Valentino and André Freitas | Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models |
1068 | Khadige Abboud and Gokmen Oz | Towards Equitable Natural Language Understanding Systems for Dialectal Cohorts: Debiasing Training Data |
1071 | Shiva Taslimipoor, Luca Benedetto, Mariano Felice and Paula Buttery | Distractor Generation Using Generative and Discriminative Capabilities of Transformer-based Models |
1075 | Mali Jin, Daniel Preotiuc-Pietro, A. Seza Doğruöz and Nikolaos Aletras | Who is bragging more online? A large scale analysis of bragging in social media |
1076 | Chia-Wen Lu, Ching-Wen Yang and Wei-Yun Ma | Automatic Construction of a Chinese Review Dataset for Aspect Sentiment Triplet Extraction via Iterative Weak Supervision |
1078 | Jianyou Wang, Kaicheng Wang, Xiaoyue Wang, Weili Cao, Ramamohan Paturi and Leon Bergen | IR2: Information Regularization for Information Retrieval |
1079 | Zirui He, Huiqi Deng, Haiyan Zhao, Ninghao Liu and Mengnan Du | Mitigating Shortcuts in Language Models with Soft Label Encoding |
1080 | Rob van der Goot, Zoey Liu and Max Müller-Eberstein | Enough is Enough! A Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies |
1082 | Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Arthur Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard and Nikhil Krishnaswamy | Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles |
1085 | Amilleah Rodriguez, Shaonan Wang and Liina Pylkkänen | Do Neural Language Models Inferentially Compose Concepts the Way Humans Can? |
1090 | Biswesh Mohapatra, Seemab Hassan, Laurent Romary and Justine Cassell | Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units |
1091 | Faizad Ullah, Ali Faheem, Ubaid Azam, Muhammad Sohaib Ayub, Faisal Kamiran and Asim Karim | Detecting Cybercrimes in Accordance with Pakistani Law: Dataset and Evaluation using PLMs |
1092 | Do June Min, Veronica Perez-Rosas, ken resnicow and Rada Mihalcea | Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation |
1093 | Ivana Filipović Petrović, Miguel López Otal and Slobodan Beliga | Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset |
1097 | Christopher D. Sapp, Elliott Evans, Rex Sprouse and Daniel Dakota | Introducing a Parsed Corpus of Historical High German |
1101 | Sylvain Coulange, Marie-Hélène Fries, Monica Masperi and Solange Rossato | A corpus of spontaneous L2 English speech for real-situation speaking assessment |
1103 | ChangSu Choi, Yongbin Jeong, Seoyoon Park, InHo Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, HyeJin Lee, Younggyun Hahm, Hansaem Kim and KyungTae Lim | Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean |
1104 | Yejin Jeon, Yunsu Kim and Gary Geunbae Lee | Leveraging the Interplay Between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation |
1105 | Charles Lam, Chaak-ming Lau and Jackson L. Lee | Multi-Tiered Cantonese Word Segmentation |
1107 | Deepak Gupta, Kush Attal and Dina Demner-Fushman | Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches |
1109 | Viet Dac Lai, Duy Ngoc Pham, Jonathan Steinberg, Jamie Mikeska and Thien Huu Nguyen | CAMAL: A Novel Dataset for Multi-label Conversational Argument Move Analysis |
1110 | Ileana Rugina, Rumen Dangovski, Li Jing, Preslav Nakov and Marin Soljacic | Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks |
1111 | Shengjie Ji and Fang Kong | A Novel Three-stage Framework for Few-shot Named Entity Recognition |
1113 | Chihiro Yano, Akihiko Fukuchi, Shoko Fukasawa, Hideyuki Tachibana and Yotaro Watanabe | Multilingual Sentence-T5: Scalable Sentence Encoders for Multilingual Applications |
1117 | Ben Foley, Peter Sefton, Simon Musgrave and Moises Sacal Bonequi | Access control framework for language collections |
1119 | Pengwei Zhan, Jing Yang, He Wang, Chao Zheng and Liming Wang | Rethinking Word-level Adversarial Attack: The Trade-off Between Efficiency, Effectiveness, and Imperceptibility |
1120 | Kartik Kartik, Sanjana Soni, Anoop Kunchukuttan, Tanmoy Chakraborty and Md. Shad Akhtar | Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation |
1123 | Liqiang Niu, Fandong Meng and Jie Zhou | UMTIT: Unifying Recognition, Translation, and Generation for Multimodal Text Image Translation |
1125 | Chandrai Kayal, Sayantan Chattopadhyay, Aryan Gupta, Satyen Abrol and Archie Gugol | JLBert: Japanese Light BERT for Cross-Domain Short Text Classification |
1129 | Bo Xu, Yifei Wu, Shouang Wei, Ming Du and Hongya Wang | Adaptive Reinforcement Tuning Language Models as Hard Data Generators for Sentence Representation |
1132 | Leonardo Campillos-Llanos, Ana Rosa Terroba, Rocío Bartolomé, Ana Valverde-Mateos, Cristina González, Adrián Capllonch-Carrión and Jonathan Heras | Replace, Paraphrase or Fine-tune? Evaluating Automatic Simplification for Medical Texts in Spanish |
1133 | Bo-Han Lu, Yi-Hsuan Lin, Annie Lee and Richard Tzong-Han Tsai | Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems |
1138 | Jin Cui, Fumiyo Fukumoto, Xinfeng Wang, Yoshimi Suzuki, Jiyi Li, Noriko Tomuro and Wanzeng Kong | Enhanced Coherence-Aware Network with Hierarchical Disentanglement for Aspect-Category Sentiment Analysis |
1141 | Pavlína Synková, Jiří Mírovský, Lucie Poláková and Magdaléna Rysová | Announcing the Prague Discourse Treebank 3.0 |
1147 | Kazumasa Omura, Fei Cheng and Sadao Kurohashi | An Empirical Study of Synthetic Data Generation for Implicit Discourse Relation Recognition |
1153 | Junjia Feng, Mingqian Lin, Lin Shang and Xiaoying Gao | Autonomous Aspect-Image Instruction A2II: Q-Former Guided Multimodal Sentiment Classification |
1160 | Hwichan Kim, Shota Sasaki, Sho Hoshino and Ukyo Honda | A Single Linear Layer Yields Task-Adapted Low-Rank Matrices |
1161 | Jian-Tao Huang, Chung-Chi Chen, Hen-Hsen Huang and Hsin-Hsi Chen | NumHG: A Dataset for Number-Focused Headline Generation |
1163 | Robert Forkel, Johann-Mattis List, Christoph Rzymski and Guillaume Segerer | Linguistic Survey of India and Polyglotta Africana: Two Retrostandardized Digital Editions of Large Historical Collections of Multilingual Wordlists |
1166 | Quentin Brabant, Lina M. Rojas Barahona, Gwénolé Lecorvé and Claire Gardent | KGConv, a Conversational Corpus grounded in Wikidata |
1168 | Zhipeng Liu, Xiaoming Zhang, Litian Zhang and Zelong Yu | MDS: A Fine-Grained Dataset for Multi-Modal Dialogue Summarization |
1169 | Shaoxiong Ji, Timothee Mickus, Vincent Segonne and Jörg Tiedemann | Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? |
1176 | Leon Ackermann and Xenia Isabel Ohmer | On the Relationship between Skill Neurons and Robustness in Prompt Tuning |
1177 | Ming Zhang, Ke Chang and Yunfang Wu | Multi-modal Semantic Understanding with Contrastive Cross-modal Feature Alignment |
1178 | Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux and Maximin Coavoux | What has LeBenchmark learnt about French Syntax? |
1179 | Chung-Chi Chen and Hiroya Takamura | Term-Driven Forward-Looking Claim Synthesis in Earnings Calls |
1181 | Nurul Fajrin Ariyani, Zied Bouraoui, Richard Booth and Steven Schockaert | Can Language Models Learn Embeddings of Propositional Logic Assertions? |
1183 | Pingjie Wang, Hongcheng Liu, Yanfeng Wang and Yu Wang | Pruning before Fine-tuning: A Retraining-free Compression Framework for Pre-trained Language Models |
1184 | Laura Mascarell, Ribin Chalumattu and Annette Rios | German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset |
1185 | Michal Mochtak, Peter Rupnik and Nikola Ljubešić | The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings |
1187 | Giulia D'Agostino, Chris A. Reed and Daniele Puccinelli | Segmentation of Complex Question Turns for Argument Mining: A Corpus-based Study in the Financial Domain |
1188 | Nikola Ljubešić and Taja Kuzman | CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation |
1190 | María Estrella Vallecillo Rodríguez, María Victoria Cantero Romero, Isabel Cabrera De Castro, Arturo Montejo Ráez and María Teresa Martín Valdivia | CONAN-MT-SP: A Spanish Corpus for Counternarrative using GPT Models |
1195 | Zi Xiong, Lizhi Qing, Yangyang Kang, Jiawei Liu, Hongsong Li, Changlong Sun, Xiaozhong Liu and Wei Lu | Enhance Robustness of Language Models Against Variation Attack through Graph Integration |
1197 | Guangming Huang, Yunfei Long, Cunjin Luo, Jiaxing Shen and Xia Sun | Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process |
1199 | Chuanqi Dong, Wenjie Zhou, Xiangyu Duan, Yuqi Zhang and Min Zhang | Multimodal Cross-lingual Phrase Retrieval |
1200 | Loic De Langhe, Orphee De Clercq and Veronique Hoste | Enhancing Unrestricted Cross-Document Event Coreference with Graph Reconstruction Networks |
1202 | Michele Pulini and Johann-Mattis List | First Steps Towards the Integration of Resources on Historical Glossing Traditions in the History of Chinese: A Collection of Standardized Fǎnqiè Spellings from the Guǎngyùn |
1203 | Marcio Lima Inacio, Gabriela Wick-Pedro, Renata Ramisch, Luís Espírito Santo, Xiomara S. Q. Chacon, Roney Santos, Rogério Sousa, Rafael Anchiêta and Hugo Goncalo Oliveira | Puntuguese: A Corpus of Puns in Portuguese with Micro-edits |
1205 | Guochao Jiang, Ziqin Luo, Yuchen Shi, Dixuan Wang, Jiaqing Liang and Deqing Yang | ToNER: Type-oriented Named Entity Recognition with Generative Language Model |
1207 | Giulia Rambelli and marianna bolognesi | The Contextual Variability of English Nouns: The Impact of Categorical Specificity beyond Conceptual Concreteness |
1208 | Jaya Caporusso, Damar Hoogland, Mojca Brglez, Boshko Koloski, Matthew Purver and Senja Pollak | A Computational Analysis of the Dehumanisation of Migrants from Syria and Ukraine in Slovene News Media |
1209 | Tim Czerniak and Elaine Uí Dhonnchadha | Towards Semantic Tagging for Irish |
1210 | Cécile Macaire, Chloé Dion, Jordan Arrigo, Claire Lemaire, Emmanuelle Esperanca-Rodier, Benjamin Lecouteux and Didier Schwab | A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation |
1211 | Adnan Al Ali and Jindřich Libovický | How Gender Interacts with Political Values: A Case Study on Czech BERT Models |
1215 | Tomáš Horych, Martin Paul Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp and Timo Spinde | MAGPIE: Multi-Task Analysis of Media-Bias Generalization with Pre-Trained Identification of Expressions |
1218 | Tim Fischer, Florian Schneider, Fynn Petersen-Frey, Anja Silvia Mollah Haque, Isabel Eiser, Gertraud Koch and Chris Biemann | Extending the Discourse Analysis Tool Suite with Whiteboards for Visual Qualitative Analysis |
1219 | Marie Bexte, Andrea Horbach and Torsten Zesch | EVil-Probe - A Composite Benchmark for Extensive Visio-Linguistic Probing |
1220 | Colin Swaelens, Ilse De Vos and Els Lefever | Lemmatisation of Medieval Greek: Against the Limits of Transformer's Capabilities? |
1221 | Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Daniel AUDIBERT, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix E. Herron, Magali Norré, Massih R Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab | Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains |
1222 | Shan Zhang, Bin Cao and Jing Fan | KCL: Few-shot Named Entity Recognition with Knowledge Graph and Contrastive Learning |
1223 | Alfarabi Imashev, Nurziya Oralbayeva, Gulmira Baizhanova and Anara Sandygulova | Comparative Analysis of Sign Language Interpreting Agents Perception: A Study of the Deaf |
1228 | Jiang Li, Xiangdong Su, Fujun Zhang and Guanglai Gao | TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation |
1230 | Nils-Jonathan Schaller, Andrea Horbach, Lars Ingver Höft, Yuning Ding, Jan Luca Bahr, Jennifer Meyer and Thorben Jansen | DARIUS: A Comprehensive Learner Corpus for Argument Mining in German-Language Essays |
1233 | Mingyang Cai, Zhen Yang and Ping Jian | Improving Implicit Discourse Relation Recognition with Semantics Confrontation |
1234 | Xiaocheng Zhang, Chang Wang, Guoping Zhao and Xiaohong Su | LI4: Label-Infused Iterative Information Interacting based Fact Verification in Question-answering Dialogue |
1236 | Guowei Ge, Kuangrong Hao and Lingguang Hao | IDC: Boost Text-to-image Retrieval via Indirect and Direct Connections |
1237 | Damien Sileo | tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework |
1242 | Richard Johansson | What Happens to a Dataset Transformed by a Projection-based Concept Removal Method? |
1243 | Ke Liang, Chu-Ren Huang and Xin-Lan Jiang | From Text to Historical Ecological Knowledge: The Construction and Application of the Shan Jing Knowledge Base |
1246 | Yang Yiyuan, Guodong Long, Michael Blumenstein, Xiubo Geng, Chongyang Tao, Tao Shen and Daxin Jiang | Pre-training Cross-Modal Retrieval by Expansive Lexicon-Patch Alignment |
1248 | Asma Farajidizaji, Vatsal Raina and Mark Gales | Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models |
1250 | Lianxi Wang, Yujia Tian and Zhuowei Chen | Enhancing Hindi Feature Representation Through Fusion of Dual-Script Word Embeddings |
1251 | Alexander Prochnow, Johannes E. Bendler, Caroline Lange, Foivos Ioannis Tzavellos, Bas Marco Göritzer, Marijn ten Thij and Riza Batista-Navarro | IDEM: The IDioms with EMotions Dataset for Emotion Recognition |
1252 | John Pavlopoulos, Ryan Sandell, Maria Konstantinidou and Chiara Bozzone | HoLM: Analyzing the Linguistic Unexpectedness in Homeric Poetry |
1254 | Yongqi Li, Mayi Xu, Xin Miao, Shen Zhou and Tieyun Qian | Prompting Large Language Models for Counterfactual Generation: An Empirical Study |
1255 | Ginevra Martinelli, Paola Impicciché, Elisabetta Fersini, Francesco Mambrini and Marco Passarotti | Exploring Neural Topic Modeling on a Classical Latin Corpus |
1256 | Alexander Yom Din, Taelin Karidi, Leshem Choshen and Mor Geva | Jump to Conclusions: Short-Cutting Transformers with Linear Transformations |
1258 | Jonathan Kamp, Lisa Beinborn and Antske Fokkens | The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement |
1260 | Zhenfei Yang, Beiming Yu, Yuan Cui, Shi Feng, Daling Wang and Yifei Zhang | BERT-BC: A Unified Alignment and Interaction Model over Hierarchical BERT for Response Selection |
1261 | Yongxin Zhou, Fabien Ringeval and François Portet | PSentScore: Evaluating Sentiment Polarity in Dialogue Summarization |
1263 | Jing Han Sun and Ali Emami | EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries |
1264 | Yuanyuan Xu, Linhai Zhang and Deyu Zhou | TECA: A Two-stage Approach with Controllable Attention Soft Prompt for Few-shot Nested Named Entity Recognition |
1265 | Kangchen Zhu, Zhiliang Tian, Jingyu Wei, Ruifeng Luo, YIPING SONG and Xiaoguang Mao | StyleFlow: Disentangle Latent Representations via Normalizing Flow for Unsupervised Text Style Transfer |
1266 | Wenbo Qiao, Peng Zhang and ZengLai Ma | A Quantum-Inspired Matching Network with Linguistic Theories for Metaphor Detection |
1267 | Da Luo, Run Lin, Qiao Liu, Yuxiang Cai, Xueyi Liu, Yanglei Gan and Rui Hou | Synergetic Interaction Network with Cross-task Attention for Joint Relational Triple Extraction |
1268 | Honggang Zhao, Chunling Xiao, Jiayi Yang, Guozhu Jin and Mingyong Li | MccSTN: Multi-Scale Contrast and Fine-Grained Feature Fusion Networks for Subject-driven Style Transfer |
1270 | Larry Heck, Simon Heck and Anirudh S. Sundar | mForms : Multimodal Form Filling with Question Answering |
1272 | Giulia Pensa, Begoña Altuna and Itziar Gonzalez-Dios | A Multi-layered Approach to Physical Commonsense Understanding: Creation and Evaluation of an Italian Dataset |
1277 | Xiaojing Du, hanjie Zhao, danyan Xing, Yuxiang Jia and Hongying Zan | MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training |
1278 | Zepeng Ding, Wenhao Huang, Jiaqing Liang, Yanghua Xiao and Deqing Yang | Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction |
1280 | Seiji Shimizu, Lis Pereira, Shuntaro Yada and Eiji ARAMAKI | QA-based Event Start-Points Ordering for Clinical Temporal Relation Annotation |
1281 | Angelo Basile, Marc Franco-Salvador and Paolo Rosso | PyRater: A Python Toolkit for Annotation Analysis |
1283 | Khang Ly, Yury Kashnitsky, Savvas Chamezopoulos and Valeria Krzhizhanovskaya | Article Classification with Graph Neural Networks and Multigraphs |
1284 | Gabriele Sarti and Malvina Nissim | IT5: Text-to-text Pretraining for Italian Language Understanding and Generation |
1286 | Bohao Yang, Chen Tang, Kun Zhao, Chenghao Xiao and Chenghua Lin | Effective Distillation of Table-based Reasoning Ability from LLMs |
1295 | Jinliang Lu and Jiajun Zhang | Improving Unsupervised Neural Machine Translation via Training Data Self-Correction |
1297 | Manu Narayanan and Noëmi Aepli | A Tulu Resource for Machine Translation |
1298 | Noof Abdullah Alfear, Dimitar Kazakov and Hend Al-Khalifa | Meta-Evaluation of Sentence Simplification Metrics |
1299 | Bo LIU, Li-Ming Zhan, Zexin Lu, Yujie Feng, Lei Xue and Xiao-Ming Wu | How Good Are LLMs at Out-of-Distribution Detection? |
1300 | Dhaivat J. Bhatt, Seyed Ahmad Abdollahpouri Hosseini, Federico Fancellu and Afsaneh Fazly | End-to-end Parsing of Procedural Text into Flow Graphs |
1301 | Yangruibo Ding, Zijian Wang, Wasi U. Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth and Bing Xiang | CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context |
1303 | Niyati Bafna, Cristina España-Bonet, Josef van Genabith, Benoît Sagot and Rachel Bawden | When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages |
1304 | Claudiu Daniel Hromei, Daniele Margiotta, Danilo Croce and Roberto Basili | MM-IGLU: Multi-Modal Interactive Grounded Language Understanding |
1307 | Derong Xu, Ziheng Zhang, Zhenxi Lin, Xian Wu, Zhihong Zhu, Tong Xu, Xiangyu Zhao, Yefeng Zheng and Enhong Chen | Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models |
1308 | Milad Alshomary, Felix Lange, Meisam Booshehri, Meghdut Sengupta, Philipp Cimiano and Henning Wachsmuth | Modeling the Quality of Dialogical Explanations |
1309 | Elena Shushkevich, Long Thanh Mai, Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya | SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels |
1310 | Wenxuan Zhang, Min Huang, Zhuoyang Song and Qinghai Miao | DimA: A Parameter-efficient Fine-tuning Method with Knowledge transfer based on Transformer |
1312 | Hongfei Xue, Linyan Xu, Yu Tong, Rui Li, Jiali Lin and Dazhi Jiang | Breakthrough from Nuance and Inconsistency: Enhancing Multimodal Sarcasm Detection with Context-Aware Self-Attention Fusion and Word Weight Calculation. |
1313 | Taiji Li, Zhi Li and Yin Zhang | Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency |
1314 | Marco Braga, Alessandro Raganato and Gabriella Pasi | AdaKron: an Adapter-based Parameter Efficient Model Tuning with Kronecker Product |
1316 | Yana Nikolova | Evaluating Word Expansion for Multilingual Sentiment Analysis of Parliamentary Speech |
1317 | Rémi Uro, Marie Tahon, Jane Wottawa, David Doukhan, Albert Rilliard and Antoine LAURENT | Annotation of Transition-Relevance Places and Interruptions for the Description of Turn-Taking in conversations in French Media Content |
1319 | Hanyu Zhang, Xiting Wang, Xiang Ao and Qing He | Distillation with Explanations from Large Language Models |
1321 | Adil Soubki and Owen Rambow | Intention and Face in Dialog |
1322 | Francois Meyer and Jan Buys | Triples-to-isiXhosa (T2X): Addressing the Challenges of Low-Resource Agglutinative Data-to-Text Generation |
1323 | Aditya Narayan Sankaran, Vigneshwaran Shankaran, Sampath Lonka and Rajesh Sharma | Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems |
1325 | Enes Yavuz Ugan, Ngoc-Quan Pham and Alexander Waibel | DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark |
1328 | Gustave Cortal | Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives |
1333 | Andrea Gregor de Varda and Marco Marelli | The Emergence of Semantic Units in Massively Multilingual Models |
1337 | Stephanie Brandl, Oliver Eberle, Tiago Ribeiro, Anders Søgaard and Nora Hollenstein | Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations |
1340 | Francesca Grasso, Stefano Locci, Giovanni Siragusa and Luigi Di Caro | EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection |
1343 | Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J. Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Özlem Uzuner and Meliha Yetisgen | Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods |
1344 | Camila Antonio Barros, Jorge Francisco Ciprián-Sánchez and Saulo Mendes Santos | A Tool for Determining Distances and Overlaps between Multimodal Annotations |
1346 | Van-Thuy Phi, Hiroki Teranishi, Yuji Matsumoto, Hiroyuki Oka and Masashi Ishii | PolyNERE: A Novel Ontology and Corpus for Named Entity Recognition and Relation Extraction in Polymer Science Domain |
1347 | Ryan Soh-Eun Shim, Kalvin Chang and David R. Mortensen | Phonotactic Complexity across Dialects |
1348 | Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof and Svetlana Kiritchenko | Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes |
1351 | Ziqiang Liu, Shujie Li, Zefeng Cai, Xiangyu Li, Yunshui Li, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang | TP-Link: Fine-grained Pre-Training for Text-to-SQL Parsing with Linking Information |
1356 | Ariel Cohen, Alexandrine Lanson, Emmanuelle Kempf and Xavier Tannier | Leveraging Information Redundancy of Real-World Data Through Distant Supervision |
1357 | Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale and Marta R. Costa-jussà | SpeechAlign: a Framework for Speech Translation Alignment Evaluation |
1358 | Anna-Katharina Dick, Matthias Drews, Valentin Pickard and Victoria Pierz | GIL-GALaD: Gender Inclusive Language - German Auto-Assembled Large Database |
1359 | Axel Ahlin, Alfred Myrne Blåder and Pierre Nugues | Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata |
1360 | Masahiro Kaneko and Naoaki Okazaki | Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction |
1362 | Renzo Arturo Alva Principe, Nicola Chiarini and Marco Viviani | An LCF-IDF Document Representation Model Applied to Long Document Classification |
1364 | Jaemin Kim, Yohan Na, Kangmin Kim, Sang-Rak Lee and Dong-Kyu Chae | SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity |
1365 | Jacob Collard, Valeria de Paiva and Eswaran Subrahmanian | Mathematical Entities: Corpora and Benchmarks |
1368 | Garry Kuwanto, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga, Teresiah W. Nyoike, Aniefon D. Akpan, Nsima Ab Udouboh, Idongesit Udeme Archibong, Idara Effiong Moses, Ifeoluwatayo A. Ige, Benjamin Ajibade, Olumide Benjamin Awokoya, Idris Abdulmumin, Saminu Mohammad Aliyu, Ruqayya Nasir Iro, Ibrahim Said Ahmad, Deontae Smith, Praise-EL Michaels, David Ifeoluwa Adelani, Derry Tanti Wijaya and Anietie Andy | Mitigating Translationese in Low-resource Languages: The Storyboard Approach |
1370 | Lucas Consolin Dezotti, Marco Passarotti and Francesco Mambrini | Modelling and Linking an Old Latin-Portuguese Dictionary to the LiLa Knowledge Base |
1372 | Yanfei Lu, Patrick Littell and Keren Rice | Empowering Oneida Language Revitalization: Development of An Oneida Verb Conjugator |
1373 | Kexin Luo, Yue Mao, Bei Zhang and Sophie Hao | Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels |
1374 | Niclas Hertzberg and Anna Lokrantz | MedQA-SWE - A Clinical Question & Answer Dataset for Swedish |
1375 | Antoni Brosa-Rodríguez and Sylvain Kahane | New Proposal of Greenberg's Universal 14 from Typometrics |
1377 | Felipe Bravo-Marquez and Maria Jose Zambrano | Unpacking Bias: An Empirical Study of Bias Measurement Metrics, Mitigation Algorithms, and their Interactions |
1378 | Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair and Yash Mahesh Bangera | Language Variety Identification with True Labels |
1379 | Christian Hauptmann, Adrian Krenzer, Antonia Krause and Frank Puppe | ADEA: An Argumentative Dialogue Dataset on Ethical Issues concerning Future A.I. Applications |
1380 | Soline Felice, Solene Virginie Evain, Solange Rossato and François Portet | Audiocite.net : A Large Spoken Read Dataset in French |
1381 | Yifeng Xie, Zhihong Zhu, Xuan Lu, Zhiqi Huang and Haoran Xiong | InfoEnh: Towards Multimodal Sentiment Analysis via Information Bottleneck Filter and Optimal Transport Alignment |
1385 | Maarten Janssen | UDMorph: Morphosyntactically Tagged UD Corpora |
1388 | Krenare Pireva Nuci, Paul Landes and Barbara Di Eugenio | RoBERTa Low Resource Fine Tuning for Sentiment Analysis in Albanian |
1389 | Raia Abu Ahmad, Ekaterina Borisova and Georg Rehm | FoRC4CL: A Fine-grained Field of Research Classification and Annotated Dataset of NLP Articles |
1391 | Ange Richard, Laura Cristina Alonzo Canul and François Portet | FRACAS: a FRench Annotated Corpus of Attribution relations in newS |
1396 | Yutong Han, Yan Yuan and Lili Mou | A Dual-View Approach to Classifying Radiology Reports by Co-Training |
1398 | Carla Perez Almendros and Jose Camacho-Collados | Do Large Language Models Understand Mansplaining? Well, actually... |
1399 | Isabelle Lorge, Li Zhang, Xiaowen Dong and Janet Pierrehumbert | STEntConv: Predicting Disagreement between Reddit Users with Stance Detection and a Signed Graph Convolutional Network |
1401 | Dongheng Li, Yongchang Hao and Lili Mou | LLMR: Knowledge Distillation with a Large Language Model-Induced Reward |
1402 | Harry Bunt | ISO 24617-12: A New Standard for Semantic Annotation |
1403 | Ben Cohen, Moreah Zisquit, Stav Yosef, Doron Friedman and Kfir Bar | Motivational Interviewing Transcripts Annotated with Global Scores |
1405 | Margot Madina, Itziar Gonzalez-Dios and Melanie Siegel | A Preliminary Study of ChatGPT for Spanish E2R Text Adaptation |
1407 | Jón Daðason and Hrafn Loftsson | Text Filtering Classifiers for Medium-Resource Languages |
1408 | Myrthe Reuver, Suzan Verberne and Antske Fokkens | Investigating the Robustness of Modelling Decisions for Few-Shot Cross-Topic Stance Detection: A Preregistered Study |
1410 | Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theodoros Rekatsinas, Benjamin Han, Yunyao Li, Jeffrey Pound, Joshua M. Susskind, Natalie Schluter, Ihab F. Ilyas and Navdeep Jaitly | Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation |
1413 | Henning Wachsmuth, Gabriella Lapesa, Elena Cabrio, Anne Lauscher, Joonsuk Park, Eva Maria Vecchi, Serena Villata and Timon Ziegenbein | Argument Quality Assessment in the Age of Instruction-Following Large Language Models |
1415 | Neema Kotonya and Francesca Toni | Towards a Framework for Evaluating Explanations in Automated Fact Verification |
1417 | Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi and Thien Huu Nguyen | CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages |
1420 | Dalmo Buzato and Evandro Cunha | Agent-based Modeling of Language Change in a Small-world Network |
1421 | Emil Svoboda and Magda Sevcikova | PaReNT (Parent Retrieval Neural Tool): A Deep Dive into Word Formation Across Languages |
1422 | Hieu Man, Chien Van Nguyen, Nghia Trung Ngo, Linh Ngo, Franck Dernoncourt and Thien Huu Nguyen | Hierarchical Selection of Important Context for Generative Event Causality Identification with Optimal Transports |
1423 | Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Dragomir Radev, Yejin Choi and Noah A. Smith | A Call for Clarity in Beam Search: How It Works and When It Stops |
1426 | Martina Katalin Szabó, Veronika Vincze, Bernadett Dam, Csenge Guba, Anita Bagi and István Szendi | Predictive and distinctive linguistic features in Schizophrenia-Bipolar Spectrum Disorders |
1427 | Masato Hagiwara and Joshua B. Tanner | Project MOSLA: Recording Every Moment of Second Language Acquisition |
1436 | Lizzy Brans and Jelke Bloem | SimLex-999 for Dutch |
1438 | Bangze Pan, Yang Li, Suge Wang, Xiaoli Li, Deyu Li, Jian Liao and Jianxing Zheng | Document-Level Event Extraction via Information Interaction Based on Event Relation and Argument Correlation |
1439 | Joe Huamani-Malca, Miguel Rodriguez Mondoñedo, Francisco Cerna-Herrera, Gissella Bejarano, Carlos Vásquez Roque, Cesar Augusto Ramos Cantu and Sabina Oporto Pérez | Lessons from Deploying the First Bilingual Peruvian Sign Language - Spanish Online Dictionary |
1440 | Sonu Gupta, Geetika Gopi, Harish Balaji, Ellen Poplavska, Nora O'Toole, Siddhant Arora, Thomas Norton, Norman Sadeh and Shomir Wilson | Creation and Analysis of an International Corpus of Privacy Laws |
1441 | Yejin Kim, Scott Rome, Kevin Foley, Mayur Nankani, Rimon Melamed, Javier Morales, Abhay K. Yadav, Maria Peifer, Sardar Hamidian and H. Howie Huang | Improving Content Recommendation: Knowledge Graph-Based Semantic Contrastive Learning for Diversity and Cold-Start Users |
1442 | Bin Wang, Fuyong Xu, Peiyu Liu and Zhenfang Zhu | HyperMR: Hyperbolic Hypergraph Multi-hop Reasoning for Knowledge-based Visual Question Answering |
1443 | Yuanzhen Luo, Qingyu Zhou and Feng Zhou | Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction |
1448 | Xin Wu, Yi Cai and Ho-fung Leung | Abstract-level Deductive Reasoning for Pre-trained Language Models |
1449 | Li Yuan, Yi Cai, Haopeng Ren and Jiexin Wang | A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation |
1453 | Youheng W. Wong, Natalie Parde and Erdem Koyuncu | Humanistic Buddhism Corpus: A Challenging Domain-Specific Dataset of English Translations for Classical and Modern Chinese |
1454 | Baris Karacan, Ankit Aich, Avery Quynh, Amy Pinkham, Philip Harvey, Colin Depp and Natalie Parde | Towards Comprehensive Language Analysis for Clinically Enriched Spontaneous Dialogue |
1455 | Hayato Tsukagoshi, Tsutomu Hirao, Makoto Morishita, Katsuki Chousa, Ryohei Sasano and Koichi Takeda | WikiSplit++: Easy Data Refinement for Split and Rephrase |
1456 | Xi Wang, Hongliang Dai, Shen Gao and Piji Li | Characteristic AI Agents via Large Language Models |
1458 | Fan Hu, Yanlin Wang, Lun Du, Hongyu Zhang, Dongmei Zhang and Xirong Li | Tackling Long Code Search with Splitting, Encoding, and Aggregating |
1460 | Supryadi Supryadi, Leiyu Pan and Deyi Xiong | An Empirical Study on the Robustness of Massively Multilingual Neural Machine Translation |
1461 | Eiki Murata and Daisuke Kawahara | Time-aware COMET: a Commonsense Knowledge Model with Temporal Knowledge |
1463 | Maxwell A. Weinzierl and Sanda M. Harabagiu | The Impact of Stance Object Type on the Quality of Stance Detection |
1471 | Panatchakorn Anantaprayoon, Masahiro Kaneko and Naoaki Okazaki | Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels |
1472 | Zhongquan Jian, Ante Wang, Jinsong Su, Junfeng Yao, Meihong Wang and Qingqiang Wu | EmoTrans: Emotional Transition-based Model for Emotion Recognition in Conversation |
1474 | Bin Li, Yunlong Fan, Yikemaiti Sataer, Chuanqi Shi, Miao Gao and Zhiqiang Gao | Few-Shot Semantic Dependency Parsing via Graph Contrastive Learning |
1476 | Yao Sun, Anastasiia Tatlubaeva, Zhihan Li and Chester Palen-Michel | What are the implications of your question? Non-Information Seeking Question-Type Identification in CNN Transcripts |
1477 | Dojun Park and Sebastian Padó | Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean |
1478 | Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang and Xicheng Lu | KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion |
1479 | Sayar Ghosh Roy and Jiawei Han | ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation |
1480 | Yingting Li, Rishabh Bhardwaj, Ambuj Mehrish, Bo Cheng and Soujanya Poria | HYPERTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks |
1482 | Rossana Cunha, Thiago Castro Ferreira, Adriana Pagano and Fabio Alves | A Persona-Based Corpus in the Diabetes Self-Care Domain - Applying a Human-Centered Approach to a Low-Resource Context |
1483 | Yaqi Chen, Hao Zhang, Xukui Yang, Wenlin Zhang and Dan Qu | Meta-Adapter for Self-Supervised Speech Models: A Solution to Low-Resource Speech Recognition Challenges |
1484 | Sungjin Nam, Kevyn Collins-Thompson, David Jurgens and Xin Tong | Finding Educationally Supportive Contexts for Vocabulary Learning with Attention-Based Models |
1486 | Liang Lu, Jingzhi Wang and David R. Mortensen | Improved Neural Protoform Reconstruction via Reflex Prediction |
1492 | Sungjun Han and Sebastian Padó | Towards Understanding the Relationship between In-context Learning and Compositional Generalization |
1493 | Kehan Long, Shasha Li, Pancheng Wang, Chenlong Bao, Jintao Tang and Ting Wang | Recommending Missed Citations Identified by Reviewers: A New Task, Dataset and Baselines |
1494 | Chester Palen-Michel, Lizzie Liang, Zhe Wu and Constantine Lignos | QueryNER: Segmentation of E-commerce Queries |
1495 | Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko and Shervin Malmasi | Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data |
1496 | Bipesh Subedi, Sunil Regmi, Bal Krishna Bal and Praveen Acharya | Exploring the Potential of Large Language Models (LLMs) for Low-resource Languages: A Study on Named-Entity Recognition (NER) and Part-Of-Speech (POS) Tagging for Nepali Language |
1498 | Yigeng Zhang, Mahsa Shafaei, Fabio Gonzalez and Thamar Solorio | Positive and Risky Message Assessment for Music Products |
1499 | Yigeng Zhang, Fabio Gonzalez and Thamar Solorio | Interpreting Themes from Educational Stories |
1501 | Chieko Nishimura, Shuhei Kurita and Yohei Seki | Text360Nav: 360-Degree Image Captioning |
1502 | Eunike Andriani Kardinata, Hiroki Ouchi and Taro Watanabe | Constructing Indonesian-English Travelogue Dataset |
1503 | Ruochen Zhang and Carsten Eickhoff | CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization |
1504 | Yunhua Zhou, Pengyu wang, Peiju Liu, Yuxin Wang and Xipeng Qiu | The Open-World Lottery Ticket Hypothesis for OOD Intent Classification |
1505 | Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam and Dong-Kyu Chae | BanglaAutoKG: Automatic Bangla Knowledge Graph Construction with Semantic Neural Graph Filtering |
1507 | Tan Yue, Xuzhao Shi, Rui Mao, Zonghai Hu and Erik Cambria | SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset |
1510 | Omar Kallas, Go Inoue and Nizar Habash | EMAD: A Bridge Tagset for Unifying Arabic POS Annotations |
1511 | Jiaying Gong and Hoda Eldardiry | Few-Shot Relation Extraction with Hybrid Visual Evidence |
1512 | Rustem Yeshpanov, Alina Polonskaya and Huseyin Atakan Varol | KazParC: Kazakh Parallel Corpus for Machine Translation |
1514 | Thibault Clerice | Detecting Sexual Content at the Sentence Level in First Millennium Latin Texts |
1518 | xiujuan xu, Xiaoxiao Shi, Zhehuan Zhao and Yu Liu | ESCP: Enhancing Emotion Recognition in Conversation with Speech and Contextual Prefixes |
1519 | Fan Xu, Lei Zeng, Bowei Zou, AiTi Aw and Huan Rong | CLFFRD: Curriculum Learning and Fine-grained Fusion for Multimodal Rumor Detection |
1521 | Haopeng Zhang, Hayate Iso, Sairam Gurajada and Nikita Bhutani | XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates |
1522 | Rui Mao, Guanyi Chen, Xulang Zhang, Frank Guerin and Erik Cambria | GPTEval: A Survey on Assessments of ChatGPT and GPT-4 |
1526 | Mengyi Huang, Meng Xiao, Ludi Wang and Yi Du | DP-CRE: Continual Relation Extraction via Decoupled Contrastive Learning and Memory Structure Preservation |
1528 | Zhouhao Sun, Xiao Ding, Li Du, Bibo Cai, Jinglong Gao, Ting Liu and Bing Qin | Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation |
1530 | Agrima Seth, Sanchit Ahuja, Kalika Bali and Sunayana Sitaram | DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures |
1531 | Xinyue Liu, Jianan Zhang, Chi Ma, Wenxin Liang, Bo Xu and Linlin Zong | Temporal Knowledge Graph Reasoning with Dynamic Hypergraph Embedding |
1532 | Jonne Saleva and Constantine Lignos | ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages using Wikidata |
1535 | LIN LI, Shaopeng Tang and Renwei Wu | Majority Rules Guided Aspect-Category based Sentiment Analysis via Label Prior Knowledge |
1538 | Thennal D K, Ganesh Nathan and Suchithra M S | Fisher Mask Nodes for Language Model Merging |
1540 | Frederikus Hudi, Zhi Qu, Hidetaka Kamigaito and Taro Watanabe | Disentangling Pretrained Representation to Leverage Low-Resource Languages in Multilingual Machine Translation |
| | |
1543 | Heyang Liu, Yanfeng Wang and Yu Wang | Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview |
1548 | Yuki Hironaka, Tomoyuki Kajiwara and Takashi Ninomiya | Transfer Fine-tuning for Quality Estimation of Text Simplification |
1552 | Maxim Konca, Andy Luecking and Alexander Mehler | German SRL: Corpus Construction and Model Training |
1553 | Guicai Xie, Ke Zhang, Lei Duan, Wei Zhang and Zeqian Huang | Typos Correction Training Against Misspellings from Text-to-Text Transformers |
1555 | Tianyu Zheng, Ge Zhang, Xingwei Qu, Ming Kuang, Wenhao Huang and Zhaofeng He | MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces |
1558 | Zhenxi Lin, Ziheng Zhang, Xian Wu and Yefeng Zheng | Biomedical Entity Linking as Multiple Choice Question Answering |
1559 | Chen Yang, Bin Cao and Jing Fan | HS-GC: Holistic Semantic Embedding and Global Contrast for Effective Text Clustering |
1561 | Martyna Wiącek, Piotr Rybak, Łukasz Pszenny and Alina Wróblewska | NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems |
1562 | Mersad Esalati, Mohammad Javad Dousti and Heshaam Faili | Esposito: An English-Persian Scientific Parallel Corpus for Machine Translation |
1563 | Yue Wang, Zilong Zheng, Juntao Li, zhihui liu, Jinxiong Chang, Qishen Zhang, Zhongyi Liu, Guannan Zhang and Min Zhang | Towards More Realistic Chinese Spell Checking with New Benchmark and Specialized Expert Model |
1564 | Puneet Mathur, Vlad I. Morariu, Aparna Garimella, Franck Dernoncourt, Jiuxiang Gu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha and Rajiv Jain | DocScript: Document-level Script Event Prediction |
1565 | Kristina Kobrock, Xenia Isabel Ohmer, Elia Bruni and Nicole Gotzner | Context Shapes Emergent Communication about Concepts at Different Levels of Abstraction |
1566 | Dongjun Jang, Sungjoo Byun and Hyopil Shin | A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark |
1568 | Yaxin Fan, Feng Jiang, Peifeng Li and Haizhou Li | Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study |
1570 | Tu-Anh Tran and Yusuke Miyao | Integrating Headedness Information into an Auto-generated Multilingual CCGbank for Improved Semantic Interpretation |
1571 | Marta Bañón, Gema Ramírez-Sánchez, Jaume Zaragoza-Bernabeu and Sergio Ortiz Rojas | FastSpell: the LangId Magic Spell |
1573 | Seongbo Jang, Seonghyeon Lee and Hwanjo Yu | KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark |
1575 | Yuya Ogasa, Tomoyuki Kajiwara and Yuki Arase | Controllable Paraphrase Generation for Semantic and Lexical Similarities |
1578 | Hadeel Saadany, Constantin Orasan, Sophie Walker and Catherine Breslin | Linking Judgement Text to Court Hearing Videos: UK Supreme Court as a Case Study |
1585 | Mengsha Liu, Daoyuan Chen, Yaliang Li, Guian Fang and Ying Shen | ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization |
1589 | Wen-wai Yim, Yujuan Fu, Asma Ben Abacha and Meliha Yetisgen | To Err is Human, How About Medical Large Language Models? Comparing Pre-trained Language Models for Medical Assessment Errors and Reliability |
1591 | Jiří Mírovský, Pavlína Synková, Lucie Polakova and Marie Paclíková | Cost-Effective Discourse Annotation in the Prague Czech–English Dependency Treebank |
1594 | Akash Anil, Victor Gutierrez-Basulto, Yazmin Ibanez-Garcia and Steven Schockaert | Inductive Knowledge Graph Completion with GNNs and Rules: An Analysis |
1597 | Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Qiang Zhang and Huajun Chen | DET: A Dual-Encoding Transformer for Relational Graph Embedding |
1600 | Felipe Gonzalez-Pizarro and Giuseppe Carenini | Neural Multimodal Topic Modeling: A Comprehensive Evaluation |
1601 | Ruilin Luo, Jiayi Li, Jianghangfan Zhang, Jing Xiao and Yujiu Yang | Prior Relational Schema Assists Effective Contrastive Learning for Inductive Knowledge Graph Completion |
1606 | Dancheng Xin, Jiawei Yuan and Yang Li | Diffusion based Counterfactual Augmentation for Dual Sentiment Classification |
1608 | Jooyoung Lee, Fan Yang, Thanh Tran, Qian Hu, Emre Barut and Kai-Wei Chang | Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought |
1610 | Ronja Laarmann-Quante, Marco Müller and Eva Belke | Automatic Extraction of Nominal Phrases from German Learner Texts of Different Proficiency Levels |
1612 | Yerin Hwang, Yongil Kim, Hyunkyung Bae, Jeesoo Bang, Hwanhee Lee and Kyomin Jung | Kosmic: Korean Text Similarity Metric Reflecting Honorific Distinctions |
1613 | Antoine Jamelot, Solen Quiniou and Sophie Hamon | Improving Text Readability through Segmentation into Rheses |
1614 | Elena Benzoni, Matteo Pellegrini, Francesco Dedè and Marco Passarotti | Representing Compounding with OntoLex. An Evaluation of Vocabularies for Word Formation Resources |
1615 | Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata and Andrea Zaninello | MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain |
1616 | Miriam Schirmer, Christian Brechenmacher and Juergen Pfeffer | GENTRAC: A Tool for Tracing Trauma in Genocide and Mass Atrocity Court Transcripts |
1619 | Davide Picca and John Pavlopoulos | Deciphering Emotional Landscapes in the Iliad: A Novel French-Annotated Dataset for Emotion Recognition |
1620 | Harshita Diddee, Anurag Shukla, Tanuja Ganu, Vivek Seshadri, Sandipan Dandapat, Monojit Choudhury and Kalika Bali | INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation |
1621 | Xiaohan Ma, Rize Jin and Tae-Sun Chung | Multi-Channel Spatio-Temporal Transformer for Sign Language Production |
1625 | Tatiana Passali and Grigorios Tsoumakas | Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods |
1626 | Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong and Longyue Wang | A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models |
1628 | Anna Kuznetsova and Carlo Strapparava | Multimodal and Multilingual Laughter Detection in Stand-Up Comedy Videos |
1631 | Takeru Isaka, Atsushi Otsuka and Iwaki Toshima | Analysis of Sensation-transfer Dialogues in Motorsports |
1633 | Toru Urakawa, Yuya Taguchi, Takuro Niitsuma and Hideaki Tamori | A Japanese News Simplification Corpus with Faithfulness |
1635 | Shirin Dabbaghi Varnosfaderani, Canasai Kruengkrai, Ramin Yahyapour and Junichi Yamagishi | Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model |
1637 | Olga Zamaraeva, Lorena S. Allegue and Carlos Gómez-Rodríguez | Spanish Resource Grammar version 2023 |
1638 | Anni Eskelinen, Amanda Myntti, Erik Henriksson, Sampo Pyysalo and Veronika Laippala | Building Question-Answer Data Using Web Register Identification |
1639 | Yash Jain, David M. Chan, PRANAV DHERAM, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran and Shalini Ghosh | Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition |
1641 | Thi-Nhung Nguyen, Bang Tien Tran, Trong-Nghia Luu, Thien Huu Nguyen and Kiem-Hieu Nguyen | BKEE: Pioneering Event Extraction in the Vietnamese Language |
1642 | Audrey Mash, Carlos Escolano, Aleix Sant, Maite Melero and Francesca De Luca Fornaciari | Unmasking Biases: Exploring Gender Bias in English-Catalan Machine Translation through Tokenization Analysis and Novel Dataset |
1645 | Ya Gao, Shaoxiong Ji and Pekka Marttinen | Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection |
1646 | Hongzheng Li, Ruojin Wang, Ge Shi, Xing Lv, Lei Lei, Chong Feng, Fang Liu, Jinkun Lin, Yangguang Mei and Linnan Xu | RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts |
1647 | Zhenyu Qian, Yiming Qian, Yuting Song, Fei Gao, Hai Jin, Chen Yu and Xia Xie | Harnessing the Power of Large Language Model for Uncertainty Aware Graph Processing |
1648 | Maria Barrett, Max Müller-Eberstein, Elisa Bassignana, Amalie Brogaard Pauli, Mike Zhang and Rob van der Goot | Can Humans Identify Domains? |
1659 | Nurbanu Aksoy, Nishant Ravikumar and Serge Sharoff | Enhancing Image-to-Text Generation in Radiology Reports through Cross-modal Multi-Task Learning |
1661 | Chunlei Xin, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, weiqiang wang, Zhongyi Liu, Xianpei Han and Le Sun | Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning |
1662 | Longyin Zhang, Bowei Zou and Ai Ti Aw | Empowering Tree-structured Entailment Reasoning: Rhetorical Perception and LLM-driven Interpretability |
1663 | Lisa Raithel, Hui-Syuan Yeh, Shuntaro Yada, Cyril Grouin, Thomas Lavergne, Aurélie Névéol, Patrick Paroubek, Philippe Thomas, Tomohiro Nishiyama, Sebastian Möller, Eiji ARAMAKI, Yuji Matsumoto, Roland Roller and Pierre Zweigenbaum | A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages |
1664 | Marie Mikulova | Fine-grained Classification of Circumstantial Meanings within the Prague Dependency Treebank Annotation Scheme |
1666 | Injy Hamed, Fadhl Eryani, David Palfreyman and Nizar Habash | ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus |
1667 | Neele Falk and Gabriella Lapesa | Stories and personal experiences in the COVID-19 Discourse |
1668 | Elisa Bassignana, Viggo Unmack Gascou, Frida Nøhr Laustsen, Gustav Kristensen, Marie Haahr Petersen, Rob van der Goot and Barbara Plank | How to Encode Domain Information in Relation Classification |
1669 | Feifan Song, Bowen Yu, Hao Lang, Haiyang Yu, Fei Huang, Houfeng Wang and Yongbin Li | Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment |
1674 | Boxi Cao, Qiaoyu Tang, Hongyu Lin, Shanshan Jiang, Bin Dong, Xianpei Han, Jiawei Chen, Tianshu Wang and Le Sun | Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models |
1675 | Johanna CORDOVA | Towards Universal Dependencies For Ancash Quechua |
1677 | Yuting Yang, pei huang, Feifei Ma, Juan Cao and Jintao Li | PAD: A Robustness Enhancement Ensemble Method via Promoting Attention Diversity |
1679 | Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan and Sophie Rosset | Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification |
1681 | Aleksandra Edwards and Jose Camacho-Collados | Language Models for Text Classification: Is In-Context Learning Enough? |
1682 | Annerose Eichel, Tana Deeg, Andre Blessing, Milena Belosevic, Sabine Arndt-Lappe and Sabine Schulte im Walde | Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds |
1683 | Benjamin Icard, François Maine, Morgane Casanova, Géraud Faye, Julien Chanson, Guillaume Gadek, Ghislain Atemezing, François Bancilhon and Paul Égré | A Multi-Label Dataset of French Fake News: Human and Machine Insights |
1684 | Ramona Kühn, Khouloud Saadi, Jelena Mitrović and Michael Granitzer | Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection |
1687 | Veronika Grigoreva, Anastasiia Ivanova, Ilseyar Alimova and Ekaterina Artemova | RuBia: A Russian Language Bias Detection Dataset |
1695 | Sérgio Nunes, Alípio Mario Jorge, Evelin Amorim, Hugo Sousa, António Leal, Purificação Moura Silvano, Inês Cantante and Ricardo Campos | Text2Story Lusa: A Dataset for Narrative Analysis in European Portuguese News Articles |
1697 | Feiyan Liu, Liangzhi Li, Xiaoli Wang, Feng Luo, Chang Liu, Jinsong Su and Yiming Qian | MHGRL: An Effective Representation Learning Model for Electronic Health Records |
1699 | Mokanarangan Thayaparan, Marco Valentino and André Freitas | A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference |
1701 | Oliver Cakebread-Andrews | Error Analysis of NLP Models and Non-Native Speakers of English Identifying Sarcasm in Reddit Comments |
1704 | Ingrid Espinoza, Steffen Frenzel, Laurin Friedrich, Wassiliki Siskou, Steffen Eckhard and Annette Hautli-Janisz | PSE v1.0: The first open access corpus of public service encounters |
1708 | Marion Weller-Di Marco and Alexander Fraser | Analyzing the Understanding of Morphologically Complex Words in Large Language Models |
1709 | Jue Hou, Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu and Roman Yangarber | What do Transformers Know about Government? |
1710 | Dmitry Zmitrovich, Aleksandr Abramov, Andrey Kalmykov, Vitaly Kadulin, Maria Tikhonova, Ekaterina Taktasheva, Danil Astafurov, Mark Baushenko, Artem Snegirev, Tatiana Shavrina, Sergei S. Markov, Vladislav Mikhailov and Alena Fenogenova | A Family of Pretrained Transformer Language Models for Russian |
1711 | Muhammed AbuOdeh, Long Phan, Ahmed Farouk Zakaria Elshabrawy and Nizar Habash | Palmyra 3.0: A User-Friendly Cloud-Based Platform for Morphology and Dependency Syntax Annotation |
1714 | Jianyu Zheng, Fengfei Fan and Jianquan Li | Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer |
1715 | Martin Popel, Lucie Polakova, Michal Novák, Jindřich Helcl, Jindřich Libovický, Pavel Straňák, Tomas Krabac, Jaroslava Hlavacova, Mariia Anisimova and Tereza Chlanova | Charles Translator: A Machine Translation System between Ukrainian and Czech |
1716 | Sergey Kramp, Giovanni Cassani and Chris Emmery | BigNLI: Native Language Identification with Big Bird Embeddings |
1718 | Weihao Zeng, Keqing He, Yejie Wang, Dayuan Fu and Weiran Xu | BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses |
1719 | Marie Iversdatter Røsok and Ingerid Løyning Dale | NB Uttale: A Norwegian Pronunciation Lexicon with Dialect Variation |
1720 | Agnieszka Falenska, Eva Maria Vecchi and Gabriella Lapesa | Self-reported demographics and discourse dynamics in a persuasive online forum |
1722 | Yuhan Liu, Xiuying Chen, GAO XING, Ji Zhang and Rui Yan | IAD: In-Context Learning Ability Decoupler of Large Language Models in Meta-Training |
1723 | Iacopo Ghinassi, Simone Tedeschi, Paola Marongiu, Roberto Navigli and Barbara McGillivray | Language Pivoting from Parallel Corpora for Word Sense Disambiguation of Historical Languages: a Case Study on Latin |
1724 | Yikun Sun, Zhen Wan, Nobuhiro Ueda, Sakiko Yahata, Fei Cheng, Chenhui Chu and Sadao Kurohashi | Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese |
1725 | Jens Nevens, Robin De Haes, Rachel Ringe, Mihai Pomarlan, Robert Porzel, Katrien Beuls and Paul Van Eecke | A Benchmark for Recipe Understanding in Artificial Agents |
1727 | Rashid Nizamani, Sebastian Schuster and Vera Demberg | SIGA: A Naturalistic NLI Dataset of English Scalar Implicatures with Gradable Adjectives |
1731 | Xiaotian Lu, Jiyi Li, Zhen Wan, Xiaofeng Lin, Koh Takeuchi and Hisashi Kashima | Evaluating Saliency Explanations in NLP by Crowdsourcing |
1733 | Jiamin Luo, Jianing Zhao, Jingjing Wang and Guodong Zhou | How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding |
1739 | Jiamin Luo, Jingjing Wang and Guodong Zhou | TopicDiff: A Topic-enriched Diffusion Approach for Multimodal Conversational Emotion Detection |
1743 | Dimitar Trajanov, Elena Apostol, Radovan Garabík, Katerina Gkirtzou, Dagmar Gromann, Chaya Liebeskind, Cosimo Palma, Michael Rosner, Alexia Sampri, Gilles Sérasset, Blerina Spahiu, Ciprian-Octavian Truică and Giedre Valunaite Oleskeviciene | From Linguistic Linked Data to Big Data |
1744 | Masaaki Nagata, Makoto Morishita, Katsuki Chousa and Norihito Yasuda | JaParaPat: A Large-Scale Japanese-English Parallel Patent Application Corpus |
1746 | Abdelhak Kelious, Mathieu Constant and Christophe Coeur | Complex Word Identification: a Comparative Study Between ChatGPT and a Dedicated Model for this Task |
1747 | Johanna Gerlach, Pierrette Bouillon, Jonathan Mutal and Hervé Spechbach | A Concept Based Approach for Translation of Medical Dialogues into Pictographs |
1748 | Hongbin Na | CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering |
1749 | Madalina Chitez, Mihai Dascalu, Aura Cristina Udrea, Cosmin Strilețchi, Karla Csürös, Roxana Rogobete and Alexandru Oravițan | Towards Building the LEMI Readability Platform for Children's Literature in the Romanian Language |
1753 | Yantao Liu, Zijun Yao, Xin Lv, Yuchen Fan, Shulin Cao, Jifan Yu, Lei Hou and Juanzi Li | Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models |
1754 | Bolette Pedersen, Nathalie Sørensen, Sussi Olsen, Sanni Nimb and Simon Gray | Towards a Danish Semantic Reasoning Benchmark - Compiled from Lexical-Semantic Resources for Assessing Selected Language Understanding Capabilities of Large Language Models |
1755 | Xuefei Li, Huiwei Zhou, Weihong Yao, Wenchu Li, Yingyu Lin and Lei Du | Sequential and Repetitive Pattern Learning for Temporal Knowledge Graph Reasoning |
1757 | Anna Rogers, Marzena Karpinska, Ankita Gupta, Vladislav Lialin, Gregory Smelkov and Anna Rumshisky | NarrativeTime: Dense Temporal Annotation on a Timeline |
1758 | Chengyuan Liu, Fubang Zhao, Kun Kuang, Yangyang Kang, Zhuoren Jiang, Changlong Sun and Fei Wu | Evolving Knowledge Distillation with Large Language Models and Active Learning |
1759 | Quan Tu, Chongyang Tao and Rui Yan | Multi-Grained Conversational Graph Network for Retrieval-based Dialogue Systems |
1760 | Andrew Rueda, Elena Alvarez-Mellado and Constantine Lignos | CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English |
1761 | Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Zijun Yao, Jing Zhang, Lei Hou and Juanzi Li | A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation |
1762 | Soëlie Lerch, Patrice Bellot, Elisabeth Murisasco and Emmanuel Bruno | EMOLIS App and Dataset to Find Emotionally Close Cartoons |
1763 | Samia Touileb, Jeanett Murstad, Petter Mæhlum, Lubos Steskal, Lilja Charlotte Storset, Huiling You and Lilja Øvrelid | EDEN: A Dataset for Event Detection in Norwegian News |
1764 | Georg Rehm, Stelios Piperidis, Dimitris Galanis, Penny Labropoulou, Maria Giagkou, Miltos Deligiannis, Leon Voukoutis, Martin Courtois, Julian Moreno-Schneider and Katrin Marheinecke | European Language Grid: One Year After |
1770 | Evelin Amorim, Ricardo Campos, Alipio Jorge, Pedro Mota and Rúben Almeida | text2story: A Python Toolkit to Extract and Visualize Story Components of Narrative Text |
1773 | JingJie Zeng, Liang Yang, Jiahao Kang, Yufeng Diao, Zhihao Yang and Hongfei LIN | "Barking Up the Right Tree", a GAN-Based Pun Generation Model through Semantic Pruning |
1777 | Zhenwen Liang, Dian Yu, Xiaoman Pan, Wenlin Yao, Qingkai Zeng, Xiangliang Zhang and Dong Yu | MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning |
1781 | Karolina Zaczynska, Peter Bourgonje and Manfred Stede | How Diplomats Dispute: The UN Security Council Conflict Corpus |
1782 | Songbo Hu, Ivan Vulić, Fangyu Liu and Anna Korhonen | Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems |
1783 | Mario Perez-Enriquez, Jose Manuel Masiello-Ruiz, Jose Luis Lopez-Cuadrado, Israel Gonzalez-Carrasco, Paloma Martinez-Fernandez and Belen Ruiz-Mezcua | Automatic Punctuation Model for Spanish Live Transcriptions |
1786 | Christian Khairallah, Salam Khalifa, Reham Marzouk, Mayar Mohamadein Nassar and Nizar Habash | Camel Morph MSA: A Large-Scale Open-Source Morphological Analyzer for Modern Standard Arabic |
1787 | Tom Bourgeade, Zongmin Li, Farah Benamara, Véronique MORICEAU, Jian Su and Aixin Sun | Humans Need Context, What About Machines? Investigating Conversational Context in Abusive Language Detection |
1792 | Denis Kokosinskii and Nikolay Arefyev | Multilingual Substitution-based Word Sense Induction |
1795 | Enrique Amigó, Jorge Carrillo-de-Albornoz, Andrés Fernández, Julio Gonzalo, Guillermo Marco, Roser Morante, Laura Plaza and Jacobo Pedrosa | A Web Portal about the State of the Art of NLP Tasks in Spanish |
1798 | Manfred Klenner and Dylan Massey | Is Gender Reference Gender-specific? Studies in a Polar Domain |
1803 | Kate Thompson, Julie Hunter and Nicholas Asher | Discourse Structure for the Minecraft Corpus |
1805 | Chi Hu, Yuan Ge, Xiangnan Ma, Hang Cao, Qiang Li, Yonghua Yang, Tong Xiao and Jingbo Zhu | RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners |
1806 | Ye Tao, Chaofeng Lu, Meng Liu, Kai Xu, Tianyu Liu, Yunlong Tian and Yongjie Du | A Fast and High-quality Text-to-Speech Method with Compressed Auxiliary Corpus and Limited Target Speaker Corpus |
1809 | Anton Chernyavskiy, Svetlana Shomova, Irina Dushakova, Ilya Kiriya and Dmitry Ilvovsky | ZenPropaganda: A Comprehensive Study on Identifying Propaganda Techniques in Russian Coronavirus-Related Media |
1810 | Di Wang, Yuzheng He, Xiao Liang, Yumin Tian, Shaofeng Li and Lin Zhao | TMFN: A Target-oriented Multi-grained Fusion Network for End-to-end Aspect-based Multimodal Sentiment Analysis |
1812 | Jianxiang Xiang, Zhenhua Liu, Haodong Liu, Yin Bai, Jia Cheng and Wenliang Chen | DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space |
1815 | Jiyao Wei, Saiping Guan, Xiaolong Jin, Jiafeng Guo and Xueqi Cheng | Few-shot Link Prediction on Hyper-relational Facts |
1816 | Henrik Voigt, Kai Lawonn and Sina Zarrieß | Plots Made Quickly: An Efficient Approach for Generating Visualizations from Natural Language Queries |
1817 | Martijn Bentum, Eric Sanders, Antal P.J. van den Bosch, Douwe Zeldenrust and Henk van den Heuvel | Corpus Creation and Automatic Alignment of Historical Dutch Dialect Speech |
1818 | Shweta Misra and Johan Boye | Nested Noun Phrase Identification using BERT |
1821 | Jorge Osés Grijalba, L. Alfonso Ureña-López, Eugenio Martínez Cámara and Jose Camacho-Collados | Question Answering over Tabular Data with DataBench: A Large-Scale Empirical Evaluation of LLMs |
1822 | Aaron Maladry, Alessandra Teresa Cignarella, Els Lefever, Cynthia Van Hee and Veronique Hoste | Human and System Perspectives on the Expression of Irony: an Analysis of Likelihood Labels and Rationales |
1823 | Pascal Tilli and Ngoc Thang Vu | Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering |
1825 | Jeremy Robichaud and Paul Cook | WaCadie: Towards an Acadian French Corpus |
1834 | Augusto R. Mendes and Helena Caseli | Identifying Fine-grained Depression Signs in Social Media Posts |
1836 | Arianna Graciotti, Valentina Presutti and Rocco Tripodi | Latent vs Explicit Knowledge Representation: How ChatGPT Answers Questions about Low-Frequency Entities |
1837 | Ivan Sedykh, Nikita Sorokin, Dmitry Abulkhanov, Sergey I. Nikolenko and Valentin Malykh | Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets |
1838 | Marianne Vergez-Couret, Myriam Bras, Aleksandra Miletić and Clamença Poujade | Loflòc: A Morphological Lexicon for Occitan using Universal Dependencies |
1840 | Ashwathy T Revi, Stuart E. Middleton and David E. Millard | Rationale-based Learning using Self-Supervised Narrative Events for Text Summarisation of Interactive Digital Narratives |
1841 | Fengkai Liu and John S. Y. Lee | CSSWiki: A Chinese Sentence Simplification Dataset with Linguistic and Content Operations |
1842 | Noémi Ligeti-Nagy, Gergő Ferenczi, Enikő Héja, László János Laki, Noémi Vadász, Zijian Győző Yang and Tamás Váradi | HuLU: Hungarian Language Understanding Benchmark Kit |
1845 | Kelvin Wey Han Chan, Christopher Bryant, Li Nguyen, Andrew Caines and Zheng Yuan | Grammatical Error Correction for Code-Switched Sentences by Learners of English |
1847 | Zhuorui Liu, Chen Zhang and Dawei Song | How Speculative Can Speculative Decoding Be? |
1848 | Adnen Abdessaied, Manuel Hochmeister and Andreas Bulling | OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog |
1849 | Serhii Hamotskyi, Nata Kozaeva and Christian Hänig | FinCorpus-DE10k: A Corpus for the German Financial Domain |
1850 | Xiang Luo, Zhiwen Tang, Jin Wang and Xuejie Zhang | DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues |
1852 | Shuo Yang and Gjergji Kasneci | Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization |
1853 | Yue Wang, Hua Zheng, Yaqi Yin, 王 涵思, Qiliang Liang and Yang Liu | Morpheme Sense Disambiguation: A New Task Aiming for Understanding the Language at Character Level |
1855 | Gaurish Thakkar, Sherzod Hakimov and Marko Tadić | M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets |
1856 | Qingqing Gao, Jiuxin Cao, Biwei Cao, Xin Guan and Bo Liu | CEPT: a Contrast-Enhanced Prompt-Tuning Framework for Emotion Recognition in Conversation |
1857 | Andres Garcia-Silva, Cristian Berrio and Jose Manuel Gomez-Perez | SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation |
1859 | Haotian Xu, Yuhua Wang and Jiahui Fan | Self-Knowledge Distillation for Knowledge Graph Embedding |
1864 | Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao and Min Zhang | 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset |
1866 | Olli Kuparinen | Murre24: Dialect Identification of Finnish Internet Forum Messages |
1870 | Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Chandra Pujari and Annemarie Friedrich | AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports |
1873 | Zhuoqun Li, Hongyu Lin, Yaojie Lu, Hao Xiang, Xianpei Han and Le Sun | Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models |
1874 | Rik van Noord, Taja Kuzman, Peter Rupnik, Nikola Ljubešić, Miquel Esplà-Gomis, Gema Ramírez-Sánchez and Antonio Toral | Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages |
1876 | Emmett Strickland, Anne Lacheret-Dujour, Sylvain Kahane, Marc Evrard, Perrine Quennehen, Bernard Caron, Francis Egbokhare and Bruno Guillaume | New Methods for Exploring Intonosyntax: Introducing an Intonosyntactic Treebank for Nigerian Pidgin |
1878 | Tianbao Song, Jingbo Sun, Xin Liu and Weiming Peng | Scale-VAE: Preventing Posterior Collapse in Variational Autoencoder |
1881 | Sebastian Reimann and Tatjana Scheffler | Metaphors in Online Religious Communication: a Detailed Dataset and Cross-Genre Metaphor Detection |
1882 | Kira Droganova and Daniel Zeman | Towards a Unified Taxonomy of Deep Syntactic Relations |
1884 | Yaqi Yin, Yue Wang and Yang Liu | Chinese Morpheme-informed Evaluation of Large Language Models |
1885 | Mingxiu Cai, Daling Wang, Shi Feng and Yifei Zhang | EmpCRL: Controllable Empathetic Response Generation via In-Context Commonsense Reasoning and Reinforcement Learning |
1888 | tianxiang wu, Han Chen, Luozheng Qin, Ziqiang Cao and Chunhui Ai | Improving Copy-oriented Text Generation via EDU Copy Mechanism |
1894 | Ruina Bai and Qi Bai | Improving multi-view document clustering: leveraging multi-structure processor and hybrid ensemble clustering module |
1895 | Liyan Wang, Haotong Wang and Yves Lepage | Continued Pre-training on Sentence Analogies for Translation with Small Data |
1898 | Baptiste Blouin, Cécile Armand and Christian Henriot | A Dataset for Named Entity Recognition and Entity Linking in Chinese Historical Newspapers |
1899 | Pierre Magistry, Ilaine Wang and Ty Eng Lim | Experiments on Speech Synthesis for Teochew, Can Taiwanese Help ? |
1901 | Somaiyeh Dehghan and Berrin Yanıkoğlu | Multi-domain Hate Speech Detection Using Dual Contrastive Learning and Paralinguistic Features |
1902 | Eric Sanders, Sara Petrollino, Gilles R. Scheifer, Henk van den Heuvel and Christopher Handy | FAIRification of LeiLanD |
1903 | Ariel Ekgren, Amaru Cuba Gyllensten, Felix Stollenwerk, Joey Öhman, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Judit Casademont and Magnus Sahlgren | GPT-SW3: An Autoregressive Language Model for the Scandinavian Languages |
1904 | Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann and Animesh Mukherjee | On Zero-Shot Counterspeech Generation by LLMs |
1908 | Hao Gu, Jiangyan Yi, Zheng Lian, Jianhua Tao and Xinrui Yan | NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption |
1911 | Shun Katada, Ryu Takeda and Kazunori Komatani | Collecting Human-Agent Dialogue Dataset with Frontal Brain Signal toward Capturing Unexpressed Sentiment |
1913 | Yige Chen, KyungTae Lim and Jungyeul Park | A Linguistically-Informed Annotation Strategy for Korean Semantic Role Labeling |
1914 | Zhicheng Lin, HeGang Chen, Yuyin Lu, Yanghui Rao, Hao Xu and Hanjiang Lai | Hierarchical Topic Modeling via Contrastive Learning and Hyperbolic Embedding |
1916 | Wajdi Zaghouani, Hamdy Mubarak and Md. Rafiul Biswas | So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset |
1918 | Nezih Younsi, Catherine Pelachaud and Laurence Chaby | Beyond Words: Decoding Facial Expression Dynamics in Motivational Interviewing |
1921 | Ke Zhang, Yimiao Feng and Jie Zheng | Prompt-based Generation of Natural Language Explanations of Synthetic Lethality for Cancer Drug Discovery |
1922 | Miriam Winkler, Virginija Juozapaityte, Rob van der Goot and Barbara Plank | Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants |
1923 | Nesrine Bannour, Christophe Servan, Aurélie Névéol and Xavier Tannier | A Benchmark Evaluation of Clinical Named Entity Recognition in French |
1924 | Alina Karakanta, Mauro Cettolo, Matteo Negri and Luisa Bentivogli | Evaluating Automatic Subtitling: Correlating Post-editing Effort and Automatic Metrics |
1927 | Qiwei Peng, Yekun Chai and Xuhong Li | HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization |
1928 | Tomáš Sourada, Jana Straková and Rudolf Rosa | OOVs in the Spotlight: How to Inflect them? |
1930 | Joshua Miles Jansen van Vüren, Febe De Wet and Thomas Niesler | Automatic Partitioning of a Code-Switched Speech Corpus Using Mixed-Integer Programming |
1931 | Hossam Zawbaa, Wael Rashwan, Sourav Dutta and Haytham Assem | Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification |
1934 | Alessio Miaschi, Felice Dell'Orletta and Giulia Venturi | Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It) |
1937 | Chuang Liu, Renren Jin, Yuqi Ren and Deyi Xiong | LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models |
1942 | Xin Sun, Jiahuan Pei, Jan de Wit, Mohammad Aliannejadi, Emiel Krahmer, Jos T.P. Dobber and Jos A. Bosch | Eliciting Motivational Interviewing Skill Codes in Psychotherapy with LLMs: A Bilingual Dataset and Analytical Study |
1943 | Da Ren and Qing Li | Releasing the Capacity of GANs in Non-Autoregressive Image Captioning |
1944 | Katarzyna Krasnowska-Kieraś and Marcin Woliński | Parsing Headed Constituencies |
1948 | Mithun Das, Saurabh Kumar Pandey and Animesh Mukherjee | Evaluating ChatGPT Against Functionality Tests for Hate Speech Detection |
1954 | Shulin Zhang, John Hale, Margaret Renwick, Zvjezdana Vrzić and Keith Langston | An Evaluation of Croatian ASR Models for Čakavian Transcription |
1956 | Sacha Beniamine, Mari Aigro, Matthew Baerman, Jules Bouton and Maria Copot | Eesthetic: A Paralex Lexicon of Estonian Paradigms |
1959 | Elizaveta Korotkova, Taido Purason, Agnes Luhtaru and Mark Fishel | Multilinguality or Back-translation? A Case Study with Estonian |
1963 | Ibrahim Khalil Khebour, Kenneth Lai, Mariah Bradford, Yifan Zhu, Richard A. Brutti, Christopher Tam, Jingxuan Tu, Benjamin A. Ibarra, Nathaniel Blanchard, Nikhil Krishnaswamy and James Pustejovsky | Common Ground Tracking in Multimodal Dialogue |
1966 | Gustav Ryberg Smidt, Els Lefever and Katrien De Graef | At the Crossroad of Cuneiform and NLP: Challenges for Fine-grained Part-of-speech Tagging |
1968 | Michaela Regneri, Alhassan Abdelhalim and Soeren Laue | Detecting Conceptual Abstraction in LLMs |
1969 | Sugyeong Eo, Jungwoo Lim, Chanjun Park, DaHyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo and Heuiseok Lim | Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation |
1973 | Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang and Xuanjing Huang | RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions |
1979 | Seonjeong Hwang, Yunsu Kim and Gary Geunbae Lee | Explainable Multi-hop Question Generation: An End-to-End Approach without Intermediate Question Labeling |
1982 | Shaolin Zhu, Menglong Cui and Deyi Xiong | Towards Robust In-Context Learning for Machine Translation with Large Language Models |
1983 | Ai Ishii, Naoya Inoue, Hisami Suzuki and Satoshi Sekine | JEMHopQA: Dataset for Japanese Explainable Multi-Hop Question Answering |
1985 | Rahul Ponnusamy, Kathiravan Pannerselvam, Saranya R, Prasanna Kumar Kumaresan, Sajeetha Thavareesan, Bhuvaneswari S, Anshid K.A, Susminu S Kumar, Paul Buitelaar and Bharathi Raja Chakravarthi | From Laughter to Inequality: Annotated Dataset for Misogyny Detection in Tamil and Malayalam Memes |
1987 | Prasanna Kumar Kumaresan, Rahul Ponnusamy, Dhruv Sharma, Paul Buitelaar and Bharathi Raja Chakravarthi | Dataset for Identification of Homophobia and Transphobia for Telugu, Kannada, and Gujarati |
1988 | Abhishek Agrawal, Mitja Nikolaus, Benoit Favre and Abdellah Fourtassi | Automatic Coding of Contingency in Child-Caregiver Conversations |
1990 | Janire Arana, Mikel Idoyaga, Maitane Urruela, Elisa Espina, Aitziber Atutxa Salazar and Koldo Gojenola | A Virtual Patient Dialogue System Based on Question-Answering on Clinical Records |
1993 | Olia Toporkov and Rodrigo Agerri | Evaluating Shortest Edit Script Methods for Contextual Lemmatization |
1994 | Daria Romanovna Ledneva and Denis Pavlovich Kuznetsov | Reimagining Intent Prediction: Insights from Graph-Based Dialogue Modeling and Sentence Encoders |
1997 | Ziqian Zeng, Runyu Wu, Yuxiang Xiao, Xiaoda Zhong, Hanlin Wang, Zhengdong Lu and Huiping Zhuang | Zero-shot Event Detection using a Textual Entailment Model as an Enhanced Annotator |
1998 | Wenjie Zhou, Qiang Wang, Mingzhou Xu, MING CHEN and Xiangyu Duan | Revisiting the Self-Consistency Challenges in Multi-Choice Question Formats for Large Language Model Evaluation |
1999 | Bo Xu, Longjiao Li, Wei Luo, Mehdi Naseriparsa, Zhehuan Zhao, Hongfei Lin and Feng Xia | Beyond Linguistic Cues: Fine-grained Conversational Emotion Recognition via Belief-Desire Modelling |
2000 | Adrian Cosma, Ioan-Bogdan Iordache and Paolo Rosso | RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian |
2001 | Paulina Garcia Corral, Hanna Bechara, Ran Zhang and Slava Jankin | PolitiCause: An Annotation Scheme and Corpus for Causality in Political Texts |
2002 | Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon and Donald Metzler | OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement |
2004 | Olivier Ferret | Language Models and Semantic Relations: a Dual Relationship |
2005 | Léane Isabelle Jourdan, Florian Boudin, Nicolas Hernandez and Richard Dufour | CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions |
2006 | Zhaobo Zhang, Rui Gan, Pingpeng Yuan and Hai Jin | Correcting Pronoun Homophones with Subtle Semantics in Chinese Speech Recognition |
2008 | Moreno La Quatra, Alkis Koudounas, Elena Baralis and Sabato Marco Siniscalchi | Speech Analysis of Language Varieties in Italy |
2009 | Claudia Collacciani, Andrea Amelio Ravelli and marianna bolognesi | Specifying Genericity through Inclusiveness and Abstractness Continuous Scales |
2010 | Jianwei Wang, Tianyin Wang and Ziqian Zeng | On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction |
2011 | Junlin Li, Bo Peng and Yu-Yin Hsu | Emstremo: Adapting Emotional Support Response with Enhanced Emotion-Strategy Integrated Selection |
2014 | Mariana O. Silva and Mirella M. Moro | PPORTAL_ner: An Annotated Corpus of Portuguese Literary Entities |
2015 | Iqra Ali, Hidetaka Kamigaito and Taro Watanabe | Monolingual Paraphrase Detection Corpus for Low Resource Pashto Language at Sentence Level |
2016 | Scott Friedman, Joan Zheng and Hillel Steinmetz | Debiasing Multi-Entity Aspect-Based Sentiment Analysis with Norm-Based Data Augmentation |
2017 | Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Sazia Mehnaz, Kanij Fatema, Mohammad Mamun Or Rashid and Farig Sadeque | Unicode Normalization and Grapheme Parsing of Indic Languages |
2023 | Martin Lebourdais, Marie Tahon, Antoine LAURENT and Sylvain Meignier | Automatic Speech Interruption Detection: Analysis, Corpus, and System |
2026 | Sergio E. Zanotto, Qi Yu, Miriam Butt and Diego Frassinelli | GRIT: A Dataset of Group Reference Recognition in Italian |
2027 | Ilaria Fiorentini, Marco Forlano and Nicholas Nese | Towards the WhAP Corpus: A resource for the study of Italian on WhatsApp |
2029 | Sebastian Vincent, Rowanne Sumner, Alice Dowek, Charlotte Prescott, Emily Preston, Chris Bayliss, Chris Oakley and Carolina Scarton | Reference-less Analysis of Context Specificity in Translation with Personalised Language Models |
2031 | Artem Abzaliev, Humberto Perez-Espinosa and Rada Mihalcea | Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification |
2034 | Linda Wiechetek, Flammie A. Pirinen, Børre Gaup, Trond Trosterud, Maja Lisa Kappfjell and Sjur Moshagen | The Ethical Question -- Use of Indigenous Corpora for Large Language Models |
2035 | Xulong Du, Xingnan Zhang, Dandan Wang, Yingying Xu, Zhiyuan Wu, Shiqing Zhang*, Xiaoming Zhao*, Jun Yu and Liangliang Lou | Integrating Representation Subspace Mapping with Unimodal Auxiliary Loss for Attention-based Multimodal Emotion Recognition |
2037 | Nadège Alavoine, Gaëlle Laperrière, Christophe Servan, Sahar Ghannay and Sophie Rosset | New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark |
2039 | Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez and David Chiang | Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information |
2040 | Jan Gorisch and Thomas Schmidt | Evaluating Workflows for Creating Orthographic Transcripts for Oral Corpora by Transcribing from Scratch or Correcting ASR-Output |
2042 | Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel and Theophile Gonos | RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot |
2043 | Feiteng Fang, Liang Zhu, Xi Feng, Jinchang Hou, Qixuan Zhao, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang | CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment |
2044 | Ana Cimitan, Ana Alves Pinto and Michaela Geierhos | Curation of Benchmark Templates for Measuring Gender Bias in Named Entity Recognition Models |
2045 | Tuan Nguyen, Corinne Fredouille, Alain Ghio, Mathieu Balaguer and Virginie Woisard | Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context |
2046 | Hao Wang, Tang Li, Chenhui Chu, Rui Wang and Pinpin Zhu | Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents |
2050 | Seunghee Han, Sunhee Kim and Minhwa Chung | Constructing Korean Learners' L2 Speech Corpus of Seven Languages for Automatic Pronunciation Assessment |
2052 | Rui Gao, Miaomiao Cheng, Xu Han and Wei Song | High-Order Semantic Alignment for Unsupervised Fine-Grained Image-Text Retrieval |
2055 | Jiaxin Duan, Fengyu Lu and Junfei Liu | Alleviating Exposure Bias in Abstractive Summarization via Sequentially Generating and Revising |
2056 | Ruize Yuan, Xiang Ao, Li Zeng and Qing He | DRAMA: Dynamic Multi-Granularity Graph Estimate Retrieval over Tabular and Textual Question Answering |
2057 | Leiyu Pan, Yongqi Leng and Deyi Xiong | Can Large Language Models Learn Translation Robustness from Noisy-Source In-context Demonstrations? |
2061 | Nobuhiro Ueda, Hideko Habe, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi and Koichiro Yoshino | J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution |
2063 | Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan and jun huang | AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework |
2065 | Guanhua Chen, Yutong Yao, Derek F. Wong and Lidia S. Chao | A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU |
2071 | Anisia Popescu, Lori Lamel and Ioana Vasilescu | Using Speech Technology to test Theories of Phonetic and Phonological Typology |
2074 | Byungha Kang and Youhyun Shin | Improving Low-Resource Keyphrase Generation through Unsupervised Title Phrase Generation |
2076 | Jiaxin Duan, Fengyu Lu and Junfei Liu | Prophecy Distillation for Boosting Abstractive Summarization |
2077 | Angus Addlesee, Oliver Lemon and Arash Eshghi | Clarifying Completions: Evaluating How LLMs Respond to Incomplete Questions |
2078 | Lingxing Kong, Yougang Chu, Zheng Ma, Jianbing Zhang, Liang He and Jiajun Chen | MixRED: A Mix-lingual Relation Extraction Dataset |
2079 | Xiao Zhang, Heqi Zheng, Yuxiang Nie, Heyan Huang and Xian-Ling Mao | SciMRC: Multi-perspective Scientific Machine Reading Comprehension |
2084 | Zhaolin Li, Monika Rind-Pawlowski and Jan Niehues | Speech Recognition Corpus of the Khinalug Language for Documenting Endangered Languages |
2087 | Eliot Maës, Hossam Boudraa, Philippe Blache and Leonor Becerra-Bonache | Did You Get It? A Zero-Shot Approach to Locate Information Transfers in Conversations |
2092 | Lorenzo Lupo, Paul Bose, Mahyar Habibi, Dirk Hovy and Carlo Schwarz | DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods |
2093 | Massimo Poesio, Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Amir Zeldes, Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský and Daniel Zeman | Universal Anaphora: The First Three Years |
2095 | Daniel G. Swanson, Bryce D. Bussert and Francis Tyers | Producing a Parallel Universal Dependencies Treebank of Ancient Hebrew and Ancient Greek via Cross-Lingual Projection |
2097 | Marcello Ferro, Claudia Marzi, Andrea Nadalini, Loukia Taxitari, Alessandro Lento and Vito Pirrelli | ReadLet: a Dataset for Oral, Visual and Tactile Text Reading Data of Early and Mature Readers |
2100 | Asahi Yoshida, Yoshihide Kato and Shigeki Matsubara | Negation Scope Conversion: Towards a Unified Negation-Annotated Dataset |
2102 | Xiaoyan Zhao, Lingzhi Wang, Zhanghao Wang, Hong Cheng, Rui Zhang and Kam-Fai Wong | PACAR: Automated Fact-Checking with Planning and Customized Action Reasoning using Large Language Models |
2104 | Aswathy Velutharambath, Roman Klinger and Amelie Wührl | Can Factual Statements be Deceptive? The DeFaBel Corpus of Belief-based Deception |
2105 | Wissam Antoun, Benoît Sagot and Djamé Seddah | From Text to Source: Results in Detecting Large Language Model-Generated Content |
2107 | Mikel Zubillaga, Oscar Sainz, Ainara Estarrona, Oier Lopez de Lacalle and Eneko Agirre | Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis |
2109 | Leran Zhang and Nora Hollenstein | Eye-Tracking Features Masking Transformer Attention in Question-Answering Tasks |
2111 | Huacheng Song and Hongzhi Xu | Benchmarking the Performance of Machine Translation Evaluation Metrics with Chinese Multiword Expressions |
2112 | Dan Li, Vikrant Yadav, Zi Long Zhu, Maziar Moradi Fard, Zubair Afzal and George Tsatsaronis | Scalable Patent Classification with Aggregated Multi-View Ranking |
2114 | Christophe Servan, Sahar Ghannay and Sophie Rosset | mALBERT: Is a Compact Multilingual BERT Model Still Worth It? |
2117 | Elisa Di Nuovo, Manuela Sanguinetti, Pier Felice Balestrucci, Luca Anselma, Cristian Bernareggi and Alessandro Mazzei | Educational Dialogue Systems for Visually Impaired Students: Introducing a Task-Oriented User-Agent Corpus |
2118 | Stephen Joseph Meisenbacher, Nihildev Nandakumar, Alexandra Klymenko and Florian Matthes | A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off |
2120 | Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze and Barbara Plank | MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank |
2121 | Huadai Liu, XU WENQIANG, xuan lin, Jingjing Huo, Hong nullpointer Chen and Zhou Zhao | AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments |
2124 | Yuning Ding, Omid Kashefi, Swapna Somasundaran and Andrea Horbach | When Argumentation Meets Cohesion: Enhancing Automatic Feedback in Student Writing |
2125 | Honglin Mu, Yang Xu, Yunlong Feng, Xiaofeng Han, Yitong Li, Yutai Hou and Wanxiang Che | Beyond Static Evaluation: A Dynamic Approach to Assessing AI Assistants' API Invocation Capabilities |
2130 | Yue Li and Carolina Scarton | Can We Identify Stance Without Target Arguments? A Study for Rumour Stance Classification |
2131 | Fabrizio Nunnari, Eleftherios Avramidis, Cristina España-Bonet, Marco González, Anna Hennes and Patrick Gebhard | DGS-Fabeln-1: A Multi-Angle Parallel Corpus of Fairy Tales between German Sign Language and German Text |
2134 | Fuqiang Niu, Min Yang, Ang Li, Baoquan Zhang, Xiaojiang Peng and Bowen Zhang | A Challenge Dataset and Effective Models for Conversational Stance Detection |
2135 | Lorenzo Proietti, Stefano Perrella, Simone Tedeschi, Giulia Vulpis, Leonardo Lavalle, Andrea Sanchietti, Andrea Ferrari and Roberto Navigli | Analyzing Homonymy Disambiguation Capabilities of Pretrained Language Models |
2136 | Camille Challant and Michael Filhol | Extending AZee with Non-manual Gesture Rules for French Sign Language |
2141 | Nathan Godey, Éric de la Clergerie and Benoît Sagot | On the Scaling Laws of Geographical Representation in Language Models |
2143 | Sondes Abderrazek, Corinne Fredouille, Alain Ghio, muriel lalain, Christine Meunier, Mathieu Balaguer and Virginie Woisard | Interpretable Assessment of Speech Intelligibility using Deep Learning: A Case Study on Speech Disorders due to Head and Neck Cancers |
2144 | Shuoran Jiang, Qingcai Chen, Yang Xiang, Youcheng Pan and Yukang Lin | Linguistic Rule Induction Improves Adversarial and OOD Robustness in Large Language Models |
2145 | Eleanor Chodroff, Blaž Pažon, Annie Baker and Steven Moran | Phonetic Segmentation of the UCLA Phonetics Lab Archive |
2147 | Sebastian Schuster, Ayesha Ansar, Om Agarwal and Vera Demberg | SpreadNaLa: A Naturalistic Code Generation Evaluation Dataset of Spreadsheet Formulas |
2148 | Ines Reinig, Ines Rehbein and Simone Paolo Ponzetto | How to do politics with words: Investigating speech acts in parliamentary debates |
2151 | Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixing Jiang, Shaoyang Song and Chunlai Zhou | Locally Differentially Private In-Context Learning |
2152 | Yuchen Fan, Yantao Liu, Zijun Yao, Jifan Yu, Lei Hou and Juanzi Li | Evaluating Generative Language Models in Information Extraction as Subjective Question Correction |
2153 | Weihao Zhao, Weidong He, Hao Wang, Haoyang Bi, Han Wu, Chen Zhu, Tong Xu and Enhong Chen | MRT: Multi-modal Short- and Long-range Temporal Convolutional Network for Time-sync Comment Video Behavior Prediction |
2158 | Sandy Ritchie, Daan van Esch, Uche Okonkwo, Shikhar Vashishth and Emily Drummond | LinguaMeta: Unified Metadata for Thousands of Languages |
2161 | Hichem Ammar Khodja, Frederic Bechet, Quentin Brabant, Alexis Nasr and Gwénolé Lecorvé | WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models |
2162 | Francesca Zermiani, Prajit Dhar, Ekta Sood, Fabian Kögel, Andreas Bulling and Maria Wirzberger | InteRead: An Eye Tracking Dataset of Interrupted Reading |
2163 | Jian Zhang, Changlin Yang, Haiping Zhu, Qika Lin, Fangzhi Xu and Jun Liu | A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction |
2166 | Mustafa Jarrar and Tymaa Hasanain Hammouda | Qabas: An Open-Source Arabic Lexicographic Database |
2167 | Maxim K. Surkov and Ivan P. Yamshchikov | Vygotsky Distance: Measure for Benchmark Task Similarity |
2168 | Xin Liu, Hongwei Sun, Shaojie Dai, Bo Lv, Youcheng Pan, Hui Wang and Yue Yu | A Lifelong Multilingual Multi-granularity Semantic Alignment Approach via Maximum Co-occurrence Probability |
2170 | Natalia Loukachevitch, Andrey Sakhovskiy and Elena Tutubalina | Biomedical Concept Normalization over Nested Entities with Partial UMLS Terminology in Russian |
2171 | Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao | Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering |
2172 | Naziya Mahamdul Shaikh, Jyoti D. Pawar and Mubarak Banu Sayed | Konidioms Corpus: A Dataset of Idioms in Konkani Language |
2173 | Pin-Jie Lin, Merel Scholman, Muhammed Saeed and Vera Demberg | Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin |
2174 | Hay Man Htun, Ye Kyaw Thu, Hutchatai Chanlekha, Kotaro Funakoshi and Thepchai Supnithi | myMediCon: End-to-End Burmese Automatic Speech Recognition for Medical Conversations |
2175 | Nadège Alavoine, Maximin Coavoux, Emmanuelle Esperanca-Rodier, Romane Gallienne, carlos gonzalez gallardo, Jérôme Goulian, Jose G. Moreno, Aurélie Névéol, Didier Schwab, Vincent Segonne and johanna simoens | Limitations of Human Identification of Automatically Generated Text |
2176 | Eleni Metheniti, Philippe Muller, Chloé Braud and Margarita Hernández Casas | Zero-shot learning for multilingual discourse relation classification |
2178 | Daan van Esch, Sandy Ritchie, Sebastian Ruder, Julia Kreutzer, Clara Rivera, Ishank Saxena and Isaac Caswell | Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages |
2179 | Phil Sidney Ostheimer, Mayank Kumar Nagda, Marius Kloft and Sophie Fellenz | Text Style Transfer Evaluation Using Large Language Models |
2180 | Zifan Jiang, Anne Göhring, Amit Moryossef, Rico Sennrich and Sarah Ebling | SwissSLi: the Multi-parallel Sign Language Corpus for Switzerland |
2186 | Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar and Jan Wieczorek | Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles |
2187 | Abelardo Carlos Martinez Lorenzo and Roberto Navigli | Efficient AMR parsing with CLAP: Compact Linearization with an Adaptable Parser |
2189 | Fahad Khan, Maxim Ionov, Christian Chiarcos, Laurent Romary, Gilles Sérasset and Besim Kabashi | On Modelling Corpus Citations in Computational Lexical Resources |
2191 | Andrea Gulli, Francesco Costantini, Diego Sidraschi and Emanuela Li Destri | Fine-Tuning a Pre-Trained Wav2Vec2 Model for Automatic Speech Recognition- Experiments with de zahrar sproche |
2198 | Shuo Yang | A Trusted Multi-View Evidential Fusion Framework for Commonsense Reasoning |
2199 | Ona de Gibert, Graeme Nail, Nikolay Arefyev, Marta Bañón, Jelmer van der Linde, Shaoxiong Ji, Jaume Zaragoza-Bernabeu, Mikko Aulamo, Gema Ramírez-Sánchez, Andrey Kutuzov, Sampo Pyysalo, Stephan Oepen and Jörg Tiedemann | A New Massive Multilingual Dataset for High-Performance Language Technologies |
2200 | Matej Klemen, Aleš Žagar, Jaka Čibej and Marko Robnik-Šikonja | SI-NLI: A Slovene Natural Language Inference Dataset and its Evaluation |
2201 | Alice Millour, Lorenza Brasile, Alberto Ghia and Laurent Kevers | Agettivu, Aggitivu o Aghjettivu? POS Tagging Corsican Dialects |
2203 | Josef Ruppenhofer, Matthias Schwendemann, Annette Portmann, Katrin Wisniewski and Torsten Zesch | Every Verb in its Right Place? A Roadmap for Operationalizing Developmental Stages in the Acquisition of L2 German |
2204 | Atilla Kaan Alkan, Felix Grezes, Cyril Grouin, Fabian Schussler and Pierre Zweigenbaum | Enriching a Time-Domain Astrophysics Corpus with Named Entity, Coreference and Astrophysical Relationship Annotations |
2206 | Recep Firat Cekinel, Çağrı Çöltekin and Pinar Karagoz | Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish |
2207 | Chloe SEKKAT, Fanny Leroy, salima mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau and Alice Coucke | Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants |
2208 | Qianlong Wang, Hongling Xu, Keyang Ding, Bin Liang and Ruifeng Xu | In-Context Example Retrieval from Multi-Perspectives for Few-Shot Aspect-Based Sentiment Analysis |
2213 | Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi and Wray Buntine | Improving Vietnamese-English Medical Machine Translation |
2214 | Wajdi Zaghouani, Abdelhamid Ahmed, Xiao Zhang and Lameya Rezk | QCAW 1.0: Building a Qatari Corpus of Student Argumentative Writing |
2218 | Gennaro Nolano, Moritz Blum, Basil Ell and Philipp Cimiano | Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials |
2219 | Rian Touchent and Éric de la Clergerie | CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective Models on French Biomedical Data |
2220 | Darinka Verdonik, Kaja Dobrovoljc, Tomaž Erjavec and Nikola Ljubešić | Gos 2: A New Reference Corpus of Spoken Slovenian |
2224 | Jin-seo Kim, Anna Seo Gyeong Choi and Sunghye Cho | KoFREN: Comprehensive Korean Word Frequency Norms Derived from Large Scale Free Speech Corpora |
2225 | Hamdy Mubarak, Hend Al-Khalifa and Khaloud Suliman Alkhalefah | Halwasa: Quantify and Analyze Hallucinations in Large Language Models: Arabic as a Case Study |
2226 | Amanda Cercas Curry, Zeerak Talat and Dirk Hovy | Impoverished Language Technology: The Lack of (Social) Class in NLP |
2227 | Wonkee Lee, Seong-Hwan Heo and Jong-Hyeok Lee | Advancing Semi-Supervised Learning for Automatic Post-Editing: Data-Synthesis by Mask-Infilling with Erroneous Terms |
2229 | Filip Dobranić, Bojan Evkoski and Nikola Ljubešić | A Lightweight Approach to a Giga-Corpus of Historical Periodicals: The Story of a Slovenian Historical Newspaper Collection |
2231 | D. Fortuné KPONOU, Fréjus A. A. Laleye and Eugène Cokou Ezin | FFSTC: Fongbe to French Speech Translation Corpus |
2232 | Yuqing Zhang, Tessa Verhoef, Gertjan van Noord and Arianna Bisazza | Endowing Neural Language Learners with Human-like Biases: A Case Study on Dependency Length Minimization |
2235 | Pietro Giovanni Bizzaro, Elena Della Valentina, Maurizio Napolitano, Nadia Mana and Massimo Zancanaro | Annotation and Classification of Relevant Clauses in Terms-and-Conditions Contracts |
2237 | Jaione Bengoetxea, Yi-Ling Chung, Marco Guerini and Rodrigo Agerri | Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation |
2239 | Antonio F. G. Sevilla, José María Lahoz-Bengoechea and ALBERTO DIAZ | Automated Extraction of Prosodic Structure from Unannotated Sign Language Video |
2240 | Samee Arif, Sualeha Farid, Awais Athar and Agha Ali Raza | UQA: Corpus for Urdu Question Answering |
2241 | Peteris Paikens, Lauma Pretkalniņa and Laura Rituma | A Computational Model of Latvian Morphology |
2242 | Ulla Petti and Anna Korhonen | LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer's Disease Related Changes in Spontaneous Speech |
2243 | Anik Das, Milton King and James Alexander Hughes | Exploring BERT-Based Classification Models for Detecting Phobia Subtypes: A Novel Tweet Dataset and Comparative Analysis |
2245 | Auriane Boudin, Stéphane Rauzy, Roxane Bertrand, Magalie Ochs and Philippe Blache | The Distracted Ear: How Listeners Shape Conversational Dynamics |
2247 | Natalia Kalashnikova, Ioana Vasilescu and Laurence Devillers | Linguistic Nudges and Verbal Interaction with Robots, Smart-Speakers, and Humans |
2249 | Yufei Tao, Ameeta Agrawal, Judit Dombi, Tetyana Sydorenko and Jung In Lee | ChatGPT Role-play Dataset: Analysis of User Motives and Model Naturalness |
2251 | Miguel Da Corte and Jorge Baptista | Charting the Linguistic Landscape of Developing Writers: an Annotation Scheme for Enhancing Native Language Proficiency |
2252 | Siyang Wang and Eva Szekely | Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model |
2253 | Gunjan Bhattarai and Katrin Erk | To Learn or Not to Learn: Replaced Token Detection for Learning the Meaning of Negation |
2255 | Weihang Ye, Peng Zhang, Jing Zhang, Hui Gao and Moyao Wang | Distilling Causal Effect of Data in Continual Few-shot Relation Learning |
2256 | Patrick Haller, Jonas Golde and Alan Akbik | PECC: Problem Extraction and Coding Challenges |
2257 | Loïc Grobol and Mélanie Jouitteau | ARBRES Kenstur: a Breton-French Parallel Corpus Rooted in Field Linguistics |
2258 | S. Magalí López Cortez, Mark Josef Norris and Steve Duman | GMEG-EXP: A Dataset of Human- and LLM-Generated Explanations of Grammatical and Fluency Edits |
2259 | Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu and Haizhou Li | Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark |
2261 | Pusheng Liu, Lianwei Wu, Linyong Wang, Sensen Guo and Yang Liu | Step-by-Step: Controlling Arbitrary Style in Text with Large Language Models |
2264 | Gamze Goren and Carlo Strapparava | Context Matters: Enhancing Metaphor Recognition in Proverbs |
2265 | Toyin D. Aguda, Suchetha Siddagangappa, Elena Kochkina, Simerjot Kaur, Dongsheng Wang and Charese Smiley | Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency |
2269 | Raghuveer Thirukovalluru, Nicholas Monath, Bhuwan Dhingra and Sam Wiseman | Sequence Reducible Holdout Loss for Language Model Pretraining |
2271 | Saeed Ahmadnia, Arash Yousefi Jordehi, Mahsa Hosseini Khasheh Heyran, SeyedAbolghasem Mirroshandel and Owen Rambow | Opinion Mining Using Pre-Trained Large Language Models: Identifying the Type, Polarity, Intensity, Expression, and Source of Private States |
2272 | Lucia Pitarch, Carlos Bobed Lisbona, David Abián, Jorge Gracia and Jordi Bernad | Building MUSCLE, a Dataset for MUltilingual Semantic Classification of Links between Entities |
2276 | Sudipta Singha Roy and Robert E. Mercer | Enhancing Scientific Document Summarization with Research Community Perspective and Background Knowledge |
2277 | haoran zhang and Tan Yongmei | Enhancing Knowledge Selection via Multi-level Document Semantic Graph |
2281 | Peter Mihajlik, Katalin Mády, Anna Kohári, Fruzsina Sára Fruzsina, Gábor Kiss, Tekla Etelka Gráczi and A. Seza Doğruöz | Is Spoken Hungarian Low-resource?: A Quantitative Survey of Hungarian Speech Data Sets |
2285 | Anis Charfi, Mabrouka Ben-Sghaier, Andria Samy Raouf Atalla, Raghda Akasheh, Sara Al-Emadi and Wajdi Zaghouani | MARASTA: A Multi-dialectal Arabic Cross-domain Stance Corpus |
2286 | Ruiting Shao, Ryan Schwarz, Christopher Clifton and Edward Delp | A Natural Approach for Synthetic Short-Form Text Analysis |
2288 | Fatemah Yousef Almeman, Steven Schockaert and Luis Espinosa Anke | WordNet under Scrutiny: Dictionary Examples in the Era of Large Language Models |
2289 | Špela Arhar Holdt, Tomaž Erjavec, Iztok Kosem and Elena Volodina | Towards an Ideal Tool for Learner Error Annotation |
2290 | Didem Sedefoglu, Allison Claire Lahnala, Jasmin Wagner, Lucie Flek and Sandra Ohly | LeadEmpathy: An Expert Annotated German Dataset of Empathy in Written Leadership Communication |
2293 | Cheril Shah, Yashashree Chandak, Atharv Mahesh Mane, Benjamin Bergen and Tyler A. Chang | Correlations Between Multilingual Language Model Geometry and Crosslingual Transfer Performance |
2295 | Eleni Vligouridou, Inessa Iliadou and Çağrı Çöltekin | A Treebank of Asia Minor Greek |
2297 | Giulia Pucci and Leonardo Ranaldi | Does the Language Matter? Curriculum Learning over Neo-Latin Languages |
2304 | Frances Yung, Merel Scholman, Sarka Zikanova and Vera Demberg | DiscoGeM 2.0: A Parallel Corpus of English, German, French and Czech Implicit Discourse Relations |
2305 | Katsumi Ibaraki, Winston Wu, Lu Wang and Rada Mihalcea | Analyzing Occupational Distribution Representation in Japanese Language Models |
2307 | Aditya Kamlesh Parikh, Louis ten Bosch and Henk van den Heuvel | Ensembles of Hybrid and End-to-End Speech Recognition. |
2308 | Zhaomin Xiao, Eduardo Blanco and Yan Huang | Analyzing Large Language Models' Capability in Location Prediction |
2312 | Anna Laskina, Eric Gaussier and Gaelle Calvary | A Closer Look at Clustering Bilingual Comparable Corpora |
2313 | Shaina Ashraf, Isabel Bezzaoui, Ionut Andone, Alexander Markowetz, Jonas Fegert and Lucie Flek | DeFaktS: A German Dataset for Fine-Grained Disinformation Detection through Social Media Framing |
2314 | Mohammed hossein Jafari harandi, Fatemeh Azadi, Mohammad Javad Dousti and Heshaam Faili | EPOQUE: An English-Persian Quality Estimation Dataset |
2315 | Rachel Beeson, Dmitry Sityaev and Kris Y. Hong | Comparing Phonemisation Methods for Multidialectal Spanish Large Vocabulary Continuous Speech Recognition |
2316 | W. Victor Yarlott, Anurag Acharya, Diego Castro Estrada, Diana Gomez and Mark Finlayson | GOLEM: GOld standard for Learning and Evaluation of Motifs |
2319 | Christin Müller and Barbara Plank | IndirectQA: Understanding Indirect Answers to Implicit Polar Questions in French and Spanish |
2322 | Mercè Vàzquez | Creating Terminological Resources in the Digital Age for Less-resourced Languages |
2329 | Kiamehr Rezaee, Jose Camacho-Collados and Mohammad Taher Pilehvar | TweetTER: A Benchmark for Target Entity Retrieval on Twitter without Knowledge Bases |
2331 | Matthias Sperber, Ondřej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh and Marco Turchi | Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation |
2332 | Oana Ignat, Zhijing Jin, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Ece Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan C. Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu and Rada Mihalcea | Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models |
2335 | Corentin MASSON and Patrick Paroubek | Evaluating Topic Model on Asymmetric and Multi-Domain Financial Corpus |
2336 | Robert Vacareanu, Enrique Noriega-Atala, Gus Hahn-Powell, Marco A. Valenzuela-Escarcega and Mihai Surdeanu | Active Learning Design Choices for NER with Transformers |
2338 | Piroska Lendvai, Maarten van Gompel, Anna Jouravel, Elena Renje, Uwe Reichel, Achim Rabus and Eckhart Arnold | A Workflow for HTR-Postprocessing, Labeling and Classifying Diachronic and Regional Variation in Pre-Modern Slavic Texts |
2339 | Danial Kamali, Joseph D. Romain, Huiyi Liu, Wei Peng, Jingbo Meng and Parisa Kordjamshidi | Using Persuasive Writing Strategies to Explain and Detect Health Misinformation |
2342 | Minghan Li and Eric Gaussier | Domain Adaptation for Dense Retrieval and Conversational Dense Retrieval through Self-Supervision by Meticulous Pseudo-Relevance Labeling |
2343 | Sarah E. Finch, James D. Finch and Jinho D. Choi | Exploring the Impact of Human Evaluator Group on Chat-Oriented Dialogue Evaluation |
2346 | Patrick Littell, Darlene Stewart, Fineen Davis, Aidan Pine and Roland Kuhn | Gramble: A tabular programming language for collaborative linguistic modeling |
2348 | Valentino Frasnelli and Alessio Palmero Aprosio | There's Something New about the Italian Parliament: the IPSA Corpus |
2351 | Abhisek Tiwari, Shreyangshu Bera, Preeti Verma, Jaithra Varma Manthena, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar and Sarbajeet Tiwari | Seeing is believing! Towards Knowledge-Infused Multi-modal Medical Dialogue Generation |
2353 | Enrica Troiano and Piek T.J.M. Vossen | CLAUSE-ATLAS: A Corpus of Narrative Information to Scale Up Computational Literary Analysis |
2354 | Zhuo Chen, Zhao Zhang, Zixuan Li, Fei Wang, Yutao Zeng, Xiaolong Jin and Yongjun Xu | Self-Improvement Programming for Temporal Knowledge Graph Question Answering |
2356 | Yilun Zhu, Siyao Peng, Sameer Pradhan and Amir Zeldes | SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution |
2357 | Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka and Thamar Solorio | OATS: A Challenge Dataset for Opinion Aspect Target Sentiment Joint Detection for Aspect-Based Sentiment Analysis |
2358 | Ann Bies, Jennifer Tracey, Ann O'Brien, Song Chen and Stephanie Strassel | Spanless Event Annotation for Corpus-Wide Complex Event Understanding |
2359 | Stephen Bothwell, Brian DuSell, David Chiang and Brian Krostenko | PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin |
2360 | Konrad Wojtasik, Kacper Wołowiec, Vadim Shishkin, Arkadiusz Janz and Maciej Piasecki | BEIR-PL: Zero Shot Information Retrieval Benchmark for the Polish Language |
2361 | Rinalds Vīksna and Inguna Skadiņa | MultiLeg: Dataset for Text Sanitisation in Less-resourced Languages |
2363 | Arezoo Hatefi, Anton Eklund and Mona Forsman | PromptStream: Self-Supervised News Story Discovery Using Topic-Aware Article Representations |
2367 | Mohit Singh Tomar, Tulika Saha, Abhisek Tiwari and Sriparna Saha | Action and Reaction go hand in hand! A Multi-modal Dialogue Act aided Sarcasm Identification |
2368 | Andrey Kutuzov, Mariia Fedorova, Dominik Schlechtweg and Nikolay Arefyev | Enriching Word Usage Graphs with Cluster Definitions |
2375 | Oana Ignat, Longju Bai, Joan C. Nwatu and Rada Mihalcea | Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost |
2377 | Kenneth Lai, Richard Brutti, Lucia Donatelli and James Pustejovsky | Encoding Gesture in Multimodal Dialogue: Creating a Corpus of Multimodal AMR |
2380 | Dana Dannélls, Richard Johansson and Lucy Yang Buhr | Transformer-based Swedish Semantic Role Labeling through Transfer Learning |
2382 | Nada Elsharawi and Alia El Bolock | C-Journal: A Journaling Application for Detecting and Classifying Cognitive Distortions using Deep-Learning based on a Crowd-sourced Dataset |
2389 | Liviu P. Dinu, Ana Sabina Uban, Ioan-Bogdan Iordache, Alina Maria Cristea, Simona Georgescu and Laurentiu Zoicas | Pater incertus? There is a Solution: Automatic Discrimination between Cognates and Borrowings for Romance Languages |
2390 | Vijeta Deshpande, Minhwa Lee, Zonghai Yao, Zihao Zhang, Jason Brian Gibbons and hong yu | LocalTweets to LocalHealth: A Mental Health Surveillance Framework Based on Twitter Data |
2395 | Kamila Górska, John Lawrence and Chris Reed | FORECAST2023: A Forecast and Reasoning Corpus of Argumentation Structures |
2402 | Rodolfo Joel Zevallos, John E. Ortega and Benjamin Irving | Related Work is All you Need |
2405 | Biswadip Mandal, Xiangci Li and Jessica Ouyang | Contextualizing Generated Citation Texts |
2406 | Benno Kruit, Yiming Xu and Jan-Christoph Kalo | Retrieval-based Question Answering with Passage Expansion using a Knowledge Graph |
2407 | Pedro P. V. Brum, Mariana O. Silva, Gabriel P. Oliveira, Lucas G. L. Costa, Anisio Lacerda and Gisele Pappa | Unsupervised Grouping of Public Procurement Similar Items: Which text representation should I use? |
2408 | Stefano Menini | Semantic Frame Extraction in Multilingual Olfactory Events |
2410 | Roberts Dargis, Arturs Znotins, Ilze Auzina, Baiba Saulite, Sanita Reinsone, Raivis Dejus, Antra Klavinska and Normunds Gruzitis | BalsuTalka.lv - Boosting the Common Voice Corpus for Low-Resource Languages |
2411 | Harm Lameris, Eva Szekely and joakim gustafson | The Role of Creaky Voice in Turn Taking and the Perception of Speaker Stance: Experiments Using Controllable TTS |
2412 | Severino Da Dalt, Joan Llop, Irene Baucells, Marc Pamies, Yishi Xu, Aitor Gonzalez-Agirre and Marta Villegas | FLOR: On the Effectiveness of Language Adaptation |
2413 | Ruth M. Holmes, Ellen Rushe and anthony ventresque | The Key Points: Using Feature Importance to Identify Shortcomings in Sign Language Recognition Models |
2416 | Eri Onami, Shuhei Kurita, Taiki Miyanishi and Taro Watanabe | JDocQA: Japanese Document Question Answering Dataset for Generative Language Models |
2417 | Mohsinul Kabir, Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, M Saiful Bari and Enamul Hoque | BenLLM-Eval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP |
2422 | Dominika Ďurišková, Daniela Jurášová, Matúš Žilinec, Eduard Šubert and Ondřej Bojar | Khan Academy Corpus: A multilingual corpus of Khan Academy lectures |
2424 | Kexuan Sun, Nicolaas Paul Jedema, Karishma Sharma, Ruben Janssen, Jay Pujara, Pedro Szekely and Alessandro Moschitti | Efficient and Accurate Contextual Re-Ranking for Knowledge Graph Question Answering |
2425 | Nhat Tran and Diane Litman | Enhancing Knowledge Retrieval with Topic Modeling for Knowledge-Grounded Dialogue |
2428 | Chih-Chen Chen, William Chen, Rodolfo Joel Zevallos and John E. Ortega | Evaluating Self-Supervised Speech Representations for Indigenous American Languages |
2431 | Claire Bonial and Harish Tayyar Madabushi | A Construction Grammar Corpus of Varying Schematicity: A Dataset for the Evaluation of Abstractions in Language Models |
2436 | Anu Singh and Esme Manandise | A Typology of Errors for User Utterances in Chatbots |
2439 | Christian Chiarcos, Ranka Stanković, Maxim Ionov and Gilles Sérasset | Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC |
2440 | salima mdhaffar, Fethi Bougares, Renato De Mori, Salah Zaiem, Mirco Ravanelli and Yannick Estève | TARIC-SLU: A Tunisian Benchmark Dataset For Spoken Language Understanding |
2441 | Muhammad Morsy Elmallah, Mahmoud Reda, Kareem Darwish, Abdelrahman El-Sheikh, Ashraf Hatim Elneima, Murtadha Aljubran, Nouf Alsaeed, Reem Mohammed and Mohamed Al-Badrashiny | Arabic Diacritization Using Morphologically Informed Character-Level Model |
2442 | Dhia Elhak Goumri, Abhishek Agrawal, Mitja Nikolaus, hong Duc Thang VU, Kübra Bodur, Elias Emmar, Cassandre Armand, Chiara Mazzocconi, Shreejata Gupta, Laurent Prévot, Benoit Favre, Leonor Becerra-Bonache and Abdellah Fourtassi | CHICA: A Developmental Corpus of Child-Caregiver's Face-to-face vs. Video Call Conversations in Middle Childhood |
2443 | Franco Alberto Cardillo and Franca Debole | Italian Word Embeddings for the Medical Domain |
2446 | Nikita Martynov, Aleksei Goncharov, Gleb Kumichev, Evgeniy Egorov, Stanislav Vladimirovich Pavlov, Mikhail Sergeevich Durinov, Aleksandr Sergeevich Zuev and Egor Anatolievich Filimonov | On the Way to Lossless Compression of Language Transformers: Exploring Cross-Domain Properties of Quantization |
2448 | Josef Jon and Ondřej Bojar | GAATME: A Genetic Algorithm for Adversarial Translation Metrics Evaluation |
2451 | Lydia Nishimwe, Benoît Sagot and Rachel Bawden | Making Sentence Embeddings Robust to User-Generated Content |
2456 | Philipp Sadler, Sherzod Hakimov and David Schlangen | Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies |
2461 | Michael Andrew Orme, Yanchao Yu and Zhiyuan Tan | How Much do Robots Understand Rudeness? Challenges in Human-Robot Interaction |
2464 | AKASH GHOSH, Venkata Sahith Bathini, Niloy Ganguly, Pawan Goyal and Mayank Singh | How Robust are the QA Models for Hybrid Scientific Tabular Data? A Study using Customized Dataset |
2465 | Zhijian Li, Stefan Larson and Kevin Leach | Generating Hard-Negative Out-of-Scope Data with ChatGPT for Intent Classification |
2466 | Chengzu Li, Chao Zhang, Simone Teufel, Rama Sanand Doddipatla and Svetlana Stoyanchev | Semantic Map-based Generation of Navigation Instructions |
2467 | Chloé Braud, Amir Zeldes, Laura Rivière, Yang Janet Liu, Philippe Muller, Damien Sileo and Tatsuya Aoyama | DISRPT: A Multilingual, Multi-domain, Cross-framework Benchmark for Discourse Processing |
2468 | Christoph Otto, Jonas Groschwitz, Alexander Koller, Xiulin Yang and Lucia Donatelli | A Corpus of German Abstract Meaning Representation (DeAMR) |
2470 | Tiberiu Sosea, Junyi Jessy Li and Cornelia Caragea | Sarcasm Detection in a Disaster Context |
2471 | Stefanie Dipper, Cora Haiber, Anna Maria Schröter, Alexandra Wiemann and Maike Brinkschulte | Universal Dependencies: Extensions for Modern and Historical German |
2473 | Sajad Ramezani, Mauzama Firdaus and Lili Mou | Claim-Centric And Sentiment Guided Graph Attention Network for Rumour Detection |
2475 | Dagmar Gromann, Hugo Goncalo Oliveira, Lucia Pitarch, Elena-Simona Apostol, Jordi Bernad, Eliot Bytyçi, Chiara Cantone, Sara Carvalho, Francesca Frontini, Radovan Garabik, Jorge Gracia, Letizia Granata, Fahad Khan, Timotej Knez, Penny Labropoulou, Chaya Liebeskind, Maria Pia di Buono, Ana Ostroški Anić, Sigita Rackevičienė, Ricardo Rodrigues, Gilles Sérasset, Linas Selmistraitis, Mahammadou Sidibé, Purificação Silvano, Blerina Spahiu, Enriketa Sogutlu, Ranka Stanković, Ciprian-Octavian Truică, Giedrė Valūnaitė Oleškevičienė, Slavko Zitnik and Katerina Zdravkova | MultiLexBATS: Multilingual Dataset of Lexical Semantic Relations |
2476 | Fangru Lin, Daniel Altshuler and Janet B. Pierrehumbert | Probing Large Language Models for Scalar Adjective Lexical Semantics and Scalar Diversity Pragmatics |
2478 | Navneet Agarwal, Kirill Milintsevich, Lucie Metivier, Maud Rotharmel, Gaël Dias and Sonia Dollfus | Analyzing Symptom-based Depression Level Estimation through the Prism of Psychiatric Expertise |
2480 | Flavio Petruzzellis, Alberto Testolin and Alessandro Sperduti | Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies |
2485 | Hillary Dawkins, Isar Nejadgholi, Daniel Gillis and Judi McCuaig | Projective Methods for Mitigating Gender Bias in Pre-trained Language Models |
2488 | Ahmad Aljanaideh | New Evaluation Methodology for Qualitatively Comparing Classification Models |
2489 | Mina Valizadeh, Vera C. Kaelin, Mary A. Khetani and Natalie Parde | CareCorpus: A Corpus of Real-World Solution-Focused Caregiver Strategies for Personalized Pediatric Rehabilitation Service Design |
2492 | Frances Adriana Laureano De Leon, Harish Tayyar Madabushi and Mark Lee | Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text |
2493 | Ann-Sophie Gnehm and Simon Clematide | Mapping Work Task Descriptions from German Job Ads on the O*NET Work Activities Ontology |
2496 | Song Chen, Jennifer Tracey, Ann Bies and Stephanie Strassel | Schema Learning Corpus: Data and Annotation Focused on Complex Events |
2501 | Christy Doran and Deborah A. Dahl | It's Not Under the Lamppost: Expanding the Reach of Conversational AI |
2506 | Dirk Väth, Lindsey Vanderlyn and Ngoc Thang Vu | Towards a Zero-Data, Controllable, Adaptive Dialog System |
2507 | Valentin Barriere and Sebastian Cifuentes | Are Text Classifiers Xenophobic? A Country-Oriented Bias Detection Method With Least Confounding Variables |
2508 | Md Nishat Raihan, Sadiya Sayara Chowdhury Puspo, Md Shafkat Rahman Farabi, Ana-Maria Bucur, Tharindu Ranasinghe and Marcos Zampieri | MentalHelp: A Multi-Task Dataset for Mental Health in Social Media |
2510 | Abhinav Sukumar Rao, Atharva Roshan Naik, Sachin Vashistha, Somak Aditya and Monojit Choudhury | Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks |
2514 | T. Mark Ellison and Fahime Same | Experimental versus In-Corpus Variation in Referring Expression Choice |
2516 | Marta Milazzo and Giorgio Maria Di Nunzio | The Onomastic Repertoire of the Roman d'Alexandre (ORNARE). Designing an Integrated Digital Onomastic Tool for Medieval French Romance |
2518 | Steinunn Rut Friðriksdóttir and Hafsteinn Einarsson | Gendered Grammar or Ingrained Bias? Exploring Gender Bias in Icelandic Language Models |
2520 | Shinka Mori, Oana Ignat, Andrew Lee and Rada Mihalcea | Towards Algorithmic Fidelity: Mental Health Representation across Demographics in Synthetic vs. Human-generated Data |
2521 | Miguel Da Corte and Jorge Baptista | Enhancing Writing Proficiency Classification in Developmental Education: the Quest for Accuracy |
2523 | Yida Mu, Chun Dong, Kalina Bontcheva and Xingyi Song | Large Language Models Offer an Alternative to the Traditional Approach of Topic modelling |
2525 | Maximilian Schmidt, Andrea Bartezzaghi and Ngoc Thang Vu | Prompting-based Synthetic Data Generation for Few-Shot Question Answering |
2526 | Julian Hough, Sina Zarrieß, Casey Kennington, David Schlangen and Massimo Poesio | Conceptual Pacts for Reference Resolution using Small, Dynamically Constructed Language Models: A Study in Puzzle Building Dialogues |
2530 | Gorka Urbizu, Muitze Zulaika, Xabier Saralegi and Ander Corral | How Well Can BERT Learn the Grammar of an Agglutinative and Flexible-Order Language? The Case of Basque. |
2531 | Marie Tahon, Anthony Larcher, Martin Lebourdais, Fethi Bougares, Anna Silnova and Pablo Gimeno | ALLIES: a Speech Corpus for Segmentation, Speaker Diarization, Speech Recognition and Speaker Change detection |
2537 | Christopher Thierauf, Mitchell Abrams and Matthias Scheutz | Automating Dataset Production Using Generative Text and Image Models |
2538 | Shucheng Zhu, Weikang Wang and Ying Liu | Quite Good, but Not Enough: Nationality Bias in Large Language Models - A Case Study of ChatGPT |
2539 | Ruitao Feng, Xudong Hong, Mayank Jobanputra, Mattes Warning and Vera Demberg | Retrieval-Augmented Modular Prompt Tuning for Low-Resource Data-to-Text Generation |
2542 | Hansi Hettiarachchi, Damith Premasiri, Lasitha Randunu Chandrakantha Uyangodage and Tharindu Ranasinghe | NSina: A News Corpus for Sinhala |
2543 | Samyak Jain, Parth Chhabra, Atula Tejaswi Neerkaje, Puneet Mathur, Ramit Sawhney, Shivam Agarwal, Preslav Nakov, Sudheer Chava and Dinesh Manocha | Saliency-Aware Interpolative Augmentation for Multimodal Financial Prediction |
2544 | Elaheh Baharlouei, Mahsa Shafaei, Yigeng Zhang, Hugo Jair Escalante and Thamar Solorio | Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model |
2546 | Sina Ahmadi, Daban Jaff, Md Mahfuz Ibn Alam and Antonios Anastasopoulos | Language and Speech Technology for Central Kurdish Varieties |
2549 | Bruno Guillaume, Kim Gerdes, Kirian Guiller, Sylvain Kahane and Yixuan LI | Joint Annotation of Morphology and Syntax in Dependency Treebanks |
2553 | Thomas Gerald, Anne Vilnat, Sofiane Ettayeb, Louis Tamames and Patrick Paroubek | Introducing CQuAE : a New French Contextualised Question-Answering Corpus for the Education Domain |
2555 | Li Song and Ying Liu | Approaches and Challenges for Resolving Different Representations of Fictional Characters for Chinese Novels |
2558 | Noam K. Benkler, Scott Friedman, Sonja Schmer-Galunder, Drisana Marissa Mosaphir, Robert P. Goldman, Ruta Wheelock, Vasanth Sarathy, Pavan Kantharaju and Matthew D. McLure | Recognizing Value Resonance with Resonance-Tuned RoBERTa Task Definition, Experimental Validation, and Robust Modeling |
2561 | Evgeniia Razumovskaia, Joshua Maynez, Annie Louis, Mirella Lapata and Shashi Narayan | Little Red Riding Hood Goes Around the Globe: Crosslingual Story Planning and Generation with Large Language Models |
2562 | Yumeng Yang | Exploring the Generalization of Cancer Clinical Trial Eligibility Classifiers Across Diseases |
2563 | Matthew J. Buchholz, Julia Bonn, Claire Benet Post, Andrew Cowell and Alexis Palmer | Bootstrapping UMR Annotations for Arapaho from Language Documentation Resources |
2564 | Daniel Vlantis, Iva Gornishka and Shuai Wang | Benchmarking the Simplification of Dutch Municipal Text |
2566 | Maciej Ogrodniczuk, Aleksandra Tomaszewska, Daniel Ziembicki, Sebastian Żurowski, Ryszard Tuora and Aleksandra Zwierzchowska | Polish Discourse Corpus (PDC): Corpus Design, ISO-Compliant Annotation, Data Highlights, and Parser Development |
2572 | Luciana Bencke, Francielle Vasconcellos Pereira, Moniele Kunrath Santos and Viviane Moreira | InferBR: a Natural Language Inference Dataset in Portuguese |
2573 | Ella Schad, Jacky Visser and Chris Reed | The RIP Corpus of Collaborative Hypothesis-Making |
2574 | Yiwen Chen and Simone Teufel | Scansion-based Lyrics Generation |
2580 | Kanishk Verma, Kolawole John Adebayo, Joachim Wagner, Megan Reynolds, Rebecca Umbach, Tijana Milosevic and Brian Davis | Beyond Binary: Towards Embracing Complexities in Cyberbullying Detection and Intervention - A Position Paper |
2583 | Ayush Agarwal, Janak Kapuriya, Shubham Agrawal, Akhil Vamshi Konam, Mansi Goel, Rishabh Gupta, Shrey Rastogi, Niharika Niharika and Ganesh Bagler | Deep Learning Based Named Entity Recognition Models for Recipes |
2586 | Niklas Kiehne, Alexander Ljapunov, Marc Bätje and Wolf-Tilo Balke | Analyzing Effects of Learning Downstream Tasks on Moral Bias in Large Language Models |
2587 | Mushaffa Rasyid Ridha and Sakriani Sakti | Refining rtMRI Landmark-Based Vocal Tract Contour Labels with FCN-Based Smoothing and Point-to-Curve Projection |
2591 | Marco Cognetta, Vilém Zouhar, Sangwhan Moon and Naoaki Okazaki | Two Counterexamples to Tokenization and the Noiseless Channel |
2593 | Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek and Christoph M. Friedrich | Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding |
2596 | Hiroto Kaino, Soichiro Sugihara, Tomoyuki Kajiwara, Takashi Ninomiya, Joshua B. Tanner and Shonosuke Ishiwatari | Utilizing Longer Context than Speech Bubbles in Automated Manga Translation |
2597 | Juan Pablo Munoz, Yi Zheng and Nilesh Jain | EFTNAS: Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks |
2598 | Yujian Gan, Massimo Poesio and Juntao Yu | Assessing the Capabilities of Large Language Models in Coreference: An Evaluation |
2602 | Jin Wang, Liang-Chih Yu and Xuejie Zhang | SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-training |
2603 | Rajesh Titung and Cecilia Ovesdotter Alm | FUSE - FrUstration and Surprise Expressions: A Subtle Emotional Multimodal Language Corpus |
2604 | Merve Ünlü Menevşe, Yusufcan Manav, Ebru Arisoy and Arzucan Özgür | Dealing with Data Scarcity in Spoken Question Answering |
2605 | Ritwik Mishra, Pooja Desur, Rajiv Ratn Shah and Ponnurangam Kumaraguru | Multilingual Coreference Resolution in Low-resource South Asian Languages |
2606 | Alexandra O'Neil, Nils Hjortnaes, Francis Tyers, Zinhle Nkosi, Thulile Ndlovu, Zanele Mlondo and Ngami Phumzile Pewa | Developing a Benchmark for Pronunciation Feedback: Creation of a Phonemically Annotated Speech Corpus of isiZulu Language Learner Speech |
2608 | Taro Miyazaki, Hideya Mino and Hiroyuki Kaneko | Understanding How Positional Encodings Work in Transformer Model |
2609 | Shiming He, Yu Hong, shuai yang, Jianmin Yao and Guodong Zhou | Demonstration Retrieval-Augmented Generative Event Argument Extraction |
2611 | Dongsheng Zhu, Zhenyu Mao, Jinghui Lu, Rui Zhao and Fei Tan | SDA: Simple Discrete Augmentation for Contrastive Sentence Representation Learning |
2614 | Daichi Yamaguchi, Rei Miyata, Atsushi Fujita, Tomoyuki Kajiwara and Satoshi Sato | Automatic Decomposition of Text Editing Examples into Primitive Edit Operations: Toward Analytic Evaluation of Editing Systems |
2622 | Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai and Deyi Xiong | Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs |
2627 | Jingxuan Tu, Timothy Obiso, Bingyang Ye, Kyeongmin Rim, Keer Xu, Liulu Yue, Susan Windisch Brown, Martha Palmer and James Pustejovsky | GLAMR: Augmenting AMR with GL-VerbNet Event Structure |
2633 | Anna Beatriz Dimas Furtado, Tharindu Ranasinghe, Frederic Blain and Ruslan Mitkov | DORE: A Dataset For Portuguese Definition Generation |
2636 | Gyeongeun Lee and Natalie Parde | AcnEmpathize: A Dataset for Understanding Empathy in Dermatology Conversations |
2639 | Prashant Krishnan, Zilong Wang, Yangkun Wang and Jingbo Shang | Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation |
2640 | Yexin Wu, Zhuosheng Zhang and Hai Zhao | Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering |
2645 | Tyler A. Chang, Katrin Tomanek, Jessica Hoffmann, Nithum Thain, Erin MacMurray van Liemt, Kathleen Meier-Hellstern and Lucas Dixon | Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics |
2647 | Rowan Hall Maudslay, Simone Teufel, Francis Bond and James Pustejovsky | ChainNet: Structured Metaphor and Metonymy in WordNet |
2648 | Arianna Muti, Federico Ruggeri, Cagri Toraman, Alberto Barrón-Cedeño, Samuel Algherini, Lorenzo Musetti, Silvia Ronchi, Gianmarco Saretto and Caterina Zapparoli | PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets |
2652 | Fariz Ikhwantri, Hiroaki Yamada and Takenobu Tokunaga | Analyzing Interpretability of Summarization Model with Eye-gaze Information |
2654 | Neha Verma, Kenton Murray and Kevin Duh | Exploring Geometric Representational Disparities Between Multilingual and Bilingual Translation Models |
2655 | Neha Ramsurrun, Rolando Coto-Solano and Michael Gonzalez | Parsing for Mauritian Creole using Universal Dependencies |
2656 | Allison Claire Lahnala, Béla Neuendorf, Alexander Thomin, Charles Welch, Tina Stibane and Lucie Flek | Appraisal Framework for Clinical Empathy: A novel application to breaking bad news conversations |
2657 | Aquia Richburg, Calvin Bao and Marine Carpuat | Automatic Authorship Analysis in Human-AI Collaborative Writing |
2658 | Anna Kolos, Inez Okulska, Kinga Głąbińska, Agnieszka Karlinska, Emilia Wisnios, Paweł Ellerik and Andrzej Prałat | BAN-PL: a Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service |
2662 | Sanuj Kumar and Tuan Le | FoTo: Targeted Visual Topic Modeling for Focused Analysis of Short Texts |
2665 | Sameer Pradhan, Ronald A. Cole and Wayne H. Ward | My Science Tutor (MyST)–A Large Corpus of Children's Conversational Speech |
2666 | Nikolaos Nikolaidis, Jakub Piskorski and Nicolas Stefanovitch | Exploring the Usability of Persuasion Techniques for Downstream Misinformation-related Classification Tasks |
2669 | Ian Porada, Xiyuan Zou and Jackie Chi Kit Cheung | A Controlled Reevaluation of Coreference Resolution Models |
2671 | Namrata Shivagunde, Vladislav Lialin, Sherin Muckatira and Anna Rumshisky | Deconstructing In-Context Learning: Understanding Prompts via Corruption |
2672 | Maria Teleki, Xiangjue Dong and James Caverlee | Quantifying the Impact of Disfluency on Spoken Content Summarization |
2673 | Vikas Yadav, Hyuk joon Kwon, Vijay Srinivasan and Hongxia Jin | Explicit over Implict: Explicit Diversity Conditions for Effective Question Answer Generation |
2678 | Jesin James, Rolando Coto-Solano, Sally Akevai Nicholas, Joshua Zhu, Bovey Yu, Fuki Babasaki, Jenny Tyler Wang and Nicholas Derby | Development of Community-Oriented Text-to-Speech Models for Māori 'Avaiki Nui (Cook Islands Māori) |
2680 | Zheng Fang, Yulan He and Rob Procter | CWTM: Leveraging Contextualized Word Embeddings from BERT for Neural Topic Modeling |
2681 | Juan Pablo Munoz, Jinjie Yuan, Yi Zheng and Nilesh Jain | LoNAS: Elastic Low-Rank Adapters for Efficient Large Language Models |
2682 | Zhaoyi Hou, Li Zhang and Chris Callison-Burch | Choice-75: A Dataset on Decision Branching in Script Learning |
2685 | Wilermine Previlon, Alice Rozet, Jotsna Gowda, Bill Dyer, Kevin Tang and Sarah Moeller | Leveraging syntactic dependencies in disambiguation: the case of African American English |
2687 | Takuma Tanigawa, Tomoyosi Akiba and Hajime Tsukada | Analysis on Unsupervised Acquisition Process of Bilingual Vocabulary through Iterative Back-Translation |
2688 | Avijit Mitra, Nalin Gupta, Chetan Naik, Abhinav Sethy, Kinsey Bice and Zeynab Raeesy | Generating Contextual Images for Long-Form Text |
2689 | Yuanyi Zhu, Maria Liakata and Giovanni Montana | A Multi-Task Transformer Model for Fine-grained Labelling of Chest X-Ray Reports |
2691 | Zhe NIU, Ronglai Zuo, Brian Mak and Fangyun Wei | A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News |
2692 | Alla Rozovskaya | Universal Dependencies for Learner Russian |
2693 | Iakovos Evdaimon, Hadi Abdine, Christos Xypolopoulos, Stamatis Outsios, Michalis Vazirgiannis and Giorgos Stamou | GreekBART: The First Pretrained Greek Sequence-to-Sequence Model |
2694 | minzhao guan, Zhixun Qiu, Fenghuan Li and Yun Xue | Semantics-Aware Dual Graph Convolutional Networks for Argument Pair Extraction |
2695 | Linyu Fan, Wu Wu Yiheng, Jun Xie, Junhui Li, Fang Kong and Guodong Zhou | Leveraging AMR Graph Structure for Better Sequence-to-Sequence AMR Parsing |
2699 | Yanyue Zhang, Yilong Lai, Zhenglin Wang, Pengfei Li, Deyu Zhou and Yulan He | Opinions Are Not Always Positive: Debiasing Opinion Summarization With Model-Specific and Model-Agnostic Methods |
2700 | Sofia Lee and Jelke Bloem | Impact of Task Adapting on Transformer Models for Targeted Sentiment Analysis in Croatian Headlines |
2701 | Chris Emmery, Marilù Miotto, Sergey Kramp and Bennett Kleinberg | SOBR: A Corpus for Stylometry, Obfuscation, and Bias on Reddit |
2704 | Haim Dubossarsky and Farheen Dairkee | Strengthening the WiC: New polysemy dataset in Hindi and lack of cross lingual transfer |
2705 | Shabnam Behzad, Omid Kashefi and Swapna Somasundaran | Assessing Online Writing Feedback Resources: Generative AI vs. Good Samaritans |
2707 | Kailin Zhao, Xiaolong Jin, Long Bai, Jiafeng Guo and Xueqi Cheng | Class-Incremental Few-Shot Event Detection |
2708 | nan chen, Xiangdong Su and Feilong Bao | Hyperbolic Representations for Prompt Learning |
2712 | Atnafu Lambebo Tonja, Israel Abebe Azime, Tadesse Destaw Belay, Mesay Gemeda Yigezu, Moges Ahmed Ah Mehamed, Abinew Ali Ayele, Ebrahim Chekol Jibril, Michael Melese Woldeyohannis, Olga Kolesnikova, Philipp Slusallek, Dietrich Klakow and Seid Muhie Yimam | EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation |
2713 | Gaurav Negi, Rajdeep Sarkar, Omnia Zayed and Paul Buitelaar | A Hybrid Approach To Aspect Based Sentiment Analysis Using Transfer Learning |
2715 | Gregorios Katsios, Ning Sa, Ankita Bhaumik and Tomek Strzalkowski | Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media |
2716 | Nan Zhang, Connor Heaton, Sean Timothy Okonsky, Prasenjit Mitra and Hilal Ezgi Toraman | PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents |
2717 | Namu Park, Kevin Lybarger, Giridhar Kaushik Ramachandran, Spencer Lewis, Aashka Damani, Özlem Uzuner, Martin Gunn and Meliha Yetisgen | A Novel Corpus of Annotated Medical Imaging Reports and Information Extraction Results Using BERT-based Language Models |
2718 | Frederico Belcavello, Tiago Timponi Torrent, Ely E. Matos, Adriana S. Pagano, Maucha Gamonal, Natalia Sigiliano, Lívia Vicente Dutra, Helen de Andrade Abreu, Mairon Samagaio, Mariane Carvalho, Franciany Campos, Gabrielly Azalim, Bruna Mazzei, Mateus Fonseca de Oliveira, Ana Carolina Loçasso Luz, Lívia Pádua Ruiz, Júlia Bellei, Amanda Pestana, Josiane Costa, Iasmin Rabelo, Anna Beatriz Silva, Raquel Roza, Mariana Souza and Igor Oliveira | Frame2: a FrameNet-based multimodal dataset for tackling text-image interactions in video |
2722 | Tarun Raheja, Raunak Sinha, Advit Deepak, Will Healy, Jayanth Srinivasa, Myungjin Lee and Ramana KOMPELLA | Enhancing Large Language Models through Transforming Reasoning Problems into Classification Tasks |
2725 | Arif Shahriar and Denilson Barbosa | Improving Bengali and Hindi Large Language Models |
2727 | Hongfei Liu, Guohua Wang, Jiayuan Xie, Jiali Chen, Wenhao Fang and Yi Cai | Knowledge-Guided Cross-Topic Visual Question Generation |
2728 | Maliha Jahan, Helin Wang, Thomas Thebaud, Yinglun Sun, Giang Ha Le, Zsuzsanna Fagyal, Odette Scharenborg, Mark Hasegawa-Johnson, Laureano Moro Velazquez and Najim Dehak | Finding Spoken Identifications: Using GPT-4 Annotation For An Efficient And Fast Dataset Creation Pipeline |
2735 | Ruiyang Zhou, Lu Chen and Kai Yu | Is LLM a Reliable Reviewer? A Comprehensive Evaluation of LLM on Automatic Paper Reviewing Tasks |
2738 | Hang Zhang, Yeyun Gong, Dayiheng Liu, Shunyu Zhang, Xingwei He, Jiancheng Lv and Jian Guo | Knowledge Enhanced Pre-training for Cross-lingual Dense Retrieval |
2740 | Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan and Bin Wang | ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval |
2744 | Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu and Nikolaos Agadakos | Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation |
2745 | Yancong Li, Xiaoming Zhang, Ying Cui and Shuai Ma | Hyperbolic Graph Neural Network for Temporal Knowledge Graph Completion |
2746 | Yinan Bao, Dou Hu, Lingwei Wei, Shuchong Wei, Wei Zhou and Songlin Hu | Multi-stream Information Fusion Framework for Emotional Support Conversation |
2749 | Wenjian Ding, Yao Zhang, Jun Wang, Adam Jatowt and Zhenglu Yang | Can We Learn Question, Answer, and Distractors All From An Image? A New Task For Multiple-choice Visual Question Answering |
2752 | Zezhong Xu, Wen Zhang, Peng Ye, Lei Liang and Huajun Chen | Prompt-fused framework for Inductive Logical Query Answering |
2754 | Dainis A. Boumber, Fatima Zahra Qachfar and Rakesh Verma | Domain-Agnostic Adapter Architecture for Deception Detection: Extensive Evaluations with the DIFrauD Benchmark |
2756 | Chenlong Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun and Jun Zhao | Continual Few-shot Event Detection via Hierarchical Augmentation Networks |
2757 | Jiaying Gong and Hoda Eldardiry | Prompt-based Zero-shot Relation Extraction with Semantic Knowledge Augmentation |
2758 | Sai Koneru, Jian Wu and Sarah Rajtmajer | Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences |
2759 | Julia Bonn, Matthew J. Buchholz, Jayeol Chun, Andrew Cowell, William Croft, Lukas Denk, Sijia Ge, Jan Hajič, Kenneth Lai, James H. Martin, Skatje Myers, Alexis Palmer, Martha Palmer, Claire Benet Post, James Pustejovsky, Kristine Stenzel, Haibo Sun, Zdeňka Urešová, Rosa Vallejos, Jens E. L. Van Gysel, Meagan Vigus, Nianwen Xue and Jin Zhao | Building an Infrastructure for Uniform Meaning Representations |
2760 | Tianxin Zhao, Yingxin Liu, Xiangdong Su, Jiang Li and Guanglai Gao | Exploring the Synergy of Dual-path Encoder and Alignment Module for Better Graph-to-Text Generation |
2761 | Christian Clark and William Schuler | Categorial Grammar Induction with Stochastic Category Selection |
2763 | Rustem Yeshpanov, Pavel Efimov, Leonid Boytsov, Ardak Shalkarbayuli and Pavel Braslavski | KazQAD: Kazakh Open-Domain Question Answering Dataset |
2764 | Qingfu Zhu, Xianzhen Luo, Fang Liu, Cuiyun Gao and Wanxiang Che | A Survey on Natural Language Processing for Programming |
2771 | Fan Xu, Kai Liu, Yifeng Yang and Keyu Yan | WW-CSL: A New Dataset for Word-Based Wearable Chinese Sign Language Detection |
2772 | Di Wang, Yuan Zhuang, Ellen Riloff and Marina Kogan | Recognizing Social Cues in Crisis Situations |
2773 | Zhuo Zhang, Jintao Huang, Xiangjing Hu, Jingyuan Zhang, Yating Zhang, Hui Wang, Yue Yu, Qifan Wang, Lizhen Qu and Zenglin Xu | Revisiting Data Reconstruction Attacks on Real-world Dataset for Federated Natural Language Understanding |
2775 | Smitha Muthya Sudheendra, Maral Abdollahi, Dongyeop Kang, Jisu Huh and Jaideep Srivastava | SkOTaPA: A dataset for Skepticism Detection in Online Text after Persuasion Attempt |
2778 | Michael Kranzlein, Nathan Schneider and Kevin Tobia | CuRIAM: Corpus re Interpretation and Metalanguage in U.S. Supreme Court Opinions |
2781 | Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li and Rongrong Ji | MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization |
2782 | Hang Jiang, Doug Beeferman, Weiquan Mao and Deb Roy | Topic Detection and Tracking with Time-Aware Document Embeddings |
2783 | Maria Irena Szawerna, Simon Dobnik, Therese Lindström Tiedemann, Ricardo Muñoz Sánchez, Xuan-Son Vu and Elena Volodina | Pseudonymization Categories across Domain Boundaries |
2784 | Ankita Bhaumik, Ning Sa, Gregorios Katsios and Tomek Strzalkowski | Social Convos: Capturing Agendas and Emotions on Social Media |
2785 | Anisia Katinskaia and Roman Yangarber | GPT-3.5 for Grammatical Error Correction |
2787 | Maksym Taranukhin, Vered Shwartz and Evangelos Milios | Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning |
2788 | Tatsuya Aoyama, Chihiro Taguchi and Nathan Schneider | J-SNACS: Adposition and Case Supersenses for Japanese Joshi |
2791 | Maria Tepei and Jelke Bloem | Automatic Animacy Classification for Romanian Nouns |
2792 | Lauriane Aufrant and Lucie Chasseur | UkraiNER: A New Corpus and Annotation Scheme Towards Comprehensive Entity Recognition |
2794 | Shahla Farzana, Edoardo Stoppa, Alex Leow, Tamar Gollan, Raeanne Moore, David Salmon, Douglas Galasko, Erin Sundermann and Natalie Parde | SLaCAD: A Spoken Language Corpus for Early Alzheimer's Disease Detection |
2795 | Niama El Khbir, Nadi Tomeh and Thierry Charnois | Information Extraction with Differentiable Beam Search on Graph RNNs |
2796 | Chengcheng Han, Renyu Zhu, Jun Kuang, Fengjiao Chen, Xiang Li, Ming Gao, Xuezhi Cao and Yunsen Xian | Conjoin After Decompose: Improving Few-Shot Performance of Named Entity Recognition |
2797 | Hongxiao Zhang, Mingtong Liu, Chunyou Li, Yufeng Chen, Jinan Xu and Ming Zhou | A Reinforcement Learning Approach to Improve Low-Resource Machine Translation Leveraging Domain Monolingual Data |
2799 | Haoran Yang, Deng Cai, Huayang Li, Wei Bi, Wai Lam and Shuming Shi | A Frustratingly Simple Decoding Method for Neural Text Generation |
2800 | Irina Temnikova, Iva Marinova, Silvia Gargova, Ruslana Margova, Alexander Komarov, Tsvetelina Stefanova, Veneta Kireva, Dimana Vyatrova, Nevena Grigorova, Yordan Mandevski and Stefan Minkov | SM-FEEL-BG - The First Bulgarian Datasets and Classifiers for Detecting Feelings, Emotions, and Sentiments of Bulgarian Social Media Text |
2801 | Wenshuai Huo, Xiaocheng Feng, yichong huang, Chengpeng Fu, Hui Wang and Bing Qin | Gradient Consistency-based Parameter Allocation for Multilingual Neural Machine Translation |
2803 | Linzi Xing, Xinglu Wang, Yuxi Feng, Zhenan Fan, Jing Xiong, Zhijiang Guo, Xiaojin Fu, Rindra Ramamonjison, Mahdi Mostajabdaveh, Xiongwei Han, Zirui Zhou and Yong Zhang | Towards Human-aligned Evaluation for Linear Programming Word Problems |
2804 | Amirreza Payandeh, Dan Pluth, Jordan Hosier, Xuesu Xiao and Vijay K. Gurbani | How susceptible are LLMs to Logical Fallacies? |
2805 | Xufeng Zhao, Mengdi Li, Wenhao Lu, Cornelius Weber, Jae Hee Lee, Kun Chu and Stefan Wermter | Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic |
2810 | Lei Zeng, Ruifang He, Haowen Sun, Jing Xu, Chang Liu and Bo Wang | Global and Local Hierarchical Prompt Tuning Framework for Multi-level Implicit Discourse Relation Recognition |
2812 | Loryn Isaacs, Santiago Chambó and Pilar León-Araúz | Humanitarian Corpora for English, French and Spanish |
2817 | zhao Tan, Xiping Liu, Qing Shu, Xi Li, Changxuan Wan, Dexi Liu, Qizhi Wan and Guoqiong Liao | Enhancing Text-to-SQL Capabilities of Large Language Models through Tailored Promptings |
2818 | Rohan Chaudhury, Maria Teleki, Xiangjue Dong and James Caverlee | DACL: Disfluency Augmented Curriculum Learning for Fluent Text Generation |
2819 | Zehan Li, Jianfei Zhang, Chuantao Yin, Yuanxin Ouyang and Wenge Rong | ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search |
2820 | Rui Li, Cheng Liu, Yu Tong and Jiang Dazhi | Feature Structure Matching for Multi-source Sentiment Analysis with Efficient Adaptive Tuning |
2821 | Youliang Yuan, Wenxuan Wang, Qingshuo Guo, Yiming Xiong, Chihao Shen and Pinjia He | Does ChatGPT Know that It Does Not Know? Evaluating the Black-Box Calibration of ChatGPT |
2822 | Ammon Shurtz, Lawry Sorenson and Stephen D. Richardson | The Effects of Pretraining in Video-Guided Machine Translation |
2825 | Anni Zou, Zhuosheng Zhang and Hai Zhao | AuRoRA: A One-for-all Platform for Augmented Reasoning and Refining with Task-Adaptive Chain-of-Thought Prompting |
2826 | Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang and Maosong Sun | Robust and Scalable Model Editing for Large Language Models |
2827 | Chris Chinenye Emezue, Ifeoma Okoh, Chinedu Emmanuel Mbonu, Chiamaka Chukwuneke, Daisy Monika Lal, Ignatius Ezeani, Paul Rayson, Ijemma Onwuzulike, Chukwuma Onyebuchi Okeke, Gerald Okey Nweya, Bright Ikechukwu Ogbonna, Chukwuebuka Uchenna ORAEGBUNAM, Esther Chidinma Awo-Ndubuisi and Akudo Amarachukwu Osuagwu | The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment |
2833 | Peiyu Liu, Zikang Liu, Ze-Feng Gao, Dawei Gao, Wayne Xin Zhao, Yaliang Li, Bolin Ding and Ji-Rong Wen | Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study |
2834 | Taishi Chika, Taro Okahisa, Takashi Kodama, Yin Jou Huang, Yugo Murawaki and Sadao Kurohashi | Domain Transferable Semantic Frames for Expert Interview Dialogues |
2836 | Wai-Chung Kwan, Huimin Wang, Hongru Wang, Zezhong WANG, Bin Liang, Xian Wu, Yefeng Zheng and Kam-Fai Wong | JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialogue Policy Learning |
2838 | Khai Jiet Liong, Hongqiu Wu and Hai Zhao | Unveiling Vulnerability of Self-Attention |
2839 | Cong Ma, Yaping Zhang, Zhiyang Zhang, Yupu Liang, Yang Zhao, Yu Zhou and Chengqing Zong | Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation |
2842 | yanchun li, senlin deng, dongsu shen, shujuan tian and saiqin long | WkNER: Enhancing Named Entity Recognition with Word Segmentation Constraints and kNN Retrieval |
2843 | Yutian Zhao, Huimin Wang, Xian Wu and Yefeng Zheng | MKeCL: Medical Knowledge-Enhanced Contrastive Learning for Few-shot Disease Diagnosis |
2849 | Marcelo Viridiano, Arthur Lorenzi, Tiago Timponi Torrent, Ely E. Matos, Adriana S. Pagano, Natália Sathler Sigiliano, Maucha Gamonal, Helen de Andrade Abreu, Lívia Vicente Dutra, Mairon Samagaio, Mariane Carvalho, Franciany Campos, Gabrielly Azalim, Bruna Mazzei, Mateus Fonseca de Oliveira, Ana Carolina Luz, Livia Padua Ruiz, Júlia Bellei, Amanda Pestana, Josiane Costa, Iasmin Rabelo, Anna Beatriz Silva, Raquel Roza, Mariana Souza Mota, Igor Oliveira and Márcio Henrique Pelegrino de Freitas | Framed Multi30K: A Frame-Based Multimodal-Multilingual Dataset |
2851 | Shun Zhang, Jian Yang, Jiaqi Bai, Chaoran Yan, Tongliang Li, Zhao Yan and Zhoujun Li | New Intent Discovery with Attracting and Dispersing Prototype |
2852 | Haibo Sun and Nianwen Xue | Anchor and Broadcast: An Efficient Concept Alignment Approach for Evaluation of Semantic Graphs |
2853 | Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola Garcia, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey and Sanjeev Khudanpur | ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition |
2854 | Zihao Zheng, Zihan Zhang, Zexin Wang, Ruiji Fu, ming liu, Zhongyuan Wang and Bing Qin | Decompose, Prioritize, and Eliminate: Dynamically Integrating Diverse Representations for Multimodal Named Entity Recognition |
2855 | Nima Ebadi, Kellen Morgan, Adrian Tan, Billy Linares, Sheri Osborn, Emma Majors, Jeremy Davis and Anthony Rios | Extracting Biomedical Entities from Noisy Audio Transcripts |
2857 | Abraham Israeli, Aviv Naaman, Guy Maduel, Rawaa Makhoul, Dana Qaraeen, Amir Ejmail, Dina Lisnanskey, Julian Jubran, shai Fine and Kfir Bar | DiaSet: An Annotated Dataset of Arabic Conversations |
2858 | Longhui Zhang, Dingkun Long, Meishan Zhang, Yanzhao Zhang, Pengjun Xie and Min Zhang | Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training |
2859 | Pei Cheng, Xiayang Shi and Yinlin Li | Enhancing Translation Ability of Large Language Models by Leveraging Task-Related Layers |
2864 | Ryan Brate, Marieke van Erp and Antal van den Bosch | Re-evaluating the Tomes for the Times |
2866 | Amanda Kann | Massively Multilingual Token-Based Typology Using the Parallel Bible Corpus |
2877 | Ritesh Singh Soun, Atula Tejaswi Neerkaje, Ramit Sawhney, Nikolaos Aletras and Preslav Nakov | RISE: Robust Early-exiting Internal Classifiers for Suicide Risk Evaluation |
2878 | Shaoru Guo, Yubo Chen, Kang Liu, Ru Li and Jun Zhao | NutFrame: Frame-based Conceptual Structure Induction with LLMs |
2879 | Polina Bychkova, Alyaxey Yaskevich, Serafima Gyulasaryan and Ekaterina Rakhilina | Building a Database of Conversational Routines |
2881 | Norizo Sakaguchi, Yugo Murawaki, Chenhui Chu and Sadao Kurohashi | Identifying Source Language Expressions for Pre-editing in Machine Translation |
2885 | Xuan ZHANG and Wei Gao | Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM |
2886 | Linyang He, Peili Chen, Ercong Nie, Yuanning Li and Jonathan R. Brennan | Decoding Probing: Revealing Internal Linguistic Structures in Neural Language Models using Minimal Pairs |
2887 | Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song and Zheng Zhang | EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs |
2888 | Xiangping Zheng, Bo Wu, Alex X. Zhang and Wei Li | Improving Robustness of GNN-based Anomaly Detection by Graph Adversarial Training |
2890 | Zican Dong, Tianyi Tang, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen | BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models |
2891 | Zheng Byron Yuan, Dorina de Jong, Ruitao Feng, Štefan Beňuš, Noël Nguyen, Róbert Sabo, Luciano Fadiga and Alessandro D'Ausilio | ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation |
2893 | Louis Clouatre, Amal Zouaq and Sarath Chandar | MVP: Minimal Viable Phrase for Long Text Understanding |
2894 | Yuan Chen and Xia Li | PLAES: Prompt-generalized and Level-aware Learning Framework for Cross-prompt Automated Essay Scoring |
2895 | Rui Zheng, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang and Xuanjing Huang | Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals |
2899 | Jing Lu, Keith Hall, Ji Ma and Jianmo Ni | HYRR: Hybrid Infused Reranking for Passage Retrieval |
2900 | Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng and Noa Garcia | Can multiple-choice questions really be useful in detecting the abilities of LLMs? |
2901 | Yufei Huang and Deyi Xiong | CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models |
2904 | Xiaomeng Jin and Heng Ji | Schema-based Data Augmentation for Event Extraction |
2906 | Taha Aksu and Nancy Chen | Granular Change Accuracy: A More Accurate Performance Metric for Dialogue State Tracking |
2907 | Jin Jiang, Xunjian Yin, Xiaojun Wan, Wei Peng, Rongjun Li, Jingyuan Yang and Yanquan Zhou | Contextual Modeling for Document-level ASR Error Correction |
2908 | Rajarshi Haldar and Julia Hockenmaier | Analyzing the Performance of Large Language Models on Code Summarization |
2911 | Yunlong Zhao, Kexin Wang, Qianqian Dong and Tom Ko | Parameter-Efficient Transfer Learning for End-to-end Speech Translation |
2912 | Khalid Ahmed and Jan Buys | Neural Machine Translation between Low-Resource Languages with Synthetic Pivoting |
2914 | Ruining Chong, Luming Lu, Liner Yang, Jinran Nie, Zhenghao Liu, Shuo Wang, Shuhan Zhou, Yaoxin Li and Erhong Yang | MCTS: A Multi-Reference Chinese Text Simplification Dataset |
2915 | Nailia Mirzakhmedova, Johannes Kiesel, Milad Alshomary, Maximilian Heinrich, Nicolas Handke, Xiaoni Cai, Valentin Barriere, Doratossadat Dastgheib, Omid Ghahroodi, MohammadAli SadraeiJavaheri, Ehsaneddin Asgari, Lea Kawaletz, Henning Wachsmuth and Benno Stein | The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments |
2916 | David Doukhan, Christine Maertens, William Le Personnic, Ludovic Speroni and Reda Dehak | InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation |
2920 | Yuto Harada and Yohei Oseki | Cognitive Information Bottleneck: Extracting Minimal Sufficient Cognitive Language Processing Signals |
2921 | Maram Hasanain, Fatema Ahmad and Firoj Alam | Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles |
2924 | Eva Fučíková, Cristina Fernández Alcaina, Jan Hajič and Zdeňka Urešová | Textual Coverage of Eventive Entries in Lexical Semantic Resources |
2925 | Tomáš Musil and David Mareček | Exploring Interpretability of Independent Components of Word Embeddings with Automated Word Intruder Test |
2929 | Amit Gajbhiye, Zied Bouraoui, Luis Espinosa Anke and Steven Schockaert | AMenDeD: Modelling Concepts by Aligning Mentions, Definitions and Decontextualised Embeddings |
2931 | Aleš Žagar, Matej Klemen, Marko Robnik-Šikonja and Iztok Kosem | SENTA: Sentence Simplification System for Slovene |
2935 | Linlin Zong, Zhenrong Xie, Chi Ma, Xinyue Liu, Xianchao Zhang and Bo Xu | RENN: A Rule Embedding Enhanced Neural Network Framework for Temporal Knowledge Graph Completion |
2936 | Zekun Wang, Jingchang Chen, Wangchunshu Zhou, Haichao Zhu, Jiafeng Liang, Liping Shan, ming liu, Dongliang Xu, Qing Yang and Bing Qin | SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models |
2943 | Xinyu Ning, Yutong Zhao, Yitong Liu and Hongwen Yang | DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation |
2944 | Juri Opitz | Schroedinger's Threshold: When the AUC doesn't predict Accuracy |
2945 | Shivani Kumar, Rishabh Gupta, Md. Shad Akhtar and Tanmoy Chakraborty | Adding SPICE to Life: Speaker Profiling in Multiparty Conversations |
2946 | Santiago Herrera, Caio Corro and Sylvain Kahane | Sparse Logistic Regression with High-order Features for Automatic Grammar Rule Extraction from Treebanks |
2948 | Yumin Kim, Heejae Suh, Mingi Kim, Dongyeon Won and Hwanhee Lee | KoCoSa: Korean Context-aware Sarcasm Detection Dataset |
2949 | Nabila Ayman, Md. Akram Hossain, Abdul Aziz, Rokan Uddin Faruqui and Abu Nowshed Chy | BengaliLCP: A Dataset for Lexical Complexity Prediction in the Bengali Texts |
2950 | Ju-Hyoung Lee, Joonghyuk Hahn, Hyeon-Tae Seo, Jiho Park and Yo-Sub Han | SuperST: Superficial Self-Training for Few-Shot Text Classification |
2953 | Júlia Falcão, Claudia Borg, Nora Aranberri and Kurt Abela | COMET for Low-Resource Machine Translation Evaluation: A Case Study of English-Maltese and Spanish-Basque |
2954 | Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen | ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting |
2955 | Yichun Zhao, Shuheng Zhou and Huijia Zhu | Probe then Retrieve and Reason: Distilling Probing and Reasoning Capabilities into Smaller Language Models |
2956 | Kossi Amouzouvi, Bowen Song, Sahar Vahdati and Jens Lehmann | Knowledge GeoGebra: Leveraging Geometry of Relation Embeddings in Knowledge Graph Completion |
2957 | Fabian Simonjetz, Jussi Laasonen, Yunus Cobanoglu, Alexander Fraser and Enrique Jiménez | Reconstruction of Cuneiform Literary Texts as Text Matching |
2958 | Jiawen Xie, Shaoting Zhang and Xiaofan Zhang | GECSum: Generative Evaluation-Driven Sequence Level Contrastive Learning for Abstractive Summarization |
2959 | Alistair Plum, Tharindu Ranasinghe and Christoph Purschke | Guided Distant Supervision for Multilingual Relation Extraction Data: Adapting to a New Language |
2961 | Jiangming Liu | Soft Well-Formed Semantic Parsing with Score-Based Selection |
2962 | Janine Siewert and Jack Rueter | The Low Saxon LSDC Dataset at Universal Dependencies |
2963 | Zhiheng Zhang, Daojian Zeng and Xue Bai | Improving Continual Few-shot Relation Extraction through Relational Knowledge Distillation and Prototype Augmentation |
2964 | Amirhossein Abaskohi, Sara Baruni, Mostafa Masoudi, Nesa Abbasi, Mohammad Hadi Babalou, Ali Edalat, sepehr kamahi, Samin Mahdizadeh Sani, nikoo naghavian, Danial Namazifard, Pouya Sadeghi and Yadollah Yaghoobzadeh | Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT |
2966 | Paul Landes and Barbara Di Eugenio | CALAMR: Component ALignment for Abstract Meaning Representation |
2971 | Qiang Gao, Bobo Li, Zixiang Meng, Yunlong Li, Jun Zhou, Fei Li, Chong Teng and Donghong Ji | Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic Information |
2972 | Hemanta Baruah, Sanasam Ranbir Singh and Priyankoo Sarmah | AssameseBackTranslit: Back Transliteration of Romanized Assamese Social Media Text |
2973 | Zi Yun Yang, Ziqing Zhang and Yisong Miao | The ELCo Dataset: Bridging Emoji and Lexical Composition |
2974 | Junyue Song, Xin Wu and Yi Cai | Step Feasibility-Aware and Error-Correctable Entailment Tree Generation |
2975 | Wei Du, Tianjie Ju, Ge Ren, GaoLei Li and Gongshen Liu | Backdoor NLP Models via AI-Generated Text |
2976 | Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Samuel Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori Levin, Amir Zeldes, Joakim Nivre, William Croft and Nathan Schneider | UCxn: Typologically-Informed Annotation of Constructions Atop Universal Dependencies |
2980 | Wei-Fan Chen, Milad Alshomary, Maja Stahl, Khalid Al Khatib, Benno Stein and Henning Wachsmuth | Reference-guided Style-Consistent Content Transfer |
2981 | Ines Rehbein, Josef Ruppenhofer, Annelen Brunner and Simone Paolo Ponzetto | Out of the mouths of MPs: Speaker Attribution in Parliamentary Debates |
2983 | Yuri Bizzoni, Pascale Feldkamp Moreira, Ida Marie S. Lassen, Mads Rosendahl Thomsen and Kristoffer Nielbo | A Matter of Perspective: Building a Multi-Perspective Annotated Dataset for the Study of Literary Quality |
2984 | Weihong Guan, Shi Feng, Daling Wang, Faliang Huang, Yifei Zhang and Yuan Cui | Improving Role-Oriented Dialogue Summarization with Interaction-Aware Contrastive Learning |
2987 | Mateusz Klimaszewski, Piotr Andruszkiewicz and Alexandra Birch | Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation |
2988 | Chenhui Dou, Chen Gong, Zhenghua Li, Zhefeng Wang, baoxing Huai and Min Zhang | Improving Chinese Named Entity Recognition with Multi-grained Words and Part-of-Speech Tags via Joint Modeling |
2990 | Yan Wang, Bo Wang, Yachao Zhao, Dongming Zhao, Xiaojia Jin, Jijun Zhang, Ruifang He and Yuexian Hou | Emotion Recognition in Conversation via Dynamic Personality |
2991 | MohammadAli SadraeiJavaheri, Ehsaneddin Asgari and Hamid Reza Rabiee | Transformers for Bridging Persian Dialects: Transliteration Model for Tajiki and Iranian Scripts |
2992 | Yuchen Shi, Deqing Yang, Jingping Liu, Yanghua Xiao, Zongyu Wang and Huimin Xu | Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency |
2994 | Masato Fujitake | LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding |
2996 | Hao Yu, Kaiyu Huang, Anqi Zhao, Junpeng Liu and Degen Huang | Context-Aware Non-Autoregressive Document-Level Translation with Sentence-Aligned Connectionist Temporal Classification |
2997 | Ai Kubota, Takuma Sato, Takayuki Amamoto, Ryota Akiyoshi and Koji Mineshima | Annotation of Japanese Discourse Relations Focusing on Concessive Inferences |
2998 | Katerina Korre, Arianna Muti and Alberto Barrón-Cedeño | The Challenges of Creating a Parallel Multilingual Hate Speech Corpus: An Exploration |
3001 | Anastasios Toumazatos, John Pavlopoulos, Ion Androutsopoulos and Stavros Vassos | Still all Greeklish to me: Greeklish to Greek Transliteration |
3005 | Kunhang Li and Yansong Feng | Motion Generation from Fine-grained Textual Descriptions |
3006 | Andraž Pelicon, Mladen Karan, Ravi Shekhar, Matthew Purver and Senja Pollak | Denoising Labeled Data for Comment Moderation Using Active Learning |
3011 | Ashish Chouhan and Michael Gertz | LexDrafter: Terminology Drafting for Legislative Documents using Retrieval Augmented Generation |
3012 | Pei Wang, Keqing He, Yejie Wang, Xiaoshuai Song, Yutao Mou, Jingang Wang, Yunsen Xian, Xunliang Cai and Weiran Xu | Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection |
3014 | Hongjin Kim, Jai-Eun Kim and Harksoo Kim | Title-based Extractive Summarization via MRC Framework |
3016 | Tonmoy Rajkhowa, Amartya Roy Chowdhury, Hrishikesh Ravindra Karande and S. R. Mahadeva Prasanna | Evaluating the Efficacy of Large Acoustic Model for Documenting Non-Orthographic Tribal Languages in India |
3020 | Dhrubajyoti Pathak, Sukumar Nandi and Priyankoo Sarmah | Evaluating Performance of Pre-trained Word Embeddings on Assamese, a Low-resource Language |
3028 | Go Inoue, Akihiko Kato, Masato Mita, Ukyo Honda and Peinan Zhang | CAMERA³: An Evaluation Dataset for Controllable Ad Text Generation in Japanese |
3030 | Amit Kumar Chaudhary, Kurt Micallef and Claudia Borg | Topic Classification and Headline Generation for Maltese using a Public News Corpus |
3035 | Solene Virginie Evain, Solange Rossato and François Portet | Unraveling Spontaneous Speech Dimensions for Cross-Corpus ASR System Evaluation for French |
3036 | Ji-Eun Han, Jun-Seok Koh, Hyeon-Tae Seo, Du-Seong Chang and Kyung-Ah Sohn | PSYDIAL: Personality-based Synthetic Dialogue Generation using Large Language Models |
3037 | Anthony James Hughes and Xingyi Song | Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence |
3040 | Haopeng Ren, Yushi Zeng, Yi Cai, Zhenqi Ye, Li Yuan and Pinli Zhu | Grounded Multimodal Procedural Entity Recognition for Procedural Documents: A New Dataset and Baseline |
3041 | Priya Rani, Theodorus Fransen, John P. McCrae and Gaurav Negi | MaCmS: Magahi Code-mixed Dataset for Sentiment Analysis |
3043 | Nobuyuki Iokawa and Hitomi Yanaka | Visual-Textual Entailment with Quantities Using Model Checking and Knowledge Injection |
3044 | Jakub Piskorski, Michał Marcińczuk and Roman Yangarber | Cross-lingual Named Entity Corpus for Slavic Languages |
3052 | Maria Becker, Kanyao Han, Antonina Werthmann, Rezvaneh Rezapour, Haejin Lee and Jana Diesner | Detecting Impact Relevant Sections in Scientific Research |
3054 | MohanRaj Chanthran, Lay-Ki Soon, Huey Fang Ong and Bhawani Selvaretnam | Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction |
3055 | senbao shi, Zhenran Xu, Baotian Hu and Min Zhang | Generative Multimodal Entity Linking |
3059 | Vera Danilova and Sara Stymne | Relation between Cross-Genre and Cross-Topic Transfer in Dependency Parsing |
3060 | Songhua Yang, Xinke Jiang, Hanjie Zhao, Wenxuan Zeng, Hongde Liu and Yuxiang Jia | FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis |
3063 | Zhi Li, Yicheng Li, Hequan Ye and Yin Zhang | Towards Autonomous Tool Utilization in Language Models: A Unified, Efficient and Scalable Framework |
3064 | Masayuki Kawarada, Tatsuya Ishigaki and Hiroya Takamura | Prompting for Numerical Sequences: A Case Study on Market Comment Generation |
3066 | Zichen Wu, Hsiu-Yuan Huang, Fanyi Qu and Yunfang Wu | Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding |
3067 | Yuetian Chen and Mei Si | Reflections & Resonance: Two-Agent Partnership for Advancing LLM-based Story Annotation |
3068 | John Dougrez-Lewis, Elena Kochkina, Maria Liakata and Yulan He | Knowledge Graphs for Real-World Rumour Verification |
3069 | Longxuan Ma, Changxin Ke, Shuhan Zhou, churui sun, Wei-Nan Zhang and Ting Liu | A Self-verified Method for Exploring Simile Knowledge from Pre-trained Language Models |
3070 | Callum William Booth, Alan Thomas and Robert Gaizauskas | BLN600: A Parallel Corpus of Machine/Human Transcribed Nineteenth Century Newspaper Texts |
3073 | Dongyang Li, Taolin Zhang, Jiali Deng, Longtao Huang, Chengyu Wang, XIAOFENG HE and Hui Xue | UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding |
3078 | Martins Kronis, Askars Salimbajevs and Mārcis Pinnis | Code-Mixed Text Augmentation for Latvian ASR |
3079 | Yong Guan, Xiaozhi Wang, Lei Hou, Juanzi Li, Jeff Z. Pan, Jiaoyan Chen and Freddy Lecue | TacoERE: Cluster-aware Compression for Event Relation Extraction |
3083 | Thomas Haider | A Large Annotated Reference Corpus of New High German Poetry |
3084 | Fahmida Alam, Md Asiful Islam, Robert Vacareanu and Mihai Surdeanu | Towards Realistic Few-Shot Relation Extraction: A New Meta Dataset and Evaluation |
3085 | Qingyan Zhao, Ruifang He, Jinpeng Zhang, Chang Liu and Bo Wang | Representation Degeneration Problem in Prompt-based Models for Natural Language Understanding |
3086 | Marko Pranjić, Marko Robnik-Šikonja and Senja Pollak | LLMSegm: Surface-level Morphological Segmentation Using Large Language Model |
3088 | Mengna Zhu, Zijie Xu, Kaisheng Zeng, Kaiming Xiao, Mao Wang, Wenjun Ke and Hongbin Huang | CMNEE:A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News |
3089 | Sangah Lee, Sungjoo Byun, Jean Seo and Minha Kang | ManNER & ManPOS: Pioneering NLP for Endangered Manchu Language |
3090 | Botond Barta, Dorina Lakatos, Attila Nagy, Milán Konor Nyist and Judit Ács | From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization |
3091 | Viktor Moskvoretskii, Alexander Panchenko and Irina Nikishina | Are Large Language Models Good at Lexical Semantics? A Case of Taxonomy Learning |
3094 | Jinfeng Huang, Qiaoqiao She, Wenbin Jiang, Hua Wu, Yang Hao, Tong Xu and Feng Wu | QDMR-based Planning-and-Solving Prompting for Complex Reasoning Tasks |
3096 | Ho-Seung Kim, YongHoon Kang and Jee-Hyong Lee | STAGE: Simple Text Data Augmentation by Graph Exploration |
3097 | Yingxiu Zhao, Bowen Yu, Binyuan Hui, Haiyang Yu, Minghao Li, Fei Huang, Nevin L. Zhang and Yongbin Li | Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment |
3099 | Zhigang Kan, Liwen Peng, Linbo Qiao and Dongsheng Li | Emancipating Event Extraction from the Constraints of Long-Tailed Distribution Data Utilizing Large Language Models |
3101 | Todd Morrill, Zhaoyuan Deng, Yanda Chen, Amith Ananthram, Colin Wayne Leach and Kathleen McKeown | Social Orientation: A New Feature for Dialogue Analysis |
3105 | Junda Chen and Jianting Liu | S3Prompt: Instructing the Model with Self-calibration, Self-recall and Self-aggregation to Improve In-context Learning |
3106 | Xincan Feng and Akifumi Yoshimoto | Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness |
3109 | Lei Chen, Bobo Li, Li Zheng, Haining Wang, Zixiang Meng, Runfeng Shi, Hao Fei, Jun Zhou, Fei Li, Chong Teng and Donghong Ji | What Factors Influence LLMs' Judgments? A Case Study on Question Answering |
3110 | Ying Zhou, Ben He and Le Sun | Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack |
3111 | Linhao Yu, Qun Liu and Deyi Xiong | LFED: A Literary Fiction Evaluation Dataset for Large Language Models |
3112 | Ahmadou Wagne, Julia Neidhardt and Thomas Elmar Kolb | PopAut: An Annotated Corpus for Populism Detection in Austrian News Comments |
3115 | Yao Dong, Qingchao Kong, Lei Wang and Yin Luo | Dual Complex Number Knowledge Graph Embeddings |
3118 | Yanggan Gu, Yang Hou, Zhefeng Wang, Xinyu Duan and Zhenghua Li | High-order Joint Constituency and Dependency Parsing |
3119 | David R. Mortensen, Valentina Izrailevitch, Yunze Xiao, Hinrich Schütze and Leonie Weissweiler | Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs |
3120 | Lu Fan, Jiashu Pu, Rongsheng Zhang and Xiao-Ming Wu | LANID: LLM-assisted New Intent Discovery |
3123 | Dongyang Li, Taolin Zhang, Longtao Huang, Chengyu Wang, XIAOFENG HE and Hui Xue | KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning |
3124 | Ondřej Herman and Miloš Jakubíček | ShadowSense: a Multi-annotated Dataset for Evaluating Word Sense Induction |
3125 | Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan and Jinlong Shu | An LLM-Enhanced Adversarial Editing System for Lexical Simplification |
3126 | Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Shuaiqiang WANG, chong meng, zhicong cheng, Zhaochun Ren and Dawei Yin | Improving the Robustness of Large Language Models via Consistency Alignment |
3128 | Frédéric RAYAR | FrenchFacts: A French Dataset of Fact-Checked Claims |
3129 | ahmad shallouf, Hanna Herasimchyk, Mikhail Salnikov, Rudy Alexandro Garrido Veliz, Natia Mestvirishvili, Alexander Panchenko, Chris Biemann and Irina Nikishina | CAM 2.0: End-to-End Open Domain Comparative Question Answering System |
3130 | Yida Mu, Ben P. Wu, William Thorne, Ambrose Robinson, Nikolaos Aletras, Carolina Scarton, Kalina Bontcheva and Xingyi Song | Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science |
3133 | Bocheng Li, Zhujin Gao, Yongxin Zhu, kun yin, haoyu cao, Deqiang Jiang and Linli Xu | Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation |
3135 | Libo Sun, Siyuan Wang, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang and Zhongyu Wei | Multi-Objective Forward Reasoning and Multi-Reward Backward Refinement for Product Review Summarization |
3138 | Guy Mor-Lan, Effi Levi, Tamir Sheafer and Shaul R. Shenhav | IsraParlTweet: The Israeli Parliamentary and Twitter Resource |
3139 | Mingming Li, Songlin Hu, Fuqing Zhu and Qiannan Zhu | Few-Shot Learning for Cold-Start Recommendation |
3143 | Saedeh Tahery, Sahar Kianian and Saeed Farzi | Cross-Lingual NLU: Mitigating Language-Specific Impact in Embeddings Leveraging Adversarial Learning |
3147 | Giuseppe Abrami, Mevlüt Bagci and Alexander Mehler | German Parliamentary Corpus (GerParCor) Reloaded |
3148 | David R. Reich, Shuwen Deng, Marina Björnsdóttir, Lena Jäger and Nora Hollenstein | Reading Does Not Equal Reading: Comparing, Simulating and Exploiting Reading Behavior Across Populations |
3151 | Haris Riaz, Razvan Gabriel Dumitru and Mihai Surdeanu | ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition |
3152 | Ziyang Xu, Keqin Peng, Liang Ding, Dacheng Tao and Xiliang Lu | Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction |
3157 | Falwah Alhamed, Julia Ive and Lucia Specia | Classifying Social Media Users Before and After Depression Diagnosis via their Language Usage: A Dataset and Study |
3161 | Atsumoto Ohashi, Ryu Hirai, Shinya Iizuka and Ryuichiro Higashinaka | JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset |
3162 | Jinman Zhao and Gerald Penn | A Generative Model For Lambek Categorial Sequents |
3163 | Elena Callegari, Iris Edda Nowenstein, Ingunn Jóhanna Kristjánsdóttir and Anton Karl Ingason | Automatic Extraction of Language-Specific Biomarkers of Healthy Aging In Icelandic |
3167 | Taiga Someya, Ryo Yoshida and Yohei Oseki | Targeted Syntactic Evaluation on the Chomsky Hierarchy |
3168 | Vamshi Krishna Bonagiri, Sreeram Vennam, Priyanshul Govil, Ponnurangam Kumaraguru and Manas Gaur | SaGE: Evaluating Moral Consistency in Large Language Models |
3169 | Yifei Yang, Hongqiu Wu and Hai Zhao | Attack Named Entity Recognition by Entity Boundary Interference |
3171 | Iwona Christop | nEMO: Dataset of Emotional Speech in Polish |
3173 | Alexandra (Sandra) Vella, Sarah Agius, Aiden Williams and Claudia Borg | Towards a corpus of spoken Maltese: Korpus tal-Malti Mitkellem, KMM |
3178 | Xiang Li, Shizhu He, Jiayu Wu, Zhao Yang, Yao Xu, yang jun jun, Haifeng Liu, Kang Liu and Jun Zhao | MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts |
3179 | Fengbin Zhu, Chao Wang, Fuli Feng, Zifeng Ren, Moxin Li and Tat-Seng Chua | Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs |
3185 | Sijia Ge, Zilong Li, Alvin Po-Chun Chen and Guanchao Wang | Annotate Chinese Aspect with UMR——A Case Study on The Liitle Prince |
3186 | shaojuan wu, Jitong Li, Xiaowang Zhang and Zhiyong Feng | An Event-based Abductive Learning for Hard Time-sensitive Question Answering |
3187 | Furkan Akkurt, Onur Gungor, Büşra Marşan, Tunga Gungor, Balkiz Ozturk Basaran, Arzucan Özgür and Susan Uskudarli | Evaluating the quality of a corpus annotation scheme using pretrained language models |
3189 | Christina Tånnander, Jens Edlund and joakim gustafson | Revisiting Three Text-to-Speech Synthesis Experiments with a Web-Based Audience Response System |
3190 | Daijun Ding, li dong, Zhichao Huang, Guangning Xu, Xu Huang, Bo Liu, Liwen Jing and Bowen Zhang | EDDA: An Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance Detection |
3193 | Yachao Zhao, Bo Wang, Yan Wang, Dongming Zhao, Xiaojia Jin, Jijun Zhang, Ruifang He and Yuexian Hou | A Comparative Study of Explicit and Implicit Gender Biases in Large Language Models via Self-evaluation |
3195 | Yufei Huang and Deyi Xiong | IT2ACL Learning Easy-to-Hard Instructions via 2-phase Automated Curriculum Learning for Large Language Models |
3197 | Yuiko Tsunomori and Ryuichiro Higashinaka | I Remember You!: SUI Corpus for Remembering and Utilizing Users' Information in Chat-oriented Dialogue Systems |
3199 | Yejin Yoon, Jungyeon Lee, Kangsan Kim, Chanhee Park and Taeuk Kim | BlendX: Complex Multi-Intent Detection with Blended Patterns |
3200 | Ritesh Kumar, Ojaswee Bhalla, Madhu Vanthi, Shehlat Maknoon Wani and Siddharth Singh | HarmPot: An Annotation Framework for Evaluating Offline Harm Potential of Social Media Text |
3210 | Cagri Toraman, Oguzhan Ozcelik, Furkan Sahinuc and Fazli Can | MiDe22: An Annotated Multi-Event Tweet Dataset for Misinformation Detection |
3213 | Geunyeong Jeong, Seokwon Jeong, Juoh Sun and Harksoo Kim | Bridging the Code Gap: A Joint Learning Framework Across Medical Coding Systems |
3214 | Pablo Weingart, Thiemo Wambsganss and Matthias Soellner | Modelling Argumentation for an User Opinion Aggregation Tool |
3215 | Anmol Singhal, Chirag Jain, Preethu Rose Anish, Arkajyoti Chakraborty and Smita Ghaisas | Generating Clarification Questions for Disambiguating Contracts |
3220 | Kaan Büyükdemirci, Izzet Emre Kucukkaya, Eren Ölmez and Cagri Toraman | JL-Hate: An Annotated Dataset for Joint Learning of Hate Speech and Target Detection |
3227 | Tianjie Ju, Weiwei Sun, Wei Du, Xinwei Yuan, Zhaochun Ren and Gongshen Liu | How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study |
3231 | Zeyuan Zeng, Zefeng Li, Liang Yang and Hongfei Lin | Leveraging Social Context for Humor Recognition and Sense of Humor Evaluation in Social Media with a New Chinese Humor Corpus - HumorWB |
3235 | Weicheng Ren, Zixuan Li, Xiaolong Jin, Long Bai, Miao Su, Yantao Liu, Saiping Guan, Jiafeng Guo and Xueqi Cheng | Nested Event Extraction upon Pivot Element Recognition |
3237 | Jian Yang, Hongcheng Guo, Yuwei Yin, Jiaqi Bai, Bing Wang, Jiaheng Liu, Xinnian Liang, LinZheng Chai, Liqun Yang and Zhoujun Li | m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt |
3239 | Jaeyoung Lee, Joonwon Jang and Misuk Kim | Hierarchical Graph Convolutional Network Approach for Detecting Low-Quality Documents |
3240 | Jia Cheng Hu, Roberto Cavicchioli, Giulia Berardinelli and Alessandro Capotondi | Learning From Wrong Predictions in Low-Resource Neural Machine Translation |
3241 | Xurui Li, Kaisong Song, Tianqianjing Lin, Yangyang Kang, Fubang Zhao, Changlong Sun and Xiaozhong Liu | PDAMeta: Meta-Learning Framework with Progressive Data Augmentation for Few-Shot Text Classification |
3242 | Tsunehiro Arimoto, Hiroaki Sugiyama, Hiromi Narimatsu and Masahiro Mizukami | Comparison of the Intimacy Process between Real and Acting-based Long-term Text Chats |
3244 | Kun Wu, Xinyi Mou, Lanqing Xue, Zhenzhe Ying, Weiqiang Wang, Qi Zhang, Xuanjing Huang and Zhongyu Wei | PASUM: A Pre-training Architecture for Social Media User Modeling based on Text Graph |
3245 | Deokhyung Kang, Baikjin Jung, Yunsu Kim and Gary Geunbae Lee | Denoising Table-Text Retrieval for Open-Domain Question Answering |
3248 | RUMENG LI, Xun Wang and hong yu | LlamaCare: an Instruction Fine-Tuned Large Language Model for Clinical NLP |
3249 | Wolfgang S. Schmeisser-Nieto, Pol Pastells, Simona Frenda and Mariona Taule | Human vs. Machine Perceptions on Immigration Stereotypes |
3250 | Michael Peechatt, Cecilia Ovesdotter Alm and Reynold Bailey | MULTICOLLAB: A Multimodal Corpus of Dialogues for Analyzing Collaboration and Frustration in Language |
3251 | Watheq Ahmad Mansour, Salam Albatarni, Sohaila Eltanbouly and Tamer Elsayed | Can Large Language Models Automatically Score Proficiency of Written Essays? |
3252 | Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal and Monojit Choudhury | Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in |
3253 | Yanan Zhang, Xiaoling Bai and Tianhua Zhou | Event-enhanced Retrieval in Real-time Search |
3254 | Xiao Pu, Mingqi Gao and Xiaojun Wan | Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks |
3257 | Yixuan Wang, Baoxin Wang, Yijun Liu, dayong wu and Wanxiang Che | LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction |
3258 | Aditya Bhargava, Timothy A. D. Fowler and Gerald Penn | LCGbank: A Corpus of Syntactic Analyses Based on Proof Nets |
3260 | Felermino Dario Mario Ali, Henrique Lopes Cardoso and Rui Sousa-Silva | Detecting Loanwords in Emakhuwa: An Extremely Low-Resource Bantu Language Exhibiting Significant Borrowing From Portuguese |
3261 | Zhanghao Hu, Yijun YANG, Junjie XU, Yifu Qiu and Pinzhen Chen | EEE-QA: Exploring Effective and Efficient Question-Answer Representations |
3263 | Philipp Heinrich, Andreas Blombach, Bao Minh Doan Dang, Leonardo Zilio, Linda Havenstein, Nathan Dykes, Stephanie Evert and Fabian Schäfer | Automatic Identification of COVID-19-related Conspiracy Narratives in German Telegram Channels and Chats |
3264 | Gitanjali Kumari, Dibyanayan Bandyopadhyay, Asif Ekbal and Vinutha B. NarayanaMurthy | CM-Off-Meme: Code-Mixed Hindi-English Offensive Meme Detection with Multi-Task Learning by Leveraging Contextual Knowledge |
3266 | Sourya Dipta Das and Yash A. Vadi | Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection |
3267 | Tong Sun, Biao Fu, Cong Hu, Liang Zhang, Ruiquan Zhang, xiaodong shi, Jinsong Su and Yidong Chen | Adaptive Simultaneous Sign Language Translation with Confident Translation Length Estimation |
3271 | Yukiko Ishizuki, Tatsuki Kuribayashi, Yuichiroh Matsubayashi, Ryohei Sasano and Kentaro Inui | To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese |
3272 | Xulin Zhou, Takuma Ichikawa and Ryuichiro Higashinaka | Collecting and Analyzing Dialogues in a Tagline Co-Writing Task |
3273 | Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang and Zhongyu Wei | DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning |
3276 | Juraj Vladika, Phillip Schneider and Florian Matthes | HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking |
3278 | Cristina Garcia Holgado and Marianne Vergez-Couret | Empowering Low-Resource Regional Languages with Lexicons : A Comparative Study of NLP tools for Morphosyntactic Analysis |
3280 | Michelle YoungJin Kim, Junghwan Kim and Kristen Johnson | ABLE: Agency-BeLiefs Embedding to Address Stereotypical Bias Through Awareness Instead of Obliviousness |
3282 | Puneet Mathur, Zhe Liu, Ke Li, Yingyi Ma, Gil Karen, Zeeshan Ahmed, Dinesh Manocha and Xuedong Zhang | DOC-RAG: ASR Language Model Personalization with Domain-Distributed Co-occurrence Retrieval Augmentation |
3283 | Iñigo Morcillo, Igor Leturia, Ander Corral, Xabier Sarasola, Michaël BARRET, Aure Séguier and Benaset Dazéas | Automatic Speech Recognition for Gascon and Languedocian Variants of Occitan |
3285 | Jing Zhang, Hui Gao, Peng Zhang, Boda Feng, Wenmin Deng and Yuexian Hou | LA-UCL: LLM-Augmented Unsupervised Contrastive Learning Framework for Few-Shot Text Classification |
3289 | Ali Faheem, Faizad Ullah, Muhammad Sohaib Ayub and Asim Karim | UrduMASD: a Multimodal Abstractive Summarization Dataset for Urdu |
3291 | Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li and Weiming Hu | Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training |
3293 | Kazuhiro Wada, Masaya Tsunokake and Shigeki Matsubara | On an Intermediate Task for Classifying URL Citations on Scholarly Papers |
3294 | Yuting Shi, Naoya Inoue, Houjing Wei, Yufeng Zhao and Tao JIN | Find-the-Common: A Benchmark for Explaining Visual Patterns from Images |
3295 | Ayush Maheshwari, Ashim Gupta, Amrith Krishna, Atul Kumar Singh, Ganesh Ramakrishnan, Anil Kumar Gourishetty and Jitin Singla | Samayik: A Benchmark and Dataset for English-Sanskrit translation |
3296 | Xunjian Yin, Xinyu Hu, Jin Jiang and Xiaojun Wan | Error-Robust Retrieval for Chinese Spelling Check |
3298 | Shijia Zhou, Leonie Weissweiler, Taiqi He, Hinrich Schütze, David R. Mortensen and Lori Levin | Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons |
3299 | A Pranav, Yan Cong, Emmanuele Chersoni, Yu-Yin Hsu and Alessandro Lenci | Comparing Static and Contextual Distributional Semantic Models on Intrinsic Tasks: An Evaluation on Mandarin Chinese Datasets |
3303 | Yichen Huang and Ekaterina Kochmar | REFeREE: A REference-FREE Model-Based Metric for Text Simplification |
3307 | Yimin Ou and Ping Jian | Effective Integration of Text Diffusion and Pre-Trained Language Models with Linguistic Easy-First Schedule |
3308 | Dalma Galambos and Pal Zsamboki | Training BERT Models to Carry Over a Coding System Developed on One Corpus to Another |
3312 | Yuhao Zhou, Wenxiang Chen, Rui Zheng, Zhiheng Xi, Tao Gui, Qi Zhang and Xuanjing Huang | ORTicket: Let One Robust BERT Ticket Transfer across Different Tasks |
3317 | Sandeep Kumar, Guneet Singh Kohli, Tirthankar Ghosal and Asif Ekbal | Longform Multimodal Lay Summarization of Scientific Papers: Towards Automatically Generating Science Blogs from Research Articles |
3320 | Yi-Pei Chen, Noriki Nishida, Hideki Nakayama and Yuji Matsumoto | Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations |
3322 | Andrea Zugarini, Kamyar Zeinalipour, Surya Sai Kadali, Marco Maggini, Marco Gori and Leonardo Rigutini | Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles |
3323 | Muhammad ElNokrashy, Badr AlKhamissi and mona Diab | Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification |
3327 | Hind Saddiki, Samantha Wray and Daisy Li | LexiVault: A repository for psycholinguistic lexicons of lesser-studied languages |
3328 | Hao An, Zhihong Zhu, Xuxin Cheng, Zhiqi Huang and Yuexian Zou | Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic |
3330 | Liuwen Cao, Yi Cai, Jiexin Wang, Hongkui He and Hailin Huang | Beyond Code: Evaluate Thought Steps for Complex Code Generation |
3333 | Zhongni Hou, Xiaolong Jin, Zixuan Li, Long Bai, Jiafeng Guo and Xueqi Cheng | Selective Temporal Knowledge Graph Reasoning |
3337 | Omama Hamad, Khaled Shaban and Ali Hamdi | ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling |
3338 | Daniil Kosakin, Sergei Obiedkov, Ivan Smirnov, Ekaterina Rakhilina, Anastasia Vyrenkova and Ekaterina Zalivina | Russian Learner Corpus: Towards Error-Cause Annotation for L2 Russian |
3341 | Dongjun Jang, Sungjoo Byun, Hyemi Jo and Hyopil Shin | KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models |
3343 | Akiyo Fukatsu, Yuto Harada and Yohei Oseki | Learning Bidirectional Morphological Inflection Like Humans |
3344 | Tiziano Labruna and Bernardo Magnini | Towards Cost-effective Multi-style Conversations: A Pilot Study in Task-oriented Dialogue Generation |
3348 | Jun Sen Yee, Mario Giulianelli and Arabella J. Sinclair | Efficiency and Effectiveness in Task-Oriented Dialogue: On Construction Repetition, Information Rate, and Task Success |
3352 | Shuohao Lin, Wei Chen, Yunpeng Gao, Zhishu Jiang, Mengqi Liao, Zhiyu Zhang, Shuyuan Zhao and Huaiyu Wan | KPatch: Knowledge Patch to Pre-trained Language Model for Zero-Shot Stance Detection on Social Media |
3353 | Yujie Shao, Xinrong Yao, Xingwei Qu, Chenghua Lin, Shi Wang, Wenhao Huang, Ge Zhang and Jie Fu | CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation |
3354 | Hyunwook Yu, Suhyeon Shin, Junku Heo, Hyuntaek Shin, Hyosu Kim and Mucheol Kim | Action-Concentrated Embedding Framework: This is your captain sign-tokening |
3356 | Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng and Dongyan Zhao | Probing Multimodal Large Language Models for Global and Local Semantic Representations |
3360 | Nivedita Sethiya, Saanvi Nair and Chandresh Maurya | Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation |
3361 | Wenjie Zhong, Jason Naradowsky, Hiroya Takamura, Ichiro Kobayashi and Yusuke Miyao | Who Said What: Formalization and Benchmarks for the Task of Quote Attribution |
3362 | Viktoria Ondrejova and Marek Suppa | SlovakSum: A Large Scale Slovak Summarization Dataset |
3363 | Hongzhi Xu, Jingxia Lin, Sameer Pradhan, Mitchell Marcus and Ming Liu | Annotating Chinese Word Senses with English WordNet: A Practice on OntoNotes Chinese Sense Inventories |
3364 | Hongyu Guo, Wenbo Shang, Xueyao Zhang and Binyang Li | MUCH: A Multimodal Corpus Construction for Conversational Humor Recognition Based on Chinese Sitcom |
3368 | Rémi Cardon, Trang Tran Hanh Pham, Julien Zakhia Doueihi and Thomas François | Contribution of Move Structure to Automatic Genre Identification: an Annotated Corpus of French Tourism Websites |
3372 | Hao Niu, Maoyi Wang, Yun Xiong, Biao Yang, Xing Jia and Zhonglei Guo | Linking Adaptive Structure Induction and Neuron Filtering: A Spectral Perspective for Aspect-based Sentiment Analysis |
3373 | Tianwen Tang, Tong Zhu, Haodong Liu, Yin Bai, Jia Cheng and Wenliang Chen | MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking |
3374 | Wenlong Zhao, Debanjan Mondal, Niket Tandon, Danica Dillion, Kurt Gray and Yuling Gu | VALUEALIGN: A Large-scale Dataset for Multi-Cultural Human Value Alignment |
3375 | Yuhan Song and Houfeng Wang | Would You Like to Make a Donation? A Dialogue System to Persuade You to Donate |
3378 | Fauzan Nayeem Farooqui, Thanmay Jayakumar, Pulkit Mathur and Mansi A. Radke | Leveraging Linguistically Enhanced Embeddings for Open Information Extraction |
3381 | Yi Fung, Anoop Kumar, Aram Galstyan, Heng Ji and Prem Natarajan | Agenda-Driven Question Generation: A Case Study in the Courtroom Domain |
3387 | Gopichand Kanumolu, Lokesh Madasu, Nirmal Surange and Manish Shrivastava | TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu |
3388 | Xiaoyu Hu, Xu Zhang, Zexu Lin and Deyu Zhou | Reduce Redundancy then Rerank: Enhancing Code Summarization with a Novel Pipeline Framework |
3392 | Andrea Bacciu, Cesare Campagnano, Giovanni Trappolini and Fabrizio Silvestri | DanteLLM: Let's Push Italian LLM Research Forward! |
3393 | Matiss Rikters, Rinalds Vīksna and Edison Marrese-Taylor | Annotations for Exploring Food Tweets From Multiple Aspects |
3401 | Tolulope Ogunremi, Kọ́lá Túbọ̀sún, Anuoluwapo Aremu, Iroro Orife and David Ifeoluwa Adelani | ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus |
3418 | Yiming Zhang, Hantao Yang, Haobo Wang and Jake Zhao | Fast Adaptation via Prompted Data: An Efficient Cross-Domain Fine-tuning Method for Large Language Models |
3421 | Iulia Petrariu and Sergiu Nisioi | A Multilingual Parallel Corpus for Aromanian |
3422 | Sung-Min Lee, Eunhwan Park, DongHyeon Jeon, INHO KANG and Seung-Hoon Na | RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion |
3425 | Gerardo Ocampo Diaz and Jessica Ouyang | Measuring Cross-Text Cohesion for Segmentation Similarity Scoring |
3429 | Xiaojun Ye, Junhao Chen, Xiang Li, Haidong Xin, Chao Li, Sheng Zhou and Jiajun Bu | MMAD:Multi-modal Movie Audio Description |
3436 | Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew and Animesh Mukherjee | InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks |
3439 | Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li and Weiming Hu | Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval |
3452 | Xiangping Zheng, Bo Wu, Alex X. Zhang and Wei Li | Hypergraph-Based Session Modeling: A Multi-Collaborative Self-Supervised Approach for Enhanced Recommender Systems |
3470 | Špela Arhar Holdt, Jaka Čibej, Kaja Dobrovoljc, Tomaž Erjavec, Polona Gantar, Simon Krek, Tina Munda, Nejc Robida, Luka Terčon and Slavko Zitnik | SUK 1.0: A New Training Corpus for Linguistic Annotation of Modern Standard Slovene |
3471 | Zofia Malisz, Jan Foremski and Małgorzata Kul | PRODIS - a speech database and a phoneme-based language model for the study of predictability effects in Polish |