| 08:00-09:00 | Registration all day | | | |
| | | | | |
| | | | | |
| 09:00 - 10:40 | Opening Session (Room Auditorium G. Agnelli + Broadcast to other rooms) | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| 10:40 - 11:00 | Coffee break | | | |
| 11:00 - 11:20 | D1-S2-R1 - Corpora and Annotation I (Chair: Siyao Peng) - Room: Auditorium G. Agnelli | 39 | Geographically-Informed Language Identification | Jonathan Dunn and Lane Edwards-Brown |
| 11:20 - 11:40 | | 72 | Emotags: Computer-Assisted Verbal Labelling of Expressive Audiovisual Utterances for Expressive Multimodal TTS | Gérard Bailly, Romain Legrand, Martin Lenglet, Frédéric Elisei, Maëva Hueber and Olivier Perrotin |
| 11:40 - 12:00 | | 136 | Data Collection Pipeline for Low-Resource Languages: A Case Study on Constructing a Tetun Text Corpus | Gabriel de Jesus and Sérgio Sobral Nunes |
| 12:00 - 12:20 | | 274 | GlotScript: A Resource and Tool for Low Resource Writing System Identification | Amir Hossein Kargaran, François Yvon and Hinrich Schütze |
| 12:20 - 12:40 | | 363 | Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages | Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Tamás Grósz, Xinwei Cao, Torbjørn Svendsen and Giampiero Salvi |
| 11:00 - 11:20 | D1-S2-R2 - Applications Involving LRs and Evaluation I (Chair: David Adelani) Room: 500 | 44 | Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents | Ramona Christen, Anastassia Shaitarova, Matthias Stürmer and Joel Niklaus |
| 11:20 - 11:40 | | 853 | Qsnail: A Questionnaire Dataset for Sequential Question Generation | Yan Lei, Liang Pang, Yuanzhuo Wang, Huawei Shen and Xueqi Cheng |
| 11:40 - 12:00 | | 1720 | Self-reported demographics and discourse dynamics in a persuasive online forum | Agnieszka Falenska, Eva Maria Vecchi and Gabriella Lapesa |
| 12:00 - 12:20 | | 1969 | Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation | Sugyeong Eo, Jungwoo Lim, Chanjun Park, DaHyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo and Heuiseok Lim |
| 12:20 - 12:40 | | 2002 | OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement | Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon and Donald Metzler |
| 11:00 - 11:20 | D1-S2-R3 - Natural Language Generation, Summarization and Simplification I (Chair: David Traum) Room: Londra | 21 | SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation | Dongqi Pu, Yifan Wang, Jia E. Loy and Vera Demberg |
| 11:20 - 11:40 | | 176 | LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation | Jennifer A. Bishop, Sophia Ananiadou and Qianqian Xie |
| 11:40 - 12:00 | | 279 | Enhancing Court View Generation with Knowledge Injection and Guidance | Ang Li, Yiquan Wu, Yifei Liu, Kun Kuang, Fei Wu and Ming Cai |
| 12:00 - 12:20 | | 470 | Diversifying Question Generation over Knowledge Base via External Natural Questions | Shasha Guo, Jing Zhang, Xirui Ke, Cuiping Li and Hong Chen |
| 12:20 - 12:40 | | 1017 | EROS:Entity-Driven Controlled Policy Document Summarization | Joykirat Singh, Sehban Fazili, Rohan Jain and Md. Shad Akhtar |
| 11:00 - 11:20 | D1-S2-R4 - Knowledge Discovery / Representation II (Chair: Ruochen Zhang) Room: Istanbul | 1815 | Few-shot Link Prediction on Hyper-relational Facts | Jiyao Wei, Saiping Guan, Xiaolong Jin, Jiafeng Guo and Xueqi Cheng |
| 11:20 - 11:40 | | 2887 | EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs | Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song and Zheng Zhang |
| 11:40 - 12:00 | | 2935 | RENN: A Rule Embedding Enhanced Neural Network Framework for Temporal Knowledge Graph Completion | Linlin Zong, Zhenrong Xie, Chi Ma, Xinyue Liu, Xianchao Zhang and Bo Xu |
| 12:00 - 12:20 | | 3088 | CMNEE:A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News | Mengna Zhu, Zijie Xu, Kaisheng Zeng, Kaiming Xiao, Mao Wang, Wenjun Ke and Hongbin Huang |
| 12:20 - 12:40 | | 1743 | From Linguistic Linked Data to Big Data | Dimitar Trajanov, Elena Apostol, Radovan Garabík, Katerina Gkirtzou, Dagmar Gromann, Chaya Liebeskind, Cosimo Palma, Michael Rosner, Alexia Sampri, Gilles Sérasset, Blerina Spahiu, Ciprian-Octavian ciprian.truica@upb.ro and Giedre Valunaite Oleskeviciene |
| 11:00 - 11:20 | D1-S2-R5 - Multilinguality, Machine Translation, and Translation Aids I (Chair: Chenhui Chu) Room: Madrid | 252 | TAeKD: Teacher Assistant Enhanced Knowledge Distillation for Closed-Source Multilingual Neural Machine Translation | Bo Lv, Xin Liu, Kaiwen Wei, Ping Luo and Yue Yu |
| 11:20 - 11:40 | | 312 | Teaching Large Language Models to Translate on Low-resource Languages with Textbook Prompting | Ping Guo, Yubing Ren, Yue Hu, Yunpeng Li, Jiarui Zhang, Xingsheng Zhang, Heyan Huang |
| 11:40 - 12:00 | | 394 | CORI: CJKV Benchmark with Romanization Integration - A step towards Cross-lingual Transfer Beyond Textual Scripts | Hoang Nguyen, Chenwei Zhang, Ye Liu, Natalie Parde, Eugene Rohrbaugh and Philip S. Yu |
| 12:00 - 12:20 | | 1133 | Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Bo-Han Lu, Yi-Hsuan Lin, Annie Lee and Richard Tzong-Han Tsai |
| 12:20 - 12:40 | | 1453 | Humanistic Buddhism Corpus: A Challenging Domain-Specific Dataset of English Translations for Classical and Modern Chinese | Youheng W. Wong, Natalie Parde and Erdem Koyuncu |
| 11:00 - 11:20 | D1-S2-R6 - Offensive and Harmful Language Detection and Analysis (Chair: Preslav Nakov) Room: Berlino | 429 | Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean | Seungyoon Lee, Chanjun Park, DaHyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo and Heuiseok Lim |
| 11:20 - 11:40 | | 1195 | Enhance Robustness of Language Models Against Variation Attack through Graph Integration | Zi Xiong, Lizhi Qing, Yangyang Kang, Jiawei Liu, Hongsong Li, Changlong Sun, Xiaozhong Liu and Wei Lu |
| 11:40 - 12:00 | | 1348 | Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes | Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof and Svetlana Kiritchenko |
| 12:00 - 12:20 | | 1787 | Humans Need Context, What About Machines? Investigating Conversational Context in Abusive Language Detection | Tom Bourgeade, Zongmin Li, Farah Benamara, Véronique MORICEAU, Jian Su and Aixin Sun |
| 12:20 - 12:40 | | 2648 | PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets | Arianna Muti, Federico Ruggeri, Cagri Toraman, Alberto Barrón-Cedeño, Samuel Algherini, Lorenzo Musetti, Silvia Ronchi, Gianmarco Saretto and Caterina Zapparoli |
| 11:00-12:40 | D1-S2-P1 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics I (Chair: Sara Tonelli) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 1 | Evaluating Prompting Strategies for Grammatical Error Correction Based on Language Proficiency | Min Zeng, Jiexin Kuang, Mengyang Qiu, Jayoung Song and Jungyeul Park |
| | | 170 | Principal Component Analysis as a Sanity Check for Bayesian Phylolinguistic Reconstruction | Yugo Murawaki |
| | | 360 | An Argument for Symmetric Coordination from Dependency Length Minimization: A Replication Study | Adam Przepiórkowski, Magdalena Borysiak and Adam Głowacki |
| | | 650 | The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer's Disease Detection from Spontaneous Speech | Jonathan Heitz, Gerold Schneider and Nicolas Langer |
| | | 685 | Multimodal Language Models Show Evidence of Embodied Simulation | Cameron R. Jones and Sean Trott |
| | | 1062 | Task-Oriented Paraphrase Analytics | Marcel Gohsen, Matthias Hagen, Martin Potthast and Benno Stein |
| | | 1085 | Do Neural Language Models Inferentially Compose Concepts the Way Humans Can? | Amilleah Rodriguez, Shaonan Wang and Liina Pylkkänen |
| | | 1207 | The Contextual Variability of English Nouns: The Impact of Categorical Specificity beyond Conceptual Concreteness | Giulia Rambelli and marianna bolognesi |
| | | 1454 | Towards Comprehensive Language Analysis for Clinically Enriched Spontaneous Dialogue | Baris Karacan, Ankit Aich, Avery Quynh, Amy Pinkham, Philip Harvey, Colin Depp and Natalie Parde |
| | | 1565 | Context Shapes Emergent Communication about Concepts at Different Levels of Abstraction | Kristina Kobrock, Xenia Isabel Ohmer, Elia Bruni and Nicole Gotzner |
| | | 1566 | A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark | Dongjun Jang, Sungjoo Byun and Hyopil Shin |
| | | 1664 | Fine-grained Classification of Circumstantial Meanings within the Prague Dependency Treebank Annotation Scheme | Marie Mikulova |
| | | 2382 | C-Journal: A Journaling Application for Detecting and Classifying Cognitive Distortions using Deep-Learning based on a Crowd-sourced Dataset | Nada Elsharawi and Alia El Bolock |
| 11:00-12:40 | D1-S2-P1 - Digital Humanities and Cultural Heritage I (Chair: Sara Tonelli) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 1749 (D) | Towards Building the LEMI Readability Platform for Children's Literature in the Romanian Language | Madalina Chitez, Mihai Dascalu, Aura Cristina Udrea, Cosmin Strilețchi, Karla Csürös, Roxana Rogobete and Alexandru Oravițan |
| | | 2864 | Re-evaluating the Tomes for the Times | Ryan Brate, Marieke van Erp and Antal van den Bosch |
| | | 145 | Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach | Siyu Duan, Jun Wang and Qi Su |
| | | 675 | The Swedish Parliament Corpus 1867 – 2022 | Väinö Aleksi Yrjänäinen, Fredrik Mohammadi Norén, Robert Borges, Johan Jarlbrink, Lotta Åberg Brorsson, Anders P. Olsson, Pelle Snickars and Måns Magnusson |
| | | 1097 | Introducing a Parsed Corpus of Historical High German | Christopher D. Sapp, Elliott Evans, Rex Sprouse and Daniel Dakota |
| | | 1208 | A Computational Analysis of the Dehumanisation of Migrants from Syria and Ukraine in Slovene News Media | Jaya Caporusso, Damar Hoogland, Mojca Brglez, Boshko Koloski, Matthew Purver and Senja Pollak |
| | | 1220 | Lemmatisation of Medieval Greek: Against the Limits of Transformer's Capabilities? | Colin Swaelens, Ilse de Vos and Els Lefever |
| | | 1323 | Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems | Aditya Narayan Sankaran, Vigneshwaran Shankaran, Sampath Lonka and Rajesh Sharma |
| | | 1359 | Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata | Axel Ahlin, Alfred Myrne Blåder and Pierre Nugues |
| | | 1898 | A Dataset for Named Entity Recognition and Entity Linking in Chinese Historical Newspapers | Baptiste Blouin, Cécile Armand and Christian Henriot |
| | | 2186 | Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles | Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar and Jan Wieczorek |
| | | 2983 | A Matter of Perspective: Building a Multi-Perspective Annotated Dataset for the Study of Literary Quality | Yuri Bizzoni, Pascale Feldkamp Moreira, Ida Marie S. Lassen, Mads Rosendahl Thomsen and Kristoffer Nielbo |
| | | 1619 | Deciphering Emotional Landscapes in the Iliad: A Novel French-Annotated Dataset for Emotion Recognition | Davide Picca and John Pavlopoulos |
| | | 1163 | Linguistic Survey of India and Polyglotta Africana: Two Retrostandardized Digital Editions of Large Historical Collections of Multilingual Wordlists | Robert Forkel, Johann-Mattis List, Christoph Rzymski and Guillaume Segerer |
| 11:00-12:40 | D1-S2-P1 - Discourse and Pragmatics (Chair: Sara Tonelli) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 2148 | How to do politics with words: Investigating speech acts in parliamentary debates | Ines Reinig, Ines Rehbein and Simone Paolo Ponzetto |
| | | 638 | Annotating Customer-Oriented Behaviour in Call Centre Sales Dialogues | Jutta Stock, Volha Petukhova and Dietrich Klakow |
| | | 763 | Developing a Rhetorical Structure Theory Treebank for Czech | Lucie Polakova, Jiří Mírovský, Šárka Zikánová and Eva Hajicova |
| | | 1082 | Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles | Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Arthur Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard and Nikhil Krishnaswamy |
| | | 1141 | Announcing the Prague Discourse Treebank 3.0 | Pavlína Synková, Jiří Mírovský, Lucie Poláková and Magdaléna Rysová |
| | | 1147 | An Empirical Study of Synthetic Data Generation for Implicit Discourse Relation Recognition | Kazumasa Omura, Fei Cheng and Sadao Kurohashi |
| | | 1200 | Enhancing Unrestricted Cross-Document Event Coreference with Graph Reconstruction Networks | Loic de Langhe, Orphee De Clercq and Veronique Hoste |
| | | 1321 | Intention and Face in Dialog | Adil Soubki and Owen Rambow |
| | | 1568 | Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study | Yaxin Fan, Feng Jiang, Peifeng Li and Haizhou Li |
| | | 1591 | Cost-Effective Discourse Annotation in the Prague Czech–English Dependency Treebank | Jiří Mírovský, Pavlína Synková, Lucie Polakova and Marie Paclíková |
| | | 1727 | SIGA: A Naturalistic NLI Dataset of English Scalar Implicatures with Gradable Adjectives | Rashid Nizamani, Sebastian Schuster and Vera Demberg |
| | | 1781 | How Diplomats Dispute: The UN Security Council Conflict Corpus | Karolina Zaczynska, Peter Bourgonje and Manfred Stede |
| | | 2061 | J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution | Nobuhiro Ueda, Hideko Habe, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi and Koichiro Yoshino |
| | | 2093 | Universal Anaphora: The First Three Years | Massimo Poesio, Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Amir Zeldes, Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský and Daniel Zeman |
| | | 2214 | QCAW 1.0: Building a Qatari Corpus of Student Argumentative Writing | Wajdi Zaghouani, Abdelhamid Ahmed, Xiao Zhang and Lameya Rezk |
| | | 2304 | DiscoGeM 2.0: A Parallel Corpus of English, German, French and Czech Implicit Discourse Relations | Frances Yung, Merel Scholman, Sarka Zikanova and Vera Demberg |
| | | 2514 | Experimental versus In-Corpus Variation in Referring Expression Choice | T. Mark Ellison and Fahime Same |
| | | 2605 | Multilingual Coreference Resolution in Low-resource South Asian Languages | Ritwik Mishra, Pooja Desur, Rajiv Ratn Shah and Ponnurangam Kumaraguru |
| | | 2879 | Building a Database of Conversational Routines | Polina Bychkova, Alyaxey Yaskevich, Serafima Gyulasaryan and Ekaterina Rakhilina |
| | | 2566 | Polish Discourse Corpus (PDC): Corpus Design, ISO-Compliant Annotation, Data Highlights, and Parser Development | Maciej Ogrodniczuk, Aleksandra Tomaszewska, Daniel Ziembicki, Sebastian Żurowski, Ryszard Tuora and Aleksandra Zwierzchowska |
| | | 2526 | Conceptual Pacts for Reference Resolution using Small, Dynamically Constructed Language Models: A Study in Puzzle Building Dialogues | Julian Hough, Sina Zarrieß, Casey Kennington, David Schlangen and Massimo Poesio |
| 11:00-12:40 | D1-S2-P1 - Policy issues, Ethics, Legal Issues, Bias Analysis (Chair: Sara Tonelli) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 239 | To Share or Not to Share: What Risks Would Laypeople Accept to Give Sensitive Data to Differentially-Private NLP Systems? | Christopher Weiss, Frauke Kreuter and Ivan Habernal |
| | | 284 | Evidence-guided Inference for Neutralized Zero-shot Transfer | Xiaotong Feng, Meng-Fen Chiang, Wang-Chien Lee and Zixin Kuang |
| | | 478 | A Canonical Form for Flexible Multiword Expressions | Jan Odijk |
| | | 1398 | Do Large Language Models Understand Mansplaining? Well, actually... | Carla Perez Almendros and Jose Camacho-Collados |
| | | 1687 | RuBia: A Russian Language Bias Detection Dataset | Veronika Grigoreva, Anastasiia Ivanova, Ilseyar Alimova and Ekaterina Artemova |
| | | 1764 | European Language Grid: One Year After | Georg Rehm, Stelios Piperidis, Dimitris Galanis, Penny Labropoulou, Maria Giagkou, Miltos Deligiannis, Leon Voukoutis, Martin Courtois, Julian Moreno-Schneider and Katrin Marheinecke |
| | | 1798 | Is Gender Reference Gender-specific? Studies in a Polar Domain | Manfred Klenner and Dylan Massey |
| | | 2044 | Curation of Benchmark Templates for Measuring Gender Bias in Named Entity Recognition Models | Ana Cimitan, Ana Alves Pinto and Michaela Geierhos |
| | | 2158 | LinguaMeta: Unified Metadata for Thousands of Languages | Sandy Ritchie, Daan van Esch, Uche Okonkwo, Shikhar Vashishth and Emily Drummond |
| | | 2538 | Quite Good, but Not Enough: Nationality Bias in Large Language Models - A Case Study of ChatGPT | Shucheng Zhu, Weikang Wang and Ying Liu |
| | | 2783 | Pseudonymization Categories across Domain Boundaries | Maria Irena Szawerna, Simon Dobnik, Therese Lindström Tiedemann, Ricardo Muñoz Sánchez, Xuan-Son Vu and Elena Volodina |
| | | 3252 | Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in | Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal and Monojit Choudhury |
| | | 3280 | ABLE: Agency-BeLiefs Embedding to Address Stereotypical Bias Through Awareness Instead of Obliviousness | Michelle YoungJin Kim, Junghwan Kim and Kristen Johnson |
| | | 119 | Language Technologies as if People Mattered: Centering Communities in Language Technology Development | Nina Markl, Lauren Hall-Lew and Catherine Lai |
| | | 354 | Your Stereotypical Mileage may Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts | Karen Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, Marthese Borg, Yongjian Chen, Fanny Ducel, Yoann Dupont, Guido Ivetta, Zhijian Li, Margot Mieskes, Marco Naguib, Yuyan Qian, Matteo Radaelli, Wolfgang S. Schmeisser-Nieto, Emma Raimundo Schulz, Thiziri Saci, Sarah Saidi, Javier Torroba Marchante, Shilin Xie, Sergio E. Zanotto and Aurélie Névéol |
| | | 906 | Large Language Models are Echo Chambers | Jan Nehring, Aleksandra Gabryszak, Pascal Jürgens, Aljoscha Burchardt, Stefan Schaffer, Matthias Spielkamp and Birgit Stark |
| | | 933 | Common European Language Data Space | Georg Rehm, Stelios Piperidis, Khalid Choukri, Andrejs Vasiļjevs, Katrin Marheinecke, Victoria Arranz, Aivars Bērziņš, Miltos Deligiannis, Dimitris Galanis, Maria Giagkou, Katerina Gkirtzou, Dimitris Gkoumas, Annika Grützner-Zahn, Athanasia Kolovou, Penny Labropoulou, Andis Lagzdiņš, Elena Leitner, Valérie Mapelli, Hélène Mazo, Simon Ostermann, Stefania Racioppa, Mickaël Rigault and Leon Voukoutis |
| | | 1038 (D) | A Luxembourgish corpus as a Gender Bias Evaluation Testset | Dimitra Anastasiou, Carole Blond-Hanten and Marie Gallais |
| | | 2226 | Impoverished Language Technology: The Lack of (Social) Class in NLP | Amanda Cercas Curry, Zeerak Talat and Dirk Hovy |
| | | 2507 | Are Text Classifiers Xenophobic? A Country-Oriented Bias Detection Method With Least Confounding Variables | Valentin Barriere and Sebastian Cifuentes |
| 11:00-12:40 | D1-S2-P1 - Speech Resources and Processing I (Chair: Sara Tonelli) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 380 | PWESuite: Phonetic Word Embeddings and Tasks They Facilitate | Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nate B. Carlson, Nathaniel Romney Robinson, Mrinmaya Sachan and David R. Mortensen |
| | | 874 | CoANZSE Audio: Creation of an Online Corpus for Linguistic and Phonetic Analysis of Australian and New Zealand Englishes | Steven Coats |
| | | 946 | Samrómur Milljón: An ASR Corpus of One Million Verified Read Prompts in Icelandic | Carlos Daniel Hernandez Mena, Þorsteinn Daði Gunnarsson and Jon Gudnason |
| | | 1014 | Becoming a High-Resource Language in Speech: The Catalan Case in the Common Voice Corpus | Carme Armentano-Oller, Montserrat Marimon and Marta Villegas |
| | | 1030 | TunArTTS: Tunisian Arabic Text-To-Speech Corpus | Imen Laouirine, Rami Kammoun and Fethi Bougares |
| | | 1543 | Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview | Heyang Liu, Yanfeng Wang and Yu Wang |
| | | 2006 | Correcting Pronoun Homophones with Subtle Semantics in Chinese Speech Recognition | Zhaobo Zhang, Rui Gan, Pingpeng Yuan and Hai Jin |
| | | 2040 | Evaluating Workflows for Creating Orthographic Transcripts for Oral Corpora by Transcribing from Scratch or Correcting ASR-Output | Jan Gorisch and Thomas Schmidt |
| | | 2045 | Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context | Tuan Nguyen, Corinne Fredouille, Alain Ghio, Mathieu Balaguer and Virginie Woisard |
| | | 2207 | Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants | Chloe SEKKAT, Fanny Leroy, salima mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau and Alice Coucke |
| | | 2440 | TARIC-SLU: A Tunisian Benchmark Dataset For Spoken Language Understanding | salima mdhaffar, Fethi Bougares, Renato De Mori, Salah Zaiem, Mirco Ravanelli and Yannick Estève |
| | | 2531 | ALLIES: a Speech Corpus for Segmentation, Speaker Diarization, Speech Recognition and Speaker Change detection | Marie Tahon, Anthony Larcher, Martin Lebourdais, Fethi Bougares, Anna Silnova and Pablo Gimeno |
| | | 2891 (D) | ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation | Zheng Byron Yuan, Dorina de Jong, Ruitao Feng, Štefan Beňuš, Noël Nguyen, Róbert Sabo, Luciano Fadiga and Alessandro D'Ausilio |
| | | 3078 | Code-Mixed Text Augmentation for Latvian ASR | Martins Kronis, Askars Salimbajevs and Mārcis Pinnis |
| | | 3106 | Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness | Xincan Feng and Akifumi Yoshimoto |
| | | 1639 | Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition | Yash Jain, David M. Chan, PRANAV DHERAM, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran and Shalini Ghosh |
| 12:40-13:20 | D1-S1-RE1 - Applications Involving LRs and Evaluation I (Chair: *TBD*) Zoom: Link01 - Virtual Room1 | 258 | IndicFinNLP: Financial Natural Language Processing for Indian Languages | Sohom Ghosh, Arnab Maji, Aswartha Narayana and Sudip Kumar Naskar |
| | | 489 | User Guide for KOTE: Korean Online That-gul Emotions Dataset | Duyoung Jeon, Junho Lee and Cheongtag Kim |
| | | 1498 | Positive and Risky Message Assessment for Music Products | Yigeng Zhang, Mahsa Shafaei, Fabio Gonzalez and Thamar Solorio |
| | | 1499 | Interpreting Themes from Educational Stories | Yigeng Zhang, Fabio Gonzalez and Thamar Solorio |
| 12:40-13:20 | D1-S1-RE1 - Applications Involving LRs and Evaluation II (Chair: *TBD*) Zoom: Link01 - Virtual Room2 | 1563 | Towards More Realistic Chinese Spell Checking with New Benchmark and Specialized Expert Model | Yue Wang, Zilong Zheng, Juntao Li, zhihui liu, Jinxiong Chang, Qishen Zhang, Zhongyi Liu, Guannan Zhang and Min Zhang |
| | | 1870 | AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports | Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Chandra Pujari and Annemarie Friedrich |
| | | 1921 | Prompt-based Generation of Natural Language Explanations of Synthetic Lethality for Cancer Drug Discovery | Ke Zhang, Yimiao Feng and Jie Zheng |
| | | 2029 | Reference-less Analysis of Context Specificity in Translation with Personalised Language Models | Sebastian Vincent, Rowanne Sumner, Alice Dowek, Charlotte Prescott, Emily Preston, Chris Bayliss, Chris Oakley and Carolina Scarton |
| 12:40-13:20 | D1-S1-RE1 - Applications Involving LRs and Evaluation III (Chair: *TBD*) Zoom: Link01 - Virtual Room3 | 2288 | WordNet under Scrutiny: Dictionary Examples in the Era of Large Language Models | Fatemah Yousef Almeman, Steven Schockaert and Luis Espinosa Anke |
| | | 2339 | Using Persuasive Writing Strategies to Explain and Detect Health Misinformation | Danial Kamali, Joseph D. Romain, Huiyi Liu, Wei Peng, Jingbo Meng and Parisa Kordjamshidi |
| | | 2470 | Sarcasm Detection in a Disaster Context | Tiberiu Sosea, Junyi Jessy Li and Cornelia Caragea |
| | | 2843 | MKeCL: Medical Knowledge-Enhanced Contrastive Learning for Few-shot Disease Diagnosis | Yutian Zhao, Huimin Wang, Xian Wu and Yefeng Zheng |
| 12:40-13:20 | D1-S1-RE1 - Applications Involving LRs and Evaluation IV (Chair: *TBD*) Zoom: Link01 - Virtual Room4 | 2705 | Assessing Online Writing Feedback Resources: Generative AI vs. Good Samaritans | Shabnam Behzad, Omid Kashefi and Swapna Somasundaran |
| | | 2819 | ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search | Zehan Li, Jianfei Zhang, Chuantao Yin, Yuanxin Ouyang and Wenge Rong |
| | | 3097 | Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment | Yingxiu Zhao, Bowen Yu, Binyuan Hui, Haiyang Yu, Minghao Li, Fei Huang, Nevin L. Zhang and Yongbin Li |
| | | 3179 | Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs | Fengbin Zhu, Chao Wang, Fuli Feng, Zifeng Ren, Moxin Li and Tat-Seng Chua |
| 12:40-13:20 | D1-S1-RE1 - Applications Involving LRs and Evaluation V (Chair: *TBD*) Zoom: Link01 - Virtual Room5 | 3266 | Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection | Sourya Dipta Das, Yash A. Vadi and Kuldeep Yadav |
| | | 3418 | Fast Adaptation via Prompted Data: An Efficient Cross-Domain Fine-tuning Method for Large Language Models | Yiming Zhang, Hantao Yang, Haobo Wang and Jake Zhao |
| | | 3016 | Evaluating the Efficacy of Large Acoustic Model for Documenting Non-Orthographic Tribal Languages in India | Tonmoy Rajkhowa, Amartya Roy Chowdhury, Hrishikesh Ravindra Karande and S. R. Mahadeva Prasanna |
| | | 3296 | Error-Robust Retrieval for Chinese Spelling Check | Xunjian Yin, Xinyu Hu, Jin Jiang and Xiaojun Wan |
| 12:40-13:20 | D1-S1-RE2 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics I (Chair: *TBD*) Zoom: Link02 - Virtual Room1 | 656 | Which Sense Dominates Multisensory Semantic Understanding? A Brain Decoding Study | Dandan Huang, Lu Cao, Zhenting Li and Yue Zhang |
| | | 785 | Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases | Yuqi Liu, Guanyi Chen and Kees van Deemter |
| | | 1266 | A Quantum-Inspired Matching Network with Linguistic Theories for Metaphor Detection | Wenbo Qiao, Peng Zhang and ZengLai Ma |
| | | 1347 | Phonotactic Complexity across Dialects | Ryan Soh-Eun Shim, Kalvin Chang and David R. Mortensen |
| 12:40-13:20 | D1-S1-RE2 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics II (Chair: *TBD*) Zoom: Link02 - Virtual Room2 | 1701 | Error Analysis of NLP Models and Non-Native Speakers of English Identifying Sarcasm in Reddit Comments | Oliver Cakebread-Andrews, Le An Ha, Ingo Frommholz and Burcu Can |
| | | 2264 | Context Matters: Enhancing Metaphor Recognition in Proverbs | Gamze Goren and Carlo Strapparava |
| | | 2878 | NutFrame: Frame-based Conceptual Structure Induction with LLMs | Shaoru Guo, Yubo Chen, Kang Liu, Ru Li and Jun Zhao |
| | | 2886 | Decoding Probing: Revealing Internal Linguistic Structures in Neural Language Models using Minimal Pairs | Linyang He, Peili Chen, Ercong Nie, Yuanning Li and Jonathan R. Brennan |
| 12:40-13:20 | D1-S1-RE3 - Corpora and Annotation I (Chair: *TBD*) Zoom: Link03 - Virtual Room1 | 424 | Automatic Data Visualization Generation from Chinese Natural Language Questions | Yan Ge, Victor Junqiu Wei, Yuanfeng Song, Jason Chen Zhang and Raymond Chi-Wing Wong |
| | | 436 | Analyzing the Dynamics of Climate Change Discourse on Twitter: A New Annotated Corpus and Multi-Aspect Classification | Shuvam Shiwakoti, Surendrabikram Thapa, Kritesh Rauniyar, Akshyat Shah, Aashish Bhandari and Usman Naseem |
| | | 851 | A Corpus and Method for Chinese Named Entity Recognition in Manufacturing | Ruiting Li, Peiyan Wang, Libang Wang, Danqingxin Yang and Dongfeng Cai |
| | | 961 | Evaluation Dataset for Lexical Translation Consistency in Chinese-to-English Document-level Translation | Xiangyu Lei, Junhui Li, shimin tao and Hao Yang |
| 12:40-13:20 | D1-S1-RE3 - Corpora and Annotation II (Chair: *TBD*) Zoom: Link03 - Virtual Room2 | 1105 | Multi-Tiered Cantonese Word Segmentation | Charles Lam, Chaak-ming Lau and Jackson L. Lee |
| | | 1168 | MDS: A Fine-Grained Dataset for Multi-Modal Dialogue Summarization | Zhipeng Liu, Xiaoming Zhang, Litian Zhang and Zelong Yu |
| | | 1374 | MedQA-SWE - A Clinical Question & Answer Dataset for Swedish | Niclas Hertzberg and Anna Lokrantz |
| 12:40-13:20 | D1-S1-RE3 - Corpora and Annotation III (Chair: *TBD*) Zoom: Link03 - Virtual Room3 | 1521 | XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates | Haopeng Zhang, Hayate Iso, Sairam Gurajada and Nikita Bhutani |
| | | 2572 | InferBR: a Natural Language Inference Dataset in Portuguese | Luciana Bencke, Francielle Vasconcellos Pereira, Moniele Kunrath Santos and Viviane Moreira |
| | | 2175 | Limitations of Human Identification of Automatically Generated Text | Nadège Alavoine, Maximin Coavoux, Emmanuelle Esperanca-Rodier, Romane Gallienne, carlos gonzalez gallardo, Jérôme Goulian, Jose G. Moreno, Aurélie Névéol, Didier Schwab, Vincent Segonne and johanna simoens |
| | | 2240 | UQA: Corpus for Urdu Question Answering | Samee Arif, Sualeha Farid, Awais Athar and Agha Ali Raza |
| 12:40-13:20 | D1-S1-RE3 - Corpora and Annotation IV (Chair: *TBD*) Zoom: Link03 - Virtual Room4 | 2314 | EPOQUE: An English-Persian Quality Estimation Dataset | Mohammed hossein Jafari harandi, Fatemeh Azadi, Mohammad Javad Dousti and Heshaam Faili |
| | | 2316 | GOLEM: GOld standard for Learning and Evaluation of Motifs | W. Victor Yarlott, Anurag Acharya, Diego Castro Estrada, Diana Gomez and Mark Finlayson |
| | | 2422 | Khan Academy Corpus: A multilingual corpus of Khan Academy lectures | Dominika Ďurišková, Daniela Jurášová, Matúš Žilinec, Eduard Šubert and Ondřej Bojar |
| | | 2716 | PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents | Nan Zhang, Connor Heaton, Sean Timothy Okonsky, Prasenjit Mitra and Hilal Ezgi Toraman |
| 12:40-13:20 | D1-S1-RE3 - Corpora and Annotation V (Chair: *TBD*) Zoom: Link03 - Virtual Room5 | 2857 | DiaSet: An Annotated Dataset of Arabic Conversations | Abraham Israeli, Aviv Naaman, Guy Maduel, Rawaa Makhoul, Dana Qaraeen, Amir Ejmail, Dina Lisnanskey, Julian Jubran, shai Fine and Kfir Bar |
| | | 3067 | Reflections & Resonance: Two-Agent Partnership for Advancing LLM-based Story Annotation | Yuetian Chen and Mei Si |
| | | 3147 | German Parliamentary Corpus (GerParCor) Reloaded | Giuseppe Abrami, Mevlüt Bagci and Alexander Mehler |
| | | 1866 | Murre24: Dialect Identification of Finnish Internet Forum Messages | Olli Kuparinen |
| | | 3387 | TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu | Gopichand Kanumolu, Lokesh Madasu, Nirmal Surange and Manish Shrivastava |
| 12:40-13:20 | D1-S1-RE3 - Corpora and Annotation VI (Chair: *TBD*) Zoom: Link03 - Virtual Room6 | 2692 | Universal Dependencies for Learner Russian | Alla Rozovskaya |
| | | 2665 | My Science Tutor (MyST)–a Large Corpus of Children's Conversational Speech | Sameer Pradhan, Ronald A. Cole and Wayne H. Ward |
| | | 379 | NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus | Jinming Zhao, Katsuhito Sudoh, Satoshi Nakamura, Yuka Ko, Kosuke Doi and Ryo Fukuda |
| | | 2775 | SkOTaPA: A dataset for Skepticism Detection in Online Text after Persuasion Attempt | Smitha Muthya Sudheendra, Maral Abdollahi, Dongyeop Kang, Jisu Huh and Jaideep Srivastava |
| | | 1407 | Text Filtering Classifiers for Medium-Resource Languages | Jón Daðason and Hrafn Loftsson |
| | | 2603 | FUSE - FrUstration and Surprise Expressions: A Subtle Emotional Multimodal Language Corpus | Rajesh Titung and Cecilia Ovesdotter Alm |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction I (Chair: *TBD*) Zoom: Link04 - Virtual Room1 | 1748 | CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering | Hongbin Na |
| | | 214 | Deriving Entity-Specific Embeddings From Multi-Entity Sequences | Connor Heaton and Prasenjit Mitra |
| | | 231 | UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt | Yucheng Cai, Wentao Ma, Yuchuan Wu, Shuzheng Si, yuan shao, Zhijian Ou and Yongbin Li |
| | | 329 | Deep Reinforcement Learning with Hierarchical Action Exploration for Dialogue Generation | Itsugun Cho, Ryota Takahashi, Yusaku Yanase and Hiroaki Saito |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction II (Chair: *TBD*) Zoom: Link04 - Virtual Room2 | 457 | Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts | Hao Lang, Yinhe Zheng, Binyuan Hui, Fei Huang and Yongbin Li |
| | | 588 | Detection, Diagnosis, and Explanation: A Benchmark for Chinese Medial Hallucination Evaluation | Chengfeng Dou, Ying Zhang, Yanyuan Chen, Zhi Jin, Wenpin Jiao, Haiyan Zhao and Yu Huang |
| | | 740 | A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation | Xiangci Li, Linfeng Song, Lifeng Jin, Haitao Mi, Jessica Ouyang and Dong Yu |
| | | 804 | Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models | Haoyu Gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Fei Huang and Yongbin Li |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction III (Chair: *TBD*) Zoom: Link04 - Virtual Room3 | 1109 | CAMAL: A Novel Dataset for Multi-label Conversational Argument Move Analysis | Viet Dac Lai, Duy Ngoc Pham, Jonathan Steinberg, Jamie Mikeska and Thien Huu Nguyen |
| | | 1260 | BERT-BC: A Unified Alignment and Interaction Model over Hierarchical BERT for Response Selection | Zhenfei Yang, Beiming Yu, Yuan Cui, Shi Feng, Daling Wang and Yifei Zhang |
| | | 1472 | EmoTrans: Emotional Transition-based Model for Emotion Recognition in Conversation | Zhongquan Jian, Ante Wang, Jinsong Su, Junfeng Yao, Meihong Wang and Qingqiang Wu |
| | | 1850 | DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues | Xiang Luo, Zhiwen Tang, Jin Wang and Xuejie Zhang |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction IV (Chair: *TBD*) Zoom: Link04 - Virtual Room4 | 1885 | EmpCRL: Controllable Empathetic Response Generation via In-Context Commonsense Reasoning and Reinforcement Learning | Mingxiu Cai, Daling Wang, Shi Feng and Yifei Zhang |
| | | 1931 | Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification | Hossam Zawbaa, Wael Rashwan, Sourav Dutta and Haytham Assem |
| | | 2043 | CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment | Feiteng Fang, Liang Zhu, Xi Feng, Jinchang Hou, Qixuan Zhao, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang |
| | | 2351 | Seeing is believing! Towards Knowledge-Infused Multi-modal Medical Dialogue Generation | Abhisek Tiwari, Shreyangshu Bera, Preeti Verma, Jaithra Varma Manthena, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar and Sarbajeet Tiwari |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction V (Chair: *TBD*) Zoom: Link04 - Virtual Room5 | 2804 | How susceptible are LLMs to Logical Fallacies? | Amirreza Payandeh, Dan Pluth, Jordan Hosier, Xuesu Xiao and Vijay K. Gurbani |
| | | 2817 | Enhancing Text-to-SQL Capabilities of Large Language Models through Tailored Promptings | zhao Tan, Xiping Liu, Qing Shu, Xi Li, Changxuan Wan, Dexi Liu, Qizhi Wan and Guoqiong Liao |
| | | 2851 | New Intent Discovery with Attracting and Dispersing Prototype | Shun Zhang, Jian Yang, Jiaqi Bai, Chaoran Yan, Tongliang Li, Zhao Yan and Zhoujun Li |
| | | 2906 | Granular Change Accuracy: A More Accurate Performance Metric for Dialogue State Tracking | Taha Aksu and Nancy Chen |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction VI (Chair: *TBD*) Zoom: Link04 - Virtual Room6 | 2945 | Adding SPICE to Life: Speaker Profiling in Multiparty Conversations | Shivani Kumar, Rishabh Gupta, Md. Shad Akhtar and Tanmoy Chakraborty |
| | | 3012 | Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection | Pei Wang, Keqing He, Yejie Wang, Xiaoshuai Song, Yutao Mou, Jingang Wang, Yunsen Xian, Xunliang Cai and Weiran Xu |
| | | 3105 | S3Prompt: Instructing the Model with Self-calibration, Self-recall and Self-aggregation to Improve In-context Learning | Junda Chen and Jianting Liu |
| | | 3337 | ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling | Omama Hamad, Khaled Shaban and Ali Hamdi |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction VII (Chair: *TBD*) Zoom: Link04 - Virtual Room7 | 3373 | MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking | Tianwen Tang, Tong Zhu, Haodong Liu, Yin Bai, Jia Cheng and Wenliang Chen |
| | | 1476 | What are the implications of your question? Non-Information Seeking Question-Type Identification in CNN Transcripts | Yao Sun, Anastasiia Tatlubaeva, Zhihan Li and Chester Palen-Michel |
| | | 1718 | BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses | Weihao Zeng, Keqing He, Yejie Wang, Dayuan Fu and Weiran Xu |
| | | 2343 | Exploring the Impact of Human Evaluator Group on Chat-Oriented Dialogue Evaluation | Sarah E. Finch, James D. Finch and Jinho D. Choi |
| 12:40-13:20 | D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction VIII (Chair: *TBD*) Zoom: Link04 - Virtual Room8 | 3120 | LANID: LLM-assisted New Intent Discovery | Lu Fan, Jiashu Pu, Rongsheng Zhang and Xiao-Ming Wu |
| | | 1999 | Beyond Linguistic Cues: Fine-grained Conversational Emotion Recognition via Belief-Desire Modelling | Bo Xu, Longjiao Li, Wei Luo, Mehdi Naseriparsa, Zhehuan Zhao, Hongfei Lin and Feng Xia |
| | | 1988 | Automatic Coding of Contingency in Child-Caregiver Conversations | Abhishek Agrawal, Mitja Nikolaus, Benoit Favre and Abdellah Fourtassi |
| | | 1782 | Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems | Songbo Hu, Ivan Vulić, Fangyu Liu and Anna Korhonen |
| | | 942 | Combining Discourse Coherence with Large Language Models for More Inclusive, Equitable, and Robust Task-Oriented Dialogue | Katherine Atwell, Mert Inan, Anthony B. Sicilia and Malihe Alikhani |
| 12:40-13:20 | D1-S1-RE5 - Digital Humanities and Cultural Heritage (Chair: *TBD*) Zoom: Link05 - Virtual Room1 | 949 | An Unsupervised Framework for Adaptive Context-aware Simplified-Traditional Chinese Character Conversion | Wei Li, Shutan Huang and Yanqiu Shao |
| | | 1514 | Detecting Sexual Content at the Sentence Level in First Millennium Latin Texts | Thibault Clerice |
| | | 3381 | Agenda-Driven Question Generation: A Case Study in the Courtroom Domain | Yi Fung, Anoop Kumar, Aram Galstyan, Heng Ji and Prem Natarajan |
| | | 2095 | Producing a Parallel Universal Dependencies Treebank of Ancient Hebrew and Ancient Greek via Cross-Lingual Projection | Daniel G. Swanson, Bryce D. Bussert and Francis Tyers |
| 12:40-13:20 | D1-S1-RE6 - Discourse and Pragmatics (Chair: *TBD*) Zoom: Link06 - Virtual Room1 | 2367 | Action and Reaction go hand in hand! A Multi-modal Dialogue Act aided Sarcasm Identification | Mohit Singh Tomar, Tulika Saha, Abhisek Tiwari and Sriparna Saha |
| | | 2810 | Global and Local Hierarchical Prompt Tuning Framework for Multi-level Implicit Discourse Relation Recognition | Lei Zeng, Ruifang He, Haowen Sun, Jing Xu, Chang Liu and Bo Wang |
| 12:40-13:20 | D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval I (Chair: *TBD*) Zoom: Link07 - Virtual Room1 | 85 | Enhanced Facet Generation with LLM Editing | Joosung Lee and Jinhong Kim |
| | | 104 | Logic Rules as Explanations for Legal Case Retrieval | ZhongXiang Sun, Kepu Zhang, Weijie Yu, Haoyu Wang and Jun Xu |
| | | 142 | NER-guided Comprehensive Hierarchy-aware Prompt Tuning for Hierarchical Text Classification | Fuhan Cai, Duo Liu, Zhongqiang Zhang, Ge Liu, Xiaozhe Yang and Xiangzhong Fang |
| | | 538 | Well Begun is Half Done: An Implicitly Augmented Generative Framework with Distribution Modification for Hierarchical Text Classification | Huawen Feng, Jingsong Yan, Junlong Liu, Junhao Zheng and Qianli Ma |
| | | 2885 | Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM | Xuan Zhang and Wei Gao |
| 12:40-13:20 | D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval II (Chair: *TBD*) Zoom: Link07 - Virtual Room2 | 726 | Coarse-Tuning for Ad-hoc Document Retrieval Using Pre-trained Language Models | Atsushi Keyaki and Ribeka Keyaki |
| | | 771 | M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval | Yang Bai, Anthony Colas, Christan Grant and Zhe Wang |
| | | 860 | KnowVrDU: A Unified Knowledge-aware Prompt-Tuning Framework for Visually-rich Document Understanding | Yunqi Zhang, Yubo Chen, jingzhe zhu, Jinyu Xu, shuai yang, zhaoliang wu, liang huang, Yongfeng Huang and Shuai Chen |
| | | 1199 | Multimodal Cross-lingual Phrase Retrieval | Chuanqi Dong, Wenjie Zhou, Xiangyu Duan, Yuqi Zhang and Min Zhang |
| 12:40-13:20 | D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval III (Chair: *TBD*) Zoom: Link07 - Virtual Room3 | 1236 | IDC: Boost Text-to-image Retrieval via Indirect and Direct Connections | Guowei Ge, Kuangrong Hao and Lingguang Hao |
| | | 1246 | Pre-training Cross-Modal Retrieval by Expansive Lexicon-Patch Alignment | Yang Yiyuan, Guodong Long, Michael Blumenstein, Xiubo Geng, Chongyang Tao, Tao Shen and Daxin Jiang |
| | | 1458 | Tackling Long Code Search with Splitting, Encoding, and Aggregating | Fan Hu, Yanlin Wang, Lun Du, Hongyu Zhang, Dongmei Zhang and Xirong Li |
| | | 1479 | ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation | Sayar Ghosh Roy and Jiawei Han |
| 12:40-13:20 | D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval IV (Chair: *TBD*) Zoom: Link07 - Virtual Room4 | 1493 | Recommending Missed Citations Identified by Reviewers: A New Task, Dataset and Baselines | Kehan Long, Shasha Li, Pancheng Wang, Chenlong Bao, Jintao Tang and Ting Wang |
| | | 1837 | Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets | Ivan Sedykh, Nikita Sorokin, Dmitry Abulkhanov, Sergey I. Nikolenko and Valentin Malykh |
| | | 2740 | ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval | Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan and Bin Wang |
| | | 2657 | Automatic Authorship Analysis in Human-AI Collaborative Writing | Aquia Richburg, Calvin Bao and Marine Carpuat |
| 12:40-13:20 | D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval V (Chair: *TBD*) Zoom: Link07 - Virtual Room5 | 2738 | Knowledge Enhanced Pre-training for Cross-lingual Dense Retrieval | Hang Zhang, Yeyun Gong, Dayiheng Liu, Shunyu Zhang, Xingwei He, Jiancheng Lv and Jian Guo |
| | | 2894 | PLAES: Prompt-generalized and Level-aware Learning Framework for Cross-prompt Automated Essay Scoring | Yuan Chen and Xia Li |
| | | 2899 | HYRR: Hybrid Infused Reranking for Passage Retrieval | Jing Lu, Keith Hall, Ji Ma and Jianmo Ni |
| | | 3253 | Event-enhanced Retrieval in Real-time Search | Yanan Zhang, Xiaoling Bai and Tianhua Zhou |
| | | 1078 | IR2: Information Regularization for Information Retrieval | Jianyou Wang, Kaicheng Wang, Xiaoyue Wang, Weili Cao, Ramamohan Paturi and Leon Bergen |
| 12:40-13:20 | D1-S1-RE8 - Evaluation and Validation Methodologies I (Chair: *TBD*) Zoom: Link08 - Virtual Room1 | 581 | ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales? | Fan Huang, Haewoon Kwak, Kunwoo Park and Jisun An |
| | | 649 | Evaluation of Really Good Grammatical Error Correction | Robert Östling, Katarina Gillholm, Murathan Kurfalı, Marie Mattson and Mats Wirén |
| | | 867 | SignBLEU: Automatic Evaluation of Multi-channel Sign Language Translation | Jung-Ho Kim, Mathew John Huerta-Enochian, Changyong Ko and Du Hui Lee |
| | | 1026 | Keyphrase Generation: Lessons from a Reproducibility Study | Edwin Thomas and Sowmya Vajjala |
| 12:40-13:20 | D1-S1-RE8 - Evaluation and Validation Methodologies II (Chair: *TBD*) Zoom: Link08 - Virtual Room2 | 1254 | Prompting Large Language Models for Counterfactual Generation: An Empirical Study | Yongqi Li, Mayi Xu, Xin Miao, Shen Zhou and Tieyun Qian |
| | | 1299 | How Good Are LLMs at Out-of-Distribution Detection? | Bo Liu, Li-Ming Zhan, Zexin Lu, Yujie Feng, Lei Xue and Xiao-Ming Wu |
| | | 2735 | Is LLM a Reliable Reviewer? A Comprehensive Evaluation of LLM on Automatic Paper Reviewing Tasks | Ruiyang Zhou, Lu Chen and Kai Yu |
| | | 2900 | Can multiple-choice questions really be useful in detecting the abilities of LLMs? | Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng and Noa Garcia |
| 12:40-13:20 | D1-S1-RE8 - Evaluation and Validation Methodologies III (Chair: *TBD*) Zoom: Link08 - Virtual Room3 | 2964 | Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT | Amirhossein Abaskohi, Sara Baruni, Mostafa Masoudi, Nesa Abbasi, Mohammad Hadi Babalou, Ali Edalat, sepehr kamahi, Samin Mahdizadeh Sani, nikoo naghavian, Danial Namazifard, Pouya Sadeghi and Yadollah Yaghoobzadeh |
| | | 3111 | LFED: A Literary Fiction Evaluation Dataset for Large Language Models | Linhao Yu, Qun Liu and Deyi Xiong |
| | | 3303 | REFeREE: A REference-FREE Model-Based Metric for Text Simplification | Yichen Huang and Ekaterina Kochmar |
| | | 3361 | Who Said What: Formalization and Benchmarks for the Task of Quote Attribution | Wenjie Zhong, Jason Naradowsky, Hiroya Takamura, Ichiro Kobayashi and Yusuke Miyao |
| 12:40-13:20 | D1-S1-RE8 - Evaluation and Validation Methodologies IV (Chair: *TBD*) Zoom: Link08 - Virtual Room4 | 3425 | Measuring Cross-Text Cohesion for Segmentation Similarity Scoring | Gerardo Ocampo Diaz and Jessica Ouyang |
| | | 28 | Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem | YuHong Sun, Zhangyue Yin, Qipeng Guo, Jiawen Wu, Xipeng Qiu and Hui Zhao |
| | | 1007 | Assessing the Efficacy of Grammar Error Correction: A Human Evaluation Approach in the Japanese Context | Qiao Wang and Zheng Yuan |
| | | 1548 | Transfer Fine-tuning for Quality Estimation of Text Simplification | Yuki Hironaka, Tomoyuki Kajiwara and Takashi Ninomiya |
| 12:40-13:20 | D1-S1-RE8 - Evaluation and Validation Methodologies V (Chair: *TBD*) Zoom: Link08 - Virtual Room5 | 1873 | Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models | Zhuoqun Li, Hongyu Lin, Yaojie Lu, Hao Xiang, Xianpei Han and Le Sun |
| | | 2436 | A Typology of Errors for User Utterances in Chatbots | Anu Singh and Esme Manandise |
| | | 2488 | New Evaluation Methodology for Qualitatively Comparing Classification Models | Ahmad Aljanaideh |
| | | 2803 | Towards Human-aligned Evaluation for Linear Programming Word Problems | Linzi Xing, Xinglu Wang, Yuxi Feng, Zhenan Fan, Jing Xiong, Zhijiang Guo, Xiaojin Fu, Rindra Ramamonjison, Mahdi Mostajabdaveh, Xiongwei Han, Zirui Zhou and Yong Zhang |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering I (Chair: *TBD*) Zoom: Link09 - Virtual Room1 | 361 | LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles | Shulin Huang, Shirong Ma, Yinghui Li, Mengzuo Huang, wuhe zou, Weidong Zhang and Haitao Zheng |
| | | 417 | No Need for Large-Scale Search: Exploring Large Language Models in Complex Knowledge Base Question Answering | Shouhui Wang and Biao Qin |
| | | 634 | PRIMO: Progressive Induction for Multi-hop Open Rule Generation | Jianyu Liu, Sheng Bi and Guilin Qi |
| | | 782 | Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database | minjun zhu, Yixuan Weng, Shizhu He, Kang Liu, Haifeng Liu, yang jun jun and Jun Zhao |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering II (Chair: *TBD*) Zoom: Link09 - Virtual Room2 | 992 | KET-QA: A Dataset for Knowledge Enhanced Table Question Answering | Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han and Dongmei Zhang |
| | | 1046 | APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning | Jiashuo Sun, Hang Zhang, Chen Lin, Xiangdong Su, Yeyun Gong and Jian Guo |
| | | 1181 | Can Language Models Learn Embeddings of Propositional Logic Assertions? | Nurul Fajrin Ariyani, Zied Bouraoui, Richard Booth and Steven Schockaert |
| | | 1197 | Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process | Guangming Huang, Yunfei Long, Cunjin Luo, Jiaxing Shen and Xia Sun |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering III (Chair: *TBD*) Zoom: Link09 - Virtual Room3 | 1478 | KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion | Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang and Xicheng Lu |
| | | 1662 | Empowering Tree-structured Entailment Reasoning: Rhetorical Perception and LLM-driven Interpretability | Longyin Zhang, Bowei Zou and Ai Ti Aw |
| | | 1805 | RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners | Chi Hu, Yuan Ge, Xiangnan Ma, Hang Cao, Qiang Li, Yonghua Yang, Tong Xiao and Jingbo Zhu |
| | | 2171 | Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering | Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering IV (Chair: *TBD*) Zoom: Link09 - Virtual Room4 | 2722 | Enhancing Large Language Models through Transforming Reasoning Problems into Classification Tasks | Tarun Raheja, Raunak Sinha, Advit Deepak, Will Healy, Jayanth Srinivasa, Myungjin Lee and Ramana KOMPELLA |
| | | 2826 | Robust and Scalable Model Editing for Large Language Models | Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang and Maosong Sun |
| | | 2974 | Step Feasibility-Aware and Error-Correctable Entailment Tree Generation | Junyue Song, Xin Wu and Yi Cai |
| | | 3094 | QDMR-based Planning-and-Solving Prompting for Complex Reasoning Tasks | Jinfeng Huang, Qiaoqiao She, Wenbin Jiang, Hua Wu, Yang Hao, Tong Xu and Feng Wu |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering V (Chair: *TBD*) Zoom: Link09 - Virtual Room5 | 3109 | What Factors Influence LLMs' Judgments? A Case Study on Question Answering | Lei Chen, Bobo Li, Li Zheng, Haining Wang, Zixiang Meng, Runfeng Shi, Hao Fei, Jun Zhou, Fei Li, Chong Teng and Donghong Ji |
| | | 3186 | An Event-based Abductive Learning for Hard Time-sensitive Question Answering | Shaojuan Wu, Jitong Li, Xiaowang Zhang and Zhiyong Feng |
| | | 337 | ControversialQA: Exploring Controversy in Question Answering | Zhen Wang, Peide Zhu and Jie Yang |
| | | 452 | Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives | Qiushi Sun, Chengcheng Han, Nuo Chen, Renyu Zhu, Jingyang Gong, Xiang Li and Ming Gao |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering VI (Chair: *TBD*) Zoom: Link09 - Virtual Room6 | 1448 | Abstract-level Deductive Reasoning for Pre-trained Language Models | Xin Wu, Yi Cai and Ho-fung Leung |
| | | 1558 | Biomedical Entity Linking as Multiple Choice Question Answering | Zhenxi Lin, Ziheng Zhang, Xian Wu and Yefeng Zheng |
| | | 2955 | Probe then Retrieve and Reason: Distilling Probing and Reasoning Capabilities into Smaller Language Models | Yichun Zhao, Shuheng Zhou and Huijia Zhu |
| | | 2604 | Dealing with Data Scarcity in Spoken Question Answering | Merve Ünlü Menevşe, Yusufcan Manav, Ebru Arisoy and Arzucan Özgür |
| 12:40-13:20 | D1-S1-RE9 - Inference, Reasoning, Question Answering VII (Chair: *TBD*) Zoom: Link09 - Virtual Room7 | 1777 | MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning | Zhenwen Liang, Dian Yu, Xiaoman Pan, Wenlin Yao, Qingkai Zeng, Xiangliang Zhang and Dong Yu |
| | | 1608 | Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought | Jooyoung Lee, Fan Yang, Thanh Tran, Qian Hu, Emre Barut and Kai-Wei Chang |
| | | 3294 | Find-the-Common: A Benchmark for Explaining Visual Patterns from Images | Yuting Shi, Naoya Inoue, Houjing Wei, Yufeng Zhao and Tao Jin |
| | | 2354 | Self-Improvement Programming for Temporal Knowledge Graph Question Answering | Zhuo Chen, Zhao Zhang, Zixuan Li, Fei Wang, Yutao Zeng, Xiaolong Jin and Yongjun Xu |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining I (Chair: *TBD*) Zoom: Link10 - Virtual Room1 | 94 | CARE: Co-Attention Network for Joint Entity and Relation Extraction | Wenjun Kong and Yamei Xia |
| | | 287 | Know-Adapter: Towards Knowledge-Aware Parameter-Efficient Transfer Learning for Few-shot Named Entity Recognition | Binling Nie, Yiming Shao and Yigang Wang |
| | | 347 | Event Representation Learning with Multi-Grained Contrastive Learning and Triple-Mixture of Experts | Tianqi Hu, Lishuang Li, Xueyang Qin and Yubo Feng |
| | | 431 | Federated Document-Level Biomedical Relation Extraction with Localized Context Contrast | Yan Xiao, Yaochu Jin and Kuangrong Hao |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining II (Chair: *TBD*) Zoom: Link10 - Virtual Room2 | 503 | ChatEL: Entity Linking with Chatbots | Yifan Ding, Qingkai Zeng and Tim Weninger |
| | | 512 | Relation Classification via Bidirectional Prompt Learning with Data Augmentation by Large Language Model | Yizhi Jiang, Jinlong Li and huanhuan chen |
| | | 589 | MCIL: Multimodal Counterfactual Instance Learning for Low-resource Entity-based Multimodal Information Extraction | Baohang Zhou, Ying Zhang, Kehui Song, Hongru Wang, Yu Zhao, Xuhui Sui and Xiaojie Yuan |
| | | 636 | Extracting Financial Events from Raw Texts via Matrix Chunking | Yusheng Huang, Ning Hu, Kunping Li, Nan Wang and Zhouhan Lin |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining III (Chair: *TBD*) Zoom: Link10 - Virtual Room3 | 786 | Prompt Tuning for Few-shot Relation Extraction via Modeling Global and Local Graphs | Zirui Zhang, Yiyu Yang and Benhui Chen |
| | | 909 | A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder | Kedi Chen, Jie Zhou, Qin Chen, Shunyu Liu and Liang He |
| | | 935 | On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation | Di Wu, Wasi U. Ahmad and Kai-Wei Chang |
| | | 950 | A Streamlined Span-based Factorization Method for Few Shot Named Entity Recognition | Wenjie Xu, yidan Chen and jianquan Ouyang |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining IV (Chair: *TBD*) Zoom: Link10 - Virtual Room4 | 994 | Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors | Shengkun Ma, Jiale Han, Yi Liang and Bo Cheng |
| | | 1107 | Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches | Deepak Gupta, Kush Attal and Dina Demner-Fushman |
| | | 1222 | KCL: Few-shot Named Entity Recognition with Knowledge Graph and Contrastive Learning | Shan Zhang, Bin Cao and Jing Fan |
| | | 1264 | TECA: A Two-stage Approach with Controllable Attention Soft Prompt for Few-shot Nested Named Entity Recognition | Yuanyuan Xu, Linhai Zhang and Deyu Zhou |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining V (Chair: *TBD*) Zoom: Link10 - Virtual Room5 | 1277 | MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training | Xiaojing Du, hanjie Zhao, danyan Xing, Yuxiang Jia and Hongying Zan |
| | | 1343 | Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods | Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J. Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Özlem Uzuner and Meliha Yetisgen |
| | | 1422 | Hierarchical Selection of Important Context for Generative Event Causality Identification with Optimal Transports | Hieu Man, Chien Van Nguyen, Nghia Trung Ngo, Linh Ngo, Franck Dernoncourt and Thien Huu Nguyen |
| | | 1438 | Document-Level Event Extraction via Information Interaction Based on Event Relation and Argument Correlation | Bangze Pan, Yang Li, Suge Wang, Xiaoli Li, Deyu Li, Jian Liao and Jianxing Zheng |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining VI (Chair: *TBD*) Zoom: Link10 - Virtual Room6 | 1443 | Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction | Yuanzhen Luo, Qingyu Zhou and Feng Zhou |
| | | 1511 | Few-Shot Relation Extraction with Hybrid Visual Evidence | Jiaying Gong and Hoda Eldardiry |
| | | 1518 | ESCP: Enhancing Emotion Recognition in Conversation with Speech and Contextual Prefixes | Xiujuan Xu, Xiaoxiao Shi, Zhehuan Zhao and Yu Liu |
| | | 1559 | HS-GC: Holistic Semantic Embedding and Global Contrast for Effective Text Clustering | Chen Yang, Bin Cao and Jing Fan |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining VII (Chair: *TBD*) Zoom: Link10 - Virtual Room7 | 1564 | DocScript: Document-level Script Event Prediction | Puneet Mathur, Vlad I. Morariu, Aparna Garimella, Franck Dernoncourt, Jiuxiang Gu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha and Rajiv Jain |
| | | 1894 | Improving multi-view document clustering: leveraging multi-structure processor and hybrid ensemble clustering module | Ruina Bai and Qi Bai |
| | | 1914 | Hierarchical Topic Modeling via Contrastive Learning and Hyperbolic Embedding | Zhicheng Lin, HeGang Chen, Yuyin Lu, Yanghui Rao, Hao Xu and Hanjiang Lai |
| | | 2010 | On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction | Jianwei Wang, Tianyin Wang and Ziqian Zeng |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining VIII (Chair: *TBD*) Zoom: Link10 - Virtual Room8 | 2046 | Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents | Hao Wang, Tang Li, Chenhui Chu, Rui Wang and Pinpin Zhu |
| | | 2078 | MixRED: A Mix-lingual Relation Extraction Dataset | Lingxing Kong, Yougang Chu, Zheng Ma, Jianbing Zhang, Liang He and Jiajun Chen |
| | | 2255 | Distilling Causal Effect of Data in Continual Few-shot Relation Learning | Weihang Ye, Peng Zhang, Jing Zhang, Hui Gao and Moyao Wang |
| | | 2312 | A Closer Look at Clustering Bilingual Comparable Corpora | Anna Laskina, Eric Gaussier and Gaelle Calvary |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining IX (Chair: *TBD*) Zoom: Link10 - Virtual Room9 | 2277 | Enhancing Knowledge Selection via Multi-level Document Semantic Graph | Haoran Zhang and Tan Yongmei |
| | | 2424 | Efficient and Accurate Contextual Re-Ranking for Knowledge Graph Question Answering | Kexuan Sun, Nicolaas Paul Jedema, Karishma Sharma, Ruben Janssen, Jay Pujara, Pedro Szekely and Alessandro Moschitti |
| | | 2680 | CWTM: Leveraging Contextualized Word Embeddings from BERT for Neural Topic Modeling | Zheng Fang, Yulan He and Rob Procter |
| | | 2707 | Class-Incremental Few-Shot Event Detection | Kailin Zhao, Xiaolong Jin, Long Bai, Jiafeng Guo and Xueqi Cheng |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining X (Chair: *TBD*) Zoom: Link10 - Virtual Room10 | 2749 | Can We Learn Question, Answer, and Distractors All From An Image? A New Task For Multiple-choice Visual Question Answering | Wenjian Ding, Yao Zhang, Jun Wang, Adam Jatowt and Zhenglu Yang |
| | | 2756 | Continual Few-shot Event Detection via Hierarchical Augmentation Networks | Chenlong Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun and Jun Zhao |
| | | 2757 | Prompt-based Zero-shot Relation Extraction with Semantic Knowledge Augmentation | Jiaying Gong and Hoda Eldardiry |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XI (Chair: *TBD*) Zoom: Link10 - Virtual Room11 | 2842 | WkNER: Enhancing Named Entity Recognition with Word Segmentation Constraints and kNN Retrieval | yanchun li, Senlin Deng, Dongsu Shen, Shujuan Tian and Saiqin Long |
| | | 2971 | Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic Information | Qiang Gao, Bobo Li, Zixiang Meng, Yunlong Li, Jun Zhou, Fei Li, Chong Teng and Donghong Ji |
| | | 2639 | Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation | Prashant Krishnan, Zilong Wang, Yangkun Wang and Jingbo Shang |
| | | 3079 | TacoERE: Cluster-aware Compression for Event Relation Extraction | Yong Guan, Xiaozhi Wang, Lei Hou, Juanzi Li, Jeff Z. Pan, Jiaoyan Chen and Freddy Lecue |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XII (Chair: *TBD*) Zoom: Link10 - Virtual Room12 | 3099 | Emancipating Event Extraction from the Constraints of Long-Tailed Distribution Data Utilizing Large Language Models | Zhigang Kan, Liwen Peng, Linbo Qiao and Dongsheng Li |
| | | 3152 | Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction | Ziyang Xu, Keqin Peng, Liang Ding, Dacheng Tao and Xiliang Lu |
| | | 3285 | LA-UCL: LLM-Augmented Unsupervised Contrastive Learning Framework for Few-Shot Text Classification | Jing Zhang, Hui Gao, Peng Zhang, Boda Feng, Wenmin Deng and Yuexian Hou |
| | | 3328 | Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic | Hao An, Zhihong Zhu, Xuxin Cheng, Zhiqi Huang and Yuexian Zou |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XIII (Chair: *TBD*) Zoom: Link10 - Virtual Room13 | 3378 | Leveraging Linguistically Enhanced Embeddings for Open Information Extraction | Fauzan Nayeem Farooqui, Thanmay Jayakumar, Pulkit Mathur and Mansi A. Radke |
| | | 42 | ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models | Jun Xu, Mengshu Sun, Zhiqiang Zhang and Jun Zhou |
| | | 584 | Enhancing Distantly Supervised Named Entity Recognition with Strong Label Guided Lottery Training | Zhiyuan Ma, Jintao Du, Changhua Meng and weiqiang wang |
| | | 846 | CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction | Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu and Wenjuan Han |
| 12:40-13:20 | D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XIV (Chair: *TBD*) Zoom: Link10 - Virtual Room14 | 1641 | BKEE: Pioneering Event Extraction in the Vietnamese Language | Thi-Nhung Nguyen, Bang Tien Tran, Trong-Nghia Luu, Thien Huu Nguyen and Kiem-Hieu Nguyen |
| | | 1997 | Zero-shot Event Detection using a Textual Entailment Model as an Enhanced Annotator | Ziqian Zeng, Runyu Wu, Yuxiang Xiao, Xiaoda Zhong, Hanlin Wang, Zhengdong Lu and Huiping Zhuang |
| | | 2308 | Analyzing Large Language Models' Capability in Location Prediction | Zhaomin Xiao, Eduardo Blanco and Yan Huang |
| | | 2609 | Demonstration Retrieval-Augmented Generative Event Argument Extraction | Shiming He, Yu Hong, Shuai Yang, Jianmin Yao and Guodong Zhou |
| 12:40-13:20 | D1-S1-RE11 - Integrated Systems and Applications I (Chair: *TBD*) Zoom: Link11 - Virtual Room1 | 148 | Knowledge-aware Attention Network for Medication Effectiveness Prediction | Yingying Zhang, Xian Wu, Yu Zhang and Yefeng Zheng |
| | | 551 | Continuous Relational Diffusion driven Topic Model with Multi-grained Text for Microblog | Chenhao Wu, Ruifang He, Chang Liu and Bo Wang |
| | | 623 | TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills | Qiushi Sun, Nuo Chen, Jianing Wang, Ming Gao and Xiang Li |
| | | 690 | CoBaLD Annotation: the Enrichment of the Enhanced Universal Dependencies with the Semantical Pattern | Maria Andreevna Petrova, Alexandra M. Ivoylova and Anastasia Tishchenkova |
| 12:40-13:20 | D1-S1-RE11 - Integrated Systems and Applications II (Chair: *TBD*) Zoom: Link11 - Virtual Room2 | 2063 | AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework | Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan and jun huang |
| | | 2198 | A Trusted Multi-View Evidential Fusion Framework for Commonsense Reasoning | Shuo Yang |
| | | 3257 | LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction | Yixuan Wang, Baoxin Wang, Yijun Liu, dayong wu and Wanxiang Che |
| | | 1202 | First Steps Towards the Integration of Resources on Historical Glossing Traditions in the History of Chinese: A Collection of Standardized Fǎnqiè Spellings from the Guǎngyùn | Michele Pulini and Johann-Mattis List |
| | | 2825 | AuRoRA: A One-for-all Platform for Augmented Reasoning and Refining with Task-Adaptive Chain-of-Thought Prompting | Anni Zou, Zhuosheng Zhang and Hai Zhao |
| 12:40-13:20 | D1-S1-RE12 - Knowledge Discovery / Representation I (Chair: *TBD*) Zoom: Link12 - Virtual Room1 | 180 | Deep Reinforcement Learning-based Dialogue Policy with Graph Convolutional Q-network | Kai Xu, Zhengyu Wang, Yuxuan Long and Qiaona Zhao |
| | | 387 | A Decade of Scholarly Research on Open Knowledge Graphs | Houcemeddine Turki, Abraham Toluwase Owodunni, Mohamed Ali Hadj Taieb, René Fabrice Bile and Mohamed Ben Aouicha |
| | | 462 | Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion | Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen and Wen Zhang |
| | | 801 | Bring Invariant To Variant: A Contrastive Prompt-based Framework for Temporal Knowledge Graph Forecasting | Ying Zhang, Xinying Qian, Yu Zhao, Baohang Zhou, Kehui Song and Xiaojie Yuan |
| 12:40-13:20 | D1-S1-RE12 - Knowledge Discovery / Representation II (Chair: *TBD*) Zoom: Link12 - Virtual Room2 | 1307 | Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models | Derong Xu, Ziheng Zhang, Zhenxi Lin, Xian Wu, Zhihong Zhu, Tong Xu, Xiangyu Zhao, Yefeng Zheng and Enhong Chen |
| | | 1410 | Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation | Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theodoros Rekatsinas, Benjamin Han, Yunyao Li, Jeffrey Pound, Joshua M. Susskind, Natalie Schluter, Ihab F. Ilyas and Navdeep Jaitly |
| | | 1597 | DET: A Dual-Encoding Transformer for Relational Graph Embedding | Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Qiang Zhang and Huajun Chen |
| | | 1601 | Prior Relational Schema Assists Effective Contrastive Learning for Inductive Knowledge Graph Completion | Ruilin Luo, Jiayi Li, Jianghangfan Zhang, Jing Xiao and Yujiu Yang |
| 12:40-13:20 | D1-S1-RE12 - Knowledge Discovery / Representation III (Chair: *TBD*) Zoom: Link12 - Virtual Room3 | 1859 | Self-Knowledge Distillation for Knowledge Graph Embedding | Haotian Xu, Yuhua Wang and Jiahui Fan |
| | | 2745 | Hyperbolic Graph Neural Network for Temporal Knowledge Graph Completion | Yancong Li, Xiaoming Zhang, Ying Cui and Shuai Ma |
| | | 2752 | Prompt-fused framework for Inductive Logical Query Answering | Zezhong Xu, Wen Zhang, Peng Ye, Lei Liang and Huajun Chen |
| | | 3452 | Hypergraph-Based Session Modeling: A Multi-Collaborative Self-Supervised Approach for Enhanced Recommender Systems | Xiangping Zheng, Bo Wu, Alex X. Zhang and Wei Li |
| | | 1117 | Access control framework for language collections | Ben Foley, Peter Sefton, Simon Musgrave and Moises Sacal Bonequi |
| 12:40-13:20 | D1-S1-RE13 - Language Modeling I (Chair: *TBD*) Zoom: Link13 - Virtual Room1 | 93 | TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models | Junbing Yan, Chengyu Wang, Taolin Zhang, XIAOFENG HE, jun huang, Wei Zhang, Longtao Huang and hui xue |
| | | 223 | Enhancing Parameter-efficient Fine-tuning with Simple Calibration based on Stable Rank | Peiyu Liu, Ze-Feng Gao, Xiao Zhang, Wayne Xin Zhao and Ji-Rong Wen |
| | | 335 | Sinkhorn Distance Minimization for Knowledge Distillation | Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou and Houqiang Li |
| | | 408 | EpiGEN: An Efficient Multi-Api Code GENeration Framework under Enterprise Scenario | Sijie Li, Sha Li, Hao Zhang, Shuyang Li, Kai Chen, Jianyong Yuan, Yi Cao and Lvqing Yang |
| 12:40-13:20 | D1-S1-RE13 - Language Modeling II (Chair: *TBD*) Zoom: Link13 - Virtual Room2 | 451 | MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, Bowen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li and Xiping Hu |
| | | 516 | Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation | Xumeng Liu, Wenya Guo, Ying Zhang, Xubo Liu, Yu Zhao, Shenglong Yu and Xiaojie Yuan |
| | | 929 | Exploring and Mitigating Shortcut Learning for Generative Large Language Models | Zechen Sun, Yisheng Xiao, Juntao Li, Yixin Ji, Wenliang Chen and Min Zhang |
| | | 932 | Token-length Bias in Minimal-pair Paradigm Datasets | Naoya Ueda, Masato Mita, Teruaki Oka and Mamoru Komachi |
| 12:40-13:20 | D1-S1-RE13 - Language Modeling III (Chair: *TBD*) Zoom: Link13 - Virtual Room3 | 962 | Mixture-of-LoRAs: An Efficient Multitask Tuning Method for Large Language Models | Wenfeng Feng, Chuzhan Hao, Yuewei Zhang, Yu Han and Hao Wang |
| | | 1023 | Structure-aware Fine-tuning for Code Pre-trained Models | Jiayi Wu, Renyu Zhu, Nuo Chen, Qiushi Sun, Xiang Li and Ming Gao |
| | | 1250 | Enhancing Hindi Feature Representation Through Fusion of Dual-Script Word Embeddings | Lianxi Wang, Yujia Tian and Zhuowei Chen |
| | | 1903 | GPT-SW3: An Autoregressive Language Model for the Scandinavian Languages | Ariel Ekgren, Amaru Cuba Gyllensten, Felix Stollenwerk, Joey Öhman, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Judit Casademont and Magnus Sahlgren |
| 12:40-13:20 | D1-S1-RE13 - Language Modeling IV (Chair: *TBD*) Zoom: Link13 - Virtual Room4 | 2305 | Analyzing Occupational Distribution Representation in Japanese Language Models | Katsumi Ibaraki, Winston Wu, Lu Wang and Rada Mihalcea |
| | | 2725 | Improving Bengali and Hindi Large Language Models | Arif Shahriar and Denilson Barbosa |
| | | 2833 | Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study | Peiyu Liu, Zikang Liu, Ze-Feng Gao, Dawei Gao, Wayne Xin Zhao, Yaliang Li, Bolin Ding and Ji-Rong Wen |
| | | 3085 | Representation Degeneration Problem in Prompt-based Models for Natural Language Understanding | Qingyan Zhao, Ruifang He, Jinpeng Zhang, Chang Liu and Bo Wang |
| | | 2269 | Sequence Reducible Holdout Loss for Language Model Pretraining | Raghuveer Thirukovalluru, Nicholas Monath, Bhuwan Dhingra and Sam Wiseman |
| 13:20 - 14:40 | Lunch | | | |
| | | | | |
| | | | | |
| | | | | |
| 14:40 - 15:40 | Keynote Speaker 1: Roger Levy (Room Auditorium G. Agnelli + Broadcast to other rooms) - Chair: Veronique Hoste | | | |
| | | | | |
| | | | | |
| | | | | |
| 15:50 - 16:10 | D1-S3-R1 - Multimodal Applications, Grounded Language Acquisition, and HRI I (Chair: Nikhil Krishnaswamy) Room: Auditorium G. Agnelli | 212 | Seeing Eye-to-Eye: Cross-Modal Coherence Relations Inform Eye-gaze Patterns During Comprehension & Production | Mert Inan and Malihe Alikhani |
| 16:10 - 16:30 | | 663 | Select and Reorder: A Novel Approach for Neural Sign Language Production | Harry Walsh, Ben Saunders and Richard Bowden |
| 16:30 - 16:50 | | 1304 | MM-IGLU: Multi-Modal Interactive Grounded Language Understanding | Claudiu Daniel Hromei, Daniele Margiotta, Danilo Croce and Roberto Basili |
| 16:50 - 17:10 | | 1344 | A Tool for Determining Distances and Overlaps between Multimodal Annotations | Camila Antonio Barros, Jorge Francisco Ciprián-Sánchez and Saulo Mendes Santos |
| 15:50 - 16:10 | D1-S3-R2 - Applications Involving LRs and Evaluation II (Chair: Leonardo Ranaldi) Room: 500 | 2176 | Zero-shot learning for multilingual discourse relation classification | Eleni Metheniti, Philippe Muller, Chloé Braud and Margarita Hernández Casas |
| 16:10 - 16:30 | | 2242 | LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer's Disease Related Changes in Spontaneous Speech | Ulla Petti and Anna Korhonen |
| 16:30 - 16:50 | | 2407 | Unsupervised Grouping of Public Procurement Similar Items: Which text representation should I use? | Pedro P. V. Brum, Mariana O. Silva, Gabriel P. Oliveira, Lucas G. L. Costa, Anisio Lacerda and Gisele Pappa |
| 16:50 - 17:10 | | 2562 | Exploring the Generalization of Cancer Clinical Trial Eligibility Classifiers Across Diseases | Yumeng Yang |
| 15:50 - 16:10 | D1-S3-R3 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction II (Chair: Larry Heck) Room: Londra | 514 | COMICORDA: Dialogue Act Recognition in Comic Books | Jiri Martinek, Pavel Kral, Ladislav Lenc and Josef Baloun |
| 16:10 - 16:30 | | 403 | Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling | Zhihong Zhu, Xuxin Cheng, Guimin Hu, Yaowei Li, Zhiqi Huang and Yuexian Zou |
| 16:30 - 16:50 | | 924 | Multilingual Turn-taking Prediction Using Voice Activity Projection | Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara and Gabriel Skantze |
| 16:50 - 17:10 | | 3161 | JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset | Atsumoto Ohashi, Ryu Hirai, Shinya Iizuka and Ryuichiro Higashinaka |
| 15:50 - 16:10 | D1-S3-R4 - Information Extraction, Knowledge Extraction, and Text Mining I (Chair: Ayla Rigouts Terryn) Room: Istanbul | 698 | Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer | Youmi Ma, An Wang and Naoaki Okazaki |
| 16:10 - 16:30 | | 697 | Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification | Xindi Wang, Robert E. Mercer and Frank Rudzicz |
| 16:30 - 16:50 | | 190 | MNER-MI: A Multi-image Dataset for Multimodal Named Entity Recognition in Social Media | Shizhou Huang, Bo Xu, Changqun Li, Jiabo Ye and xin Lin |
| 16:50 - 17:10 | | 989 | TED-EL: A Corpus for Speech Entity Linking | Silin Li, Ruoyu Song, Tianwei Lan, Zeming Liu and Yuhang Guo |
| 15:50 - 16:10 | D1-S3-R5 - Inference, Reasoning, Question Answering I (Chair: Bernardo Magnini) Room: Madrid | 353 | ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models | Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang and Bin Dong |
| 16:10 - 16:30 | | 985 | Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering | Yuan Gao, Yiheng Zhu, Yuanbin Cao, Yinzhi Zhou, Zhen Wu, Yujie Chen, Shenglan Wu, Haoyuan Hu and Xinyu Dai |
| 16:30 - 16:50 | | 1753 | Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models | Yantao Liu, Zijun Yao, Xin Lv, Yuchen Fan, Shulin Cao, Jifan Yu, Lei Hou and Juanzi Li |
| 16:50 - 17:10 | | 1983 | JEMHopQA: Dataset for Japanese Explainable Multi-Hop Question Answering | Ai Ishii, Naoya Inoue, Hisami Suzuki and Satoshi Sekine |
| 15:50 - 16:10 | D1-S3-R6 - Document Classification, Information Retrieval and Cross-lingual Retrieval II (Chair: Liana Ermakova) Room: Berlino | 2112 | Scalable Patent Classification with Aggregated Multi-View Ranking | Dan Li, Vikrant Yadav, Zi Long Zhu, Maziar Moradi Fard, Zubair Afzal and George Tsatsaronis |
| 16:10 - 16:30 | | 2312 | A Closer Look at Clustering Bilingual Comparable Corpora | Anna Laskina, Eric Gaussier and Gaelle Calvary |
| 16:30 - 16:50 | | 2363 | PromptStream: Self-Supervised News Story Discovery Using Topic-Aware Article Representations | Arezoo Hatefi, Anton Eklund and Mona Forsman |
| 16:50 - 17:10 | | 2704 | Strengthening the WiC: New polysemy dataset in Hindi and lack of cross lingual transfer | Haim Dubossarsky and Farheen Dairkee |
| 15:50-17:10 | D1-S3-P2 - Digital Humanities and Cultural Heritage II (Chair: Eva Maria Vecchi) Room: Poster Area II (Pavillion 1 - Lingotto Fiere) | 3070 | BLN600: A Parallel Corpus of Machine/Human Transcribed Nineteenth Century Newspaper Texts | Callum William Booth, Alan Thomas and Robert Gaizauskas |
| | | 3308 | Training BERT Models to Carry Over a Coding System Developed on One Corpus to Another | Dalma Galambos and Pal Zsamboki |
| | | 374 | Linking Named Entities in Diderot's Encyclopédie to Wikidata | Pierre Nugues |
| | | 391 | Development and Evaluation of Pre-trained Language Models for Historical Danish and Norwegian Literary Texts | Ali Al-Laith, Alexander Conroy, Jens Bjerring-Hansen and Daniel Hershcovich |
| | | 864 | Converting legacy data to CLDF: A FAIR exit strategy for linguistic web apps | Robert Forkel, Daniel G. Swanson and Steven Moran |
| | | 1252 | HoLM: Analyzing the Linguistic Unexpectedness in Homeric Poetry | John Pavlopoulos, Ryan Sandell, Maria Konstantinidou and Chiara Bozzone |
| | | 1255 | Exploring Neural Topic Modeling on a Classical Latin Corpus | Ginevra Martinelli, Paola Impicciché, Elisabetta Fersini, Francesco Mambrini and Marco Passarotti |
| | | 1373 | Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels | Kexin Luo, Yue Mao, Bei Zhang and Sophie Hao |
| | | 1616 | GENTRAC: A Tool for Tracing Trauma in Genocide and Mass Atrocity Court Transcripts | Miriam Schirmer, Christian Brechenmacher and Juergen Pfeffer |
| | | 2516 (D) | The Onomastic Repertoire of the Roman d'Alexandre (ORNARE). Designing an Integrated Digital Onomastic Tool for Medieval French Romance | Marta Milazzo and Giorgio Maria Di Nunzio |
| | | 3083 (D) | A Large Annotated Reference Corpus of New High German Poetry | Thomas Haider |
| 15:50-17:10 | D1-S3-P2 - Evaluation and Validation Methodologies I (Chair: Eva Maria Vecchi) Room: Poster Area II (Pavillion 1 - Lingotto Fiere) | 4 | From Technology to Market. Bilingual Corpus on the Evaluation of Technology Opportunity Discovery | Amir Hazem, Kazuyuki Motohashi and Chen Zhu |
| | | 45 | Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets | Yida Mu, Xingyi Song, Kalina Bontcheva and Nikolaos Aletras |
| | | 59 | HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models | Guijin Son, Hanwool Lee, suwan kim, Huiseo Kim, Jae cheol Lee, Je Won Yeom, Jihyu Jung, Jung woo Kim and Songseong Kim |
| | | 248 | SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment | Abdullatif Koksal, Silvia Severini and Hinrich Schütze |
| | | 286 | An Untold Story of Preprocessing Task Evaluation: An Alignment-based Joint Evaluation Approach | Eunkyul Leah Jo, Angela Yoonseo Park, Grace Tianjiao Zhang, Izia Xiaoxiao Wang, Junrui Wang, MingJia Mao and Jungyeul Park |
| | | 507 | Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models | Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao |
| | | 578 | Distribution Aware Metrics for Conditional Natural Language Generation | David M. Chan, Yiming Ni, David Ross, Sudheendra Vijayanarasimhan, Austin Myers and John Canny |
| | | 673 | Automatic Speech Recognition-System Independent Word error rate estimation | Chanho Park, Mingjie Chen and Thomas Hain |
| | | 789 | Multilingual Generation in Abstractive Summarization: A Comparative Study | Jinpeng Li, Jiaze Chen, Huadong Chen, Dongyan Zhao and Rui Yan |
| | | 816 | When Cohesion Lies in the Embedding Space: Embedding-Based Reference-Free Metrics for Topic Segmentation | Iacopo Ghinassi, Lin Wang, Chris Newell and Matthew Purver |
| | | 1022 | EsCoLA: Spanish Corpus of Linguistic Acceptability | Nuria Bel, Marta Punsola and Valle Ruíz-Fernández |
| | | 1036 | BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation | Carinne Cherf and Yuval Pinter |
| | | 1298 | Meta-Evaluation of Sentence Simplification Metrics | Noof Abdullah Alfear, Dimitar Kazakov and Hend Al-Khalifa |
| | | 1436 | SimLex-999 for Dutch | Lizzy Brans and Jelke Bloem |
| | | 1522 | GPTEval: A Survey on Assessments of ChatGPT and GPT-4 | Rui Mao, Guanyi Chen, Xulang Zhang, Frank Guerin and Erik Cambria |
| 15:50-17:10 | D1-S3-P2 - Integrated Systems and Applications (Chair: Eva Maria Vecchi) Room: Poster Area II (Pavillion 1 - Lingotto Fiere) | 110 | Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction | Unggi Lee, Sungjun Yoon, Joon Seo Yun, Kyoungsoo Park, YoungHoon Jung, Damji Stratton and Hyeoncheol Kim |
| | | 443 | Estimating Lexical Complexity from Document-Level Distributions | Sondre Wold, Petter Mæhlum and Oddbjørn Hove |
| | | 931 | A Community-Driven Data-to-Text Platform for Football Match Summaries | Pedro Fernandes, Sérgio Nunes and Luís Santos |
| | | 1486 | Improved Neural Protoform Reconstruction via Reflex Prediction | Liang Lu, Jingzhi Wang and David R. Mortensen |
| | | 1578 | Linking Judgement Text to Court Hearing Videos: UK Supreme Court as a Case Study | Hadeel Saadany, Constantin Orasan, Sophie Walker and Catherine Breslin |
| | | 1620 | INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation | Harshita Diddee, Anurag Shukla, Tanuja Ganu, Vivek Seshadri, Sandipan Dandapat, Monojit Choudhury and Kalika Bali |
| | | 1645 | Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection | Ya Gao, Shaoxiong Ji and Pekka Marttinen |
| | | 1697 | MHGRL: An Effective Representation Learning Model for Electronic Health Records | Feiyan Liu, Liangzhi Li, Xiaoli Wang, Feng Luo, Chang Liu, Jinsong Su and Yiming Qian |
| | | 1770 | text2story: A Python Toolkit to Extract and Visualize Story Components of Narrative Text | Evelin Amorim, Ricardo Campos, Alipio Jorge, Pedro Mota and Rúben Almeida |
| | | 684 | LexAbSumm: Aspect-based Summarization of Legal Decisions | Santosh T.Y.S.S., Mahmoud Aly and Matthias Grabmair |
| | | 1218 (D) | Extending the Discourse Analysis Tool Suite with Whiteboards for Visual Qualitative Analysis | Tim Fischer, Florian Schneider, Fynn Petersen-Frey, Anja Silvia Mollah Haque, Isabel Eiser, Gertraud Koch and Chris Biemann |
| | | 1635 | Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model | Shirin Dabbaghi Varnosfaderani, Canasai Kruengkrai, Ramin Yahyapour and Junichi Yamagishi |
| | | 1237 | tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework | Damien Sileo |
| 15:50-17:10 | D1-S3-P2 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation I (Chair: Eva Maria Vecchi) Room: Poster Area II (Pavillion 1 - Lingotto Fiere) | 22 | Hybrid of Spans and Table-Filling for Aspect-Level Sentiment Triplet Extraction | Minghua Nuo and Chaofan Guo |
| | | 237 | Good or Bad News? Exploring GPT-4 for Sentiment Analysis for Faroese on a Public News Corpora | Iben Nyholm Debess, Annika Simonsen and Hafsteinn Einarsson |
| | | 434 | Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training | Chaojun Xiao, Yutao Sun, Yuan Yao, Xu Han, Wenbin Zhang, Zhiyuan Liu and Maosong Sun |
| | | 442 (D) | DMON: A Simple yet Effective Approach for Argument Structure Learning | Sun Wei, Mingxiao Li, Jingyuan Sun, Jesse Davis and Marie-Francine Moens |
| | | 472 | EmoProgress: Cumulated Emotion Progression Analysis in Dreams and Customer Service Dialogues | Eileen Wemmer, Sofie Labat and Roman Klinger |
| | | 568 | Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions | Flor Miriam Plaza-del-Arco, Alba A. Cercas Curry, Amanda Cercas Curry and Dirk Hovy |
| | | 637 | Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks | Jakub Šmíd, Pavel Přibáň and Ondrej Prazak |
| | | 810 | A Two-Stage Framework with Self-Supervised Distillation for Cross-Domain Text Classification | Yunlong Feng, Bohan Li, Libo Qin, Xiao Xu and Wanxiang Che |
| | | 1012 | Source-free Domain Adaptation for Aspect-based Sentiment Analysis | Zishuo Zhao, Ziyang Ma, Zhenzhou Lin, Jingyou Xie, Yinghui Li and Ying Shen |
| | | 1153 | Autonomous Aspect-Image Instruction A2II: Q-Former Guided Multimodal Sentiment Classification | Junjia Feng, Mingqian Lin, Lin Shang and Xiaoying Gao |
| | | 1185 | The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings | Michal Mochtak, Peter Rupnik and Nikola Ljubešić |
| | | 1399 | STEntConv: Predicting Disagreement between Reddit Users with Stance Detection and a Signed Graph Convolutional Network | Isabelle Lorge, Li Zhang, Xiaowen Dong and Janet Pierrehumbert |
| | | 1408 | Investigating the Robustness of Modelling Decisions for Few-Shot Cross-Topic Stance Detection: A Preregistered Study | Myrthe Reuver, Suzan Verberne and Antske Fokkens |
| | | 1667 | Stories and personal experiences in the COVID-19 Discourse | Neele Falk and Gabriella Lapesa |
| | | 1773 | "Barking Up the Right Tree", a GAN-Based Pun Generation Model through Semantic Pruning | JingJie Zeng, Liang Yang, Jiahao Kang, Yufeng Diao, Zhihao Yang and Hongfei LIN |
| | | 2208 | In-Context Example Retrieval from Multi-Perspectives for Few-Shot Aspect-Based Sentiment Analysis | Qianlong Wang, Hongling Xu, Keyang Ding, Bin Liang and Ruifeng Xu |
| | | 1413 | Argument Quality Assessment in the Age of Instruction-Following Large Language Models | Henning Wachsmuth, Gabriella Lapesa, Elena Cabrio, Anne Lauscher, Joonsuk Park, Eva Maria Vecchi, Serena Villata and Timon Ziegenbein |
| 15:50-17:10 | D1-S3-P2 - Speech Resources and Processing II (Chair: Eva Maria Vecchi) Room: Poster Area II (Pavillion 1 - Lingotto Fiere) | 3199 | BlendX: Complex Multi-Intent Detection with Blended Patterns | Yejin Yoon, Jungyeon Lee, Kangsan Kim, Chanhee Park and Taeuk Kim |
| | | 1104 | Leveraging the Interplay Between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation | Yejin Jeon, Yunsu Kim and Gary Geunbae Lee |
| | | 1317 | Annotation of Transition-Relevance Places and Interruptions for the Description of Turn-Taking in conversations in French Media Content | Rémi Uro, Marie Tahon, Jane Wottawa, David Doukhan, Albert Rilliard and Antoine LAURENT |
| | | 1380 | Audiocite.net : A Large Spoken Read Dataset in French | Soline Felice, Solene Virginie Evain, Solange Rossato and François Portet |
| | | 1719 | NB Uttale: A Norwegian Pronunciation Lexicon with Dialect Variation | Marie Iversdatter Røsok and Ingerid Løyning Dale |
| | | 2220 | Gos 2: A New Reference Corpus of Spoken Slovenian | Darinka Verdonik, Kaja Dobrovoljc, Tomaž Erjavec and Nikola Ljubešić |
| | | 2281 | Is Spoken Hungarian Low-resource?: A Quantitative Survey of Hungarian Speech Data Sets | Peter Mihajlik, Katalin Mády, Anna Kohári, Fruzsina Sára Fruzsina, Gábor Kiss, Tekla Etelka Gráczi and A. Seza Doğruöz |
| | | 2307 | Ensembles of Hybrid and End-to-End Speech Recognition. | Aditya Kamlesh Parikh, Louis ten Bosch and Henk van den Heuvel |
| | | 2411 | The Role of Creaky Voice in Turn Taking and the Perception of Speaker Stance: Experiments Using Controllable TTS | Harm Lameris, Eva Szekely and joakim gustafson |
| | | 2428 | Evaluating Self-Supervised Speech Representations for Indigenous American Languages | Chih-Chen Chen, William Chen, Rodolfo Joel Zevallos and John E. Ortega |
| | | 2853 | ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition | Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola Garcia, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey and Sanjeev Khudanpur |
| | | 3171 | nEMO: Dataset of Emotional Speech in Polish | Iwona Christop |
| | | 3401 | ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus | Tolulope Ogunremi, Kọ́lá Túbọ̀sún, Anuoluwapo Aremu, Iroro Orife and David Ifeoluwa Adelani |
| | | 3471 | PRODIS - a speech database and a phoneme-based language model for the study of predictability effects in Polish | Zofia Malisz, Jan Foremski and Małgorzata Kul |
| 17:10 - 17:30 | Coffee break | | | |
| 17:30 - 17:50 | D1-S4-R1 - Corpora and Annotation II (Chair: Serge Sharoff) - Room: Auditorium G. Agnelli | 406 | Why Voice Biomarkers of Psychiatric Disorders are not used in Clinical Practice? Deconstructing the Myth of the Need for Objective Diagnosis | Vincent P. Martin and Jean-Luc Rouas |
| 17:50 - 18:10 | | 572 | Annotate the Way You Think: An Incremental Note Generation Framework for the Summarization of Medical Conversations | Longxiang Zhang, Caleb D. Hart, Susanne Burger and Thomas Schaaf |
| 18:10 - 18:30 | | 1060 | Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data | Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova and Barbara Plank |
| 18:30 - 18:50 | | 1166 | KGConv, a Conversational Corpus grounded in Wikidata | Quentin Brabant, Lina M. Rojas Barahona, Gwénolé Lecorvé and Claire Gardent |
| 18:50 - 19:10 | | 1340 | EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection | Francesca Grasso, Stefano Locci, Giovanni Siragusa and Luigi Di Caro |
| 17:30 - 17:50 | D1-S4-R2 - Evaluation and Validation Methodologies I (Chair: Alessandra Zarcone) - Room: 500 | 1842 | HuLU: Hungarian Language Understanding Benchmark Kit | Noémi Ligeti-Nagy, Gergő Ferenczi, Enikő Héja, László János Laki, Noémi Vadász, Zijian Győző Yang and Tamás Váradi |
| 17:50 - 18:10 | | 1573 | KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark | Seongbo Jang, Seonghyeon Lee and Hwanjo Yu |
| 18:10 - 18:30 | | 2821 | Does ChatGPT Know that It Does Not Know? Evaluating the Black-Box Calibration of ChatGPT | Youliang Yuan, Wenxuan Wang, Qingshuo Guo, Yiming Xiong, Chihao Shen and Pinjia He |
| 18:30 - 18:50 | | 3254 | Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks | Xiao Pu, Mingqi Gao and Xiaojun Wan |
| 18:50 - 19:10 | | 2118 | A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off | Stephen Joseph Meisenbacher, Nihildev Nandakumar, Alexandra Klymenko and Florian Matthes |
| 17:30 - 17:50 | D1-S4-R3 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation I (Chair: Henning Wachsmuth) - Room: Londra | 831 | Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis | Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker and Sheak Rashed Haider Noori |
| 17:50 - 18:10 | | 893 | Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis | Zhenxiao Cheng, Jie Zhou, Wen Wu, Qin Chen and Liang He |
| 18:10 - 18:30 | | 898 | DEEM: Dynamic Experienced Expert Modeling for Stance Detection | Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li and Yang Liu |
| 18:30 - 18:50 | | 947 | Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis | Siyin Wang, Jie Zhou, Qin Chen, Qi Zhang, Tao Gui and Xuanjing Huang |
| 18:50 - 19:10 | | 1006 | EmoPrompt-ECPE: Emotion knowledge-aware Prompt-tuning for Emotion-Cause Pair Extraction | Xue Gu, Zhihan Zhou, Ziyao Meng, Jian Li, Tiago Gomes, Adriano Tavares and Hao Xu |
| 17:30 - 17:50 | D1-S4-R4 - Speech Resources and Processing II (Chair: Jan Odijk) - Room: Istanbul | 1817 | Corpus Creation and Automatic Alignment of Historical Dutch Dialect Speech | Martijn Bentum, Eric Sanders, Antal P.J. van den Bosch, Douwe Zeldenrust and Henk van den Heuvel |
| 17:50 - 18:10 | | 2008 | Speech Analysis of Language Varieties in Italy | Moreno La Quatra, Alkis Koudounas, Elena Baralis and Sabato Marco Siniscalchi |
| 18:10 - 18:30 | | 2145 | Phonetic Segmentation of the UCLA Phonetics Lab Archive | Eleanor Chodroff, Blaž Pažon, Annie Baker and Steven Moran |
| 18:30 - 18:50 | | 2174 | myMediCon: End-to-End Burmese Automatic Speech Recognition for Medical Conversations | Hay Man Htun, Ye Kyaw Thu, Hutchatai Chanlekha, Kotaro Funakoshi and Thepchai Supnithi |
| 18:50 - 19:10 | | 2252 | Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model | Siyang Wang and Eva Szekely |
| 17:30 - 17:50 | D1-S4-R5 - Discourse and Pragmatics (Chair: Maciej Ogrodniczuk) - Room: Madrid | 2467 | DISRPT: A Multilingual, Multi-domain, Cross-framework Benchmark for Discourse Processing | Chloé Braud, Amir Zeldes, Laura Rivière, Yang Janet Liu, Philippe Muller, Damien Sileo and Tatsuya Aoyama |
| 17:50 - 18:10 | | 1803 | Discourse Structure for the Minecraft Corpus | Kate Thompson, Julie Hunter and Nicholas Asher |
| 18:10 - 18:30 | | 928 | Linear Cross-document Event Coreference Resolution with X-AMR | Shafiuddin Rehan Ahmed, George Arthur Baker, Evi Judge, Michael Reagan, Kristin Wright-Bettner, Martha Palmer and James H. Martin |
| 18:30 - 18:50 | | 2356 | SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution | Yilun Zhu, Siyao Peng, Sameer Pradhan and Amir Zeldes |
| 18:50 - 19:10 | | 3271 | To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese | Yukiko Ishizuki, Tatsuki Kuribayashi, Yuichiroh Matsubayashi, Ryohei Sasano and Kentaro Inui |
| 17:30 - 17:50 | D1-S4-R6 - Integrated Systems and Applications (Chair: Chris Biemann) - Room: Berlino | 1589 | To Err is Human, How About Medical Large Language Models? Comparing Pre-trained Language Models for Medical Assessment Errors and Reliability | Wen-wai Yim, Yujuan Fu, Asma Ben Abacha and Meliha Yetisgen |
| 17:50 - 18:10 | | 1615 | MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain | Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata and Andrea Zaninello |
| 18:10 - 18:30 | | 715 | Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents | Santosh T.Y.S.S, Hassan Sarwat, Ahmed Mohamed Abdelaal Abdou and Matthias Grabmair |
| 18:30 - 18:50 | | 1071 | Distractor Generation Using Generative and Discriminative Capabilities of Transformer-based Models | Shiva Taslimipoor, Luca Benedetto, Mariano Felice and Paula Buttery |
| 18:50 - 19:10 | | 3063 | Towards Autonomous Tool Utilization in Language Models: A Unified, Efficient and Scalable Framework | Zhi Li, Yicheng Li, Hequan Ye and Yin Zhang |
| 17-30-19:10 | D1-S3-P3 - Document Classification, Information Retrieval and Cross-lingual Retrieval (Chair: François Yvon) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 206 | FaGANet: An Evidence-Based Fact-Checking Model with Integrated Encoder Leveraging Contextual Information | Weiyao Luo, Junfeng Ran, Zailong Tian, Sujian Li and Zhifang Sui |
| | | 495 | PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods | Slawomir Dadas, Michał Perełkiewicz and Rafał Poświata |
| | | 559 | Enhancing Few-Shot Topic Classification with Verbalizers. A Study on Automatic Verbalizer and Ensemble Methods | Quang Anh Nguyen, Nadi Tomeh, Mustapha Lebbah, Thierry Charnois, Hanene Azzag and Santiago Cordoba Muñoz |
| | | 654 | Incorporating Word-level Phonemic Decoding into Readability Assessment | Christine Pinney, Casey Kennington, Maria Soledad Pera, Katherine Landau Wright and Jerry Alan Fails |
| | | 732 | Document Set Expansion with Positive-Unlabeled Learning Using Intractable Density Estimation | Haiyang Zhang, Qiuyi Chen, Yanjie Zou, Jia Wang, Yushan Pan and Mark Stevenson |
| | | 775 | UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval | Hongru Wang, Boyang XUE, Baohang Zhou, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang and Kam-Fai Wong |
| | | 872 | Large Language Models for Generative Recommendation: A Survey and Visionary Discussions | Lei Li, Yongfeng Zhang, Dugang Liu and Li Chen |
| | | 900 | From Graph to Word Bag: Introducing Domain Knowledge to Confusing Charge Prediction | Ang Li, Qiangchao Chen, Yiquan Wu, Xiang Zhou, Kun Kuang, Fei Wu and Ming Cai |
| | | 902 | Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion | Shi Yu, Chenghao Fan, Chenyan Xiong, David Jin, Zhiyuan Liu and Zhenghao Liu |
| | | 979 | Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation | Kyohoon Jin, Junho Lee, Juhwan Choi, Sangmin Song and Youngbin Kim |
| | | 1125 | JLBert: Japanese Light BERT for Cross-Domain Short Text Classification | Chandrai Kayal, Sayantan Chattopadhyay, Aryan Gupta, Satyen Abrol and Archie Gugol |
| | | 1188 (D) | CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation | Nikola Ljubešić and Taja Kuzman |
| | | 1309 | SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels | Elena Shushkevich, Long Thanh Mai, Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya |
| | | 1362 | An LCF-IDF Document Representation Model Applied to Long Document Classification | Renzo Arturo Alva Principe, Nicola Chiarini and Marco Viviani |
| | | 1439 | Lessons from Deploying the First Bilingual Peruvian Sign Language - Spanish Online Dictionary | Joe Huamani-Malca, Miguel Rodriguez Mondoñedo, Francisco Cerna-Herrera, Gissella Bejarano, Carlos Vásquez Roque, Cesar Augusto Ramos Cantu and Sabina Oporto Pérez |
| | | 1495 | Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data | Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko and Shervin Malmasi |
| 17-30-19:10 | D1-S3-P3 - Inference, Reasoning, Question Answering II (Chair: François Yvon) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 2640 | Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering | Yexin Wu, Zhuosheng Zhang and Hai Zhao |
| | | 2954 | ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting | Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen |
| | | 3043 | Visual-Textual Entailment with Quantities Using Model Checking and Knowledge Injection | Nobuyuki Iokawa and Hitomi Yanaka |
| | | 3178 | MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts | Xiang Li, Shizhu He, Jiayu Wu, Zhao Yang, Yao Xu, yang jun jun, Haifeng Liu, Kang Liu and Jun Zhao |
| | | 378 | SGCM: Salience-Guided Context Modeling for Question Generation | Chuyao Ding, Yu Hong and Jianmin Yao |
| | | 435 | TAPASGO: Transfer Learning towards a German-Language Tabular Question Answering Model | Dominik Andreas Kowieski, Michael Hellwig and Thomas Feilhauer |
| | | 583 | Non-Essential is NEcessary: Order-agnostic Multi-hop Question Generation | Kyungho Kim, Seongmin Park, Junseo Lee and Jihwa Lee |
| | | 971 | Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer | Xinshuo Hu, Dongfang Li, Xiaoguang Li, Yuxiang Wu, Lifeng Shang and Baotian Hu |
| | | 1045 | Generating multiple-choice questions for medical question answering with distractors and cue-masking | Damien Sileo, Kanimozhi Uma and Marie-Francine Moens |
| | | 2464 | How Robust are the QA Models for Hybrid Scientific Tabular Data? A Study using Customized Dataset | Akash Ghosh, Venkata Sahith Bathini, Niloy Ganguly, Pawan Goyal and Mayank Singh |
| | | 2682 | Choice-75: A Dataset on Decision Branching in Script Learning | Zhaoyi Hou, Li Zhang and Chris Callison-Burch |
| | | 3245 | Denoising Table-Text Retrieval for Open-Domain Question Answering | Deokhyung Kang, Baikjin Jung, Yunsu Kim and Gary Geunbae Lee |
| | | 3261 | EEE-QA: Exploring Effective and Efficient Question-Answer Representations | Zhanghao Hu, Yijun YANG, Junjie XU, Yifu Qiu and Pinzhen Chen |
| 17-30-19:10 | D1-S3-P3 - Language Modeling (Chair: François Yvon) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 260 | Code Defect Detection using Pre-trained Language Models with Encoder-Decoder via Line-Level Defect Localization | Jimin An, YunSeok Choi and Jee-Hyong Lee |
| | | 800 | JCoLA: Japanese Corpus of Linguistic Acceptability | Taiga Someya, Yushi Sugimoto and Yohei Oseki |
| | | 803 | NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages | Francois Meyer, Haiyue Song, Abhisek Chakrabarty, Jan Buys, Raj Dabre and Hideki Tanaka |
| | | 915 | How Important Is Tokenization in French Medical Masked Language Models? | Yanis Labrak, Adrien Bazoge, Béatrice Daille, Mickael Rouvier and Richard Dufour |
| | | 1176 | On the Relationship between Skill Neurons and Robustness in Prompt Tuning | Leon Ackermann and Xenia Isabel Ohmer |
| | | 1221 | Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains | Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Daniel AUDIBERT, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix E. Herron, Magali Norré, Massih R Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab |
| | | 1301 | CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context | Yangruibo Ding, Zijian Wang, Wasi U. Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth and Bing Xiang |
| | | 1722 | IAD: In-Context Learning Ability Decoupler of Large Language Models in Meta-Training | Yuhan Liu, Xiuying Chen, GAO XING, Ji Zhang and Rui Yan |
| | | 1821 | Question Answering over Tabular Data with DataBench: A Large-Scale Empirical Evaluation of LLMs | Jorge Osés Grijalba, L. Alfonso Ureña-López, Eugenio Martínez Cámara and Jose Camacho-Collados |
| | | 1908 | NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption | Hao Gu, Jiangyan Yi, Zheng Lian, Jianhua Tao and Xinrui Yan |
| | | 1934 | Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It) | Alessio Miaschi, Felice Dell'Orletta and Giulia Venturi |
| | | 2144 | Linguistic Rule Induction Improves Adversarial and OOD Robustness in Large Language Models | Shuoran Jiang, Qingcai Chen, Yang Xiang, Youcheng Pan and Yukang Lin |
| | | 2412 | FLOR: On the Effectiveness of Language Adaptation | Severino Da Dalt, Joan Llop, Irene Baucells, Marc Pamies, Yishi Xu, Aitor Gonzalez-Agirre and Marta Villegas |
| | | 2593 | Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding | Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek and Christoph M. Friedrich |
| | | 2671 | Deconstructing In-Context Learning: Understanding Prompts via Corruption | Namrata Shivagunde, Vladislav Lialin, Sherin Muckatira and Anna Rumshisky |
| | | 3126 | Improving the Robustness of Large Language Models via Consistency Alignment | Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Shuaiqiang WANG, chong meng, zhicong cheng, Zhaochun Ren and Dawei Yin |
| | | 3248 | LlamaCare: an Instruction Fine-Tuned Large Language Model for Clinical NLP | RUMENG LI, Xun Wang and hong yu |
| | | 2332 | Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models | Oana Ignat, Zhijing Jin, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Ece Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan C. Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu and Rada Mihalcea |
| | | 310 | Disambiguating homographs and homophones simultaneously: a regrouping method for Japanese | Yo Sato |
| | | 1063 | Release of Pre-Trained Models for the Japanese Language | Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki and Koh Mitsuda |
| | | 1420 | Agent-based Modeling of Language Change in a Small-world Network | Dalmo Buzato and Evandro Cunha |
| | | 2114 | mALBERT: Is a Compact Multilingual BERT Model Still Worth It? | Christophe Servan, Sahar Ghannay and Sophie Rosset |
| | | 1674 | Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models | Boxi Cao, Qiaoyu Tang, Hongyu Lin, Shanshan Jiang, Bin Dong, Xianpei Han, Jiawei Chen, Tianshu Wang and Le Sun |
| 17-30-19:10 | D1-S3-P3 - Less-Resourced/Endangered/Less-studied Languages I (Chair: François Yvon) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 603 | POS Tagging for the Endangered Dagur Language | Joanna Dolińska and Delphine Bernhard |
| | | 988 | The ParCoLab Parallel Corpus and its Extension to Four Regional Languages of France | Dejan Stosic, Saša Marjanović, Delphine Bernhard, Myriam Bras, Laurent Kevers, Stella Retali-Medori, Marianne Vergez-Couret and Carole Werner |
| | | 1068 | Towards Equitable Natural Language Understanding Systems for Dialectal Cohorts: Debiasing Training Data | Khadige Abboud and Gokmen Oz |
| | | 1368 | Mitigating Translationese in Low-resource Languages: The Storyboard Approach | Garry Kuwanto, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga, Teresiah W. Nyoike, Aniefon D. Akpan, Nsima Ab Udouboh, Idongesit Udeme Archibong, Idara Effiong Moses, Ifeoluwatayo A. Ige, Benjamin Ajibade, Olumide Benjamin Awokoya, Idris Abdulmumin, Saminu Mohammad Aliyu, Ruqayya Nasir Iro, Ibrahim Said Ahmad, Deontae Smith, Praise-EL Michaels, David Ifeoluwa Adelani, Derry Tanti Wijaya and Anietie Andy |
| | | 1922 | Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants | Miriam Winkler, Virginija Juozapaityte, Rob van der Goot and Barbara Plank |
| | | 2039 | Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information | Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez and David Chiang |
| | | 2084 | Speech Recognition Corpus of the Khinalug Language for Documenting Endangered Languages | Zhaolin Li, Monika Rind-Pawlowski and Jan Niehues |
| | | 2136 | Extending AZee with Non-manual Gesture Rules for French Sign Language | Camille Challant and Michael Filhol |
| | | 2201 | Agettivu, Aggitivu o Aghjettivu? POS Tagging Corsican Dialects | Alice Millour, Lorenza Brasile, Alberto Ghia and Laurent Kevers |
| | | 2338 | A Workflow for HTR-Postprocessing, Labeling and Classifying Diachronic and Regional Variation in Pre-Modern Slavic Texts | Piroska Lendvai, Maarten van Gompel, Anna Jouravel, Elena Renje, Uwe Reichel, Achim Rabus and Eckhart Arnold |
| | | 2563 | Bootstrapping UMR Annotations for Arapaho from Language Documentation Resources | Matthew J. Buchholz, Julia Bonn, Claire Benet Post, Andrew Cowell and Alexis Palmer |
| | | 2678 | Development of Community-Oriented Text-to-Speech Models for Māori 'Avaiki Nui (Cook Islands Māori) | Jesin James, Rolando Coto-Solano, Sally Akevai Nicholas, Joshua Zhu, Bovey Yu, Fuki Babasaki, Jenny Tyler Wang and Nicholas Derby |
| | | 2712 | EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation | Atnafu Lambebo Tonja, Israel Abebe Azime, Tadesse Destaw Belay, Mesay Gemeda Yigezu, Moges Ahmed Ah Mehamed, Abinew Ali Ayele, Ebrahim Chekol Jibril, Michael Melese Woldeyohannis, Olga Kolesnikova, Philipp Slusallek, Dietrich Klakow and Seid Muhie Yimam |
| | | 3020 | Evaluating Performance of Pre-trained Word Embeddings on Assamese, a Low-resource Language | Dhrubajyoti Pathak, Sukumar Nandi and Priyankoo Sarmah |
| | | 3054 | Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction | MohanRaj Chanthran, Lay-Ki Soon, Huey Fang Ong and Bhawani Selvaretnam |
| | | 3240 | Learning From Wrong Predictions in Low-Resource Neural Machine Translation | Jia Cheng Hu, Roberto Cavicchioli, Giulia Berardinelli and Alessandro Capotondi |
| | | 624 | Transferring BERT Capabilities from High-Resource to Low-Resource Languages Using Vocabulary Matching | Piotr Rybak |
| | | 2542 | NSina: A News Corpus for Sinhala | Hansi Hettiarachchi, Damith Premasiri, Lasitha Randunu Chandrakantha Uyangodage and Tharindu Ranasinghe |
| 17-30-19:10 | D1-S3-P3 - Machine Learning Models and Techniques for CL/NLP I (Chair: François Yvon) Room: Poster Area I (Pavillion 1 - Lingotto Fiere) | 101 | CrossTune: Black-Box Few-Shot Classification with Label Enhancement | Danqing Luo, Chen Zhang, Yan Zhang and Haizhou Li |
| | | 313 | Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation | Heegon Jin, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh and Yeonsoo Lee |
| | | 439 | Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind | Hongchuan Zeng, Hongshen Xu, Lu Chen and Kai Yu |
| | | 450 | Semantic Role Labeling Guided Out-of-distribution Detection | Jinan Zou, Maihao Guo, Yu Tian, Yuhao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad and Javen Qinfeng Shi |
| | | 537 | Evaluating Code-Switching Translation with Large Language Models | Muhammad Huzaifah, Weihua Zheng, Nattapol Chanpaisit and Kui Wu |
| | | 592 | Task-agnostic Distillation of Encoder-Decoder Language Models | Chen Zhang, Yang Yang, Qiuchi Li, Jingang Wang and Dawei Song |
| | | 655 | DDxGym: Online Transformer Policies in a Knowledge Graph Based Natural Language Environment | Benjamin Winter, Alexei Gustavo Figueroa Rosero, Alexander Loeser, Felix Alexander Gers, Nancy Katerina Figueroa Rosero and Ralf Krestel |
| | | 822 | TaiChi: Improving the Robustness of NLP Models by Seeking Common Ground While Reserving Differences | Huimin Chen, Chengyu Wang, Yanhao Wang, Cen CHEN and Yinggui Wang |
| | | 891 | Rebalancing Label Distribution while Eliminating Inherent Waiting Time in Multi Label Active Learning applied to Transformers | Maxime Arens, Lucile Callebert, Mohand Boughanem and Jose G. Moreno |
| | | 917 | FCDS: Fusing Constituency and Dependency Syntax into Document-Level Relation Extraction | xudong zhu, Zhao Kang and Bei Hui |
| | | 1011 | LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders | Xingwu Sun, Zhen Yang, Ruobing Xie, Fengzong Lian, Zhanhui Kang and Chengzhong Xu |
| | | 1110 | Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks | Ileana Rugina, Rumen Dangovski, Li Jing, Preslav Nakov and Marin Soljacic |
| | | 1283 | Article Classification with Graph Neural Networks and Multigraphs | Khang Ly, Yury Kashnitsky, Savvas Chamezopoulos and Valeria Krzhizhanovskaya |
| | | 1426 | Predictive and distinctive linguistic features in Schizophrenia-Bipolar Spectrum Disorders | Martina Katalin Szabó, Veronika Vincze, Bernadett Dam, Csenge Guba, Anita Bagi and István Szendi |
| | | 1492 | Towards Understanding the Relationship between In-context Learning and Compositional Generalization | Sungjun Han and Sebastian Padó |
| | | 1621 | Multi-Channel Spatio-Temporal Transformer for Sign Language Production | Xiaohan Ma, Rize Jin and Tae-Sun Chung |
| | | 1684 | Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection | Ramona Kühn, Khouloud Saadi, Jelena Mitrović and Michael Granitzer |
| | | 1758 | Evolving Knowledge Distillation with Large Language Models and Active Learning | Chengyuan Liu, Fubang Zhao, Kun Kuang, Yangyang Kang, Zhuoren Jiang, Changlong Sun and Fei Wu |
| | | 1847 | How Speculative Can Speculative Decoding Be? | Zhuorui Liu, Chen Zhang and Dawei Song |
| | | 1852 | Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization | Shuo Yang and Gjergji Kasneci |
| 19:10 - 19:50 | ELRA Members Meeting Room: 500 | | | |
| | | | | |
| | | | | |
| 20:00 - 22:00 | Welcome Reception | | | |