AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics Atilla Kaan Alkan, Felix Grezes, Sergi Blanco-Cuaresma, Jennifer Lynn Bartlett, Daniel Chivvis, Anna Kelbert, Kelly Lockhart and Alberto Accomazzi
Benchmarking LLMs for ARR Area Assignment: Evidence and Implications for Assignment Strategies Eileen Bingert, Diego Alves and Stefania Degaetano-Ortlieb
Benchmarking Retrieval-Augmented Generation for Scientific Knowledge QA in European Portuguese Jose Matos, Catarina Silva and Hugo Goncalo Oliveira
Beyond Abstracts: A Biomedical MeSH Indexing Corpus Incorporating Summarized Methods Sections Sujoy Datta, Robert E. Mercer and Xindi Wang
Challenges and Opportunities for NSLP in Scientific Publishing–A Case Study Thomas Kleinbauer, Michael Didas and Michael Wagner
ClimateCheck 2026 Task 2: Comparing Hierarchical Approaches for Fine-Grained Climate Disinformation Narrative Classification Arthur Hilbert, Nils Feldhus, Jing Yang and Vera Schmitt
ClimateCheck 2026: Scientific Fact-Checking and Disinformation Narrative Classification of Climate-related Claims Raia Abu Ahmad, Max Upravitelev, Aida Usmanova, Veronika Solopova and Georg Rehm
ClimateSense at ClimateCheck 2026 Thibault Ehrhart, Gregoire Burel and Raphael Troncy
Comparing LLM-Based Knowledge Graph Extraction Approaches on Literary Studies in Spanish: A Case Study on Orbis Tertius Federico Cortes
Contrastive SciBERT for Cross-Document Software Coreference Mahmoud Hassan and Dipendra Yadav
Demystifying Funding: Reconstructing a Unified Dataset of the UK Funding Lifecycle William Thorne, Rupert Shepherd and Diana Maynard
Do Lexical and Contextual Coreference Resolution Systems Degrade Differently under Mention Noise? An Empirical Study on Scientific Software Mentions Atilla Kaan Alkan, Felix Grezes, Jennifer Lynn Bartlett, Anna Kelbert, Kelly Lockhart and Alberto Accomazzi
Do We Need Bigger Models for Science? Task-Aware Retrieval with Small Language Models Florian Kelber, Matthias Jobst, Yuni Susanti and Michael Färber
EarlySciRev: A Dataset of Early-Stage Scientific Revisions Extracted from LaTeX Writing Traces Léane Jourdan, Julien Aubert-Béduchaud, Yannis Chupin, Marah Baccari and Florian Boudin
Enhancing Factuality and Transparency in Generative Models for Biomedical Question Answering Ankita Behura, Siting Liang and Daniel Sonntag
Enhancing Scholarly Knowledge Graphs via Domain-Specific Entity Detection and Linking Nicolau Duran-Silva, César A. Parra-Rojas, Pablo Accuosto, Julian Moreno-Schneider and Georg Rehm
Evaluating Generative Large Language Models for Portuguese Scientific Information Extraction Tomás Pinto, Catarina Silva and Hugo Goncalo Oliveira
From Slides to Chatbots: Enhancing Large Language Models with University Course Materials Tu Anh Dinh, Philipp Nicolas Schumacher and Jan Niehues
Generating Research Data Metadata from Their Accompanying README Files Kotaro Sekido, Yu Watanabe, Koichiro Ito and Shigeki Matsubara
Identifying Implicit Research Data References in Paper Citations Koshi Motegi, Koichiro Ito and Shigeki Matsubara
Improving Completeness in Deep Research Agents through Targeted Enrichment Jesse Wonnink, Jakub Zavrel and Paul Groth
MioFFAn: an Annotation Software for Formula Formalization with LLM Automation Capabilities Nicolas Sibuet Ruiz, Horacio Saggion and Riccardo Rossi
Normalizing section names and structure of scientific articles Nicolau Duran-Silva, Julian Moreno-Schneider, César A. Parra-Rojas and Georg Rehm
Retrieval-Augmented LLMs and Encoder Models for Multi-Label Climate Disinformation Narrative Classification Neda Foroutan, Alexandra Tsiakalou and Vera Schmitt
The Linguist’s Lie Detector: Benchmarking Linguistic Veracity in Large Language Models Lucía Catalán Gris, Kim Gerdes and John S. Y. Lee
The Software Mention Detection and Coreference Resolution Shared Task 2026 Sharmila Upadhyaya, Wolfgang Otto, Julia Matela, Frank Krüger and Stefan Dietze
Towards Efficient Self-Explainable Climate-Related Claim Verification with Generative Models Siting Liang, Omar Adjali and Daniel Sonntag
Transferring Scientific English Pre-Trained Language Models to Multiple Languages Using Cross-Lingual Transfer Nikolas Ching-Pu Rauscher, Fabio Barth and Georg Rehm
UniCite: A Dataset and Unified Hierarchical Taxonomy for Multi-Dimensional Citation Analysis Amina Mourky, Elena Leitner, Julian Moreno-Schneider, Raia Abu Ahmad, Ekaterina Borisova and Georg Rehm