Accepted System Demonstrations Papers

  • OpenRLHF: A Ray-based Easy-to-use, Scalable and High-performance RLHF Framework
    Bin Chen, Hao Chen, Haoran Wang, Haotian Xu, Jason Klein Liu, Jian Hu, Songlin Jiang, Wei Shen, Weixun Wang, Wenkai Fang, Xianyu, Xibin Wu, Yu Cao, LiuYiming
  • EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models
    Chengyu Wang, Jun Huang, Junbing Yan, Wenrui Cai, Yuanhao Yue
  • CafGa: Customizing Feature Attributions to Explain Language Models
    Alan David Boyle, Furui Cheng, Mennatallah El-Assady, Vilém Zouhar
  • SLACKAGENTS: SCALABLE COLLABORATION OF MULTIPLE AI AGENTS IN WORKSPACES
    Caiming Xiong, Huan Wang, Jianguo Zhang, Juntao Tan, Shelby Heinecke, Silvio Savarese, Weiran Yao, Zhiwei Liu, Zuxin Liu, Frank Wang
  • InTriage: Intelligent Telephone Triage in Pre-Hospital Emergency Care
    Dehan Hong, Hao Fei, Kai He, Marcus Eng Hock Ong, Mengling Feng, Qika Lin, Eng Siong Chng
  • EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
    Guozhou Zheng, Haoming Xu, Huajun Chen, Mengru Wang, Ningyu Zhang, Shuxun Wang, Xinle Deng, Yunzhi Yao, Ziwen Xu, kewei Xu
  • AERA Chat: An Interactive Platform for Automated Explainable Student Answer Assessment
    Artem Bobrov, Cesare Aloisi, Jiazheng Li, Runcong Zhao, Yulan He
  • Tau-Eval: A Unified Evaluation Framework for Useful and Private Text Anonymization
    Damien Riquet, Gabriel Loiseau, Marc Tommasi, Maxime Meyer, Damien Sileo
  • SpiritRAG: A Q&A System for Religion and Spirituality in the United Nations Archive
    Anastassia Shaitarova, Fabian Winiger, Gerold Schneider, Nianlong Gu, Patrick Montjourides, Yingqiang Gao
  • KMatrix-2: A Comprehensive Heterogeneous Knowledge Collaborative Enhancement Toolkit for Large Language Model
    Jun Zhao, Kang Liu, Kun Luo, Shizhu He, Wangtao Sun, XueYou Zhang, Ziyang Huang, di wu, shun wu, yuanxiaowei
  • EvoAgentX: An Automated Framework for Evolving Agentic Workflows
    Jinyuan Fang, Siwei Liu, Yingxu Wang, Zaiqiao Meng
  • ResearStudio: A Human-intervenable Framework for Building Controllable Deep Research Agents
    Linyi Yang, Yixuan Weng
  • PresentAgent: Multimodal Agent for Presentation Video Generation
    Biao Wu, Jingwei Shi, Ling Chen, Meng Fang, Yang Zhao, Yanjie Liang, Zeyu Zhang
  • PDFMathTranslate: Scientific Document Translation Preserving Layouts
    Chang Chu, Rongxin Ouyang, Xiangyao Ma, Zhikuang Xin
  • Open Political Corpora: Structuring, Searching, and Analyzing Political Text Collections with PoliCorp
    Muhammad Ahsan Shahid, Nina Smirnova, Philipp Mayr
  • LAD: LoRA-Adapted Diffusion
    Ayoub Bagheri, Ruurd Jan Anthonius Kuiper, Bram van Es, Lars de Groot
  • UnityAI Guard: Pioneering Toxicity Detection Across Low-Resource Indian Languages
    Adithya Ananth, Birudugadda Srivibhav, Daksh Jain, Eshwar Dhande, Himanshu Beniwal, Kuldeep, Mayank Singh, Pavan Deekshith Doddi, Reddybathuni Venkat, Rohit Kumar
  • CrowdAgent: Multi-Agent Managed Multi-Source Annotation System
    Changjie Fan, Haobo Wang, Junbo Zhao, Maosheng Qin, Mingxuan Xia, Minmin Lin, Renyu Zhu, Runze Wu, Zhen Zhu, Chenchenkai, Lu Xu
  • LionGuard 2: Building Lightweight, Data-Efficient & Localised Multilingual Content Moderators
    Gabriel Chua, Leanne Tan, Roy Ka-Wei Lee, Ziyu Ge
  • Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research
    David Demitri Africa, Paula Buttery, Richard Diehl Martinez, Ryan Daniels, Suchir Salhan, Yuval Weiss
  • DistaLs: a Comprehensive Collection of Language Distance Measures
    Esther Ploeger, Rob van der Goot, Tanja Samardzic, Verena Blaschke
  • MASA: A Modular Framework for LLM-Driven Multi-Agent Systems for Autoformalization
    Andre Freitas, Lan Zhang, Marco Valentino
  • SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains
    Daniel Smolyak, Krithika Ramesh, Margrét V. Bjarnadóttir, Nupoor Gandhi, Ritu Agarwal, Zihao Zhao, Anjalie Field
  • Open-Theatre: An Open-Source Toolkit for LLM-based Interactive Drama
    Hongqiu Wu, Tianyang Xu, Weiqi Wu, hai zhao
  • End-to-End Multilingual Automatic Dubbing via Duration-based Translation with Large Language Models
    Hyun-Sik Won, DongJin Jeong, Hyunkyu Choi, Jinwon Kim
  • Sanskrit Voyager: Unified Web Platform for Interactive Reading and Linguistic Analysis of Sanskrit Texts
    Danilo Croce, Giacomo De Luca, Roberto Basili
  • AutoIntent: AutoML for Text Classification
    Darina Rustamova, Denis Kuznetsov, Ilya Alekseev, Roman Solomatin
  • PledgeTracker: A System for Monitoring the Fulfilment of Pledges
    Andreas Vlachos, David Corney, Michael Sejr Schlichtkrull, Yulong Chen, Zhenyun Deng, Andrew Dudfield, Nasim Asl
  • Bratly: A Python Extension for BRAT Functionalities
    Christian Lovis, Jamil Zaghir, Jean-Philippe Goldman, Mina Bjelogrlic, Nikola Bjelogrlic
  • SciClaims: An End-to-End Generative System for Biomedical Claim Analysis
    Jose Manuel Gomez-Perez, Raúl Ortega
  • LingConv: An Interactive Toolkit for Controlled Paraphrase Generation with Linguistic Attribute Control
    Hadi Amiri, Mohamed Elgaar
  • BAREC Demo: Resources and Tools for Sentence-level Arabic Readability Assessment
    Hanada Taha, Khalid Elmadani, Kinda Altarbouch, Nizar Habash, Ossama Obeid
  • TRACE: Training and Inference-Time Interpretability Analysis for Language Models
    Andre Freitas, Danilo Carvalho, Nura Aljaafari
  • Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssist
    Elizabeth M. Daly, Erik Miehling, Hyo Jin Do, Jasmina Gajcin, Martín Santillán Cooper, Michael Desmond, Qian Pan, Werner Geyer, Zahra Ashktorab
  • TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs
    Alfy Samuel, Alperen Öziş, Anoop Kumar, Daben Liu, Duygu Nur Yaldiz, Hayrettin Eren Yildiz, Mitash Ashish Shah, Sai Praneeth Karimireddy, Amir Avestimehr, Sungmin Kang, Yavuz Faruk Bakman, Zhiqi Huang
  • Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
    Jingyuan Wang, Qiyu Sun, Richong Zhang, Shiqi Li, Yaowei Zheng, Yuchen Gong, Ziyang Miao
  • The iRead4Skills Intelligent Complexity Analyzer
    Alejandro Catala, David Antunes, Elodie Vanzeveren, Eugénio Ribeiro, Jorge Baptista, Marcos Garcia, Raquel Amaro, Thibault Bañeras-Roux, Thomas François, Wafa Aissa, Mario Izquierdo-Álvarez
  • TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability
    Ameya Godbole, James Flemings, Johnny Wei, Krishna P. Gummadi, Mohammad Aflah Khan, Robin Jia, Willie Neiswanger, Ryan Wang
  • AM4DSP: Argumentation Mining in Structured Decentralized Discussion Platforms for Deliberative Democracy
    Anna De Liddo, Elena Cabrio, Lucas Anastasiou, Serena Villata, Sofiane Elguendouze, Erwan Hain
  • GLiNER2: Schema-Driven Multi-Task Learning for Structured Information Extraction
    Gil Pasternak, Urchade Zaratiana
  • Automated Evidence Extraction and Scoring for Corporate Climate Policy Engagement: A Multilingual RAG Approach
    Chiara Colesanti Senni, Imene kolli, Markus Leippold, Saeid Vaghefi, Shantam Raj
  • LaTeXMT: Machine Translation for LaTeX Documents
    Calvin Hoy, Georg Moser, Samuel Frontull
  • MALLM: Multi-Agent Large Language Models Framework
    Bela Gipp, Jan Philip Wahle, Jonas Becker, Lars Benedikt Kaesberg, Niklas Bauer, Terry Ruas
  • RadEval: A framework for radiology text evaluation
    Dave Van Veen, David W Eyre, Fatemeh Haghighi, Javid Abderezaei, Jean-Benoit Delbrouck, Justin Xu, Xi Zhang, Zaiqiao Meng, Ali Ganjizadeh, Eric Brattain, Julie Bauml, Roger Boodoo
  • Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree
    Sam Johnson, Thai Le, Viet Pham
  • LangVAE and LangSpace: Building and Probing for Language Model VAEs
    Andre Freitas, Danilo Carvalho, Harriet Unsworth, Yingji Zhang
  • AgentMaster: A Multi-Agent Conversational Framework Using A2A and MCP Protocols for Multimodal Information Retrieval and Analysis
    Callie C. Liao, Duoduo Liao, Sai Surya Gadiraju
  • ROBOTO2: An Interactive System and Dataset for LLM-assisted Clinical Trial Risk of Bias Assessment
    Anthony Hevia, Lucy Lu Wang, Nguyen Thanh Tam, Sanjana Chintalapati, Terry P Klassen, Veronica Ka Wai Lai
  • GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models
    Ananda Sreenidhi, Bhavani Sai Praneeth Varma Mantina, Fei Yuan, Hengyu Luo, Joseph Attieh, Jörg Tiedemann, Mengjie Wang, Ona de Gibert, Peiqin Lin, Raúl Vázquez, Samea Yusofi, Sawal Devkota, Shaoxiong Ji, Xu Huang, Zihao Li
  • AIPOM: Agent-aware Interactive Planning for Multi-Agent Systems
    Chen Shen, Dan Zhang, Estevam Hruschka, Hannah Kim, Kushan Mitra
  • Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification
    Chenfei Xiong, Dirk Hovy, Donya Rooein, Elliott Ash, Jingwei Ni, Lorena Calvo-Bartolomé, Markus Leippold, Mennatallah El-Assady, Vilém Zouhar, Yu Fan, Alexander Hoyle, MRINMAYA SACHAN
  • SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks
    Adamenko Pavel, Aidar Valeev, Alena Fenogenova, Dmitrii Babaev, Ivan Lopatin, Ivanov Mikhail, Pavel Zadorozhny, Rodion Levichev, Valentin Malykh
  • AgentDiagnose: An Open Toolkit for Diagnosing LLM Agent Trajectories
    Apurva Gandhi, Graham Neubig, Tianyue Ou, Wanyao Guo, Xiang Yue
  • BioGraphia: A LLM-Assisted Biological Pathway Graph Annotation Platform
    Adam Officer, Angela Chen, Lei Li, Sumin Jo, Xi Xu, Yufei Huang
  • From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens
    Daniel Wei, Eric Haoran Huang, Haoyue Shi, Hala Sheta, Ilia Alenabi, Jiajun Hong, Jialin Yang, Jiawei Zhou, Ruoxi Ning, Ryker Lin, Shuyu Wu, Ziqiao Ma
  • MedTutor: A Retrieval-Augmented LLM System for Case-Based Medical Education
    Arman Cohan, Dongsuk Jang, Sophie Chheang, Ziyao Shangguan, Anurag Gupta, Jan T Czerminski, Kyle Tegtmeyer
  • AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
    Chi Chen, Chongyi Wang, Haotian Chen, Maosong Sun, Shenzhi Yang, Wang Xu, Xin Cong, Yankai Lin, Yaxi Lu, Yesai Wu, Yuan Yao, Yupeng Huo, Zhiyuan Liu, Zhong Zhang, Zhongwu Zhai, 司函, Yikun Fu
  • MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models
    Caiming Xiong, Haolin Chen, Huan Wang, Jianguo Zhang, Jielin Qiu, Shelby Heinecke, Shiyu Wang, Weiran Yao, Zhiwei Liu, Zuxin Liu, silvio savarese, Roshan Ram
  • PromptSuite: A Task-Agnostic Framework for Multi-Prompt Generation
    Eliya Habba, Gabriel Stanovsky, Gili Lior, Noam Dahan
  • ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning
    Ao Sun, Ching Wing Kwok, Jianyuan Zhan, Takatomo Saito, Wei Dai, Xudong Xiao, Yian Wang, Yichen Lu, Zongheng Wu, Jiaen LIU, SICHENG LAI, Sheng fu
  • Quest2DataAgent: Automating End-to-End Scientific Data Collection
    Ethan A. Brown, Sobin Alosious, Tengfei Luo, Tianyu Yang, Xiangliang Zhang, Yuhan Liu, Jason R. Rohr
  • Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support
    Anastasiia Derzhanskaia, Christin Seifert, Jan Trienes, Jörg Schlötterer, Markus Mühling, Roland Schwarzkopf
  • GraDeT-HTR: A Resource-Efficient Bengali Handwritten Text Recognition System utilizing Grapheme-based Tokenizer and Decoder-only Transformer
    Ahmed Nesar Tahsin Choudhury, Mahmudul Hasan, Md. Mahmudul Hasan, Md Mosaddek Khan
  • LearnLens: LLM-Enabled Personalised, Curriculum-Grounded Feedback with Educators in the Loop
    Artem Bobrov, Cesare Aloisi, Jiazheng Li, Runcong Zhao, Yulan He
  • GraphMind: Interactive Novelty Assessment System for Accelerating Scientific Discovery
    Italo Luis da Silva, Lin Gui, Yulan He, hanqi yan
  • SciSketch: An Open-source Framework for Automated Schematic Diagram Generation in Scientific Papers
    Arman Cohan, Chen Zhao, Kaiyan Zhang, Manasi Patwardhan, Yilun Zhao, Zihang Wang
  • Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
    Hang Yuan, Heung-Yeung Shum, Jian Guo, Leon Zhou, Lionel Ni, Saizhuo Wang
  • o-MEGA: Optimized Methods for Explanation Generation and Analysis
    Andrej Ridzik, Jaroslav Kopčan, Marcel Veselý, Martin Tamajka, Qiwei Peng, Ľuboš Kriš
  • Interactive Training: Feedback-Driven Neural Network Optimization
    Wentao Zhang, Yang Young Lu, Yuntian Deng
  • OpenS2S: Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model
    Chen Wang, Chengqing Zong, Jiajun Zhang, Jinqiao Wang, Jun Lin, Lingxiang Wu, Tianyu Peng, Wen Yang
  • TinyScientist: An Interactive, Extensible, and Controllable Framework for Building Research Agents
    Fenghai Li, Haofei Yu, Jiaxun Zhang, Keyang Xuan, Kunlun Zhu, Kyle Richardson, Ziheng Qi, Zijie Lei, Jiaxuan You
  • ConfReady: A RAG based Assistant and Dataset for Conference Checklist Responses
    Agam Shah, Kosha Bheda, Michael Galarnyk, Prasun Banerjee, Rutwik Routu, Sudheer Chava, Vidhyakshaya Kannan
  • LLM$\times$MapReduce-V3: Enabling Interactive In-Depth Survey Generation through a MCP-Driven Hierarchically Modular Agent System
    Haoyu Wang, Jie Zhou, Maosong Sun, Shuo Wang, Siyu Lin, Yu Chao, Zhiyuan Liu, Zhu Zhang, Zihan Zhou, xiaorong wang
  • DVAGen: Dynamic Vocabulary Augmented Generation
    Jiahao Kuang, Jie Wang, Nuowei Liu, Tao Ji, Wei Du, Xiaoling Wang, Yuanbin Wu
  • PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization
    Dawei Xiang, Kexin Chu, Tianqi Ding, Wei Zhang, Wenyan Xu, Zixu Shen
  • Metamo: Empowering Large Language Models with Psychological Distortion Detection for Cognition-aware Coaching
    HAJIME HOTTA, Manh-Cuong Phan, Minh-Tien Nguyen, Le Huu Loi
  • MathBuddy: A Multimodal System for Affective Math Tutoring
    Dacia Braca, Debanjana Kar, Leopold Böss, Nina Christine Hubig, Yufang Hou, Philipp Wintersberger, Sebastian Maximilian Dennerlein