Workshop Name: The 2nd International Workshop on Data-driven AI(DATAI 2025)
Address:London, United Kingdom - September 1, afternoon, 2025
Website:https://data-drivenai.github.io/2025/
Overview
High-quality data and advanced AI models thrive in a mutually-reinforcing cycle: better data yield better models, and smarter models unlock cleaner, richer data. DATAI 2025 gathers researchers and practitioners to explore this synergy—covering discovery, cleaning, labeling, integration, and lifecycle management of data tailored for deep learning and large language models (LLMs). This workshop is dedicated to fostering a comprehensive understanding of the intricate relationship between AI technologies and the data they depend on, focusing on the development of high-quality data specifically tailored for AI technologies, with a particular emphasis on large-scale models. Through engaging researchers, developers, and practitioners in rigorous discussions, the workshop seeks to explore sustained advancements, design innovations, and practical applications of data construction techniques that propel the progress of AI technologies forward.
Call for Papers
The goal is to advance understanding of how data quality and AI techniques co-evolve. Topics of interest include,but are not limited to:
· Data discovery for AI/LLM-driven data discovery
· Data cleaning & integration for AI
· LLM-based data extraction, labeling, and transformation
· Data quality for AI in time-series and multimodal data
· Data selection for pre-training & SFT of LLMs
· Lifecycle data management for AI models
· Labeling quality vs. AI performance trade-offs
· Data-efficient AI and small-data learning
· AI for data systems & feedback-driven data improvement
Important Dates
Submission deadline, June 1, 2025;
Author notification, June 20, 2025;
Camera ready, July 1, 2025
Contact: wangzh@hit.edu.cn; nantang@hkust-gz.edu.cn