WaC13 2026 : 13th Web-as-Corpus Workshop

posted by organizer: yvesscherrer || 406 views || tracked by 2 users: [display]

WaC13 2026 : 13th Web-as-Corpus Workshop

When	Oct 29, 2026 - Oct 29, 2026
Where	Budapest, Hungary
Submission Deadline	Aug 7, 2026
Notification Due	Sep 5, 2026
Final Version Due	Sep 12, 2026

Call For Papers

The World Wide Web has evolved from a resource for building linguistic corpora into the central data infrastructure powering modern natural language processing and Large Language Models (LLMs). As web-scale data increasingly shapes AI systems’ knowledge and capabilities, understanding its quality, representativeness, and ethical implications has become critical.

At the same time, the “more is better” paradigm is being challenged by issues such as machine-generated content, data toxicity, limited metadata, and the under-representation of many languages and domains. These challenges call for a shift toward Data-Centric AI, focusing on the curation, analysis, and responsible use of web-derived data.

The 13th Web-as-Corpus (WaC-13) workshop provides a multidisciplinary forum for research addressing the full lifecycle of web data. We invite submissions on methods, resources, and applications related to web corpora, with special emphasis on multilingual data and less-resourced languages.

Topics of interest include (but are not limited to):

* Creation and evaluation of high-quality datasets for foundation models (e.g., data collection, filtering, enrichment, language identification)
* Use of web data in empirical linguistic research
* Analysis of web-scale corpora for quality, representativeness, and societal insights
* Ethical and legal aspects of collecting, sharing, and using web data

By bringing together researchers from NLP, linguistics, and the social sciences, WaC aims to advance best practices for one of the field’s most influential data sources.

Related Resources

The Web 2026 WWW 2026 : The Web Conference

AS 2026 22nd International Conference on Applied Statistics

Cyber-AI 2026 The 2nd IEEE 2026 International Conference on Cybersecurity and AI-Based Systems (Scopus)

ICVISP 2026 2026 10th International Conference on Vision, Image and Signal Processing (ICVISP 2026)

CACML 2026 2026 5th Asia Conference on Algorithms, Computing and Machine Learning (CACML 2026)

SCALABILITY 2026 The Third International Conference on Systems Scalability and Expandability

IJCSITCE 2026 The International Journal of Computational Science, Information Technology and Control Engineering

IMETI 2026 The 15th International Multi-Conference on Engineering and Technology Innovation 2026

IJNSA 2026 International Journal of Network Security & Its Applications - ERA Indexed, H Index - 52

IJESA 2026 International Journal of Embedded Systems and Applications