Skip to Main Navigation

Development Data Group

Select a EDS Sub navigation page selecting option, leaving this page

AI for Data – Data for AI

 

Overview

The Development Data Group’s AI for Data – Data for AI  program is advancing the strategic use of artificial intelligence (AI) to improve the quality, usability, and impact of development data. Our mission has two complementary objectives:

  • AI for Data: Applying AI to improve data and metadata quality, data discoverability and dissemination, monitoring of data use, and user experience in producing and accessing development datasets.
  • Data for AI: Ensuring development data—such as indicators, microdata, and geospatial datasets—is structured, documented, and made available in ways that enable effective and trustworthy use by AI systems. This includes producing AI-ready data and metadata, establishing and adapting standards and protocols to improve interoperability and AI-accessibility such as Model Context Protocol (MCP), and monitoring data quality so that AI models and tools can be built, evaluated, and deployed responsibly in development contexts.

We are spearheading the application of AI, along with the development of methodologies, tools, and models to drive improvements and efficiencies across the development data lifecycle. Our work ranges from applying generative AI to enhance metadata quality to building low-resource AI models that support semantic search, data dissemination, and knowledge discovery—particularly for underserved contexts and users.

By innovating across the data lifecycle—from curation to dissemination to monitoring its use—we aim to ensure that development data becomes AI-ready: more accessible, useful, and actionable for both people and machines, while fostering responsible AI use across global development applications.

 

Vision

The AI for Data – Data for AI program reflects the World Bank’s commitment to harnessing frontier technologies for development impact. By aligning technical innovation with the principles of openness, quality, and inclusion, we are building a future where development data is not only easier to find—but more meaningful, more relevant, and more empowering for everyone.

 

Workstreams

We leverage generative models to improve and scale metadata generation and refinement, including:

  • Automated rewriting of indicator names and definitions for clarity and consistency.
  • LLM-based validation of metadata completeness, coherence, and alignment with global standards.
  • Multi-agent systems for iterative metadata improvement and enrichment.

 

Partnerships

While supporting programs across the Development Data Group, we also collaborate with World Bank units and external partners to advance the responsible use of AI in development data.

We work with:

  • World Bank's Operational teams to advise and develop AI methods to support projects in countries, mainly on AI-readiness of data.
  • National Statistical Offices (NSOs) and data producers to support AI-assisted metadata curation and catalog integration.
  • Global AI communities, contributing to open-source tools and standards that enable fair, energy-efficient, and multilingual AI.
AI for Data, Data for AI_Blog series banner

Explore our work through our Data Blog series.

Read our blog series

Core Team