Introducing datasources

Datasets are the absolute foundation of machine learning development. I built a data-discovery and routing layer that lets AI agents know which real-world data source to use, how to access it, whether it is usable, and what it can be joined to. Existing public-data directories are human/topic-centric, the aim of this repo is to be agent-centric. It asks: given a research question, which dataset should an agent reach for, how does it authenticate, what can it join on, and is there an API/MCP/tooling path? ...

22 June 2026 · 2 min · Stephanie Rebecca