Introducing datasources
Datasets are the absolute foundation of machine learning development. I built a data-discovery and routing layer that lets AI agents know which real-world data source to use, how to access it, whether it is usable, and what it can be joined to. Existing public-data directories are human/topic-centric, the aim of this repo is to be agent-centric. It asks: given a research question, which dataset should an agent reach for, how does it authenticate, what can it join on, and is there an API/MCP/tooling path? ...