SourceDomainKindLicenseCostAuthFreqBulkMCPMCP commandJoin keys
AACT (Aggregate Analysis of ClinicalTrials.gov)Public PostgreSQL mirror of ClinicalTrials.gov, refreshed daily by CTTI at Duke, with all protocol and results data normalised into ~50 relational tables for SQL analysis.clinical-biotechregistryAACT-CTTI-Attributionfree-with-registrationaccount-requireddailyyesmcp-existsNCT_IDEUDRACT_NUMBEREU_CT_NUMBERPMIDMESH_TERM
Alpha VantageRealtime and historical market data API covering global equities, FX, crypto, commodities, technical indicators, fundamentals, and macroeconomic series, accessed by ticker symbol with a single query endpoint.finance-marketstime-seriesAlpha-Vantage-Proprietaryfreemiumapi-key-freeintraday (1/5/15/30/60 min), daily, weekly, monthly depending on endpointnomcp-existsTICKERISO_3DATE
AlphaFold Protein Structure DatabaseOpen database of AlphaFold-predicted 3D protein structures for 214M+ UniProt sequences, run by EMBL-EBI with Google DeepMind, served per-accession via JSON API and in bulk per-proteome on Google Cloud Storage and the EBI FTP.bio-genomicsregistryCC-BY-4.0freenoneperiodic, tracking UniProt releases; synced to UniProt release 2025_03yesmcp-needed-high-valueUNIPROT_ACCESSIONPDB_ID
American Gut ProjectCrowdfunded citizen-science microbiome dataset of stool, skin, and oral 16S rRNA samples paired with dietary and lifestyle questionnaires, run by the Knight Lab at UC San Diego.bio-genomicscorpusBSD-3-Clausefreenoneirregularyesmcp-needed-low-valueDOIPMID
ArrayExpressEMBL-EBI archive of high-throughput functional genomics experiments (microarray and sequencing), now a collection within BioStudies, with MAGE-TAB metadata and raw sequencing reads brokered to ENA.bio-genomicscorpusEMBL-EBI-Terms-of-Usefreenonecontinuous as submitters release studiesyesmcp-needed-low-valueNCBI_TAXON_IDPMID
arXivOpen-access preprint repository for physics, math, CS, quantitative biology, finance, statistics, EE, and economics, with a public metadata API and bulk full-text mirrors.academiccorpusCC0freenonedailyyesmcp-existsnpx -y @futurelab-studio/latest-science-mcp@latestARXIV_IDDOI
BindingDBPublic database of measured protein-small-molecule binding affinities (Ki, Kd, IC50, EC50) curated from literature, patents, PubChem bioassays, and ChEMBL.clinical-biotechregistryCC-BY-3.0freenoneupdated when new data are added, usually monthly (release 202606)yesmcp-needed-high-valueUNIPROT_ACCESSIONCHEMBL_IDINCHI_KEYPMID
BioImage ArchiveEMBL-EBI's open, cross-modality archive of life-sciences biological imaging data (light, EM, super-resolution, high-content, whole-slide) plus segmentations and AI training sets, built on BioStudies infrastructure with per-study S-BIAD accessions.bio-genomicscorpusvaries-per-studyfreenonecontinuous on submission; no fixed release cycleyesmcp-needed-low-valueNCBI_TAXON_ID
Bureau of Labor Statistics (BLS) Public Data APIUS Bureau of Labor Statistics Public Data API and bulk flat files covering employment, unemployment, wages, prices (CPI/PPI), productivity, job openings, and compensation across dozens of survey programs.government-open-datamixedUS-Government-Public-Domainfreeapi-key-freemonthly for most series; weekly/quarterly/annual underlying cadence varies by surveyyesmcp-existsDATEUS_STATE_CODEFIPS
Cancer Cell Line Encyclopedia (CCLE)Multi-omic characterisation of ~1,000 human cancer cell lines (RNA-seq, WES/WGS, copy number, methylation, proteomics, metabolomics, drug sensitivity), distributed through the Broad Institute's DepMap Portal.bio-genomicsmixedDepMap-Terms-Of-Usefree-non-commercialnoneDepMap releases bi-annually (e.g. 25Q2, 25Q4, 26Q1); historical CCLE Phase I-III drops are staticyesmcp-existsENSEMBL_IDGENE_SYMBOLENTREZ_GENE_IDDOIPMID
CBOE Volatility Index (VIX)Daily OHLC time series of the CBOE Volatility Index (VIX) from January 1990 to present, mirrored from the official CBOE CSV by DataHub's Core dataset pipeline.finance-marketstime-seriesODC-PDDL-1.0freenoneapproximately every 4 hours via DataHub Core pipeline; upstream CBOE file updates each US trading dayyesapi-direct-sufficientDATE
ChEMBLManually curated database of bioactive molecules with drug-like properties, integrating chemistry, bioactivity, and target/genomic data from medicinal chemistry literature and approved drugs.clinical-biotechregistryCC-BY-SA-3.0freenoneapproximately every 6-12 months (major releases); Release 37 May 2026yesmcp-needed-high-valueCHEMBL_IDUNIPROT_ACCESSIONDOIPMIDMESH_TERM
ClinGen (Clinical Genome Resource)NIH-funded expert-curated resource defining the clinical relevance of genes and variants, covering gene-disease validity, dosage sensitivity, variant pathogenicity, and clinical actionability, plus a canonical allele registry.bio-genomicsknowledge-graphCC0freenonecontinuous expert curation; downloads refreshed as curations are publishedyesmcp-needed-high-valueRSIDGENE_SYMBOLENTREZ_GENE_IDEFO_ID
ClinicalTrials.govUS federal registry and results database of publicly and privately supported clinical studies conducted globally, run by the National Library of Medicine.clinical-biotechregistryUS-Government-Public-Domainfreenonedailyyesmcp-existsuvx --from biomcp-python biomcp runNCT_IDEUDRACT_NUMBEREU_CT_NUMBERPMIDMESH_TERM
ClinVarNIH/NCBI public archive of human genetic variants and their clinically asserted relationships to diseases and drug responses, with submitter evidence and a review-status confidence rating.bio-genomicsregistryUS-Government-Public-Domainfreenoneweekly XML release; monthly XML/VCF/TSV archives; some TSV summaries dailyyesapi-direct-sufficientRSIDGENE_SYMBOLENTREZ_GENE_IDMESH_TERM
CommodityPriceAPICommercial REST API for real-time and historical prices on 130+ commodities (precious metals, energy, agricultural, soft commodities) with quote-currency conversion across 175+ currencies and daily history back to 1990.finance-marketstime-seriesCommodityPriceAPI-Proprietaryfreemiumapi-key-free60-second refresh on Plus and Premium; 10-minute delay on Lite. Monthly-only commodities deliver closing rate only.nomcp-needed-low-valueDATEISO_4217
Common CrawlFree open repository of monthly web crawls (300B+ pages, 15 years) with raw HTML, metadata, and extracted plaintext available as bulk WARC/WAT/WET files plus a CDX URL index.news-eventscorpusCommon-Crawl-Terms-of-Usefreenonemonthlyyesmcp-existsURLDATE
Companies House (UK)UK statutory register of incorporated companies with profiles, officers, filing history, and persons with significant control.corporate-registryregistryOGL-3.0freeapi-key-freereal-time for filings, monthly for bulk snapshotyesmcp-needed-high-valueCOMPANIES_HOUSE_NUMBERISO_3DATEURL
COREWorld's largest aggregator of open access research papers, harvesting metadata and full text from thousands of repositories and journals globally.academiccorpusODC-BYfreemiumapi-key-freecontinuousyesmcp-existsnpx -y @futurelab-studio/latest-science-mcp@latestDOIARXIV_IDPMIDISSNURL
CrossrefDOI registration agency and open metadata API for scholarly works, covering ~180M records deposited by member publishers across journals, books, conference proceedings, preprints, and datasets.academicregistryCC0freenonecontinuous (REST API); annual public data file snapshotyesmcp-existsDOIISSNORCIDROR
DailyMedNLM's authoritative repository of FDA-submitted Structured Product Labels (SPLs) for prescription, OTC, homeopathic, animal, and other regulated drug products in the US.clinical-biotechregistryUS-Government-Public-Domainfreenonedaily, weekly, and monthly delta archives; full snapshots refreshed periodicallyyesmcp-existsNDCRXNORM_CUI
DataCiteGlobal DOI registration agency and metadata graph for research datasets, software, preprints, and other non-article research outputs.academicregistryCC0freenonecontinuous (API); annual snapshot (Public Data File)yesmcp-existsDOIORCIDRORFUNDER_DOIURL
dbSNPNIH/NCBI public archive of short genetic variation (SNVs, short indels, microsatellites) keyed to RefSNP (rs) clusters, with mapped coordinates, allele frequencies, and cross-references to genes and clinical resources.bio-genomicsregistryUS-Government-Public-Domainfreenoneperiodic builds (Build 157 released 2025-03-18); ALFA allele-frequency updates quarterlyyesapi-direct-sufficientRSIDGENE_SYMBOLENTREZ_GENE_ID
DefiLlama StablecoinsOpen API tracking circulating supply, chain distribution, peg mechanism, and historical market cap of stablecoins across every supported blockchain, run as the stablecoins module of the DefiLlama TVL aggregator.finance-marketstime-seriesDefiLlama-Termsfreemiumnonecirculating supply refreshed hourly per chain; historical mcap and price series updated dailynomcp-needed-high-valueDATEISO_4217
DrugBankCurated drug knowledgebase covering chemistry, pharmacology, targets, interactions, and external identifiers for approved, investigational, and experimental small-molecule and biotech drugs.clinical-biotechregistryDrugBank-Custom-Licensefree-non-commercialaccount-requiredminor releases every few months; major versions ~yearlyyesmcp-existsCHEMBL_IDRXNORM_CUIBNF_CODEGENE_SYMBOLUNIPROT_ACCESSIONMESH_TERMWIKIDATA_QID
Drugs@FDAFDA's authoritative record of approved drug and biologic products, with full application, product, and submission/supplement approval history plus links to approval letters, labels, and review documents.clinical-biotechregistryCC0freeapi-key-freedaily (Monday-Friday) on the openFDA mirror; raw FDA data files refreshed regularlyyesmcp-needed-high-valueFDA_APPLICATION_NUMBERNDCUNII
ECMWF (European Centre for Medium-Range Weather Forecasts)Global numerical weather forecasts, ensemble products, and the ERA5 climate reanalysis from ECMWF, distributed via the free ECMWF Open Data feed, the Copernicus Climate Data Store API, and the licensed MARS archive.geospatialmixedCC-BY-4.0freemiumapi-key-freeoperational forecasts run 2-4x daily; ERA5 updated ~daily with lag; monthly and seasonal productsyesmcp-needed-low-valueDATE
EIA Open DataUS Energy Information Administration's REST API and bulk files covering energy production, consumption, prices, and forecasts across electricity, petroleum, natural gas, coal, nuclear, and international markets.finance-marketsmixedUS-Government-Public-Domainfreeapi-key-freetwice-daily for most series; weekly/monthly/annual underlying cadence varies by routeyesmcp-needed-low-valueDATEISO_3US_STATE_CODE
EMPIARPublic archive of raw image data underpinning 3D electron microscopy structures (cryo-EM single particle, cryo-electron tomography, volume EM), run by EMBL-EBI as a companion to EMDB.bio-genomicscorpusCC0-1.0freenonecontinuous deposition tied to EMDB submissions; new entries released on author-set datesyesmcp-needed-low-valuePDB_IDNCBI_TAXON_ID
EnsemblOpen genome annotation project from EMBL-EBI covering 4,700+ genomes with REST API, BioMart, FTP downloads, and MySQL access.bio-genomicsknowledge-graphApache-2.0freenonequarterlyyesmcp-existsENSEMBL_IDGENE_SYMBOLUNIPROT_ACCESSION
eQTL CatalogueUniformly reprocessed QTL summary statistics (eQTL, sQTL, and other molecular traits) across ~74 cell types and tissues, all on GRCh38.bio-genomicsmixedCC-BY-4.0freenoneirregular major releases; r8 pre-release Jan 2026, prior stable r7 Jun 2024yesmcp-needed-low-valueRSIDENSEMBL_IDEFO_IDGENE_SYMBOLUBERON_ID
EU CTIS (Clinical Trials Information System)EU and EEA single-entry-point registry for clinical trials of human medicines under Regulation 536/2014, run by the European Medicines Agency in cooperation with member states and the European Commission.clinical-biotechregistryEU-Commission-Reuse-Decision-2011-833freenonecontinuousnomcp-needed-high-valueEU_CT_NUMBEREUDRACT_NUMBERWHO_UTN
Europe PMCOpen archive of life-sciences literature covering ~48M articles, preprints, and full-text open-access biomedical papers, mirrored from PubMed Central with European deposit and funder linkage.academiccorpusCC-BY-4.0freenoneweeklyyesmcp-existsnpx -y @futurelab-studio/latest-science-mcp@latestDOIPMIDPMCIDORCIDRORISSNMESH_TERMCHEMBL_IDUNIPROT_ACCESSIONENSEMBL_ID
European Nucleotide Archive (ENA)EMBL-EBI's open archive of nucleotide sequence and raw read data; the European INSDC node mirroring NCBI SRA and DDBJ.bio-genomicscorpusEMBL-EBI-Terms-of-Usefreenonecontinuousyesmcp-needed-high-valueNCBI_TAXON_ID
Event RegistryCommercial news intelligence platform that ingests 150K+ global news sources in 60+ languages, clusters articles into events, and enriches with Wikidata-linked entities, sentiment, and categories.news-eventsevent-streamproprietary-no-redistributionfreemiumapi-key-freecontinuousnomcp-needed-low-valueWIKIDATA_QIDISO_2URLDATE
Expression Atlas (EMBL-EBI)Curated, uniformly re-analyzed gene and protein expression data across species, covering baseline (TPM by tissue/condition) and differential (log2FC, FDR) experiments, with a separate Single Cell Expression Atlas.bio-genomicsreference-tableCC-BY-4.0freenoneperiodic curated releases (data re-quantified against fixed Ensembl/Ensembl Genomes/EFO versions per release)yesmcp-needed-low-valueENSEMBL_IDGENE_SYMBOLEFO_IDNCBI_TAXON_IDPMID
FDA Medical Device DatabasesUS FDA medical-device authorizations and post-market safety data via openFDA device endpoints and the CDRH databases, covering 510(k) clearances, PMA approvals, De Novo, classification, recalls/enforcement, MAUDE adverse events, and UDI.clinical-biotechregistryCC0freeapi-key-freeweekly for most device endpoints; CDRH web databases refreshed on similar cadenceyesmcp-exists
FDA Orange BookFDA's Approved Drug Products with Therapeutic Equivalence Evaluations, the canonical US source for approved drug products, therapeutic equivalence (TE) codes, listed patents, and marketing exclusivity periods.clinical-biotechregistryUS-Government-Public-Domainfreenonemonthlyyesmcp-needed-high-valueNDC
FDA Purple BookFDA database of all US-licensed biological products (including biosimilars and interchangeables) with BLA numbers, reference-product relationships, exclusivity dates, and patent listings.clinical-biotechregistryUS-Government-Public-Domainfreenonemonthlyyesmcp-needed-low-value
Finnhub Stock APIMulti-asset market data API covering global equities, FX, crypto, company fundamentals, estimates, alternative data, and economic indicators behind a single REST endpoint plus WebSocket streaming, with a generous free tier the provider markets as real-time.finance-marketstime-seriesFinnhub-Terms-of-Servicefreemiumapi-key-freereal-time on US equities (free) and global equities on premium; intraday and daily candlesnomcp-existsTICKERISINCUSIPFIGICIKLEIISO_3ISO_4217DATE
FRED (Federal Reserve Economic Data)Aggregated economic and financial time-series database (~800K series) maintained by the Federal Reserve Bank of St. Louis, covering US macro indicators plus international data from BIS, OECD, World Bank, and others.finance-marketstime-seriesFRED-Mixed-Source-Termsfreeapi-key-freevaries by series (daily, weekly, monthly, quarterly, annual)yesmcp-existsDATEISO_3
GDELT (Global Database of Events, Language, and Tone)Open real-time event database extracting people, organizations, locations, themes, and tone from global news media in 100+ languages.news-eventsevent-streamGDELT-Open-Datafreenoneevery 15 minutesyesmcp-needed-high-valueISO_3DATEURLWIKIDATA_QID
Gene OntologyCanonical open ontology of gene functions (molecular function, biological process, cellular component) plus curated GO annotations linking genes and gene products to GO terms across hundreds of species.bio-genomicsknowledge-graphCC-BY-4.0freenonemonthlyyesmcp-existsGO_IDUNIPROT_ACCESSIONENSEMBL_IDGENE_SYMBOLENTREZ_GENE_IDNCBI_TAXON_IDPMIDDOI
Global Biodiversity Information Facility (GBIF)International network and integrated infrastructure aggregating species occurrence records, taxonomic backbone, datasets, and publishing organisations from thousands of natural-history collections, surveys, and citizen-science feeds, each occurrence carrying lat/lon, eventDate, taxonomic keys, and Darwin Core fields.geospatialmixedCC-BY-4.0freenoneContinuous ingestion from publishers; species backbone rebuilt periodically (months); occurrence index updated as datasets are re-crawledyesmcp-existsNCBI_TAXON_IDWIKIDATA_QIDDOIORCIDISO_3ISO_2DATE
gnomAD (Genome Aggregation Database)Population-scale reference of human allele frequencies aggregated from hundreds of thousands of exomes and genomes, used to gauge how common a variant is in the general population.bio-genomicsreference-tableMITfreenoneapproximately annual major releases; v4.1 released 2024-04-19yesmcp-needed-high-valueRSIDENSEMBL_IDGENE_SYMBOL
Google TrendsGoogle's public search-interest explorer, exposing normalised relative-search-volume time series and geographic breakdowns for any query, topic, or category back to 2004.consumer-signaltime-seriesGoogle-Terms-Of-Servicefreenonedailynofragile-unofficialDATEISO_2ISO_3WIKIDATA_QID
GTEx (Genotype-Tissue Expression)Reference atlas of tissue-specific gene expression and genetic regulation (eQTL/sQTL) across ~54 human tissues from ~1000 post-mortem donors.bio-genomicspanelGTEx-Open-Accessfreenoneper-releaseyesmcp-needed-low-valueGENE_SYMBOLENSEMBL_ID
GWAS CatalogNHGRI-EBI curated catalogue of published genome-wide association studies, variant-trait associations, study metadata, and full summary statistics where available.bio-genomicsregistryEMBL-EBI-Terms-of-Usefreenoneapproximately weekly catalogue release; summary-statistics ingest is continuousyesmcp-existsPMIDDOIGENE_SYMBOLENSEMBL_IDEFO_ID
Hacker News APIOfficial Hacker News API exposing every story, comment, job post, Ask HN, Show HN, poll, and user profile as JSON via Firebase, with live update feeds and walk-the-entire-corpus access from item 1.consumer-signalcorpusMITfreenonereal-timenomcp-existsURLDATE
Harmonized System CodesHierarchical lookup of the World Customs Organization Harmonized System (HS) classification, mirrored as public-domain CSV on DataHub from the UN Comtrade nomenclature endpoint.government-open-datareference-tableODC-PDDL-1.0freenoneirregularyesmcp-needed-low-value
Human Protein AtlasOpen-access atlas of human protein expression across tissues, single cells, brain regions, cancers, and blood, integrating antibody-based imaging with transcriptomics and proteomics.bio-genomicsregistryCC-BY-4.0freenoneannual major release (currently v25.1, May 2026)yesmcp-needed-low-valueENSEMBL_IDUNIPROT_ACCESSIONGENE_SYMBOL
IBAN Country CodesHTML lookup table of ISO 3166-1 country codes (alpha-2, alpha-3, numeric) keyed by country name, republished by iban.com.government-open-datareference-tableiban-com-termsfreenoneirregularnomcp-needed-low-valueISO_2ISO_3
IBAN Currency CodesHTML lookup table of ISO 4217 currency codes (alphabetic and numeric) keyed by country/territory, republished by iban.com.government-open-datareference-tableiban-com-termsfreenoneirregularnomcp-needed-low-valueISO_4217
IGSR (International Genome Sample Resource)EMBL-EBI hosted registry of open-consent human genome samples and population variant calls from the 1000 Genomes Project and successor collections (HGSVC, MAGE, ONT, Illumina Platinum).bio-genomicsregistryEMBL-EBI-Terms-of-Usefreenonecontinuousyesmcp-existsDOIPMID
Image Data Resource (IDR)Public repository of curated, richly annotated reference bioimaging datasets from published scientific studies, served on OMERO with a session-based JSON API, search engine, and bulk download.bio-genomicscorpusCC-BY-4.0freeaccount-requiredrolling; new studies curated and added on an ongoing basisyesmcp-needed-low-valueNCBI_TAXON_IDGENE_SYMBOLPDB_ID
InterProIntegrated database of protein families, domains, and functional sites maintained by EMBL-EBI, unifying signatures from 13 member databases (Pfam, CATH-Gene3D, PANTHER, PROSITE, SMART, and others) into a single resource with REST API and FTP bulk downloads.bio-genomicsregistryCC0freenoneapproximately every 8 weeks (~2 months); release 109.0 June 2026yesmcp-needed-high-valueINTERPRO_IDUNIPROT_ACCESSIONPDB_IDGO_IDNCBI_TAXON_ID
KEGG (Kyoto Encyclopedia of Genes and Genomes)Integrated database of pathways, genes, compounds, reactions, diseases, and drugs maintained by Kanehisa Laboratories at Kyoto University.bio-genomicsknowledge-graphKEGG-Customfreemiumnonemonthlyyesmcp-existsGENE_SYMBOLENTREZ_GENE_IDUNIPROT_ACCESSIONENSEMBL_IDCHEBI_IDPMIDNCBI_TAXON_IDPDB_ID
MusicBrainz APICommunity-maintained open music metadata database for artists, releases, recordings, works, labels, and places, exposed via REST API and full PostgreSQL bulk dumps.consumer-signalknowledge-graphCC0free-non-commercialnoneweeklyyesmcp-existsISNIISO_2WIKIDATA_QIDURL
Nasdaq Data LinkNasdaq-operated marketplace and REST API delivering financial, economic, and alternative time-series and tables data (formerly Quandl), spanning free public-domain datasets and ~400+ premium vendor databases.finance-marketsmixednasdaq-data-link-mixed-vendor-termsfreemiumapi-key-freevaries by dataset (intraday, daily, weekly, monthly, quarterly, annual)yesmcp-existsTICKERISINCUSIPISO_3DATE
Nasdaq Trader Symbol DirectoryAuthoritative directory of Nasdaq-listed and other US exchange-listed securities published daily as pipe-delimited files by Nasdaq Trader.finance-marketsregistrynasdaqtrader-symbol-directory-no-published-termsfreenonemultiple times per trading dayyesmcp-needed-low-valueTICKER
NCBI GeneNIH-maintained reference database of gene-specific information across all sequenced species, with nomenclature, sequences, maps, pathways, and cross-references.bio-genomicsregistryUS-Government-Public-Domainfreeapi-key-freedailyyesmcp-existsGENE_SYMBOLENSEMBL_IDPMIDMESH_TERM
NCBI GEONIH/NCBI Gene Expression Omnibus, public repository of microarray, RNA-seq, and other high-throughput functional genomics data with 286k+ series and 8.5M+ samples.bio-genomicsregistryUS-Government-Public-Domainfreenonedailyyesmcp-existsPMIDGENE_SYMBOLMESH_TERM
NCBI ProteinNIH-maintained protein sequence database aggregating records from RefSeq, GenBank/EMBL/DDBJ translated CDS, SwissProt, PIR, PRF, and PDB into one Entrez-queryable surface.bio-genomicsregistryUS-Government-Public-Domainfreeapi-key-freedailyyesmcp-existsUNIPROT_ACCESSIONPDB_IDPMIDNCBI_TAXON_ID
NCBI SRA (Sequence Read Archive)Largest public repository of high-throughput raw sequencing reads and alignment metadata, run by NCBI/NLM/NIH.bio-genomicsregistryUS-Government-Public-Domainfreeapi-key-freecontinuousyesmcp-existsPMIDPMCIDDOIMESH_TERM
NCBI TaxonomyNCBI-curated classification and nomenclature for every organism represented in the public sequence databases, with a stable integer taxon ID linking nucleotide, protein, gene, assembly, BioSample, and PubMed records across NCBI and downstream resources.bio-genomicsknowledge-graphUS-Government-Public-Domainfreeapi-key-freedailyyesmcp-existsNCBI_TAXON_IDPMID
NCI Genomic Data Commons (GDC)NCI's unified repository of harmonised cancer genomic, clinical, and biospecimen data spanning TCGA, TARGET, CPTAC, MMRF, HCMI, and 30+ other programs, with a single REST API and bulk transfer tool.bio-genomicsregistryUS-Government-Public-Domainfreedars-or-equivalentcontinuousyesmcp-needed-high-valueENSEMBL_IDGENE_SYMBOLENTREZ_GENE_IDICD_10DOI
NewsAPICommercial REST API aggregating current and historic news articles from 150,000+ global publisher sources across 55 countries and 14 languages.news-eventsevent-streamproprietary-newsapi-tosfreemiumapi-key-freecontinuousnomcp-needed-low-valueURLDATE
NHGRI DNA Sequencing CostsNHGRI-tracked annual cost-per-genome and cost-per-megabase data for human DNA sequencing at NIH-funded genome sequencing centers, 2001 to 2022.bio-genomicstime-seriesUS-Government-Public-Domainfreenoneirregular; NHGRI states plans to update on a regular basis. Most recent table dated May 2022, page last updated May 2023.yesapi-direct-sufficientDATE
NHS Prescription Cost Analysis (PCA)NHSBSA monthly count of items dispensed and net ingredient cost for every drug, dressing, appliance, and medical device reimbursed by the NHS in community pharmacies in England, broken down by BNF code and NHS region.healthcare-claimstime-seriesOGL-3.0freenonemonthlyyesmcp-needed-low-valueBNF_CODEDATE
OECD Data ExplorerOECD's flagship data portal exposing 1,500+ statistical dataflows across 38 member countries via a public SDMX 2.1 REST API, with CSV/Excel downloads and a browsable web UI.government-open-datamixedOECD-Terms-and-Conditionsfreenonevaries by dataflow (annual, quarterly, monthly)yesmcp-existsISO_3ISO_2ISO_4217
Open TargetsOpen-access drug target identification and prioritisation platform integrating genetics, genomics, transcriptomics, drugs, animal models, and scientific literature for target-disease association scoring.clinical-biotechknowledge-graphCC0-1.0freenonequarterlyyesmcp-existsENSEMBL_IDGENE_SYMBOLUNIPROT_ACCESSIONCHEMBL_IDEFO_IDDOIPMID
Open-MeteoFree weather, climate, marine, air-quality, and historical-reanalysis API aggregating 30+ national weather models (ECMWF, NOAA GFS, DWD ICON, JMA, MeteoFrance) at 1-11 km global resolution, queried by latitude/longitude with no API key.geospatialtime-seriesCC-BY-4.0free-non-commercialnoneforecast updated hourly to 6-hourly depending on model; ERA5 reanalysis updated daily with ~5 day lag; historical-forecast archive continuous since 2021nomcp-existsISO_2DATE
OpenAlexOpen scholarly knowledge graph covering papers, authors, institutions, concepts, topics, and citations across all of academia.academicknowledge-graphCC0freenonedailyyesmcp-existsnpx -y openalex-research-mcp (set OPENALEX_EMAIL)npx -y @futurelab-studio/latest-science-mcp@latestDOIPMIDPMCIDARXIV_IDORCIDRORISSNWIKIDATA_QIDMAG_ID
OpenCorporatesCross-jurisdiction database of legal entities sourced from 140+ official company registries worldwide.corporate-registryregistryODbL-1.0freemiumapi-key-freevaries by jurisdiction (real-time to monthly, mirrors source registry cadence)yesmcp-needed-high-valueCOMPANIES_HOUSE_NUMBERCIKISO_2ISO_3DATEURLWIKIDATA_QID
openFDAUS FDA Elasticsearch-backed public API and bulk downloads covering drug adverse events, product labeling, NDC directory, recalls, device safety reports, and food enforcement.clinical-biotechmixedCC0freeapi-key-freevaries by endpoint; many refreshed weekly, some dailyyesmcp-existsNDCRXNORM_CUIMEDDRA_TERMNCT_ID
OpenFIGIBloomberg-operated open mapping service that resolves third-party security identifiers (ISIN, CUSIP, SEDOL, ticker) to the Financial Instrument Global Identifier (FIGI) and back.finance-marketsregistryPublic-Domain-Dedicationfreeapi-key-freecontinuousnomcp-existsFIGIISINCUSIPSEDOLTICKER
OpenStreetMapCrowdsourced global geographic database of roads, buildings, points of interest, boundaries, and natural features, with multiple read/write APIs and weekly planet dumps.geospatialgeospatialODbL-1.0freenonePlanet dump weekly (Fridays); daily/hourly/minutely diffs; Overpass minutely; Nominatim dailyyesmcp-needed-high-valueOSM_IDWIKIDATA_QIDWIKIPEDIA_ARTICLEISO_3ISO_2
ORCIDGlobal registry of persistent researcher identifiers linking people to their works, affiliations, funding, and peer review activity.academicregistryCC0freenoneannualyesmcp-existsORCIDDOIRORWIKIDATA_QID
PeptideAtlasCompendium of peptides and proteins identified from uniformly reprocessed public mass-spectrometry proteomics data, with protein-existence evidence and genome mappings; the Human build is the flagship.bio-genomicsknowledge-graphISB-PeptideAtlas-Termsfreenoneannual builds per organismyesmcp-needed-low-valueUNIPROT_ACCESSIONENSEMBL_IDGENE_SYMBOL
Personal Genome ProjectOpen-consent global network of participant-driven projects publishing whole-genome sequences, genotyping arrays, phenotype surveys, microbiome, and health records under a CC0-equivalent public-domain dedication.bio-genomicsregistryCC0-1.0freenoneirregularyesmcp-needed-low-valueURL
Polygon.ioMulti-asset market data API covering US stocks, options, indices, forex, crypto, and futures with REST, WebSocket, and S3 flat-file delivery.finance-marketstime-seriesPolygon-Terms-of-Servicefreemiumapi-key-freereal-time on paid tiers; 15-min delayed on free tieryesmcp-existsTICKERFIGIISINCUSIPCIKDATE
PredScope Prediction Market DataFree, no-auth REST API exposing live and resolved Polymarket prediction-market odds, volumes, and outcomes, refreshed every 10 minutes.finance-marketsevent-streampredscope-free-any-use-attribution-requestedfreenone10 minutesnomcp-needed-low-valueDATEURL
Proceedings of Machine Learning Research (PMLR)Open-access proceedings of machine learning conferences and workshops (ICML, AISTATS, CoRL, UAI, and many workshops), published as per-volume PDFs plus openly available BibTeX metadata without a conventional publisher.academiccorpusPMLR-Author-Copyrightfreenonerolling; a new volume is posted as each conference or workshop completesyesmcp-needed-low-valueURLISSN
Project Baseline Health StudyDeep-phenotyped longitudinal cohort of ~2,500 US adults followed 2017-2023 with annual clinical assessments, multi-organ imaging, continuous Verily Study Watch wearable data, labs, functional tests, and participant-reported outcomes. Access is request-gated through Verily's platform.public-healthpanelcontrolled-access-data-use-agreementfree-with-registrationaccount-requiredclosed cohort; enrolment and active follow-up ran 2017-2023, no new participant data addednorequires-scrapingNCT_ID
PubChemOpen chemistry database from NIH/NLM aggregating compounds, substances, and bioassays with dense cross-references to drug, target, and literature resources.clinical-biotechknowledge-graphUS-Government-Public-Domainfreenonecompound and substance records updated continuously; FTP snapshots refreshed regularly (weekly for most subsets)yesmcp-existsINCHI_KEYCHEMBL_IDDRUGBANK_IDCHEBI_IDUNIIMESH_TERMPMIDDOIUNIPROT_ACCESSIONGENE_SYMBOLENTREZ_GENE_IDNCBI_TAXON_IDNDCATC_CODERXNORM_CUIWIKIDATA_QID
PubMed (NCBI E-utilities)NCBI's biomedical literature database (~37M citations from MEDLINE, life-science journals, and online books), exposed via the Entrez E-utilities REST API and annual XML bulk releases.academiccorpusUS-Government-Public-Domainfreeapi-key-freedailyyesmcp-existsuvx --from biomcp-python biomcp runnpx -y @futurelab-studio/latest-science-mcp@latestPMIDPMCIDDOIMESH_TERMISSN
RCSB Protein Data BankPublic repository of experimentally-determined 3D structures of proteins, nucleic acids, and complexes plus integrative and computed structure models, run by the RCSB consortium as the US member of the worldwide PDB.bio-genomicsregistryCC0-1.0freenoneweekly release every Wednesday (00:00 UTC)yesmcp-existsPDB_IDUNIPROT_ACCESSIONDOIPMIDNCBI_TAXON_ID
ReactomeFree, open-source, curated, and peer-reviewed pathway database covering human biology with orthology projections to model organisms.bio-genomicsknowledge-graphCC0freenonequarterlyyesmcp-existsUNIPROT_ACCESSIONENSEMBL_IDGENE_SYMBOLCHEMBL_IDPMIDDOI
Reddit Data APIOfficial Reddit Data API exposing posts, comments, subreddits, users, and votes across the public Reddit network via OAuth2-authenticated REST endpoints.consumer-signalcorpusReddit-Data-API-Termsfreemiumoauthreal-timenomcp-existsURLWIKIDATA_QID
RfamEMBL-EBI database of RNA families, each represented by a curated seed alignment, consensus secondary structure, and covariance model used for genome annotation.bio-genomicsknowledge-graphCC0freenoneannual or semi-annual major releasesyesmcp-needed-low-valuePDB_IDMIRBASE_IDNCBI_TAXON_IDGO_IDPMID
S&P 500 Companies FinancialsStatic CSV roster of the 500 S&P 500 constituents with sector, ticker, SEC filings link, and a snapshot of common per-share valuation metrics.finance-marketsreference-tableODC-PDDL-1.0freenonead-hoc (script-triggered; constituents refreshed Feb 2026, financials refreshed Jun 2026)yesapi-direct-sufficientTICKER
SEC EDGARUS SEC system of all corporate filings (10-K, 10-Q, 8-K, S-1, 13F, etc.) with company metadata and structured XBRL financial facts.corporate-registrymixedUS-Government-Public-Domainfreenonereal-time for filings dissemination; nightly bulk ZIP refresh (~3am ET)yesmcp-existsCIKTICKERDATEURL
Semantic ScholarAi2's scholarly knowledge graph covering 214M papers, 79M authors, and 2.49B citations across all disciplines, with paper-recommendation and bulk-snapshot endpoints.academicknowledge-graphCC-BY-NC-4.0free-non-commercialapi-key-freeweeklyyesmcp-existsDOIPMIDPMCIDARXIV_ID
Smithsonian Open AccessSmithsonian Institution's open metadata and media for 14M+ collection records across 19 museums, 9 research centers, libraries, archives, and the National Zoo, all CC0.government-open-datacorpusCC0-1.0freeapi-key-freeweeklyyesmcp-needed-low-valueWIKIDATA_QIDURL
Stack Exchange Data ExplorerWeb-based T-SQL query interface over a weekly read-only snapshot of every public Stack Exchange site (Stack Overflow plus 170+ communities), backed by the same data published as monthly Internet Archive dumps.consumer-signalcorpusCC-BY-SA-4.0freeaccount-requiredweeklyyesmcp-existsURLDATEWIKIDATA_QID
STRINGDatabase of known and predicted protein-protein interactions across ~12,500 organisms, covering ~59M proteins and 20B+ functional associations from experiments, co-expression, text-mining, and genomic context.bio-genomicsknowledge-graphCC-BY-4.0freenonemajor version every 1-2 years (v12.0 current)yesmcp-existsUNIPROT_ACCESSIONENSEMBL_IDGENE_SYMBOL
The Cancer Genome Atlas (TCGA, via Broad GDAC Firehose)Pan-cancer multi-omic atlas of ~11,000 tumour samples across 33 cancer types, processed and distributed by the Broad Institute GDAC Firehose pipeline as standardised level-3 datasets and analysis runs.bio-genomicsmixedTCGA-Data-Usefreenonefrozen; final stddata run 2016-01-28, final analyses run 2016-01-28 (released 2017), CPTAC AWG runs through 2020yesmcp-needed-high-valueGENE_SYMBOLENSEMBL_IDENTREZ_GENE_IDDOIPMID
Trading Economics CommoditiesLive and historical prices, forecasts, and metadata for ~100 commodities (energy, metals, agricultural, industrial, livestock, electricity, carbon) accessed via the Trading Economics markets API and web tables.finance-marketstime-seriesTrading-Economics-Proprietarypaidapi-key-paidintraday quotes during market hours; end-of-day snapshots; historical series go back decades for major contractsnomcp-existsDATEISO_4217
UCSC Genome BrowserUCSC Genomics Institute's reference genome browser with assemblies, annotation tracks, and pairwise/multiple alignments for human, mouse, and thousands of other species, exposed via REST API, public MySQL, FTP, and Table Browser.bio-genomicsmixedMITfreenonetrack-by-track; most reference tracks refresh on a months cadence, MySQL servers sync weekly Monday mornings Pacificyesmcp-existsGENE_SYMBOLENSEMBL_IDRSIDENTREZ_GENE_IDDOI
UK BiobankDeep-phenotyped longitudinal cohort of ~500,000 UK adults with whole-genome sequencing, imaging, biomarkers, lifestyle questionnaires, and linked NHS hospital, cancer, death, and primary-care records. Access is by approved application only.public-healthpanelUKB-Material-Transfer-Agreementpaiddars-or-equivalentrolling additions; major data refreshes (imaging, sequencing, linked EHR) on a multi-month cycleyesmcp-needed-low-valueICD_10OPCS_4BNF_CODEGENE_SYMBOLENSEMBL_IDRSID
UN/LOCODE (United Nations Code for Trade and Transport Locations)UNECE-maintained five-character code list identifying ports, airports, rail terminals, inland depots, and border crossings worldwide, used in trade and transport documentation.government-open-datareference-tableODC-PDDL-1.0freenonebiannual (July and December releases)yesmcp-needed-low-valueISO_2ISO_3
UniProtComprehensive, freely accessible protein sequence and functional annotation resource maintained by the UniProt Consortium (EMBL-EBI, SIB, PIR), with REST API, FTP bulk downloads, and rich cross-references to genomic, structural, and clinical databases.bio-genomicsregistryCC-BY-4.0freenone8-week release cycleyesmcp-existsUNIPROT_ACCESSIONGENE_SYMBOLENSEMBL_IDPMIDDOICHEMBL_ID
UnpaywallOpen database of free, legal open-access (OA) full-text locations for scholarly articles, keyed by DOI.academicregistryCC0freemiumnonecontinuous (API); periodic JSONL snapshotyesmcp-needed-low-valueDOIISSN
USPTO Open Data Portal (Patents)US Patent and Trademark Office's Open Data Portal, the successor to the PatentsView Search API and PEDS, covering all granted US patents and published applications with disambiguated assignee/inventor data, CPC/USPC classification, citations, and full prosecution (Patent File Wrapper) records.government-open-dataregistryUS-Government-Public-Domainfree-with-registrationapi-key-freedaily refresh for Patent File Wrapper; granted patents weekly (Tuesday grant cycle)yesmcp-needed-high-value
VC Deal Flow SignalQuarterly longitudinal panel of GitHub engineering-velocity metrics across ~55 venture-backed startups and ~20 sectors, hosted on Hugging Face as three CSV/Parquet configurations.finance-marketspanelCC-BY-4.0freenonequarterlyyesmcp-existsDATE
Visa Onchain Analytics DashboardFree public web dashboard, built by Visa with Allium Labs, charting fiat-backed stablecoin supply, transfers, addresses, and lending across 19+ public blockchains.finance-marketspanelvisa-onchain-dashboard-proprietary-display-onlyfreenonedaily (per Allium upstream aggregation; not stated on dashboard itself)norequires-scrapingDATEISO_4217
WHO FluNetWHO global influenza surveillance combining weekly reports from ~150 National Influenza Centres into a unified view of virological detections, subtype distribution, and positivity rates.public-healthtime-seriesCC-BY-NC-SA-3.0-IGOfreenoneweeklyyesmcp-needed-low-valueISO_3DATE
Wikidata Query ServicePublic SPARQL endpoint over Wikidata's CC0 knowledge graph of 120M+ items, hubbing identifiers from DOI and ORCID to ISBN, NCT_ID, CHEMBL_ID, OSM_ID and most other cross-domain keys.government-open-dataknowledge-graphCC0freenonelive mirror of Wikidata edits; full RDF dumps weeklyyesmcp-existsWIKIDATA_QIDWIKIPEDIA_ARTICLEDOIPMIDPMCIDARXIV_IDORCIDISNIRORISSNISBNNCT_IDCHEMBL_IDDRUGBANK_IDUNIIATC_CODEUNIPROT_ACCESSIONENSEMBL_IDENTREZ_GENE_IDNCBI_TAXON_IDCIKLEIISO_3ISO_2OSM_IDMESH_TERM
Wikipedia Pageviews APIWikimedia REST endpoints exposing per-article and aggregate pageview counts for all Wikimedia projects, daily/monthly granularity, back to July 2015.consumer-signaltime-seriesCC0freenonedailyyesmcp-needed-high-valueURLDATEISO_3WIKIDATA_QID
World Bank Commodity Prices (Pink Sheet)World Bank's monthly Pink Sheet of global commodity price benchmarks across energy, agriculture, fertilizers, metals, and precious metals, published as Excel workbooks with full monthly history back to 1960 and annual aggregates.finance-marketstime-seriesCC-BY-4.0freenonemonthlyyesmcp-needed-low-valueDATEISO_4217
World Bank Open DataWorld Bank's free catalogue of global development statistics covering 16,000+ indicators across 200+ economies, with REST API, bulk downloads, and an official MCP for the newer Data360 platform.government-open-datamixedCC-BY-4.0freenonevaries by indicator (annual, quarterly, monthly)yesmcp-existsISO_2ISO_3
X (Twitter) APIReal-time and full-archive access to X (Twitter) posts, users, search, trends, and news stories via the v2 REST API, exposed to agents through X's official hosted MCP server.consumer-signalevent-streamX-Developer-Agreementpaidoauthreal-timenomcp-existsURLISO_639_1
yfinanceUnofficial Python wrapper around Yahoo Finance's undocumented JSON endpoints for historical OHLCV prices, fundamentals, options chains, dividends, splits, and corporate actions.finance-marketstime-seriesApache-2.0freenonecontinuousnofragile-unofficialTICKERISIN