Agent-Native Flux APIs

Every Flux API you create on FoxNose is agent-native by default. The same URL prefix that serves your REST endpoints also exposes a Model Context Protocol server, a route catalog, and per-collection JSON Schema — discoverable at runtime, gated by the same permissions, scoped to the connected collections. No SDK to install. No glue layer to write. No second tool surface to maintain.

This page explains what "agent-native" means in practice, how the three introspection endpoints fit together, when to use MCP vs. a hand-written integration, and how to control which APIs are visible to agents.

What "agent-native" means here

Most platforms that integrate with AI agents take one of two paths:

SDK-based. You install a client library, instantiate a class, write retriever or tool wrappers around the SDK's methods, and ship them to your agent framework. Every new tool or schema change means another code update on the consumer side.
Custom MCP server. You build a small MCP server in front of an existing API. You define the tools, wire each to an HTTP call, handle auth, ship a binary, and keep the schema in sync as the upstream API evolves.

FoxNose collapses both into the Flux API itself. Every prefix already has:

https://{environment_key}.fxns.io/{api_prefix}/_mcp        # MCP server (Streamable HTTP)
https://{environment_key}.fxns.io/{api_prefix}/_router     # Flat route catalog
https://{environment_key}.fxns.io/{api_prefix}/{collection}/_schema   # JSON Schema per folder

…sitting next to the data routes (/{collection}, /{collection}/{key}, /{collection}/_search). Nothing to deploy, nothing to keep in sync — the introspection endpoints read the same configuration the REST endpoints already use.

The three introspection endpoints

Endpoint	Path	Purpose	Reference
MCP Server	`GET\|POST\|DELETE /{api_prefix}/_mcp`	Per-API Model Context Protocol server over Streamable HTTP. Publishes a fixed five-tool catalog — `discover_resources`, `describe_resource`, `get_record`, `query_records`, `search_records` — usable directly by Claude, ChatGPT, OpenAI Responses, Cursor, and any MCP-aware client.	MCP Server →
Router Introspection	`GET /{api_prefix}/_router`	Flat list of every reachable REST route under this prefix, with method, path, action (`get_many` / `get_one` / `search` / `schema`), scope (`collection` / `item`), path params, query params, request body contract, and response shape.	Router Introspection →
Schema Introspection	`GET /{api_prefix}/{collection_path}/_schema`	Live JSON Schema and route metadata for a connected collection, including `searchable_fields` and configured locales.	Schema Introspection →

The three are designed to compose: an agent can call /_router to discover what routes exist, /{collection}/_schema to learn what each one expects, and /_mcp to invoke them with proper input validation — or short-circuit the whole thing by using the MCP server, which already wraps all of the above into tools.

What makes this agent-native, not just MCP-enabled

Plenty of platforms now bolt MCP onto an existing API surface. The differences worth knowing about — what makes FoxNose specifically built for agents rather than retrofitted for them:

Hybrid search arrives as one tool. The search_records tool wraps FoxNose's hybrid retrieval — vector similarity, full-text matching, and structured filters in a single call. The agent gets state-of-the-art retrieval without you choosing modes or writing fan-out logic between separate search backends. A typical hand-rolled tool surface would expose three or four narrower search tools and let the model pick poorly between them; here, one tool does the right thing.
Constant tool count, regardless of content size. The five tools (discover_resources, describe_resource, get_record, query_records, search_records) work across every connected collection via a resource_id argument. Adding a tenth collection, or a hundredth, doesn't grow the catalog or confuse the model — it expands what the existing tools can reach. Hand-rolled MCP servers tend to inflate one-tool-per-resource and overflow the model's tool budget.
Schema-driven tool inputs, no drift. The describe_resource tool returns the live JSON Schema of any collection — including which fields are searchable, which are vectorized, and which locales are configured. The agent can validate its tool calls against the current schema before issuing them. Hand-written MCP servers maintain a separate copy of the schema in the tool definitions and drift the moment the upstream schema changes.
Localization-aware. Every collection's schema exposes its configured locales. An agent that talks to a multilingual knowledge base can ask describe_resource once, then call search_records/get_record with the right locales argument — no separate "EN tool" and "FR tool". This is unusual; most agent tool surfaces treat localization as the application's problem.
Versioning-aware. FoxNose supports schema versioning. You can pin an agent to a specific version of the schema, migrate the underlying schema, and only roll the agent forward when its prompt and reasoning are ready for the new shape. No more "schema change broke production agents at 3am" surprises.
One source of truth for permissions. The agent inherits exactly the access rights of its Flux API key. There is no separate "allowed tools" list to keep in sync, no second role system, no chance an agent sees something the equivalent REST request would be denied (see below).

These six properties are what we mean by "agent-native": the system is designed so that adding agents doesn't add a parallel stack to maintain.

Same auth, same permissions

There is no second access-control surface for agents. The introspection endpoints inherit the existing Flux API model:

is_auth_required on the API row controls whether a Flux API key is required for any traffic — including /_mcp, /_router, and /_schema. Public APIs accept anonymous traffic everywhere; private APIs require the Authorization: Secure <access_key>:<signature> header everywhere.
allowed_methods on the per-collection connection (get_one, get_many) controls which MCP tools the agent gets per collection. A collection connected with only get_many exposes query_records and search_records but not get_record — same as the REST surface.
Flux API key roles apply the same way. A read-only key on a private API works for all five MCP tools; a key without read on a specific collection can't see that collection via MCP either.

If a collection is read-only via REST, it is read-only via MCP. If a collection is invisible to a given key via REST, it is invisible via MCP. The model is the same model — that's the whole point.

Governing agent visibility per API

Not every knowledge surface should be discoverable by autonomous agents. Some content has legal or contractual constraints on automated access. Some prefixes are partner-only and shouldn't appear in any agent's tool catalog. Some you want to roll out to agents on staging first and only enable on production after validation. The Flux API gives you two governance toggles per prefix, set independently:

mcp_enabled (default true) — when false, /{api_prefix}/_mcp returns 404. The MCP tool surface for this prefix is invisible to all clients.
router_introspection_enabled (default true) — when false, /{api_prefix}/_router returns 404. The route catalog is invisible — agents cannot enumerate what's reachable under this prefix.

These are policy switches, not feature flags. They are independent: you can keep router introspection on while disabling the MCP server (e.g. for internal-only API surfaces that benefit from runtime discovery by your own apps but shouldn't be reachable to external agents). The data routes (/{collection}, /{collection}/{key}, /{collection}/_search) are never affected — REST consumers continue to work without disruption.

Toggle either flag through PATCH /v1/{env}/api/{api}/:

curl -X PATCH https://api.foxnose.net/v1/<ENVIRONMENT_KEY>/api/<API_KEY>/ \
  -H "Authorization: Bearer <JWT>" \
  -H "Content-Type: application/json" \
  -d '{"mcp_enabled": false, "router_introspection_enabled": false}'

Typical governance scenarios:

Scenario	Configuration
Phased rollout — validate agent integration on staging before exposing production	Set both to `false` on `prod-*` prefixes initially; flip to `true` once staging validates
Partner-only API — REST consumers are trusted partners under contract; agent discovery would broaden the audience inappropriately	`mcp_enabled: false`, `router_introspection_enabled: false`; partners use the documented REST routes
Compliance-restricted content — regulated knowledge surfaces that may not be exposed to autonomous LLMs	`mcp_enabled: false`; keep router introspection for internal tooling if needed
Internal-only API — used by your own apps for runtime discovery, but never by external agents	`router_introspection_enabled: true`, `mcp_enabled: false`

Schema introspection (/_schema) is not yet covered by a separate toggle — it's bound to the collection connection itself. If you want it hidden, don't connect the collection to that API.

When to use MCP vs. a hand-written integration

The MCP server is the lowest-effort path, but it isn't always the right one. A quick decision matrix:

Situation	Recommended approach
End user wants to talk to your content through Claude / ChatGPT / Cursor / any MCP client	MCP server. Zero code on your side. See Connect Claude and Connect ChatGPT and OpenAI.
You're building a custom agent on OpenAI's Responses API or Claude's API and want it to use FoxNose tools natively	MCP server via API. Pass `tools: [{type: "mcp", server_url: ".../_mcp", ...}]` (OpenAI) or `mcp_servers` (Claude). All five tools light up.
You're building a RAG pipeline where the LLM doesn't directly choose tools — your app orchestrates retrieval and prompt assembly	Direct API calls or the LangChain retriever. You control the search mode, ranking, and context formatting precisely.
You need a fine-grained, app-specific tool surface (e.g., a single tool that wraps three FoxNose calls and post-processes the result)	Custom tool wrapper around the SDK. Pattern 3 in LLM Integrations.
Your client speaks only stdio MCP (no HTTP transport)	MCP server + `mcp-proxy` as a bridge.
You're locked to OpenAI's Assistants API (not Responses API), which doesn't yet support MCP	Custom tool wrapper. Assistants don't speak MCP; wrap FoxNose calls as Assistants tools manually.

In practice most teams end up using both: MCP for the user-facing assistant experience, direct SDK calls inside the app-specific orchestration layer. The two are not mutually exclusive — the same Flux API serves both.

Client compatibility

Client	Transport	Works with FoxNose	Notes
Claude Code (CLI agent)	Streamable HTTP	✅ Direct	`claude mcp add --transport http --header "Authorization: Secure ..."`
Claude Desktop	stdio	✅ Public direct; private via `mcp-proxy`	Stdio-only client today; needs a small bridge for HTTP transports
Claude API (`mcp_servers`)	Streamable HTTP	✅ Direct	Any auth header passed through
OpenAI Responses API (`tools: [{type: "mcp"}]`)	Streamable HTTP	✅ Direct	Any auth header passed through
ChatGPT Custom Connectors (Plus / Team / Enterprise)	Streamable HTTP	✅ Public; ⚠️ Private requires OAuth	UI expects OAuth 2; FoxNose uses `Authorization: Secure` today. Use the Responses API, an OAuth proxy, or wait for native OAuth
Cursor, Cline, Continue, Codex CLI, Goose, Zed, others	Mixed	✅ Generally direct	Most modern MCP-aware clients accept HTTP + custom headers — config is similar in shape across clients

The MCP protocol version FoxNose targets is Streamable HTTP 2025-03-26 — see the MCP Server reference for protocol-level details.

What's next?

Connect your first client.

Connect Claude Connect ChatGPT and OpenAI

Read the protocol-level reference.

MCP Server Reference Router Introspection

Manage the per-API toggles.

Flux API Object reference

Understand rate limits and retrieval billing. Every MCP tools/call counts as one retrieval, and each Flux API enforces per-plan rate limits on agent traffic.

Flux API limits