1. Check /models only
Start with the lowest-risk reachability check. A successful response proves that the base URL, key loading, and account access are usable before any prompt content is sent.
Endpoint preflight
Validate the base URL, key handling, model discovery, selected model ID, and one non-streaming chat request before pointing an SDK, agent host, proxy, or UI at TKEN. Start small, save redacted evidence, then move into streaming, agents, or team traffic.
$env:TKEN_API_KEY="sk-your-tken-key"
$env:TKEN_BASE_URL="https://www.tken.shop/v1"
node tools/endpoint-tester.mjs `
--base-url $env:TKEN_BASE_URL `
--api-key-env TKEN_API_KEY `
--skip-chat `
--json
# Then choose a model returned by /models.
A useful preflight separates configuration errors from provider availability, quota, and app-level bugs. Keep the first request set small enough that a failure is easy to classify.
Start with the lowest-risk reachability check. A successful response proves that the base URL, key loading, and account access are usable before any prompt content is sent.
Do not copy a model name from marketing copy, another account, or a provider page. Model IDs can vary by account, channel, provider status, and release timing.
Use one small, non-sensitive prompt. Confirm the HTTP status, response shape, timeout, and short preview before testing streaming, JSON output, tool calls, embeddings, or agent loops.
Store the base URL, env var name, status codes, model count, selected model, timeout, and failure class. Do not store keys, account identifiers, order details, usage records, private prompts, or provider account screens.
Classify the first failure before changing app code. Most rollout problems come from a small set of endpoint, account, quota, model, timeout, or response-shape issues.
/models succeeds with the deployment key, the selected model ID was returned by /models, and one non-streaming chat completion succeeds with a non-sensitive prompt.
The app has request timeouts, retry caps, token caps, error logging, spend limits, owner alerts, and a documented rollback route before production traffic grows.
Logs, screenshots, issue reports, and handoff docs redact keys, account IDs, private prompts, usage records, provider account screens, and full sensitive response bodies.
Confirm that the base URL ends in /v1, the key is loaded from an environment variable, /models returns IDs, the selected model came from /models, and one small non-streaming chat request succeeds.
It prevents stale model names and account-specific availability mismatches. If /models does not return an ID, do not assume that ID is safe for the rollout.
Save status codes, model count, a few sample IDs, selected model, latency, timeout, and a short redacted preview. Do not save keys, account IDs, usage records, private prompts, or provider screens.
Use the GitHub playbook for command details, expected JSON shape, failure classes, and a safe evidence template. Read the endpoint preflight playbook.
Create a TKEN key, set https://www.tken.shop/v1 as the base URL, verify /models, then run one non-streaming chat before connecting real tools, agents, or team traffic.