Should I retry every failed API request?

No. Retry transient errors such as rate limits, timeouts and selected 5xx responses with backoff. Do not blindly replay non-idempotent business actions.

Can I use fallback models with TKEN?

Yes, if the fallback model is available in your TKEN account and your application accepts differences in latency, cost and output behavior.

Is TKEN an official provider router?

No. TKEN is an independent third-party OpenAI-compatible gateway. Test fallback behavior before production use.

Reliability pattern

Add retries and fallback routes to TKEN calls

Use bounded exponential backoff for transient failures and a deliberate fallback model list when your application can tolerate route changes.

Get API key Router config

JavaScript bounded retry

const retryable = new Set([408, 409, 429, 500, 502, 503, 504]);

async function withBackoff(run, attempts = 4) {
  let lastError;
  for (let i = 0; i < attempts; i += 1) {
    try {
      return await run();
    } catch (error) {
      lastError = error;
      const status = error.status ?? error.response?.status;
      if (!retryable.has(status) || i === attempts - 1) throw error;
      const delay = Math.min(8000, 500 * 2 ** i) + Math.random() * 250;
      await new Promise(resolve => setTimeout(resolve, delay));
    }
  }
  throw lastError;
}

Retry only transient failures

Use model fallbacks deliberately

Record non-sensitive error metadata

Fallback model loop

Keep the fallback list short and explicit. A lower-cost or faster fallback can be useful, but it may change output quality, supported features or latency.

TKEN base URL fallback candidates

const models = [
  process.env.TKEN_PRIMARY_MODEL,
  process.env.TKEN_FALLBACK_MODEL
].filter(Boolean);

for (const model of models) {
  try {
    const response = await withBackoff(() =>
      client.chat.completions.create({
        model,
        messages: [{ role: "user", content: "Summarize this request." }]
      })
    );
    return response.choices[0].message.content;
  } catch (error) {
    console.warn("route failed", { model, status: error.status });
  }
}

Reliability checklist

1. Cap retry attempts

Retries consume request budget and can make overload worse. Set a small maximum attempt count and use jittered backoff.

2. Separate user actions from API retries

Do not replay checkout, account or database writes just because a model request failed. Retry only the model call.

3. Monitor fallback quality

Track fallback frequency, response time, error class and user satisfaction before increasing fallback traffic.

Disclosure: TKEN is an independent third-party API gateway, not an official endpoint for OpenAI, LiteLLM or other model providers.

Make model calls resilient

Add bounded retries to https://www.tken.shop/v1 calls, then test fallback behavior with real prompts before production routing.

Create account Streaming setup