Tokonomics API Documentation

Tokonomics is a drop-in LLM cost metering proxy that tracks token usage and costs across 60+ models from 9 providers. It adds ~31ms overhead per request (3.6%), supports streaming, and works with any HTTP client in any language. Over 37 models have sub-cent per-thousand-token pricing synced weekly from official provider pages.

Last updated: July 2026

5-minute setup: Get your API key from the dashboard, change your base URL, and you're tracking costs instantly.

Quick Start

Tokonomics requires exactly one code change to start tracking costs: replace your provider's base URL with the Tokonomics proxy URL. Setup takes under 5 minutes and works with cURL, Python, JavaScript, Go, and any HTTP client.

bash

# Before: direct to OpenAI
# curl https://api.openai.com/v1/chat/completions

# After: through Tokonomics (one URL change)
curl https://tokonomics.ca/proxy/openai/chat/completions \
  -H "Authorization: Bearer mk_your_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

python

from openai import OpenAI

client = OpenAI(
    api_key="mk_your_key_here",
    base_url="https://tokonomics.ca/proxy/openai"  # only change
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

javascript

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'mk_your_key_here',
  baseURL: 'https://tokonomics.ca/proxy/openai', // only change
});

const response = await client.chat.completions.create({
  model: 'gpt-4o',
  messages: [{ role: 'user', content: 'Hello!' }],
});
console.log(response.choices[0].message.content);

go

package main

import (
    "bytes"
    "encoding/json"
    "fmt"
    "net/http"
)

func main() {
    body, _ := json.Marshal(map[string]any{
        "model": "gpt-4o",
        "messages": []map[string]string{{"role": "user", "content": "Hello!"}},
    })
    req, _ := http.NewRequest("POST",
        "https://tokonomics.ca/proxy/openai/chat/completions",
        bytes.NewBuffer(body))
    req.Header.Set("Authorization", "Bearer mk_your_key_here")
    req.Header.Set("Content-Type", "application/json")
    resp, _ := http.DefaultClient.Do(req)
    defer resp.Body.Close()
    fmt.Println("Status:", resp.Status)
}

Authentication

Tokonomics authenticates every request using SHA-256 hashed API keys with the mk_ prefix format. Keys are never stored in plaintext. Pass your key as a Bearer token in the Authorization header.

http

Authorization: Bearer mk_your_key_here

Property	Details
`Format`	Prefix `mk_` followed by 48 hex characters
`Storage`	Only the SHA-256 hash is stored
`Scope`	Tied to your tenant
`Management`	Dashboard → API Keys

Your Tokonomics key (mk_...) is different from your provider key (sk-...). The Tokonomics key authenticates you to the proxy. Your provider key is stored encrypted in Provider Keys.

Base URL

text

https://tokonomics.ca

All endpoints are relative to this base URL. The proxy path follows the pattern:

text

https://tokonomics.ca/proxy/{provider}/{upstream-path}

Proxy Endpoint

POST /proxy/{provider}/{path}

Tokonomics forwards your request to the upstream LLM provider with ~31ms median overhead, records token usage with DECIMAL(12,8) precision, calculates cost using rates synced from 9 providers, and checks budget alerts. Responses are streamed back in real-time without buffering.

Path Parameters

Parameter	Required	Description
`provider`	Required	Provider slug: `openai`, `anthropic`, `deepseek`, `gemini`, `mistral`, `groq`, `together`, `xai`, `cohere`
`path`	Required	Upstream API path, e.g. `chat/completions` or `messages`

Headers

Header	Required	Description
`Authorization`	Required	Your Tokonomics key: `Bearer mk_...`
`Content-Type`	Required	`application/json`
`X-Metering-Tags`	Optional	JSON object for cost attribution, e.g. `{"team":"growth","feature":"chatbot"}`
`X-Provider-Key`	Optional	Override the stored provider key for this request only

Request Body

Pass the exact same body you would send directly to the provider. Tokonomics forwards it unchanged.

Response

The response is streamed back transparently. Tokonomics does not buffer or modify the response body.

Supported Providers

Use the provider slug in the proxy path. The upstream path mirrors the provider's own API.

OpenAI/proxy/openai/chat/completions

Anthropic/proxy/anthropic/messages

DeepSeek/proxy/deepseek/chat/completions

Gemini/proxy/gemini/...

Mistral/proxy/mistral/chat/completions

Groq/proxy/groq/chat/completions

Together AI/proxy/together/chat/completions

xAI (Grok)/proxy/xai/chat/completions

Cohere/proxy/cohere/chat

Any OpenAI-compatible API works out of the box via the openai provider slug.

Tagging Calls

Add the X-Metering-Tags header to attribute costs to specific teams, features, users, or environments. Tags are stored with every usage event and queryable via analytics.

bash

curl https://tokonomics.ca/proxy/openai/chat/completions \
  -H "Authorization: Bearer mk_your_key_here" \
  -H "Content-Type: application/json" \
  -H 'X-Metering-Tags: {"team":"growth","feature":"chatbot","env":"production"}' \
  -d '{"model":"gpt-4o","messages":[{"role":"user","content":"Hello!"}]}'

Common Tag Patterns

Pattern	Example	Use case
By feature	`{"feature":"support-bot"}`	Which feature costs the most?
By team	`{"team":"growth"}`	Per-team budget allocation
By tenant	`{"tenant":"acme-corp"}`	Multi-tenant SaaS cost isolation
By environment	`{"env":"staging"}`	Separate dev/prod costs
By user	`{"user_id":"usr_123"}`	Per-user cost tracking

Bring Your Own Key (BYOK)

Tokonomics never calls providers on your behalf. You supply your own API keys.

Option 1: Store in dashboard (recommended)

Go to Provider Keys and save your provider API key once. It is stored encrypted with AES-256.

Option 2: Per-request header

bash

curl https://tokonomics.ca/proxy/openai/chat/completions \
  -H "Authorization: Bearer mk_your_key_here" \
  -H "X-Provider-Key: sk-your-openai-key" \
  -H "Content-Type: application/json" \
  -d '{"model":"gpt-4o","messages":[{"role":"user","content":"Hello!"}]}'

Key resolution priority

Priority	Source
1 (highest)	`X-Provider-Key` header (per-request)
2	Stored key in dashboard (encrypted AES-256)
3	Error: no key found

Analytics: Summary

GET /analytics/summary

Returns current month spend, budget utilization percentage, and a per-model cost breakdown.

bash

curl https://tokonomics.ca/analytics/summary \
  -H "Authorization: Bearer mk_your_key_here"

json

{
  "current_spend_usd": 12.4831,
  "monthly_budget_usd": 100.00,
  "budget_used_pct": 12.48,
  "by_model": [
    { "model": "gpt-4o", "cost_usd": 9.21, "calls": 142 },
    { "model": "deepseek-chat", "cost_usd": 3.27, "calls": 891 }
  ]
}

Analytics: Daily Spend

GET /analytics/daily?days=30

Returns daily spend for the last N days. Maximum 90 days.

Parameter	Type	Default	Description
`days`	integer	`30`	Number of days to return. Max 90.

json

{
  "daily": [
    { "date": "2026-06-01", "cost_usd": 1.2340, "calls": 48 },
    { "date": "2026-06-02", "cost_usd": 0.8821, "calls": 31 }
  ]
}

Analytics: By Tag

GET /analytics/by-tag?key=team&period=month

Returns spend grouped by a custom tag key.

Parameter	Type	Required	Description
`key`	string	Required	Tag key to group by
`period`	string	Optional	`month` (default) or `all`

json

{
  "tag_key": "team",
  "period": "month",
  "breakdown": [
    { "value": "growth", "cost_usd": 8.44, "calls": 312 },
    { "value": "support", "cost_usd": 4.00, "calls": 189 }
  ]
}

Analytics: Events

GET /analytics/events?limit=50&offset=0

Returns paginated raw usage events. Maximum limit of 200 per request.

Parameter	Type	Default	Description
`limit`	integer	`50`	Number of events. Max 200.
`offset`	integer	`0`	Pagination offset.

json

{
  "events": [
    {
      "id": "uuid",
      "model": "gpt-4o",
      "provider": "openai",
      "input_tokens": 512,
      "output_tokens": 128,
      "cost_usd": 0.00384,
      "latency_ms": 342,
      "tags": { "team": "growth", "feature": "chatbot" },
      "created_at": "2026-06-05T14:32:00Z"
    }
  ],
  "total": 1042,
  "limit": 50,
  "offset": 0
}

Errors

All errors return a JSON body with an error field and an appropriate HTTP status code.

Status	Code	Cause
`401`	Unauthorized	Missing or invalid API key
`400`	Bad Request	Malformed request body
`500`	Proxy Error	Internal proxy failure
`502`	Provider Error	Upstream provider returned an error

json

{ "error": "Invalid or inactive API key" }

Rate Limits

Tokonomics enforces sliding-window rate limits via Redis. The Free plan allows 10 requests per minute; Pro allows 60. Rate limiting uses fail-open design.

Per-minute rate limits

Plan	Requests/minute	Monthly cap
Free	10	100 calls/month
Pro	60	1M calls/month (fair use)

Response headers

Header	Description
`X-RateLimit-Limit`	Max requests per minute for your plan
`X-RateLimit-Remaining`	Requests remaining in current window
`X-RateLimit-Reset`	Unix timestamp when the window resets
`Retry-After`	Seconds to wait (only on 429 responses)

When limits are exceeded

Requests return 429 Too Many Requests with a Retry-After header.

json

{
  "error": "Rate limit exceeded",
  "detail": "You have exceeded 10 requests/minute. Free plan limit.",
  "retry_after": 12
}

Health Check

GET /health — No authentication required.

json

{ "status": "ok", "ts": 1749139200 }

Tokonomics API Documentation

Quick Start

Authentication

Base URL

Proxy Endpoint

Path Parameters

Headers

Request Body

Response

Supported Providers

Tagging Calls

Common Tag Patterns

Bring Your Own Key (BYOK)

Option 1: Store in dashboard (recommended)

Option 2: Per-request header

Key resolution priority

Analytics: Summary

Analytics: Daily Spend

Analytics: By Tag

Analytics: Events

Errors

Rate Limits

Per-minute rate limits

Response headers

When limits are exceeded

Health Check

Product

Developers

Company