For LLMs

Drop these docs into ChatGPT or Claude

Download a single condensed markdown file the AI can answer questions from.

API documentation

Getting Started

Welcome to the LumyxAI API documentation. Our API is compatible with both OpenAI and Anthropic SDKs, so you can use any existing SDK or HTTP client by simply changing the base URL and API key.

Quick Start

Create an account at lumyx-ai.site/register
Generate an API key from your dashboard
Make your first API call using the examples below

Base URL: https://lumyx-ai.site/api/v1

Authentication

All API requests require a valid API key passed via the Authorization header using the Bearer scheme.

Header

Authorization: Bearer lx-xxxxxxxxxxxxxxxxxxxx

Security note: Never expose your API key in client-side code. Always make API calls from your backend server.

Models

LumyxAI offers a range of models optimized for different use cases. Pass the model name in the model field of your request.

Swipe to see more →

Model	Context	Input cr/1M	Output cr/1M	Capabilities
Loading models...

Prices shown in credits per 1M tokens ("cr/1M"). A model with Input 0.50 cr/1M costs 0.50 credits per million input tokens. 100 credits = $1.00; VIP tiers get discounted credit packs.

Explore the full catalog with filters on the models page.

Chat Completions

Create a chat completion by sending a POST request to:

Endpoint

POST https://lumyx-ai.site/api/v1/chat/completions

Request Body

Parameter	Type	Required	Description
`model`	string	Yes	Model ID (e.g. "lumyx-plus_v1")
`messages`	array	Yes	Array of message objects with role and content
`stream`	boolean	No	Enable streaming responses (default: false)
`temperature`	number	No	Sampling temperature 0-2 (default: 0.7)
`max_tokens`	integer	No	Maximum number of tokens to generate
`tools`	array	No	List of tool/function definitions for the model to call
`tool_choice`	string \| object	No	Controls tool use: "auto", "none", "required", or specific function
`response_format`	object	No	Force JSON output: `{"type":"json_object"}`
`stop`	string \| string[]	No	Stop sequences to halt generation
`seed`	integer	No	Seed for deterministic sampling (best effort)
`top_p`	number	No	Nucleus sampling (default: 1)
`frequency_penalty`	number	No	Penalize repeated tokens (-2.0 to 2.0)
`reasoning`	object	No	Enable reasoning/thinking tokens (see Reasoning section)
`user`	string	No	Unique user identifier for abuse monitoring

Example Request

bash

curl https://lumyx-ai.site/api/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "lumyx-plus_v1",
    "messages": [
      {"role": "system", "content": "You are a helpful assistant."},
      {"role": "user", "content": "What is machine learning?"}
    ],
    "temperature": 0.7,
    "max_tokens": 512
  }'

Example Response

json

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1710000000,
  "model": "lumyx-plus_v1",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Machine learning is a subset of artificial intelligence..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 25,
    "completion_tokens": 150,
    "total_tokens": 175
  }
}

Responses API

OpenAI's newer Responses API format. Send a POST request to:

Endpoint

POST https://lumyx-ai.site/api/v1/responses

Request Body

Parameter	Type	Required	Description
`model`	string	Yes	Model ID (e.g. "lumyx-plus_v1" or "openrouter/auto")
`input`	string \| array	Yes	User input: plain string or array of content parts / message objects
`instructions`	string	No	System-level instructions for the model (like system prompt)
`stream`	boolean	No	Enable SSE streaming (default: false)
`temperature`	number	No	Sampling temperature 0-2 (default: 0.7)
`max_output_tokens`	integer	No	Maximum tokens to generate (like max_tokens)
`top_p`	number	No	Nucleus sampling (default: 1)
`tools`	array	No	Tool/function definitions (Responses API style)
`user`	string	No	Unique user identifier for abuse monitoring

Example Request

bash

curl https://lumyx-ai.site/api/v1/responses \\
  -H "Content-Type: application/json" \\
  -H "Authorization: Bearer YOUR_API_KEY" \\
  -d '{
    "model": "lumyx-plus_v1",
    "input": "What is machine learning?",
    "instructions": "You are a helpful assistant.",
    "temperature": 0.7,
    "max_output_tokens": 512
  }'

Example Response (non-streaming)

json

{
  "id": "resp_abc123",
  "object": "response",
  "created_at": 1710000000,
  "model": "lumyx-plus_v1",
  "output": [
    {
      "type": "message",
      "id": "msg_abc456",
      "role": "assistant",
      "content": [
        {
          "type": "output_text",
          "text": "Machine learning is a subset of artificial intelligence...",
          "annotations": []
        }
      ],
      "status": "completed"
    }
  ],
  "usage": {
    "input_tokens": 25,
    "output_tokens": 150,
    "total_tokens": 175
  }
}

Streaming

Set "stream": true to receive Server-Sent Events:

bash

curl -N https://lumyx-ai.site/api/v1/responses \\
  -H "Authorization: Bearer YOUR_API_KEY" \\
  -H "Content-Type: application/json" \\
  -d '{"model":"lumyx-plus_v1","input":"Hello","stream":true}'

Streaming events emitted in order:

Event	Description
`response.created`	Emitted once at start, status "in_progress"
`response.output_text.delta`	Emitted per text token, contains delta string
`response.output_item.added`	Emitted when a tool call starts
`response.function_call_arguments.delta`	Emitted per tool argument delta
`response.completed`	Emitted once at end, status "completed", full output

Anthropic Messages API

Drop-in compatible with the Anthropic SDK

LumyxAI fully supports the Anthropic Messages API format, including streaming with Anthropic SSE events. Switch from Anthropic to LumyxAI by changing just two lines — the base URL and API key.

Base URL

https://lumyx-ai.site/api/v1

Authentication

x-api-key: YOUR_API_KEY

Endpoint

POST https://lumyx-ai.site/api/v1/messages

Request Body

Parameter	Type	Required	Description
`model`	string	Required	Model ID (e.g. "lumyx-plus_v1")
`messages`	array	Required	Array of message objects with role and content (supports content blocks)
`max_tokens`	integer	Required	Maximum number of tokens to generate
`system`	string	Optional	System prompt (top-level field, Anthropic style)
`temperature`	number	Optional	Sampling temperature 0–1 (default: 0.7)
`top_p`	number	Optional	Nucleus sampling (default: 1)
`stream`	boolean	Optional	Enable streaming with Anthropic SSE events
`stop_sequences`	string[]	Optional	Custom stop sequences
`tools`	array	Optional	Tool definitions with `input_schema` (Anthropic format)
`tool_choice`	object	Optional	Tool selection: `{"type":"auto"}`, `{"type":"any"}`, or `{"type":"tool","name":"..."}`
`thinking`	object	Optional	Enable extended thinking: `{"type":"enabled","budget_tokens":1024}`
`top_k`	integer	Optional	Top-K sampling parameter

Example — Migrate from Anthropic

2-line migration: Replace api_key and base_url— everything else stays the same.

python

import anthropic

client = anthropic.Anthropic(
    api_key="YOUR_API_KEY",             # ← your LumyxAI key
    base_url="https://lumyx-ai.site/api/v1",  # ← LumyxAI endpoint
)

message = client.messages.create(
    model="lumyx-plus_v1",
    max_tokens=1024,
    system="You are a helpful assistant.",
    messages=[
        {"role": "user", "content": "What is machine learning?"}
    ],
)

print(message.content[0].text)

Example Response

json

{
  "id": "msg_abc123",
  "type": "message",
  "role": "assistant",
  "model": "lumyx-plus_v1",
  "content": [
    {
      "type": "text",
      "text": "Machine learning is a subset of artificial intelligence..."
    }
  ],
  "stop_reason": "end_turn",
  "stop_sequence": null,
  "usage": {
    "input_tokens": 25,
    "output_tokens": 150
  }
}

Supported Features

Messages API (create)

Streaming (native SSE events)

System prompt (string & array)

Content blocks (text, image, tool_use, tool_result)

Tool use / function calling

Vision / image inputs (base64 & URL)

Extended thinking (thinking blocks)

Stop sequences & top_k

Token usage tracking

Tip: The same API key works for both OpenAI and Anthropic endpoints. Use whichever SDK you prefer — no separate configuration needed.

Tool Use / Function Calling

Let models call your functions

Define tools that the model can call during a conversation. Works with both OpenAI and Anthropic formats, including streaming.

bash

curl https://lumyx-ai.site/api/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "lumyx-plus_v1",
    "messages": [{"role": "user", "content": "What is the weather in Paris?"}],
    "tools": [{
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Get weather for a city",
        "parameters": {
          "type": "object",
          "properties": {
            "city": {"type": "string", "description": "City name"}
          },
          "required": ["city"]
        }
      }
    }],
    "max_tokens": 512
  }'

Tool Call Response

json

{
  "choices": [{
    "message": {
      "role": "assistant",
      "content": null,
      "tool_calls": [{
        "id": "call_abc123",
        "type": "function",
        "function": {
          "name": "get_weather",
          "arguments": "{\"city\": \"Paris\"}"
        }
      }]
    },
    "finish_reason": "tool_calls"
  }]
}

Multi-turn: After receiving a tool call, send the result back with role: "tool" and tool_call_id to continue the conversation.

Vision / Images

Send images to vision-capable models

Send images as base64 data or URLs in your messages. Supported on vision-capable models like lumyx-vision.

bash

curl https://lumyx-ai.site/api/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "lumyx-vision",
    "messages": [{
      "role": "user",
      "content": [
        {"type": "text", "text": "What is in this image?"},
        {"type": "image_url", "image_url": {
          "url": "data:image/png;base64,iVBORw0KGgo..."
        }}
      ]
    }],
    "max_tokens": 256
  }'

Supported formats: PNG, JPEG, GIF, WebP. Both base64 data URIs and external URLs are supported.

Reasoning / Thinking

Extended thinking and chain-of-thought

Some models support reasoning tokens— the model "thinks" step-by-step before responding, improving accuracy on complex tasks like math, logic, and coding. LumyxAI supports reasoning via both OpenAI and Anthropic formats, including streaming.

Effort Levels

Control how much the model thinks before responding. Higher effort uses more tokens but produces more accurate results.

Effort	When to Use	Token Usage
`low`	Simple tasks, quick answers, low-stakes queries	Minimal
`medium`	Balanced reasoning for general-purpose tasks	Moderate
`high`	Complex math, logic puzzles, multi-step coding tasks	High

OpenAI Format (Chat Completions)

Use the reasoning object or the reasoning_effort shorthand on the /v1/chat/completions endpoint.

bash

# Using reasoning object with effort level
curl https://lumyx-ai.site/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "lumyx-plus_v1",
    "messages": [{"role": "user", "content": "Solve: x^2 + 5x + 6 = 0"}],
    "reasoning": {"effort": "high"},
    "max_tokens": 1024
  }'

# Or with max_tokens budget for reasoning
curl https://lumyx-ai.site/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "lumyx-plus_v1",
    "messages": [{"role": "user", "content": "What is 15 * 23?"}],
    "reasoning": {"effort": "low", "max_tokens": 512},
    "max_tokens": 256
  }'

Anthropic Format (Messages)

Use the thinking object on the /v1/messages endpoint. Set type to "enabled" and optionally specify a budget_tokens limit.

bash

curl https://lumyx-ai.site/v1/messages \
  -H "x-api-key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "lumyx-plus_v1",
    "max_tokens": 4096,
    "messages": [{"role": "user", "content": "Solve: x^2 + 5x + 6 = 0"}],
    "thinking": {
      "type": "enabled",
      "budget_tokens": 2048
    }
  }'

Response Format

When reasoning is enabled, the response includes thinking content before the final answer.

json

{
  "id": "msg_abc123",
  "type": "message",
  "role": "assistant",
  "model": "lumyx-plus_v1",
  "content": [
    {
      "type": "thinking",
      "thinking": "Let me solve x^2 + 5x + 6 = 0...\nI need to factor this quadratic..."
    },
    {
      "type": "text",
      "text": "The solutions are x = -2 and x = -3."
    }
  ],
  "stop_reason": "end_turn",
  "usage": {
    "input_tokens": 25,
    "output_tokens": 150
  }
}

All Reasoning Parameters

Endpoint	Parameter	Type	Description
Chat	`reasoning.effort`	string	`"low"`, `"medium"`, or `"high"`
Chat	`reasoning.max_tokens`	integer	Max tokens the model can use for reasoning
Chat	`reasoning.enabled`	boolean	Explicitly enable or disable reasoning
Chat	`reasoning.exclude`	boolean	Exclude reasoning content from the response (still used internally)
Chat	`reasoning_effort`	string	Shorthand — same as `reasoning.effort`
Messages	`thinking.type`	string	`"enabled"` or `"disabled"`
Messages	`thinking.budget_tokens`	integer	Max tokens for thinking (converted to `reasoning.max_tokens` internally)

Billing: Reasoning tokens are billed as output tokens. Higher effort levels consume more tokens and cost more credits. The thinking_used field in the Usage API tracks which requests used reasoning.

Compatibility: Not all models support reasoning. Models tagged with "reasoning" in the Models endpoint support this feature. Unsupported models will ignore reasoning parameters and respond normally.

Legacy <think> Tag Support

Some legacy models (older DeepSeek R1, QwQ) embed reasoning inside <think>...</think> XML tags within the content field instead of using the standard reasoning_content field. LumyxAI automatically strips these tags from streamed content and moves the reasoning into reasoning_content when the model has strip_think_tags: true in its configuration.

Streaming only: Think-tag stripping currently applies to streaming responses. Non-streaming requests to think-tag models may still contain raw <think> tags.

Streaming

Set "stream": true to receive responses as Server-Sent Events (SSE). Each chunk contains a delta of the response.

bash

curl https://lumyx-ai.site/api/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "lumyx-plus_v1",
    "messages": [{"role": "user", "content": "Tell me a story"}],
    "stream": true
  }'

Stream Chunk Format

json

data: {"id":"chatcmpl-abc123","object":"chat.completion.chunk","choices":[{"index":0,"delta":{"content":"Hello"},"finish_reason":null}]}

data: {"id":"chatcmpl-abc123","object":"chat.completion.chunk","choices":[{"index":0,"delta":{"content":" world"},"finish_reason":null}]}

data: {"id":"chatcmpl-abc123","object":"chat.completion.chunk","choices":[{"index":0,"delta":{},"finish_reason":"stop"}]}

data: [DONE]

Credits & Billing

Pay-as-you-go pricing with credits

LumyxAI uses a credit-based billing system. Purchase credits upfront and they are consumed as you make API requests. Credits never expire. Each model has its own per-token pricing.

How It Works

Buy Credits

Purchase via Stripe from your dashboard ($2 min)

Make Requests

Use the API normally via OpenAI or Anthropic SDKs

Credits Deducted

Tokens used are billed at the model's per-token rate

Credit Packages

Package	Credits	Price	Per Credit
Starter	500	$5	$0.010
Popular Best Value	2,000	$15	$0.0075
Pro	5,000	$30	$0.006
Power	15,000	$75	$0.005

You can also enter a custom amount($2 – $1,000) at checkout. Rate: 100 credits per $1.

Credit Deduction

Credits are deducted after each request based on the tokens consumed and the model's pricing:

formula

cost = (input_tokens / 1,000,000) × input_price
     + (output_tokens / 1,000,000) × output_price

Note: If a model has no pricing configured (both input and output prices are 0), it is free to use and no credits are deducted. Requests to priced models will return a 402 error if your balance is 0.

Check Balance via API

View your credit balance and transaction history from the Credits dashboard.

Tip: Taxes are calculated automatically based on your location during Stripe checkout. All prices shown are before tax.

Usage

Track your API usage and costs

Query your API usage programmatically. Get aggregated totals, daily breakdowns, per-model stats, and recent request logs — all authenticated with your API key.

Get Usage

GET/v1/usage

bash

curl https://lumyx-ai.site/v1/usage \
  -H "Authorization: Bearer YOUR_API_KEY"

Query Parameters

Parameter	Type	Description
start_date	string	Filter from this date (ISO 8601, e.g. `2026-03-01`)
end_date	string	Filter until this date (ISO 8601)
model	string	Filter by model ID
limit	integer	Number of recent logs to return (default: 50, max: 100)
page	integer	Page number for pagination (default: 1)

Response Format

json

{
  "object": "usage",
  "account": {
    "credits_remaining": 142.50,
    "vip_tier": 0
  },
  "free_model_daily_limit": {
    "limit": 50,
    "used": 12,
    "remaining": 38,
    "resets_at": "2026-03-20T00:00:00.000Z",
    "timezone": "America/New_York"
  },
  "summary": {
    "total_requests": 1284,
    "total_prompt_tokens": 512000,
    "total_completion_tokens": 256000,
    "total_tokens": 768000,
    "total_cost": 3.840000,
    "avg_latency_ms": 1250
  },
  "daily": [
    {
      "date": "2026-03-19",
      "requests": 45,
      "tokens": 32000,
      "cost": 0.160000
    }
  ],
  "by_model": [
    {
      "model_id": "gpt-5.2",
      "requests": 800,
      "tokens": 500000,
      "cost": 2.500000
    }
  ],
  "recent_activity": [
    {
      "id": "clx...",
      "model_id": "gpt-5.2",
      "prompt_tokens": 150,
      "completion_tokens": 300,
      "total_tokens": 450,
      "cost": 0.002250,
      "latency_ms": 1100,
      "status_code": 200,
      "thinking_used": false,
      "created_at": "2026-03-19T14:30:00.000Z"
    }
  ],
  "pagination": {
    "page": 1,
    "limit": 50,
    "total": 1284,
    "total_pages": 26
  }
}

Tip: The free_model_daily_limit object shows your remaining free model requests and when they reset (midnight in your timezone). The daily array always returns the last 30 days regardless of filters. Use start_date and end_date to filter the summary, model breakdown, and recent activity.

Uncensored Models

Some models on Lumyx AI are tagged as uncensored. These models have fewer content restrictions and can produce outputs that standard models may decline.

Policy Acceptance Required

Before sending requests to uncensored models, users must accept the Uncensored Content Policy in their dashboard. API requests to uncensored models without acceptance return a 403 error.

API Error Response

If a user has not accepted the uncensored policy, the API returns:

json

{
  "error": {
    "message": "This model requires acceptance of the Uncensored Content Policy. Visit your dashboard terms page to review and accept.",
    "type": "terms_not_accepted",
    "code": "uncensored_policy_required"
  }
}

How to Accept

Log in to your Lumyx AI dashboard
Navigate to Settings → Terms & Policies
Find the Uncensored Content Policy and click to review it
Click I Agree & Continue to accept
You can now use uncensored models via the API and playground

Uncensored models are intended for adult users and researchers. By accepting the policy you confirm you are of legal age in your jurisdiction and will not use outputs to cause harm.

Rate Limits

Rate limits are applied per API key and vary by plan tier.

Plan	Requests/min	Requests/day
Free	10	1,000
Pro	60	50,000
Enterprise	Unlimited	Unlimited

Rate limit headers are included in every response: X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Reset

Errors

The API uses standard HTTP status codes. Error responses include a JSON body with details.

Status	Code	Description
400	`invalid_request`	Missing or invalid request parameters
401	`unauthorized`	Invalid or missing API key
402	`insufficient_credits`	Insufficient credits — top up your account
404	`model_not_found`	Model not found or not available
429	`rate_limit_exceeded`	Too many requests
500	`internal_error`	Internal server error
503	`model_unavailable`	Model temporarily unavailable

Error Response Format

json

{
  "error": {
    "message": "Invalid API key provided.",
    "type": "unauthorized",
    "code": 401
  }
}

Integrations

LumyxAI is fully compatible with the OpenAI API format, so any tool, CLI, or SDK that supports a custom base URL will work out of the box. Below are step-by-step guides for the most popular integrations.

All integrations use the same base URL and your LumyxAI API key:

text

Base URL:  https://lumyx-ai.site/v1
API Key:   lx-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

OpenAI Python SDK

The official openai Python package works with LumyxAI by simply changing the base URL.

bash

pip install openai

python

from openai import OpenAI

client = OpenAI(
    base_url="https://lumyx-ai.site/v1",
    api_key="lx-your-api-key"
)

response = client.chat.completions.create(
    model="gpt-5.2",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"}
    ],
    stream=True
)

for chunk in response:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

OpenAI Node.js SDK

bash

npm install openai

javascript

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://lumyx-ai.site/v1",
  apiKey: "lx-your-api-key",
});

const response = await client.chat.completions.create({
  model: "gpt-5.2",
  messages: [{ role: "user", content: "Hello!" }],
  stream: true,
});

for await (const chunk of response) {
  process.stdout.write(chunk.choices[0]?.delta?.content || "");
}

Anthropic Python SDK

LumyxAI also supports the native Anthropic Messages format via /v1/messages.

bash

pip install anthropic

python

from anthropic import Anthropic

client = Anthropic(
    base_url="https://lumyx-ai.site/api/v1",
    api_key="lx-your-api-key"
)

message = client.messages.create(
    model="gpt-5.2",
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(message.content[0].text)

curl

bash

curl https://lumyx-ai.site/v1/chat/completions \
  -H "Authorization: Bearer lx-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.2",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Aider

Aider is an AI pair-programming tool that works in your terminal.

bash

export OPENAI_API_BASE=https://lumyx-ai.site/v1
export OPENAI_API_KEY=lx-your-api-key

aider --model openai/gpt-5.2

Continue.dev (VS Code / JetBrains)

Add this to your ~/.continue/config.json:

json

{
  "models": [
    {
      "title": "LumyxAI",
      "provider": "openai",
      "model": "gpt-5.2",
      "apiBase": "https://lumyx-ai.site/v1",
      "apiKey": "lx-your-api-key"
    }
  ]
}

Cursor

In Cursor, go to Settings → Models → OpenAI API Key, then set:

API Key	lx-your-api-key
Base URL	https://lumyx-ai.site/v1

Then select any LumyxAI model from the model picker.

LiteLLM

Use LiteLLM to route LumyxAI as a provider alongside others.

python

import litellm

response = litellm.completion(
    model="openai/gpt-5.2",
    messages=[{"role": "user", "content": "Hello!"}],
    api_base="https://lumyx-ai.site/v1",
    api_key="lx-your-api-key"
)

Claude Code (CLI)

To use LumyxAI as a provider with Claude Code, set these environment variables before launching. You can customize which models are used for each tier.

bash

export ANTHROPIC_AUTH_TOKEN=lx-your-api-key
export ANTHROPIC_API_KEY=""
export ANTHROPIC_BASE_URL=https://lumyx-ai.site/api/

# Optional: customize which models Claude Code uses
export ANTHROPIC_DEFAULT_OPUS_MODEL=lumyxai/hunter-alpha
export ANTHROPIC_DEFAULT_SONNET_MODEL=glm-5-turbo
export ANTHROPIC_DEFAULT_HAIKU_MODEL=gpt-5.4-openai
export CLAUDE_CODE_SUBAGENT_MODEL=gpt-5.4-openai

claude

Note: Use ANTHROPIC_AUTH_TOKEN for the API key and set ANTHROPIC_API_KEY to an empty string. The base URL should be https://lumyx-ai.site/api/ (without v1). Replace the model names with any models available on LumyxAI.

Environment Variables (Universal)

Many tools automatically pick up these standard environment variables. Add them to your .bashrc, .zshrc, or .env file:

bash

# Works with: aider, continue.dev, litellm, langchain, etc.
export OPENAI_API_BASE=https://lumyx-ai.site/v1
export OPENAI_API_KEY=lx-your-api-key

Available Endpoints

Method	Endpoint	Description
GET	/v1/models	List all available models
GET	/v1/models/{id}	Get a specific model
POST	/v1/chat/completions	Chat completions (OpenAI format)
POST	/v1/responses	Responses API (OpenAI new format)
POST	/v1/completions	Text completions (legacy)
POST	/v1/messages	Messages (Anthropic format)
POST	/v1/embeddings	Generate embeddings
GET	/v1/usage	Get usage stats and history
GET	/v1/health	Health check
POST	/v1/web/search	Web search with content scraping
POST	/v1/web/scrape	Extract page content
POST	/v1/web/map	Discover all URLs on a website

Embeddings

Generate vector embeddings from text using POST /v1/embeddings. Supports platform models and BYOK prefix routing — use your own provider's embedding models at $0 cost.

bash

# Platform embedding model
curl https://lumyx-ai.site/v1/embeddings \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "text-embedding-3-small",
    "input": "The quick brown fox"
  }'

# BYOK — route through your own provider
curl https://lumyx-ai.site/v1/embeddings \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "or-xxxx/text-embedding-3-small",
    "input": "The quick brown fox"
  }'

Parameters

Parameter	Type	Description
model	string	Model ID or `prefix/model-id` for BYOK
input	string \| string[]	Text to embed
encoding_format	string	`float` or `base64`
dimensions	number	Output dimensions (model-dependent)

Image Generation

Generate images via POST /v1/images/generations. Requires a BYOK provider prefix — route through your own OpenAI, OpenRouter, or compatible provider. Supports DALL-E 3, Stable Diffusion, Flux, and any other image model your provider offers.

BYOK required: Image generation always requires a provider prefix (e.g. or-xxxx/dall-e-3). There is no platform fallback for this endpoint. BYOK image requests are billed at $0 LumyxAI credits.

bash

curl https://lumyx-ai.site/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "or-xxxx/dall-e-3",
    "prompt": "A photorealistic image of a cat coding on a laptop",
    "n": 1,
    "size": "1024x1024",
    "quality": "standard"
  }'

python

from openai import OpenAI

client = OpenAI(
    base_url="https://lumyx-ai.site/v1",
    api_key="YOUR_LUMYX_API_KEY"
)

response = client.images.generate(
    model="or-xxxx/dall-e-3",  # prefix/model
    prompt="A photorealistic cat coding on a laptop",
    n=1,
    size="1024x1024"
)
print(response.data[0].url)

Parameters

Parameter	Type	Description
model	string	`prefix/model-id` — BYOK required
prompt	string	Text description of the image to generate
n	number	Number of images (default 1)
size	string	`256x256`, `512x512`, `1024x1024`, `1792x1024`, `1024x1792`
quality	string	`standard` or `hd`
style	string	`vivid` or `natural`
response_format	string	`url` or `b64_json`

Bring Your Own Key (BYOK)

Connect your own OpenRouter, OpenAI, Anthropic, or any OpenAI-compatible provider to route requests through your own API key. You pay your provider directly — LumyxAI charges $0 for BYOK requests.

How prefix routing works

When you add a provider, LumyxAI auto-assigns a short prefix (e.g. or-a3x2). Prepend it to any model ID with a / separator and your request routes directly to that provider. Multi-segment model IDs work fine — only the first segment before / is treated as the prefix.

bash

# Chat — routes to your OpenRouter account
curl https://lumyx-ai.site/v1/chat/completions \
  -H "Authorization: Bearer YOUR_LUMYX_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "or-a3x2/baidu/cobuddy:free", "messages": [{"role": "user", "content": "Hi"}]}'

# Embeddings — routes to your provider's /embeddings
curl https://lumyx-ai.site/v1/embeddings \
  -H "Authorization: Bearer YOUR_LUMYX_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "or-a3x2/text-embedding-3-small", "input": "Hello world"}'

# Image generation — routes to your provider's /images/generations
curl https://lumyx-ai.site/v1/images/generations \
  -H "Authorization: Bearer YOUR_LUMYX_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "or-a3x2/dall-e-3", "prompt": "A sunset over the ocean"}'

Supported endpoints

Endpoint	BYOK	Notes
/v1/chat/completions	✓	Vision, tools, streaming, reasoning all pass through
/v1/completions	✓	Legacy text completions
/v1/messages	✓	Anthropic-format requests
/v1/responses	✓	OpenAI Responses API format
/v1/embeddings	✓	Platform fallback if no prefix
/v1/images/generations	✓	Prefix required — no platform fallback

Models without a prefix always use LumyxAI platform routing and standard credit billing.

Setup: Add a provider in your Providers dashboard. A short prefix is auto-assigned (e.g. or-a3x2). Supported: OpenRouter, OpenAI, Anthropic, Groq, Mistral, DeepSeek, Gemini, and any OpenAI-compatible API.

Billing: BYOK requests consume $0 LumyxAI credits. Usage is logged for your records but billed at $0.00. You pay your provider directly. Rate limits still apply.

Web Search

LumyxAI provides a web search API under /v1/web. Search the web, scrape pages as markdown, discover URLs on a site, or use AI to extract structured data — all authenticated with your existing API key. Results are cached automatically (5–15 min).

Method	Endpoint	Description
POST	/v1/web/search	Search the web with optional content scraping
POST	/v1/web/scrape	Extract full page content as markdown, HTML, or screenshot
POST	/v1/web/map	Discover all URLs on a website
POST	/v1/web/extract	AI-powered structured data extraction (from URLs or web search)