1. Api
CheapAi Aggregation API
  • Api
    • CheapAI Aggregation API novice tutorial & calling documentation | Support online calling/CherryStudio and other open source software calling/code calling
    • Online debugging instructions
    • Make a request
    • CheapAI Aggregation API introduction and customer service customization
      • Introduction to AI interface aggregation management service
      • API Quick Start Guide
      • Group detailed table
      • Particularity of groupings and price differences
      • How to create a new token for a specified group
      • Agent interface calling address
      • API Key Quota Query Service
      • Free AI Chat Project Navigation
      • Contact customer service & enterprise university customization_WeChat
      • Invite friends to register and enjoy huge cashback
    • Common configuration for various plug-in software to call API
      • 1. Fill in the correct API interface address (it is recommended to test in order)
      • 2. Fill in the API key (token), usually starting with sk-
      • 3. Fill in the correct model name (including dialogue, pictures, music, video, etc.)
    • Various plug-in/software usage tutorials
      • CherryStudio configuration tutorial
      • CherryStudio calls CLUDE MCP
      • CherryStudio configures o4 inference level
      • Cherry Studio configuration banana raw image (4K resolution) tutorial
      • CherryStudio configures claude code, Gemini Cli, Opencode and other programming tools
      • chatgpt-web-mi (conversation, picture and video tool)
      • Claude Code installation and usage tutorial
      • ChatBox (supports mobile phones and computers)
      • SillyTavern configuration
      • Omate mobile terminal configuration tutorial
      • Dify configuration tutorial
      • n8n configuration tutorial
      • n8n workflow obtains local images and generates video examples
      • N8N workflow using transit API tutorial
      • Cursor configuration tutorial
      • cline configuration tutorial
      • immersive translation
      • Browser plug-in ChatGPT Sidebar
      • nextchat setup tutorial
      • CLAUDE DEV configuration tutorial
      • chatgpt-on-wechat configuration tutorial
      • aider configuration tutorial
      • lobechat setup tutorial
      • chatgpt GPT Academic academic optimization configuration gpt tutorial
      • zotero gpt configuration method
      • Open source gpt_academic
      • Gemini CLI transfer station configuration and usage tutorial
      • LangBot configuration tutorial
      • Codex configuration tutorial
      • RikkaHub configuration tutorial
      • Coze workflow using transit API tutorial
      • COZE workflow simple configuration from input to obtaining url
      • zed configuration tutorial
      • opencode configuration tutorial
      • OpenClaw latest version custom transfer station configuration tutorial
      • OpenClaw and CC switch custom transfer station configuration tutorial
    • Chat
      • ChatGpt interface
        • ChatGPT Audio (Audio)
          • GPT-4o-audio
          • Audio to text whisper-1
          • Audio to text gpt-4o-transcribe
          • Create speech gpt-4o-mini-tts
          • Create translation (not supported)
        • ChatGPT Chat (Chat)
          • Chat completion object
          • Chat completion block object
          • Create chat completion (streaming)
          • Create chat completion (non-streaming)
          • Create chat image recognition (streaming)
          • Create chat picture recognition (streaming) best64
          • Create chat image recognition (non-streaming)
          • Create a chat creation graph (non-streaming)
          • Official Function calling call
          • Official N test
          • List models
          • Create chat function call
          • Create structured output
          • Controlling inference model effort
          • Create chat completion qwen-mt-turbo
          • Create chat completion deepseek v3.1 thinking level (streaming)
          • deepseek-ocr recognition
        • ChatGPT automatic completion (Completions)
          • completion object
          • Creation completed
        • ChatGPT Embeddings
          • embedded object
          • Create embed
        • Web search
          • web search
      • Anthropic Claude interface
        • Chat completion object
        • Chat completion block object
        • native format
          • Create chat completion (streaming)
          • Create function calls (streaming)
          • Create formatted output
          • Create a Thought Chat
          • PDF support
          • PDF supports base64 format
          • Internet search
        • chat compatible format
          • Create a Thought Chat
          • Create chat completion (streaming)
          • Create chat completion (non-streaming)
          • Create chat image recognition (streaming)
          • Create chat image recognition (non-streaming)
      • Google Gemini interface
        • native format
          • text generation
          • text generation-stream
          • Text generation + thinking-flow
          • Image generation
          • Image generation gemini-2.5-flash-image
          • Image generation gemini-2.5-flash-image controls aspect ratio
          • Image generation gemini-3-pro-image-preview controls aspect ratio + clarity
          • Picture editing
          • Picture understanding
          • Formatted output
          • function call
          • Document understanding
          • URL context [native format]
          • code execution
          • google search
          • video understanding
          • URL context
          • audio understanding
          • Embeddings
          • TTS text to speech
          • Text generation gemini-3-pro-preview:generateContent
          • Imagen generates images
          • gemini-tts text-to-speech
          • text embedding
        • chat compatible format
          • gemini image creation interface [chat compatible format]
          • Chat interface [chat compatible format]
          • Chat interface-Thinking 1 [chat compatible format]
          • Chat interface-Thinking 2 [chat compatible format]
          • Image recognition interface [chat compatible format]
          • Chat + read file interface [chat compatible format]
          • Text embedding [chat compatible format]
    • Chat(Responses)
      • Comparison between Responses API and Chat API
      • Create model response
      • Create function call
      • Create model response (streaming return)
      • Create model responses (control think length)
      • Create a web search
      • Create model responsive to gpt-5 enabled think
      • Create function call Copy
    • painting model
      • README
      • image object
      • Midjourney
        • Upload pictures
        • Submit Imagine task
        • Query task status based on task ID
        • Query tasks based on ID list
        • Get the seed of the task image
        • Execute Action
        • Submit Blend task
        • Submit Describe task
        • Submit Modal
      • Ideogram
        • Generate 3.0 (Vincent Chart)Generate
        • Generate 3.0 (Picture Editing)Edit
        • Generate 3.0 (Picture Remix) Remix
        • Generate 3.0 (image reconstruction) Reframe
        • Generate 3.0 (replace background) Replace Background
        • ideogram
        • Remix
        • Upscale (enlarge HD)
        • Describe
      • GPT Image Series
        • Create gpt-image-1
        • Edit gpt-image-1
        • Mask gpt-image-1
        • Create gpt-image-1.5
        • Edit gpt-image-1.5
        • Mask gpt-image-1.5
      • Grok Image Series
        • Create Image
        • edit image
      • DALL·E 3
        • Create DALL·E 3
      • FLUX series
        • gpt compatible format
          • Flux creation (OpenAI dall-e-3 format)
          • Flux editing (OpenAI dall-e-3 format)
        • Replicate official format
          • Flux resolution
          • Access tutorial
          • Create task black-forest-labs/flux-kontext-dev
          • Query tasks
      • Bean bag series
        • Create image
      • Fal.ai Platform
        • status code
        • Get request results
        • /fal-ai/nano-banana Wen Sheng Tu
        • /fal-ai/nano-banana/edit Image editing
      • Tencent AIGC raw pictures
        • status code
        • Get request results
        • Create task
      • Qwen series
        • qwen-image-max
        • z-image-turbo
        • qwen-image-edit-2509
    • video model
      • veo video generation
        • Video unified format
          • status code
          • Create video
          • Create videos with pictures
          • Query tasks
          • Create video (reference image)
        • OpenAI video format
          • openai creates videos with pictures
          • openai query task
          • openai download video
      • luma video generation
        • Official API format
          • status code
          • Submit video generation task
          • Extended video
        • Query tasks
          • Query a single task
          • Get tasks in batches
      • Runway video generation
        • status code
        • Submit video generation task
        • Query video tasks (free)
      • conch video generation
        • status code
        • Submit video generation task
        • Video task status query
        • tusheng video
        • First and last frame video
      • Doubao video generation
        • Create task
        • Query a single task
        • Query video generation task list-default
        • Query video generation task list - search multiple task IDs
        • seedance-1-5-pro-first and last frames
        • seedance-1-5-pro
      • sora video generation
        • Unified video format
          • Query tasks
          • Create videos with pictures sora-2
          • Create video sora-2
          • Create video sora-2-pro
          • Create video (with Character)
          • Create a role
        • chat format
          • Create video
          • Create videos, with pictures
          • Continuously modify and generate videos
        • OpenAI official video format
          • Create video
          • Create a character from an uploaded video
          • openai edit video
          • openai download video
          • openai query task
      • grok video generation
        • Video unified format
          • status code
          • Create video
          • Query tasks
      • Tongyi Wanxiang video generation
        • Generate video
        • Video query
      • Tencent AIGC video generation
        • status code
        • Create task
        • Get request results
        • Special effects template creation task
    • GPTs related
      • Introduction
      • GPTs conversation
    • System API
      • Get a list of tokens
      • Add new token
      • Get token support model
      • Get account information
      • modify token
      • search token
      • Get token usage
      • delete token
    • Vincent Music Suno
      • illustrate
      • parameter
      • Task submission
        • Generate songs (inspiration mode)
        • Generate songs (custom mode)
        • Generate songs (continuation mode)
        • Generate songs (singer style)
        • Generate songs (upload songs for secondary creation)
        • Generate songs (splicing songs)
        • Generate lyrics
        • song splicing
        • Report upload completed
        • Query upload processing status
        • Initialize audio files
        • Request upload authorization
        • s3 upload example
        • Scenario 1: Inspiration mode
        • Scene 2: Customized lyrics and song title
        • Scene 3: Pure music. Customized
      • Query interface
        • Get tasks in batches
        • Query a single task
        • Get wav
        • Timing: lyrics, audio timeline
        • Get scene details
    • Python configuration method
      • Python uses speech to text
      • Python uses text to speech
      • Python uses Embeddings vectorization
      • python calls DALL·E
      • python simple call openai function-calling demo
      • python simple langchain calls openai demo
      • python llama_index configuration
      • Python basic conversation
      • Python uses gpt-4o to identify pictures-local pictures
      • Python uses gpt-4o to identify images
      • Python uses Claude to identify images
      • python library streaming output
      • gpt realtime model call
      • python request request streaming output demo
      • python uses gpt-image-1 to create and edit pictures
      • python openai official library (using AutoGPT, langchain, etc.)
      • python continuous dialogue
    • Rerank reranking model
      • Reorder
    • Kling platform
      • Callback protocol
      • Vincent Video
        • Vincent Video
        • Query task (single)
      • tusheng video
        • tusheng video
        • Query task (single)
      • Omni-Video
        • Omni-Video
        • Query task (single)
      • Multi-picture reference student video
        • Multi-picture reference student video
        • Query task (single)
      • Multimodal video editing
        • Initialize the video to be edited
        • Add video selection
        • Delete video selection
        • Preview selected video
        • multimodal video
        • Query task (single)
      • video extension
        • video extension
        • Query task (single)
      • video effects
        • video effects
        • Query task (single)
      • image generation
        • image generation
        • Query task (single)
      • Multiple reference pictures
        • Multiple reference pictures
        • Query task (single)
      • Omni-Image
        • Omni-Image
        • Query task (single)
      • Expand image
        • Expand image
        • Query task (single)
      • image recognition
        • image recognition
      • digital man
        • digital man
        • Query task (single)
      • Vincent sound effects
        • Vincent sound effects
        • Query task (single)
      • Video sound effects
        • Video sound effects
        • Query task (single)
      • speech synthesis
        • speech synthesis
      • virtual try on
        • virtual try on
        • Query task (single)
      • lip sync
        • face recognition
        • lip sync
        • Query task (single)
      • Custom sounds
        • Custom sounds
        • Query custom timbre (single)
        • Check official sounds
        • Delete a custom sound
      • motion control
        • motion control
        • Query task (single)
      • main body
        • main body
    • Replicate aggregation platform
      • Access tutorial
      • Flux resolution
      • Query tasks
      • Create task - pass model version (predictions)
      • Create task stability-ai/stable-diffusion
      • Create task stability-ai/sdxl
      • Create task stability-ai/stable-diffusion-inpainting
      • Create task stability-ai/stable-diffusion-img2img
      • Create task lucataco/flux-schnell-lora
      • Create task lucataco/flux-dev-lora
      • Create task andreasjansson/stable-diffusion-animation
      • Create task sujaykhandekar/object-removal
      • Create task cjwbw/rembg
      • Create task lucataco/animate-diff
      • Create task ideogram-ai/ideogram-v2-turbo
      • Create task black-forest-labs/flux-kontext-dev
      • Create task minimax/video-01-live
      • Create task minimax/video-01
      • Create task recraft-ai/recraft-v3
      • Create task recraft-ai/recraft-v3-svg
      • Create task black-forest-labs/flux-1.1-pro-ultra
      • Create task black-forest-labs/flux-kontext-pro
      • Create task black-forest-labs/flux-kontext-max
      • Create task flux-kontext-apps/multi-image-kontext-max
      • Create task flux-kontext-apps/multi-image-kontext-pro
      • Create task lucataco/remove-bg
      • Create task riffusion/riffusion
      • Create task black-forest-labs/flux-fill-dev
      • Create task black-forest-labs/flux-fill-pro
      • Create task google/imagen-4-fast
      • Create task google/imagen-4-ultra
      • Create task google/imagen-4
      • Create task prunaai/vace-14b
      • Create task bytedance/seedream-4
    • Conch Minimax Platform
      • Create asynchronous speech synthesis task V2
      • Synchronous speech synthesis V2
      • Vincent Tusheng video generation task
      • Upload forked audio
      • Upload sample audio
      • Generate video from first and last frames
      • Quick sound reproduction
      • Query video generation task status
      • Retrieval (for video download, asynchronous audio download)
      • text synthesis
      • Query the speech generation task status
      • sound design
    • Official VIDU video generation, picture generation, audio generation
      • status code
      • Create a Vincent video task
      • Create Tusheng video task
      • Create an image generation task
      • Create a Vincent audio task
      • speech synthesis
      • Create a reference student video task
      • Create a video task for generating first and last frames
      • Get request results
    • Fal-ai aggregation platform
      • Access tutorial
      • falai-veo3 video generation
        • /fal-ai/veo3
        • /fal-ai/veo3/fast/image-to-video
        • /fal-ai/veo3/fast
        • /fal-ai/veo3/requests/{request_id}
        • /fal-ai/veo3/image-to-video
      • Get request results
      • /fal-ai/flux-1/dev
      • /fal-ai/flux-1/dev/image-to-image
      • /fal-ai/flux-1/dev/redux
      • /fal-ai/flux-1/schnell/redux
      • /fal-ai/flux-pro/kontext
      • /fal-ai/flux-pro/kontext/text-to-image
      • /fal-ai/flux-pro/kontext/max
      • /fal-ai/flux-pro/kontext/max/multi
      • /fal-ai/wan/v2.2-a14b/image-to-image
      • /fal-ai/bytedance/seedream/v4/text-to-image
      • /fal-ai/bytedance/seedream/v4/edit
      • /fal-ai/vidu/reference-to-image
      • /fal-ai/imagen4/preview
      • /fal-ai/qwen-image-edit-lora
      • /fal-ai/qwen-image-edit-plus
      • /fal-ai/kling-video/v2.5-turbo/pro/text-to-video
      • /fal-ai/kling-video/v2.5-turbo/pro/image-to-video
      • /fal-ai/flux-lora
      • /fal-ai/flux-lora/image-to-image
      • /fal-ai/flux-lora/inpainting
      • /fal-ai/kling-video/v2.5-turbo/pro/text-to-video
      • /fal-ai/qwen-image-edit-lora
    • php configuration method
      • PHP image editing demo
    • nodejs configuration method
      • nodejs basic conversation
    • Help Center
      • AI return field: Thinking related
      • HTTP status codes and their meanings
      • Self-built image bed API
      • Upload pictures to picture bed
  1. Api

CheapAI Aggregation API novice tutorial & calling documentation | Support online calling/CherryStudio and other open source software calling/code calling

CheapAI API features 300+ of the latest global models, synchronized with official updates, offering the lowest prices across the network, no concurrency limits, and high-speed stability. Introduction to AI API Aggregation Management Service
Welcome major enterprises and universities to consult for customization, with large discounts, corporate bank transfers, and special VAT invoices available! (Please contact customer service in advance) Contact Customer Service & Enterprise/University Customization_WeChat
CheapAI Aggregation API update notification: https://cheapaiapi.store/categories/qing-yun-ju-he-api
Invite friends to register and enjoy huge cashback
Common domain: https://api.cheapaiapi.store
Note: All online tools are for testing purposes only, please save your data locally in a timely manner.

2. If you already have an API key, you can also use open-source software like CherryStudio on your computer or Chatbox on your phone for deployment and calling, to experience more powerful features. Configuration tutorials for common open-source software are as follows:#

Click to view: CherryStudio Configuration Tutorial (Powerful, recommended)
Click to view: Cherry Studio banana Image Generation (4K Resolution) Configuration Tutorial
Click to view: CherryStudio Configuration for Claude Code, Gemini CLI, Opencode, etc. Programming Tools
Click to view: ChatBox (Supports Mobile and PC)
Click to view: Dify Configuration Tutorial
Click to view: n8n Configuration Tutorial
Click to view: SillyTavern Configuration
Click to view: Omate Mobile Configuration Tutorial (Similar to SillyTavern)
Click to view: Immersive Translate
Click to view: Cline Configuration Tutorial (Recommended for programming)
Click to view: Claude Code Installation and Calling Tutorial
Click to view: Gemini CLI Relay Station Configuration and Usage Tutorial
Click to view: Codex Configuration Tutorial
Click to view: Zed Programming Configuration Tutorial
Click to view: Coze Workflow Using Relay API Tutorial
Click to view: Claude Code Installation and Usage Tutorial
Click to view: OpenClaw Latest Version Custom Relay Station Configuration Tutorial
For more application configuration tutorials, you can click "Various Plugins/Software Usage Tutorials" on the left.
Please note: API calling for all applications actually requires correctly filling out the common API interface configuration:
1. Fill in the correct API interface address (recommended to test sequentially)
Interface/Request Address: https://api.cheapaiapi.store
Interface/Request Address: https://api.cheapaiapi.store/v1
Route Request Address: https://api.cheapaiapi.store/v1/chat/completions
Request address has different names in various software, it might be called: Interface Address / Request Port / BaseURL / Key Address / Proxy Address
For using other image or video generation models, please check the calling interfaces and instance code in the left list.
2. Fill in the API Key (Token), usually starting with sk-
3. Fill in the correct Model Name (including dialogue, image, music, video generation, etc.).

3. Call via Code, please check the common document on the left#

The left side contains detailed calling methods for chat, image generation, video generation, as well as Python, Node.js, etc., code. Please search and click to view.
If you encounter code issues during code calling, you can throw our documentation and your requirements to Claude 4 or Gemini 2.5 Pro (which can be online directly called in your Console-Playground), and let them output the correct code for you.

4. AI Communication Group#

1. Welcome to log in to our newly created AI communication and learning website: CheapAI TOP https://cheapaiapi.store#

2. Welcome to join the official WeChat "CheapAI TOP - AI Communication Group":#

Join WeChat group chat QR code.png
Scan to add WeChat: CheapAI TOP - AI Communication Group

5. FAQ#

1. Prompts errors like unable to connect to API, invalid token, etc.
This situation is usually caused by an API configuration error. Please confirm whether you have selected the OpenAI specification, filled in the correct interface address, the correct API key, and model name, and try again. For specific filling methods, please check the tutorials above.
2. Prompts that this model cannot be used in the default group, especially Gemini models.
This situation is usually because there is no permission to use this model in your account API token. Please go to API Token - click Edit Key - Add Alternative Group. Add the group where the model you want to use is located into the alternative group to use it normally.
3. With so many groups, how should I use them?
Different groups correspond to different channels and application scenarios, and you can choose according to your needs:
When using Gemini models, please add the key to the "Premium Gemini Group".
When using Claude Code, please add the first key group as the Claude Code exclusive group.
The limited-time special offer group will give a 40% discount on the current price, supporting most models in Gemini, Qianwen, Deepseek, Doubao, etc. When using these models, you can add the "Limited Time Special Offer" group as the first one in the key.
Pay special attention when using different models. Please check the group where the model is located and adjust the key to the corresponding group. Different groups have different model source channels and different multipliers. The price is the default group's price multiplied by the current group's multiplier.
It is recommended to add multiple groups to the key, arranged from low to high multiplier, to avoid channel saturation or network fluctuations making output impossible, ensuring reliability of calling.
Models are constantly responding to official updates and changes, please pay attention to the 300+ model list and pricing https://api.cheapaiapi.store/pricing
4. Prompt indicates upstream saturation.
1.
It might be because the current model group has a small volume release, resulting in saturation and making the model unavailable. You can switch to other models, or try again later. You can also set other alternative groups in the key to try calling through other channels. The technical team will replenish the volume in time, so you can try again later.
2.
It might also be that there is a problem with your request body parameters. You can read the documentation here to solve it. You can also contact WeCom Customer Service to inquire about the reason for the error.
5. The model does not answer for a long time.
Usually, because your question is too long, or your context is too long, there is too much content to send to the large model, causing the large model to need to read and think, thereby prolonging the response time. Please try to open a new conversation and reduce the number of words in your question to try again.
6. What to do if code calling problems occur.
CheapAI Aggregation API provides rich code calling documentation which has been verified. You can click to view and refer to it directly at https://api.cheapaiapi.store/about
If you encounter code issues during your code calls, you can feed our documentation and your requirements to Claude 4.5 or Gemini 2.5 Pro (you can call them directly online in your Console-Playground), and let them output the correct code for you.
7. Prompts sensitive words or sensitive words
This is usually when sensitive words are detected in your conversation, causing the model to refuse to answer. Please check your system prompts, custom role prompts, question prompts, and context to see if there are any sensitive words. If so, please remove them or try to open a new conversation.
8. The large model cannot answer who it is
Using the API to call a large model defaults to a "raw" call. The model itself does not provide background information, so the model cannot answer data that it does not know. It cannot "introspect" its own version. When interacting with a large model through the official website, the backend system will pre-inject a "system prompt" containing the version number to the model, and the model merely answers according to this "script".
9. Why does the large model get dumber over time?
This phenomenon is usually caused by two dimensions: "context fatigue" within a single conversation and "model iteration and perception differences" during long-term use. You can resolve this by optimizing your prompts or checking relevant tutorials on platforms like Bilibili.
The following link will also update related tutorials from time to time, which can be watched and learned for free:
https://pan.quark.cn/s/d0f3910e647e
Modified at 2026-03-21 13:19:57
Next
Online debugging instructions
Built with