Documentation Index
Fetch the complete documentation index at: https://mintlify.com/danny-avila/librechat/llms.txt
Use this file to discover all available pages before exploring further.
AI Endpoint Configuration
LibreChat supports multiple AI providers and custom endpoints. Configure them using environment variables and the librechat.yaml file.
Endpoint Selection
Comma-separated list of enabled endpointsENDPOINTS=openAI,assistants,azureOpenAI,google,anthropic
OpenAI
OpenAI API key. Set to user_provided to require users to provide their own keys.OPENAI_API_KEY=sk-xxx
# Or require user-provided keys:
OPENAI_API_KEY=user_provided
Comma-separated list of available OpenAI modelsOPENAI_MODELS=gpt-5,gpt-5-codex,gpt-5-mini,gpt-5-nano,o3-pro,o3,o4-mini,gpt-4.1,gpt-4.1-mini,gpt-4.1-nano,o3-mini,o1-pro,o1,gpt-4o,gpt-4o-mini
Enable OpenAI debug logging
Enable automatic conversation title generation
OPENAI_TITLE_MODEL
string
default:"gpt-4o-mini"
Model to use for conversation title generationOPENAI_TITLE_MODEL=gpt-4o-mini
Enable conversation summarization
OPENAI_SUMMARY_MODEL
string
default:"gpt-4o-mini"
Model to use for summarizationOPENAI_SUMMARY_MODEL=gpt-4o-mini
Force system prompt inclusion
Reverse proxy URL for OpenAI APIOPENAI_REVERSE_PROXY=https://your-proxy.com/v1
OpenAI organization IDOPENAI_ORGANIZATION=org-xxx
Assistants API
API key for OpenAI AssistantsASSISTANTS_API_KEY=user_provided
# Or use specific key:
ASSISTANTS_API_KEY=sk-xxx
Custom base URL for Assistants APIASSISTANTS_BASE_URL=https://api.openai.com/v1
Comma-separated list of models available for AssistantsASSISTANTS_MODELS=gpt-4o,gpt-4o-mini,gpt-3.5-turbo-0125
Assistants Configuration (YAML)
endpoints:
assistants:
disableBuilder: false # Disable Assistants Builder Interface
pollIntervalMs: 3000 # Polling interval for updates
timeoutMs: 180000 # Timeout for operations
# Filter assistants by ID
supportedIds:
- "asst_supportedAssistantId1"
- "asst_supportedAssistantId2"
# OR exclude specific assistants
# excludedIds:
# - "asst_excludedAssistantId"
# Only show user-created assistants
# privateAssistants: false
# Models that support retrieval
retrievalModels:
- "gpt-4-turbo-preview"
# Available capabilities
capabilities:
- "code_interpreter"
- "retrieval"
- "actions"
- "tools"
- "image_vision"
Anthropic
Anthropic API keyANTHROPIC_API_KEY=user_provided
# Or use specific key:
ANTHROPIC_API_KEY=sk-ant-xxx
Comma-separated list of available Anthropic modelsANTHROPIC_MODELS=claude-sonnet-4-6,claude-opus-4-6,claude-opus-4-20250514,claude-sonnet-4-20250514,claude-3-7-sonnet-20250219,claude-3-5-sonnet-20241022,claude-3-5-haiku-20241022
Reverse proxy URL for Anthropic APIANTHROPIC_REVERSE_PROXY=https://your-proxy.com/v1
Anthropic via Google Vertex AI
Use Anthropic models through Google Vertex AIANTHROPIC_USE_VERTEX=true
Vertex AI region for Anthropic modelsANTHROPIC_VERTEX_REGION=us-east5
Anthropic Configuration (YAML)
endpoints:
anthropic:
streamRate: 20 # Stream rate limiting (ms)
titleModel: claude-3.5-haiku # Model for titles
# Vertex AI Configuration
vertex:
region: "us-east5" # Vertex region
# serviceKeyFile: "/path/to/service-account.json"
# projectId: "${VERTEX_PROJECT_ID}"
# Model mappings
models:
claude-opus-4.5:
deploymentName: claude-opus-4-5@20251101
claude-sonnet-4:
deploymentName: claude-sonnet-4-20250514
claude-3.7-sonnet:
deploymentName: claude-3-7-sonnet-20250219
Google
Google API key (for Gemini API / AI Studio)GOOGLE_KEY=user_provided
# Or use specific key:
GOOGLE_KEY=AIzaSyXXX
Reverse proxy URL for Google APIGOOGLE_REVERSE_PROXY=https://your-proxy.com/v1
Pass API key in Authorization header instead of X-goog-api-keyUse this if your reverse proxy doesn’t support X-goog-api-key header
Comma-separated list of available Google models# Gemini API (AI Studio):
GOOGLE_MODELS=gemini-3.1-pro-preview,gemini-3.1-pro-preview-customtools,gemini-2.5-pro,gemini-2.5-flash,gemini-2.5-flash-lite,gemini-2.0-flash,gemini-2.0-flash-lite
# Vertex AI:
GOOGLE_MODELS=gemini-3.1-pro-preview,gemini-2.5-pro,gemini-2.5-flash,gemini-2.0-flash-001,gemini-2.0-flash-lite-001
GOOGLE_TITLE_MODEL
string
default:"gemini-2.0-flash-lite-001"
Model to use for title generationGOOGLE_TITLE_MODEL=gemini-2.0-flash-lite-001
Vertex AI Configuration
GOOGLE_LOC
string
default:"us-central1"
Google Cloud region for Vertex AI
Alternative region for Gemini Image GenerationGOOGLE_CLOUD_LOCATION=global
Path to Google Cloud service account JSON fileGOOGLE_SERVICE_KEY_FILE=/path/to/service-account.json
Google Safety Settings
Safety settings for both Vertex AI and Gemini APIOptions: BLOCK_NONE, BLOCK_ONLY_HIGH, BLOCK_MEDIUM_AND_ABOVE, BLOCK_LOW_AND_ABOVEFor Vertex AI: BLOCK_NONE requires either allowlist access or monthly invoiced billingFor Gemini API: BLOCK_NONE is available by default
GOOGLE_SAFETY_SEXUALLY_EXPLICIT=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_HATE_SPEECH=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_HARASSMENT=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_DANGEROUS_CONTENT=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_CIVIC_INTEGRITY=BLOCK_ONLY_HIGH
Gemini Image Generation
Dedicated Gemini API key for image generation (falls back to GOOGLE_KEY)
GEMINI_IMAGE_MODEL
string
default:"gemini-2.5-flash-image"
Vertex AI model for image generationGEMINI_IMAGE_MODEL=gemini-2.5-flash-image
AWS Bedrock
BEDROCK_AWS_DEFAULT_REGION
Default AWS region for BedrockBEDROCK_AWS_DEFAULT_REGION=us-east-1
BEDROCK_AWS_ACCESS_KEY_ID
AWS access key IDBEDROCK_AWS_ACCESS_KEY_ID=AKIAXXXXX
BEDROCK_AWS_SECRET_ACCESS_KEY
AWS secret access keyBEDROCK_AWS_SECRET_ACCESS_KEY=xxxxx
BEDROCK_AWS_SESSION_TOKEN
AWS session token (for temporary credentials)BEDROCK_AWS_SESSION_TOKEN=xxxxx
Comma-separated list of Bedrock model IDsIf omitted, all supported model IDs will be included
BEDROCK_AWS_MODELS=anthropic.claude-sonnet-4-6,anthropic.claude-opus-4-6-v1,anthropic.claude-3-5-sonnet-20240620-v1:0,meta.llama3-1-8b-instruct-v1:0
# Cross-region inference:
BEDROCK_AWS_MODELS=us.anthropic.claude-sonnet-4-6,global.anthropic.claude-opus-4-6-v1
Bedrock Configuration (YAML)
endpoints:
bedrock:
# Model configuration
models:
- "anthropic.claude-3-7-sonnet-20250219-v1:0"
- "anthropic.claude-3-5-sonnet-20241022-v2:0"
# Inference profiles
inferenceProfiles:
"us.anthropic.claude-sonnet-4-20250514-v1:0": "${BEDROCK_INFERENCE_PROFILE_CLAUDE_SONNET}"
"anthropic.claude-3-7-sonnet-20250219-v1:0": "arn:aws:bedrock:us-west-2:123456789012:application-inference-profile/abc123"
# Guardrails
guardrailConfig:
guardrailIdentifier: "your-guardrail-id"
guardrailVersion: "1"
trace: "enabled" # enabled, enabled_full, or disabled
Unsupported Bedrock Models:
ai21.j2-mid-v1 (no streaming support)
ai21.j2-ultra-v1, cohere.command-text-v14, cohere.command-light-text-v14 (no conversation history support)
Azure OpenAI
Azure OpenAI environment variables are DEPRECATED. Use librechat.yaml configuration instead.
endpoints:
azureOpenAI:
# Configuration goes here
# See librechat.yaml documentation
Custom Endpoints
Define custom OpenAI-compatible endpoints in librechat.yaml:
Groq Example
endpoints:
custom:
- name: 'groq'
apiKey: '${GROQ_API_KEY}'
baseURL: 'https://api.groq.com/openai/v1/'
models:
default:
- 'llama3-70b-8192'
- 'llama3-8b-8192'
- 'mixtral-8x7b-32768'
- 'gemma-7b-it'
fetch: false
titleConvo: true
titleModel: 'mixtral-8x7b-32768'
modelDisplayLabel: 'groq'
Mistral AI Example
endpoints:
custom:
- name: 'Mistral'
apiKey: '${MISTRAL_API_KEY}'
baseURL: 'https://api.mistral.ai/v1'
models:
default: ['mistral-tiny', 'mistral-small', 'mistral-medium']
fetch: true
titleConvo: true
titleModel: 'mistral-tiny'
modelDisplayLabel: 'Mistral'
dropParams: ['stop', 'user', 'frequency_penalty', 'presence_penalty']
OpenRouter Example
endpoints:
custom:
- name: 'OpenRouter'
apiKey: '${OPENROUTER_KEY}'
baseURL: 'https://openrouter.ai/api/v1'
headers:
x-librechat-body-parentmessageid: '{{LIBRECHAT_BODY_PARENTMESSAGEID}}'
models:
default: ['meta-llama/llama-3-70b-instruct']
fetch: true
titleConvo: true
titleModel: 'meta-llama/llama-3-70b-instruct'
dropParams: ['stop']
modelDisplayLabel: 'OpenRouter'
Known Endpoints (API Keys Only)
These endpoints can be configured with just API keys:
Agents Endpoint
endpoints:
agents:
recursionLimit: 50 # Default recursion depth
maxRecursionLimit: 100 # Maximum recursion depth
disableBuilder: false # Disable builder interface
maxCitations: 30 # Max total citations
maxCitationsPerFile: 7 # Max citations per file
minRelevanceScore: 0.45 # Minimum relevance threshold
# Available capabilities
capabilities:
- "deferred_tools"
- "execute_code"
- "file_search"
- "actions"
- "tools"
Next Steps
librechat.yaml
Complete YAML configuration reference
Environment Variables
All environment variables
Authentication
Set up OAuth and SSO
File Storage
Configure file storage