mirror of https://github.com/danny-avila/LibreChat.git synced 2026-03-18 13:46:34 +01:00

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active. https://librechat.ai/

ai anthropic artifacts aws azure chatgpt chatgpt-clone claude clone deepseek gemini google gpt-5 librechat mcp o1 openai responses-api vision webui

Find a file

Danny Avila 365c39c405 feat: Accurate Token Usage Tracking & Optional Balance (#1018 ) * refactor(Chains/llms): allow passing callbacks * refactor(BaseClient): accurately count completion tokens as generation only * refactor(OpenAIClient): remove unused getTokenCountForResponse, pass streaming var and callbacks in initializeLLM * wip: summary prompt tokens * refactor(summarizeMessages): new cut-off strategy that generates a better summary by adding context from beginning, truncating the middle, and providing the end wip: draft out relevant providers and variables for token tracing * refactor(createLLM): make streaming prop false by default * chore: remove use of getTokenCountForResponse * refactor(agents): use BufferMemory as ConversationSummaryBufferMemory token usage not easy to trace * chore: remove passing of streaming prop, also console log useful vars for tracing * feat: formatFromLangChain helper function to count tokens for ChatModelStart * refactor(initializeLLM): add role for LLM tracing * chore(formatFromLangChain): update JSDoc * feat(formatMessages): formats langChain messages into OpenAI payload format * chore: install openai-chat-tokens * refactor(formatMessage): optimize conditional langChain logic fix(formatFromLangChain): fix destructuring * feat: accurate prompt tokens for ChatModelStart before generation * refactor(handleChatModelStart): move to callbacks dir, use factory function * refactor(initializeLLM): rename 'role' to 'context' * feat(Balance/Transaction): new schema/models for tracking token spend refactor(Key): factor out model export to separate file * refactor(initializeClient): add req,res objects to client options * feat: add-balance script to add to an existing users' token balance refactor(Transaction): use multiplier map/function, return balance update * refactor(Tx): update enum for tokenType, return 1 for multiplier if no map match * refactor(Tx): add fair fallback value multiplier incase the config result is undefined * refactor(Balance): rename 'tokens' to 'tokenCredits' * feat: balance check, add tx.js for new tx-related methods and tests * chore(summaryPrompts): update prompt token count * refactor(callbacks): pass req, res wip: check balance * refactor(Tx): make convoId a String type, fix(calculateTokenValue) * refactor(BaseClient): add conversationId as client prop when assigned * feat(RunManager): track LLM runs with manager, track token spend from LLM, refactor(OpenAIClient): use RunManager to create callbacks, pass user prop to langchain api calls * feat(spendTokens): helper to spend prompt/completion tokens * feat(checkBalance): add helper to check, log, deny request if balance doesn't have enough funds refactor(Balance): static check method to return object instead of boolean now wip(OpenAIClient): implement use of checkBalance * refactor(initializeLLM): add token buffer to assure summary isn't generated when subsequent payload is too large refactor(OpenAIClient): add checkBalance refactor(createStartHandler): add checkBalance * chore: remove prompt and completion token logging from route handler * chore(spendTokens): add JSDoc * feat(logTokenCost): record transactions for basic api calls * chore(ask/edit): invoke getResponseSender only once per API call * refactor(ask/edit): pass promptTokens to getIds and include in abort data * refactor(getIds -> getReqData): rename function * refactor(Tx): increase value if incomplete message * feat: record tokenUsage when message is aborted * refactor: subtract tokens when payload includes function_call * refactor: add namespace for token_balance * fix(spendTokens): only execute if corresponding token type amounts are defined * refactor(checkBalance): throws Error if not enough token credits * refactor(runTitleChain): pass and use signal, spread object props in create helpers, and use 'call' instead of 'run' * fix(abortMiddleware): circular dependency, and default to empty string for completionTokens * fix: properly cancel title requests when there isn't enough tokens to generate * feat(predictNewSummary): custom chain for summaries to allow signal passing refactor(summaryBuffer): use new custom chain * feat(RunManager): add getRunByConversationId method, refactor: remove run and throw llm error on handleLLMError * refactor(createStartHandler): if summary, add error details to runs * fix(OpenAIClient): support aborting from summarization & showing error to user refactor(summarizeMessages): remove unnecessary operations counting summaryPromptTokens and note for alternative, pass signal to summaryBuffer * refactor(logTokenCost -> recordTokenUsage): rename * refactor(checkBalance): include promptTokens in errorMessage * refactor(checkBalance/spendTokens): move to models dir * fix(createLanguageChain): correctly pass config * refactor(initializeLLM/title): add tokenBuffer of 150 for balance check * refactor(openAPIPlugin): pass signal and memory, filter functions by the one being called * refactor(createStartHandler): add error to run if context is plugins as well * refactor(RunManager/handleLLMError): throw error immediately if plugins, don't remove run * refactor(PluginsClient): pass memory and signal to tools, cleanup error handling logic * chore: use absolute equality for addTitle condition * refactor(checkBalance): move checkBalance to execute after userMessage and tokenCounts are saved, also make conditional * style: icon changes to match official * fix(BaseClient): getTokenCountForResponse -> getTokenCount * fix(formatLangChainMessages): add kwargs as fallback prop from lc_kwargs, update JSDoc * refactor(Tx.create): does not update balance if CHECK_BALANCE is not enabled * fix(e2e/cleanUp): cleanup new collections, import all model methods from index * fix(config/add-balance): add uncaughtException listener * fix: circular dependency * refactor(initializeLLM/checkBalance): append new generations to errorMessage if cost exceeds balance * fix(handleResponseMessage): only record token usage in this method if not error and completion is not skipped * fix(createStartHandler): correct condition for generations * chore: bump postcss due to moderate severity vulnerability * chore: bump zod due to low severity vulnerability * chore: bump openai & data-provider version * feat(types): OpenAI Message types * chore: update bun lockfile * refactor(CodeBlock): add error block formatting * refactor(utils/Plugin): factor out formatJSON and cn to separate files (json.ts and cn.ts), add extractJSON * chore(logViolation): delete user_id after error is logged * refactor(getMessageError -> Error): change to React.FC, add token_balance handling, use extractJSON to determine JSON instead of regex * fix(DALL-E): use latest openai SDK * chore: reorganize imports, fix type issue * feat(server): add balance route * fix(api/models): add auth * feat(data-provider): /api/balance query * feat: show balance if checking is enabled, refetch on final message or error * chore: update docs, .env.example with token_usage info, add balance script command * fix(Balance): fallback to empty obj for balance query * style: slight adjustment of balance element * docs(token_usage): add PR notes		2023-10-05 18:34:10 -04:00
.devcontainer	fix: devcontainer image and networking (#891 )	2023-09-07 07:19:03 -04:00
.github	Update CONTRIBUTING.md	2023-09-26 11:43:57 -04:00
.husky	chore: move files out of root to declutter	2023-09-06 14:00:36 -04:00
api	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
client	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
config	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
docs	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
e2e	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
packages/data-provider	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
pyserver	feat: Add Code Interpreter Plugin (#837 )	2023-08-28 09:13:50 -04:00
.dockerignore	chore: Update docker, Minor Styling fix (#528 )	2023-06-17 11:38:48 -04:00
.env.example	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
.eslintrc.js	refactor(client): Refactors recent typescript changes for best practices (#763 )	2023-08-05 16:45:26 -04:00
.gitignore	feat: Message Rate Limiters, Violation Logging, & Ban System 🔨 (#903 )	2023-09-13 10:57:07 -04:00
bun.lockb	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
deploy-compose.yml	chore(docker-compose.yml): comment out meilisearch ports in docker-compose.yml (#807 )	2023-08-14 10:23:00 -04:00
docker-compose.yml	chore(docker-compose.yml): comment out meilisearch ports in docker-compose.yml (#807 )	2023-08-14 10:23:00 -04:00
Dockerfile	Add podman installation instructions. Update dockerfile to stub env (#819 )	2023-08-24 20:20:37 -04:00
Dockerfile.multi	chore(Dockerfile.multi): add data-provider package build and copy step	2023-07-30 11:50:24 -04:00
index.html	Update index.html to replace ChatGPT Clone with LibreChat (#724 )	2023-07-28 19:14:58 -04:00
mkdocs.yml	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
package-lock.json	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
package.json	feat: Accurate Token Usage Tracking & Optional Balance (#1018 )	2023-10-05 18:34:10 -04:00
prettier.config.js	refactor: Settings/Presets UI Restructure, convert many files to TS (#740 )	2023-08-04 13:56:44 -04:00
README.md	feat: Message Rate Limiters, Violation Logging, & Ban System 🔨 (#903 )	2023-09-13 10:57:07 -04:00

README.md

LibreChat

All-In-One AI Conversations with LibreChat

LibreChat brings together the future of assistant AIs with the revolutionary technology of OpenAI's ChatGPT. Celebrating the original styling, LibreChat gives you the ability to integrate multiple AI models. It also integrates and enhances original client features such as conversation and message search, prompt templates and plugins.

With LibreChat, you no longer need to opt for ChatGPT Plus and can instead use free or pay-per-call APIs. We welcome contributions, cloning, and forking to enhance the capabilities of this advanced chatbot platform.

Click on the thumbnail to open the video☝️

Features

Response streaming identical to ChatGPT through server-sent events
UI from original ChatGPT, including Dark mode
AI model selection: OpenAI API, BingAI, ChatGPT Browser, PaLM2, Anthropic (Claude), Plugins
Create, Save, & Share custom presets - More info on prompt presets here
Edit and Resubmit messages with conversation branching
Search all messages/conversations - More info here
Plugins now available (including web access, image generation and more)

⚠️ Breaking Changes ⚠️

Please read this before updating from a previous version

Changelog

Keep up with the latest updates by visiting the releases page - Releases

Getting Started

Installation
Configuration

General Information

Features

Cloud Deployment

Contributions

Star History

Contributors

Contributions and suggestions bug reports and fixes are welcome! Please read the documentation before you do!

For new features, components, or extensions, please open an issue and discuss before sending a PR.

Join the Discord community