⚡ refactor: Optimize & Standardize Tokenizer Usage (#10777) · 8bdc808074 - Andreas/LibreChat

mirror of https://github.com/danny-avila/LibreChat.git synced 2026-04-02 13:57:19 +02:00

⚡ refactor: Optimize & Standardize Tokenizer Usage (#10777)

Some checks are pending

Docker Dev Branch Images Build / build (Dockerfile, lc-dev, node) (push) Waiting to run

Details

Docker Dev Branch Images Build / build (Dockerfile.multi, lc-dev-api, api-build) (push) Waiting to run

Details

Docker Dev Images Build / build (Dockerfile, librechat-dev, node) (push) Waiting to run

Details

Docker Dev Images Build / build (Dockerfile.multi, librechat-dev-api, api-build) (push) Waiting to run

Details

Sync Locize Translations & Create Translation PR / Sync Translation Keys with Locize (push) Waiting to run

Details

Sync Locize Translations & Create Translation PR / Create Translation PR on Version Published (push) Blocked by required conditions

Details

* refactor: Token Limit Processing with Enhanced Efficiency

- Added a new test suite for `processTextWithTokenLimit`, ensuring comprehensive coverage of various scenarios including under, at, and exceeding token limits.
- Refactored the `processTextWithTokenLimit` function to utilize a ratio-based estimation method, significantly reducing the number of token counting function calls compared to the previous binary search approach.
- Improved handling of edge cases and variable token density, ensuring accurate truncation and performance across diverse text inputs.
- Included direct comparisons with the old implementation to validate correctness and efficiency improvements.

* refactor: Remove Tokenizer Route and Related References

- Deleted the tokenizer route from the server and removed its references from the routes index and server files, streamlining the API structure.
- This change simplifies the routing configuration by eliminating unused endpoints.

* refactor: Migrate countTokens Utility to API Module

- Removed the local countTokens utility and integrated it into the @librechat/api module for centralized access.
- Updated various files to reference the new countTokens import from the API module, ensuring consistent usage across the application.
- Cleaned up unused references and imports related to the previous countTokens implementation.

* refactor: Centralize escapeRegExp Utility in API Module

- Moved the escapeRegExp function from local utility files to the @librechat/api module for consistent usage across the application.
- Updated imports in various files to reference the new centralized escapeRegExp function, ensuring cleaner code and reducing redundancy.
- Removed duplicate implementations of escapeRegExp from multiple files, streamlining the codebase.

* refactor: Enhance Token Counting Flexibility in Text Processing

- Updated the `processTextWithTokenLimit` function to accept both synchronous and asynchronous token counting functions, improving its versatility.
- Introduced a new `TokenCountFn` type to define the token counting function signature.
- Added comprehensive tests to validate the behavior of `processTextWithTokenLimit` with both sync and async token counting functions, ensuring consistent results.
- Implemented a wrapper to track call counts for the `countTokens` function, optimizing performance and reducing unnecessary calls.
- Enhanced existing tests to compare the performance of the new implementation against the old one, demonstrating significant improvements in efficiency.

* chore: documentation for Truncation Safety Buffer in Token Processing

- Added a safety buffer multiplier to the character position estimates during text truncation to prevent overshooting token limits.
- Updated the `processTextWithTokenLimit` function to utilize the new `TRUNCATION_SAFETY_BUFFER` constant, enhancing the accuracy of token limit processing.
- Improved documentation to clarify the rationale behind the buffer and its impact on performance and efficiency in token counting.

This commit is contained in:

Danny Avila

2025-12-02 12:22:04 -05:00

• committed by

GitHub

parent b2387cc6fa

commit 8bdc808074

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

19 changed files with 925 additions and 107 deletions

									
										3

api/server/routes/messages.js
									
										View file
										
				@ -1,7 +1,7 @@

				const express = require('express');

				const { unescapeLaTeX } = require('@librechat/api');

				const { logger } = require('@librechat/data-schemas');

				const { ContentTypes } = require('librechat-data-provider');

				const { unescapeLaTeX, countTokens } = require('@librechat/api');

				const {

				  saveConvo,

				  getMessage,

				@ -14,7 +14,6 @@ const { findAllArtifacts, replaceArtifactContent } = require('~/server/services/

				const { requireJwtAuth, validateMessageReq } = require('~/server/middleware');

				const { cleanUpPrimaryKeyValue } = require('~/lib/utils/misc');

				const { getConvosQueried } = require('~/models/Conversation');

				const { countTokens } = require('~/server/utils');

				const { Message } = require('~/db/models');

				const router = express.Router();

Rows
Columns

3 api/server/routes/messages.js Unescape Escape View file

3

api/server/routes/messages.js

View file