feat: default max output tokens for autocomplete #5789

uinstinct · 2025-05-22T06:45:29Z

Description

Adds the ability to have a lower maximum output tokens for auto completion by default.

Closes #5449

References #4448 (comment)
References #3994 (comment)

Checklist

[] I've read the contributing guide
[] The relevant docs, if any, have been updated or created
[] The relevant tests, if any, have been updated or created

Screenshots

[ For visual changes, include screenshots. Screen recordings are particularly helpful, and appreciated! ]

Tests

[ What tests were added or updated to ensure the changes work as expected? ]

Summary by cubic

Set a lower default max output tokens limit (256) for autocomplete to reduce unnecessary token usage.

New Features
- Autocomplete now uses a smaller max tokens value by default unless overridden.

netlify · 2025-05-22T06:45:33Z

✅ Deploy Preview for continuedev canceled.

Name	Link
🔨 Latest commit	`3ebd8f9`
🔍 Latest deploy log	https://app.netlify.com/projects/continuedev/deploys/682ec80bbfd12600096dfb21

uinstinct · 2025-05-22T06:50:24Z

core/autocomplete/CompletionProvider.ts

@@ -73,6 +74,11 @@ export class CompletionProvider {
      llm.completionOptions.temperature = 0.01;
    }

+    // (DOES NOT WORK) llm.complettionOptions.maxTokens is already populated - need to detect if llm not have maxTokens already set


need some help here. i want to know if maxTokens in user's config was already set. However, BaseLLM sets the completion option's maxTokens.

Patrick-Erichsen

Hey @uinstinct , sorry for the delay in reviewing this!

I'm going to close this out because I think there's a good chance we create a new autocompleteOptions in the near term. Assuming we do that, we can just directly add a new maxTokens property there.

Patrick-Erichsen · 2025-06-01T19:05:26Z

core/llm/index.ts

@@ -195,6 +196,12 @@ export abstract class BaseLLM implements ILLM {
    const templateType =
      options.template ?? autodetectTemplateType(options.model);

+    // if model has a single role - autocomplete, then use a smaller default maxTokens
+    const defaultMaxTokens =
+      options.roles?.length === 1 && options.roles.at(0) === "autocomplete"


This solution makes sense but there are some models such as codestral that can be used for both autocomplete and chat.

We are considering creating a top-level autocompleteOptions, similar to chatOptions and embedOptions that already exist. If we go that route than we can just create an autocompleteOptions.maxTokens.

feat: max output tokens for autocomplete

3ebd8f9

uinstinct requested a review from a team as a code owner May 22, 2025 06:45

uinstinct requested review from Patrick-Erichsen and removed request for a team May 22, 2025 06:45

github-project-automation bot added this to Issues and PRs May 22, 2025

github-project-automation bot moved this to Todo in Issues and PRs May 22, 2025

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label May 22, 2025

uinstinct commented May 22, 2025

View reviewed changes

Patrick-Erichsen requested changes Jun 1, 2025

View reviewed changes

github-project-automation bot moved this from Todo to In Progress in Issues and PRs Jun 1, 2025

Patrick-Erichsen closed this Jun 1, 2025

github-project-automation bot moved this from In Progress to Done in Issues and PRs Jun 1, 2025

github-actions bot locked and limited conversation to collaborators Jun 1, 2025

uinstinct deleted the autocomplete-max-tokens branch June 2, 2025 03:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: default max output tokens for autocomplete #5789

feat: default max output tokens for autocomplete #5789

uinstinct commented May 22, 2025 •

edited by cubic-dev-ai bot

Loading

Uh oh!

netlify bot commented May 22, 2025 •

edited

Loading

Uh oh!

uinstinct May 22, 2025

Uh oh!

Patrick-Erichsen left a comment

Uh oh!

Patrick-Erichsen Jun 1, 2025

Uh oh!

Uh oh!

feat: default max output tokens for autocomplete #5789

feat: default max output tokens for autocomplete #5789

Conversation

uinstinct commented May 22, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Screenshots

Tests

Summary by cubic

Uh oh!

netlify bot commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for continuedev canceled.

Uh oh!

uinstinct May 22, 2025

Choose a reason for hiding this comment

Uh oh!

Patrick-Erichsen left a comment

Choose a reason for hiding this comment

Uh oh!

Patrick-Erichsen Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

uinstinct commented May 22, 2025 •

edited by cubic-dev-ai bot

Loading

netlify bot commented May 22, 2025 •

edited

Loading