VSCode llamafile Support Is Broken #5530

ipaddicting · 2025-05-06T07:31:51Z

Before submitting your bug report

I believe this is a bug. I'll try to join the Continue Discord for questions
I'm not able to find an open issue that reports the same bug
I've seen the troubleshooting guide on the Continue Docs

Relevant environment info

- OS: macOS
- Continue version: v1.0.8
- IDE version: VSCode 1.99.3
- Model: llamafile v0.9.2
- config:
  
  "models": [
    {
      "title": "Qwen 2.5 Coder 32B",
      "model": "qwen2.5-coder:32b",
      "provider": "llamafile",
      "apiBase": "http://127.0.0.1:8001"
    }
  ],

Description

The support of llamafile on VSCode is broken.

I rolled back the Continue to v1.0.1, and it just works fine, log output as following:

{"function":"log_server_request","level":"INFO","line":2842,"method":"POST","msg":"request","params":{},"path":"/completion","remote_addr":"127.0.0.1","remote_port":57485,"status":200,"tid":"5188741632","timestamp":1746515946}

To reproduce

Open chat
Type who are you
Press Enter

Log output

Error message from continue:

Unexpected token 'F', "File Not Found" is not valid JSON

And the error message from llamafile:

{"function":"log_server_request","level":"INFO","line":2842,"method":"POST","msg":"request","params":{},"path":"/completions","remote_addr":"127.0.0.1","remote_port":58247,"status":404,"tid":"5188744736","timestamp":1746516335}

Looks like the path of request changed, from "path":"/completion" to path":"/completions".

The text was updated successfully, but these errors were encountered:

ipaddicting · 2025-05-06T08:04:29Z

The bug was introduced by this pull request Update LlamaCpp.ts #5030 on v1.0.6-vscode, because Llamafile is extends LlamaCpp.

ipaddicting · 2025-05-19T08:47:41Z

Just check the updated documentation from llama.cpp, the URL remains completion: https://github.com/ggml-org/llama.cpp/blob/master/tools/server/README.md#post-completion-given-a-prompt-it-returns-the-predicted-completion.

/v1/completions is for OAI-compatible clients only, but in LlamaCpp.ts there is no /v1 presented, and I don't think it should.

RomneyDa · 2025-05-26T21:02:21Z

@ipaddicting thanks for the quick turnaround on this

RomneyDa · 2025-05-26T21:03:25Z

@sestinj assigning to you since related pr #5726 reverts #5030

github-project-automation bot added this to Issues and PRs May 6, 2025

github-project-automation bot moved this to Todo in Issues and PRs May 6, 2025

sestinj assigned RomneyDa May 6, 2025

dosubot bot added ide:vscode Relates specifically to VS Code extension kind:bug Indicates an unexpected problem or unintended behavior labels May 6, 2025

dosubot bot added the os:mac Happening specifically on Mac label May 7, 2025

ipaddicting linked a pull request May 19, 2025 that will close this issue

fix: reverted the URL of llama.cpp back to 'completion'. #5726

Open

RomneyDa assigned sestinj and unassigned RomneyDa May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

VSCode llamafile Support Is Broken #5530

VSCode llamafile Support Is Broken #5530

ipaddicting commented May 6, 2025 •

edited

Loading

ipaddicting commented May 6, 2025 •

edited

Loading

Uh oh!

ipaddicting commented May 19, 2025 •

edited

Loading

Uh oh!

RomneyDa commented May 26, 2025

Uh oh!

RomneyDa commented May 26, 2025

Uh oh!

VSCode llamafile Support Is Broken #5530

VSCode llamafile Support Is Broken #5530

Comments

ipaddicting commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before submitting your bug report

Relevant environment info

Description

To reproduce

Log output

ipaddicting commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ipaddicting commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RomneyDa commented May 26, 2025

Uh oh!

RomneyDa commented May 26, 2025

Uh oh!

ipaddicting commented May 6, 2025 •

edited

Loading

ipaddicting commented May 6, 2025 •

edited

Loading

ipaddicting commented May 19, 2025 •

edited

Loading