Add OVHcloud as an inference provider #1303

fabienric · 2025-03-21T10:44:46Z

What

Adds OVHcloud as an inference provider.

Test Plan

Added new tests for OVHcloud both with and without streaming.

What Should Reviewers Focus On?

I used the Cerebras PR as an example.

julien-c · 2025-04-02T11:14:58Z

Hi @fabienric we are currently finishing a refactoring of Inference Providers integration code in #1315, this should be merged soon, but we will need to rewrite part of your implementation (should be even simpler to integrate), will ping again after it's been merged.

hanouticelina · 2025-04-08T09:16:43Z

Hi @fabienric,
We've merged a refactoring for Inference Provider integration into main, which should make adding new providers much easier.
Could you merge main into your branch and update your PR accordingly? it should be relatively straightforward with the new structure:
1 - You have to update the PROVIDERS mapping here: inference/src/lib/getProviderHelper.ts#L49 and add ovhcloud (let's ensure we respect the alphabetical order) :

import * as OvhCloud from "../providers/ovhcloud";
...
export const PROVIDERS: Record<InferenceProvider, Partial<Record<InferenceTask, TaskProviderHelper>>> = {
	...
	"ovhcloud": {
		"conversational": new OvhCloud.OvhCloudConversationalTask(),
	},
        ...

2 - Update packages/inference/src/providers/ovhcloud.ts to implement OvhCloudConversationalTask that inherits BaseConversationalTask:

import { BaseConversationalTask, BaseTextGenerationTask } from "./providerHelper";

export class OvhCloudConversationalTask extends BaseConversationalTask {
	constructor() {
		super("ovhcloud", "https://oai.endpoints.kepler.ai.cloud.ovh.net");
	}
}

and that's it :) let us know if you need any help! you can find more details in the documentation : https://huggingface.co/docs/inference-providers/register-as-a-provider#2-js-client-integration.

julien-c · 2025-04-08T12:47:42Z

(sorry for the moving parts @fabienric – we can help move this PR over the finish line if needed)

- ovhcloud inference provider: use new base tasks and provider helpers, fix issues with inference parameters, add support for text generation task

fabienric · 2025-04-14T14:26:52Z

Hi @hanouticelina and @julien-c,

Thank you for the feedback, refactoring and updated documentation.

I've implemented our provider ; it required more work than I expected to get the payload right for an OpenAI compatible endpoint (make sure that the seed, max_tokens and other generation parameters are correctly mapped: the base implementation put them in a parameters dictionary, which is ignored on our end). Maybe other providers are impacted by this issue?

I've also implemented the text generation task, but I've found that the streaming case is not covered by the base task (the getResponse returns a Promise<TextGenerationOutput> where we would need a way to return a TextGenerationStreamOutput). My test case passes using ChatCompletionStreamOutput but this doesn't seem right. Maybe I missed something here.

Available to discuss the matter further if required.

Wauplin · 2025-04-22T09:18:46Z

Maybe other providers are impacted by this issue?

Actually most (if not all) providers are implemented only for the conversational task which is by far the most used task. It covers both text-generation and image-text-to-text pipelines as long as the model is tagged as conversational (i.e. it has a chat template, etc.). That is why the text-generation implementation has been left aside a bit for now. Let us know if the intention is indeed to cover text-generation AND conversational, in which case we can think about better improvements for text-generation (streaming option, OAI-compatible API, etc.). I do think it'll better to ship conversational-only for now to get the integration running and then go back to "non-conversational text-generation"

- fix tests

…o ovhcloud-inference-provider

fabienric · 2025-04-22T14:13:22Z

Hi @Wauplin and thanks for your feedback.

Actually my remark on the OpenAI compatible parameters also applies to the conversational case ; let me know if it's not the case.

I agree with you on the fact that the priority on our side is to get the conversational use case running. I can remove the code related to text-generation if needed but I think we can leave it as is.

Let me know when you're ready to merge.

add ovhcloud inference provider

39615a6

fabienric requested review from SBrandeis, gary149, Wauplin, julien-c, pcuenca, ngxson, hanouticelina and coyotte508 as code owners March 21, 2025 10:44

Merge branch 'main' into ovhcloud-inference-provider

46b3874

julien-c added the inference-providers integration of a new or existing Inference Provider label Apr 2, 2025

Fabien Ric added 2 commits April 14, 2025 16:12

- merge main into branch

11cbc80

- ovhcloud inference provider: use new base tasks and provider helpers, fix issues with inference parameters, add support for text generation task

Merge remote-tracking branch 'hf/main' into ovhcloud-inference-provider

8d393b9

Merge branch 'main' into ovhcloud-inference-provider

368f0f6

Fabien Ric added 3 commits April 22, 2025 16:03

Merge remote-tracking branch 'hf/main' into ovhcloud-inference-provider

61bb11f

- merge main into branch

85441ee

- fix tests

Merge remote-tracking branch 'origin/ovhcloud-inference-provider' int…

e0c7e49

…o ovhcloud-inference-provider

fix unused import

1212dc8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OVHcloud as an inference provider #1303

Add OVHcloud as an inference provider #1303

fabienric commented Mar 21, 2025

julien-c commented Apr 2, 2025

hanouticelina commented Apr 8, 2025

julien-c commented Apr 8, 2025

fabienric commented Apr 14, 2025 •

edited

Loading

Wauplin commented Apr 22, 2025

fabienric commented Apr 22, 2025 •

edited

Loading

Add OVHcloud as an inference provider #1303

Are you sure you want to change the base?

Add OVHcloud as an inference provider #1303

Conversation

fabienric commented Mar 21, 2025

What

Test Plan

What Should Reviewers Focus On?

julien-c commented Apr 2, 2025

hanouticelina commented Apr 8, 2025

julien-c commented Apr 8, 2025

fabienric commented Apr 14, 2025 • edited Loading

Wauplin commented Apr 22, 2025

fabienric commented Apr 22, 2025 • edited Loading

fabienric commented Apr 14, 2025 •

edited

Loading

fabienric commented Apr 22, 2025 •

edited

Loading