[Story]: Allow for fallback models #1221

JAORMX · 2025-03-05T07:57:57Z

Description

The idea is to provide a way for CodeGate to fallback to another provider + model in case there's an issue with the original one CodeGate is proxying.

Additional Context

When actively working with a model, one might start hitting issues such as rate limits or actual downtime from the provider itself. In this case, I was trying Claude 3.7 and started hitting rate limit issues. I'd like to fallback to an equivalent model to seamlessly continue working.

JAORMX · 2025-03-05T07:59:03Z

@aponcedeleonch and I have been discussing about this and came up with the concept of "pseudo-providers" that is, fake provider endpoints within codegate that will encompass more complex logic. This could be a concept that we could use to implement fallback as requested in this issue. We could also start implementing A/B testing, or other more complex workflows using this.

aponcedeleonch · 2025-03-05T08:18:18Z

I've been thinking and probably would make sense to add fallback rules or conditions. That way we would only fallback if certain condition is met. At least these 2 come to mind:

Error: Fallback if an error occurred in the primary model. Like the rate limit example you pointed out above
Balance: A 50/50 split between the models. Would be useful for A/B testing for example

JAORMX · 2025-03-05T08:31:21Z

@aponcedeleonch adding rules makes sense; that's similar to the fallback mechanism we had in Minder for REST endpoints and data sources.

Balacing does not fall within the fallback provider implementation IMO and should be something else. It could use the same mechanism ("pseudo-providers"), but it would be another implementation.

Think of pseudo-providers as the abstract base class, and fallback as the implementation. Balancing and A/B testing would also be separate implementations.

JAORMX added the needs-triage label Mar 5, 2025

lukehinds removed the needs-triage label Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Story]: Allow for fallback models #1221

[Story]: Allow for fallback models #1221

JAORMX commented Mar 5, 2025

JAORMX commented Mar 5, 2025

aponcedeleonch commented Mar 5, 2025

JAORMX commented Mar 5, 2025

[Story]: Allow for fallback models #1221

[Story]: Allow for fallback models #1221

Comments

JAORMX commented Mar 5, 2025

Description

Additional Context

JAORMX commented Mar 5, 2025

aponcedeleonch commented Mar 5, 2025

JAORMX commented Mar 5, 2025