-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Story]: Allow for fallback models #1221
Comments
@aponcedeleonch and I have been discussing about this and came up with the concept of "pseudo-providers" that is, fake provider endpoints within codegate that will encompass more complex logic. This could be a concept that we could use to implement fallback as requested in this issue. We could also start implementing A/B testing, or other more complex workflows using this. |
I've been thinking and probably would make sense to add fallback rules or conditions. That way we would only fallback if certain condition is met. At least these 2 come to mind:
|
@aponcedeleonch adding rules makes sense; that's similar to the fallback mechanism we had in Minder for REST endpoints and data sources. Balacing does not fall within the fallback provider implementation IMO and should be something else. It could use the same mechanism ("pseudo-providers"), but it would be another implementation. Think of pseudo-providers as the abstract base class, and fallback as the implementation. Balancing and A/B testing would also be separate implementations. |
Description
The idea is to provide a way for CodeGate to fallback to another provider + model in case there's an issue with the original one CodeGate is proxying.
Additional Context
When actively working with a model, one might start hitting issues such as rate limits or actual downtime from the provider itself. In this case, I was trying Claude 3.7 and started hitting rate limit issues. I'd like to fallback to an equivalent model to seamlessly continue working.
The text was updated successfully, but these errors were encountered: