Skip to content

Issues: kubernetes-sigs/gateway-api-inference-extension

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Adaptive metrics probing periods
#667 opened Apr 8, 2025 by ahg-g
Add Envoy AI Gateway Guides
#651 opened Apr 5, 2025 by Xunzhuo
Add a metric to track InferenceModels ready to serve by the epp good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.
#598 opened Mar 28, 2025 by ahg-g
lora-syncer should block on startup until the server is ready kind/bug Categorizes issue or PR as related to a bug.
#597 opened Mar 28, 2025 by ahg-g
move updating scheduling parameters from env variables to main from scheduling pkg good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.
#586 opened Mar 27, 2025 by kaushikmitr
lora-syncer tool's error handling needs improvement kind/bug Categorizes issue or PR as related to a bug.
#584 opened Mar 26, 2025 by kfswain
EPP TLS support provides very minimal protection kind/feature Categorizes issue or PR as related to a new feature.
#582 opened Mar 26, 2025 by LiorLieberman
Metric showing latency to make a placement decision kind/bug Categorizes issue or PR as related to a bug.
#581 opened Mar 26, 2025 by smarterclayton
Include metadata metric
#579 opened Mar 26, 2025 by JeffLuoo
During hitless rollout testing, on average one early request to vLLM times out. kind/bug Categorizes issue or PR as related to a bug.
#557 opened Mar 21, 2025 by smarterclayton
Configure x-request-id support in the default ootb examples kind/bug Categorizes issue or PR as related to a bug.
#556 opened Mar 21, 2025 by smarterclayton
Improve metric capture on error
#554 opened Mar 20, 2025 by kfswain
We should encourage all InferencePool deployments to gracefully rollout and drain kind/bug Categorizes issue or PR as related to a bug.
#549 opened Mar 20, 2025 by smarterclayton
Remove k8s dependency from BBR
#535 opened Mar 19, 2025 by rramkumar1
Unable to know when gateway extension is scraped, no log line printed kind/bug Categorizes issue or PR as related to a bug.
#525 opened Mar 18, 2025 by smarterclayton
ProTip! Updated in the last three days: updated:>2025-04-05.