- Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Dockerfile to use devel image for compatibility
#2848 opened Dec 16, 2024 by YaserJaradeh Loading…
2 of 5 tasks
Fix CPU and memory affinity under external resource management
#3012 opened Feb 11, 2025 by askervin Loading…
Expose the real-time internal state of the batcher through SSE
#3065 opened Feb 27, 2025 by mfuntowicz • Draft
Set
uv UV_PYTHON_INSTALL_DIR explicitly #3197 opened Apr 27, 2025 by sebastianliebscher Loading…
1 of 5 tasks
display available cached versions in TGI server error message of Neuron backend
#3063 opened Feb 26, 2025 by jimburtoft Loading…
4 tasks
Kvrouter that will increase the kv-cache hits in case of multiple routing strategy
#2965 opened Jan 29, 2025 by Narsil Loading…
5 tasks
Fix flashinfer plan call to use positional arguments for #3165
#3166 opened Apr 11, 2025 by ruckc Loading…
2 of 5 tasks
feat: expose GPU energy consumption (mJ) in responses
#3315 opened Aug 28, 2025 by JulienDelavande Loading…
2 of 5 tasks
Remove
once_cell dependency from multiple Cargo.toml files and update usage in validation.rs to use std::sync::LazyLock instead of once_cell::sync::Lazy. #3334 opened Sep 28, 2025 by htiennv Loading…
5 tasks
Previous Next
ProTip! no:milestone will show everything without a milestone.