Skip to content

server : consolidate slot selection into get_available_slot#24755

Merged
ggerganov merged 1 commit into
masterfrom
gg/server-consolidate-slot-selection
Jun 19, 2026
Merged

server : consolidate slot selection into get_available_slot#24755
ggerganov merged 1 commit into
masterfrom
gg/server-consolidate-slot-selection

Conversation

@ggerganov

@ggerganov ggerganov commented Jun 18, 2026

Copy link
Copy Markdown
Member

Overview

fix #24746

Absorb get_slot_by_id logic into get_available_slot so slot selection is handled by a single function call. When a specific slot id is requested, the LCP similarity check still runs to enable proper prompt cache updates.

Requirements

Absorb get_slot_by_id logic into get_available_slot so slot selection
is handled by a single function call. When a specific slot id is
requested, the LCP similarity check still runs to enable proper
prompt cache updates.

Assisted-by: pi:llama.cpp/Qwen3.6-27B
@ggerganov ggerganov marked this pull request as ready for review June 19, 2026 06:21
@ggerganov ggerganov requested a review from a team as a code owner June 19, 2026 06:21
@ggerganov ggerganov merged commit 80452d6 into master Jun 19, 2026
23 of 25 checks passed
@ggerganov ggerganov deleted the gg/server-consolidate-slot-selection branch June 19, 2026 06:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Misc. bug: Explicit Slot Requests Bypass Prompt Cache Restore

1 participant