feat: Update Qwen3 to Qwen3.5 by r-dh · Pull Request #181 · superlinear-ai/raglite

r-dh · 2026-03-06T13:47:38Z

Summary

Upgrade default local LLM from Qwen 3 (8B/4B) to Qwen 3.5 (9B/4B)
Bump llama-cpp-python optional dep from >=0.3.9 to >=0.3.16
Fix division by zero in _limit_chunkspans when context budget is exhausted

Blocker

Depends on abetlen/llama-cpp-python#2133, which adds Qwen 3.5 GDN (Gated Delta Network) support to llama-cpp-python. That PR also fixes a prefix-caching bug affecting all hybrid architecture models. Until it is merged and released as >=0.3.16, this PR cannot be merged.

Known issue

test_self_query fails: Qwen 3.5 returns {'topic': ['Physics']} instead of {} for an off-topic query ("What is the price of a Bugatti Chiron?"). The model applies a metadata filter even when the query is unrelated to the dataset. This is a behavioral regression compared to Qwen 3 and should be addressed separately, either by tuning the self-query prompt or by accepting the looser behavior.

Test notes

Tests use n_ctx=6144 instead of the default 8192 because the GDN model plus the embedding model together exceed Metal GPU memory at 8192
All other non-slow tests pass (31 passed, 1 pre-existing OpenAI API failure unrelated to this change)
All slow function-calling tests pass (16 passed, 2 skipped for PostgreSQL)

r-dh added 2 commits March 6, 2026 14:51

feat: upgrade default LLM from Qwen 3 to Qwen 3.5

b9bda63

fix: prevent division by zero in _limit_chunkspans

c0b4a01

r-dh force-pushed the rd-qwen3.5 branch from c5cd5cc to c0b4a01 Compare March 6, 2026 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Update Qwen3 to Qwen3.5#181

feat: Update Qwen3 to Qwen3.5#181
r-dh wants to merge 2 commits intomainfrom
rd-qwen3.5

r-dh commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

r-dh commented Mar 6, 2026

Summary

Blocker

Known issue

Test notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant