Fix push/pop scopes leaking learned clauses into subsequent proofs by Copilot · Pull Request #9221 · Z3Prover/z3

Copilot · 2026-04-03T17:55:26Z

After push()/check-sat()/pop(), learned clauses involving only outer-scope variables survive gc_vars() and persist in the clause database, polluting watch lists and biasing VSIDS activity scores for subsequent proofs outside the scope.

Approach

Track the user scope level at clause creation using a new 2-bit m_scope_lim field in the clause struct (uses 2 previously unused bits in the existing packed bitfield). During user_pop(), after gc_vars() removes clauses containing pop'd variables, learned clauses with scope_lim > new_scope_level are also removed.

The field saturates at 3, so the cleanup is precise for up to 3 levels of nested push/pop; popping to scope level ≥3 cannot distinguish level-N from level-N+1 clauses, but the common cases (1–2 nesting levels) are handled exactly.

The cleanup is controlled by the new sat.gc.learned_pop parameter (default true), which can be set to false to disable the feature for testing or profiling.

Changes

sat_clause.h: Add m_scope_lim:2 bitfield and scope_lim()/set_scope_lim() accessors to clause.
sat_clause.cpp: Initialize m_scope_lim = 0 in the constructor; copy it in copy_clause() so memory defragmentation preserves scope information.
sat_solver.cpp (mk_nary_clause): Stamp each learned clause with the current user scope depth on creation.
sat_solver.cpp (user_pop): After gc_vars(), sweep m_learned and detach/delete any clause whose scope_lim exceeds the new scope level, gated on m_config.m_gc_learned_pop.
sat_params.pyg: Add gc.learned_pop (bool, default true) parameter.
sat_config.h / sat_config.cpp: Add m_gc_learned_pop field and read it from params.
src/test/sat_gc.cpp: New benchmark (tst_sat_gc) that uses a near-phase-transition 3-SAT formula (200 variables, ratio 4.5) with 3 push/pop cycles to demonstrate that gc.learned_pop=true removes ~6000 stale learned clauses that would otherwise accumulate. Runs in ~0.1 s.

Add a 2-bit scope_lim field to the SAT clause struct to track the user scope level at which each learned clause was created. When user_pop is called, learned clauses that were added during the popped user scopes are removed. The root cause was that learned clauses accumulated during inner push/pop scopes were persisting after pop, polluting the clause database and biasing the search for subsequent proofs. - sat_clause.h: Add m_scope_lim:2 bitfield and set_scope_lim/scope_lim accessors - sat_clause.cpp: Initialize m_scope_lim=0 in constructor, copy in copy_clause - sat_solver.cpp (mk_nary_clause): Set scope_lim when adding learned clauses - sat_solver.cpp (user_pop): After gc_vars, remove learned clauses with scope_lim > new scope level (handles up to 3 levels of nesting precisely) Agent-Logs-Url: https://github.com/Z3Prover/z3/sessions/c23a0600-90dd-45a1-b64f-c286d041f373 Co-authored-by: NikolajBjorner <3085284+NikolajBjorner@users.noreply.github.com>

Agent-Logs-Url: https://github.com/Z3Prover/z3/sessions/c23a0600-90dd-45a1-b64f-c286d041f373 Co-authored-by: NikolajBjorner <3085284+NikolajBjorner@users.noreply.github.com>

NikolajBjorner · 2026-04-03T20:01:44Z

@copilot create a benchmark where this makes a difference. Add a parameter to enable-disable this feature to allow testing.

…d clause cleanup - src/params/sat_params.pyg: Add gc.learned_pop (bool, default true) parameter - src/sat/sat_config.h: Add m_gc_learned_pop field to config - src/sat/sat_config.cpp: Read gc.learned_pop from params - src/sat/sat_solver.cpp: Gate the user_pop cleanup on m_config.m_gc_learned_pop - src/test/sat_gc.cpp: New benchmark showing 6000+ clauses removed per run - src/test/main.cpp: Register tst_sat_gc in FOR_EACH_EXTRA_TEST - src/test/CMakeLists.txt: Add sat_gc.cpp to test sources Agent-Logs-Url: https://github.com/Z3Prover/z3/sessions/83cecb0d-37c4-4f96-a987-7d73ae8fabc3 Co-authored-by: NikolajBjorner <3085284+NikolajBjorner@users.noreply.github.com>

Copilot · 2026-04-03T20:47:59Z

@copilot create a benchmark where this makes a difference. Add a parameter to enable-disable this feature to allow testing.

Added in commit a636fb3:

Parameter: sat.gc.learned_pop (bool, default true) — gates the learned clause cleanup in user_pop. Pass false to disable and observe the pollution effect.

Benchmark (src/test/sat_gc.cpp, tst_sat_gc): sets up a near-phase-transition 3-SAT formula on 200 outer-scope variables (ratio 4.5), runs 3 push/pop cycles, and compares the learned clause count before the final check-sat. With the default gc.learned_pop=true the clause database is clean (0 extra clauses); with gc.learned_pop=false over 6000 stale clauses accumulate. The test runs in ~0.1 s.

The mechanism: conflicts during the inner-scope check that are derived entirely from outer-scope clauses (which carry no scope-guard literal) produce learned lemmas that survive gc_vars. The benchmark triggers this by using a formula dense enough that outer-scope conflicts arise naturally. The VERIFY(lrn_with <= lrn_without) assertion documents the invariant.

NikolajBjorner · 2026-04-03T21:22:53Z

the "fix" while plausible improvement for something has nothing to do with the original bug report.
The original bug report creates benchmarks that don't use the sat core (unless the user specified "sat.smt=true" which I didn't see)

Initial plan

9b6d87f

Copilot AI assigned Copilot and NikolajBjorner Apr 3, 2026

Copilot started work on behalf of NikolajBjorner April 3, 2026 17:55 View session

Copilot AI linked an issue Apr 3, 2026 that may be closed by this pull request

push/pop scopes affect performance of subsequent proofs #9220

Open

Copilot AI and others added 2 commits April 3, 2026 18:33

Improve comments in push/pop scope cleanup code

1218520

Agent-Logs-Url: https://github.com/Z3Prover/z3/sessions/c23a0600-90dd-45a1-b64f-c286d041f373 Co-authored-by: NikolajBjorner <3085284+NikolajBjorner@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix push/pop scopes affecting performance of proofs~~ Fix push/pop scopes leaking learned clauses into subsequent proofs Apr 3, 2026

Copilot finished work on behalf of NikolajBjorner April 3, 2026 18:35

Copilot AI requested a review from NikolajBjorner April 3, 2026 18:35

Copilot started work on behalf of NikolajBjorner April 3, 2026 20:01 View session

Copilot finished work on behalf of NikolajBjorner April 3, 2026 20:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix push/pop scopes leaking learned clauses into subsequent proofs#9221

Fix push/pop scopes leaking learned clauses into subsequent proofs#9221
Copilot wants to merge 4 commits intomasterfrom
copilot/fix-push-pop-scopes-performance

Copilot AI commented Apr 3, 2026 •

edited

Loading

Uh oh!

NikolajBjorner commented Apr 3, 2026

Uh oh!

Copilot AI commented Apr 3, 2026

Uh oh!

NikolajBjorner commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Approach

Changes

Uh oh!

NikolajBjorner commented Apr 3, 2026

Uh oh!

Copilot AI commented Apr 3, 2026

Uh oh!

NikolajBjorner commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Apr 3, 2026 •

edited

Loading