Commit c76d9fd
committed
DAOS-18633 rebuild: abort orphaned reclaim rpt after PS leader switch
After PS leader switch, ds_rebuild_regenerate_task() only regenerates
rebuild tasks for DOWN/DRAIN/UP targets. RECLAIM tasks are not
regenerated because reintegrated targets are already UPIN. This
leaves orphaned rpt on every target with a stale leader term, whose
IV updates are silently dropped by the new leader (no matching rgt).
The result is sp_rebuilding > 0 permanently, blocking EC aggregation
and causing system-wide performance degradation.
Fix: detect stale leader term in rebuild_tgt_status_check_ult() and
abort the orphaned rpt.
TODO: persist in-progress reclaim tasks in RDB so they can be
properly re-triggered on PS leader step_up.
Signed-off-by: Wang Shilong <shilong.wang@hpe.com>1 parent b34e4e8 commit c76d9fd
1 file changed
+17
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2972 | 2972 | | |
2973 | 2973 | | |
2974 | 2974 | | |
| 2975 | + | |
| 2976 | + | |
| 2977 | + | |
| 2978 | + | |
| 2979 | + | |
| 2980 | + | |
| 2981 | + | |
| 2982 | + | |
| 2983 | + | |
| 2984 | + | |
| 2985 | + | |
| 2986 | + | |
| 2987 | + | |
| 2988 | + | |
| 2989 | + | |
| 2990 | + | |
| 2991 | + | |
2975 | 2992 | | |
2976 | 2993 | | |
2977 | 2994 | | |
| |||
0 commit comments