feat(execution): Predictive Pipeline + CodeRabbit review fixes by rafaelscosta · Pull Request #579 · SynkraAI/aiox-core

rafaelscosta · 2026-03-10T17:41:02Z

Summary

Incorporates all changes from feat(execution): Predictive Pipeline — previsão de resultados antes da execução #575 (Predictive Pipeline + Decision Memory)
Addresses all 10 CodeRabbit review items from PR feat(execution): Predictive Pipeline — previsão de resultados antes da execução #575

CodeRabbit Fixes Applied

_enqueueWrite error propagation: Rethrows after _emitSafeError to keep promise chain rejected
_recalculateModelStats after prune: Recalculates taskTypeStats, agentStats, strategyStats after splice(0, excess)
_extractFeatures sanitization: Guards with Number.isFinite() for complexity, contextSize, agentExperience
minSamplesForPrediction threshold: _stagePredict now checks neighbors.length < minSamplesForPrediction
EWMA weight ordering: Reverses durationValues before EWMA so highest-similarity neighbors get largest weight
_ensureLoaded guard: recordDecision (now async) and save call _ensureLoaded() before mutating state
atomicWriteSync: Decision memory uses atomicWriteSync (write-to-tmp + rename) instead of fs.writeFileSync
(Skipped — assertion would fail with seeded data)
Test async updates: All decision-memory tests use async/await for recordDecision
Concurrent test assertion: Verifies unique IDs via new Set(results.map(r => r.id))

Test Results

89 predictive-pipeline tests: PASS
37 decision-memory tests: PASS
126 total tests passing

Supersedes #575

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Added predictive task outcome forecasting that assesses risk levels, detects anomalies, and recommends optimal execution strategies based on historical performance.
- Added persistent decision memory that records decisions, tracks outcomes, detects recurring patterns, and retrieves relevant historical context for decision-making.
Tests
- Comprehensive test coverage for new predictive forecasting and decision tracking functionality.

…ning Story 9.5 of Epic 9 (Persistent Memory Layer). Implements Phase 2 of the Agent Immortality Protocol (SynkraAI#482) — Persistence layer. Features: - Record decisions with context, rationale, and alternatives - Track outcomes (success/partial/failure) with confidence scoring - Auto-detect categories from description keywords - Find relevant past decisions for context injection (AC7) - Pattern detection across recurring decisions (AC9) - Time-based confidence decay for relevance scoring - Persistence to .aiox/decisions.json 37 unit tests covering all features.

…ados de tasks Pipeline preditivo que estima resultados antes da execução usando padrões históricos. Implementa k-NN ponderado com vetores de features, EWMA para estimativa de duração, detecção de anomalias, avaliação de risco e motor de recomendação de agentes/estratégias. 89 testes unitários cobrindo todos os cenários.

…ion, atomic writes - Fix _enqueueWrite to rethrow errors after emitting, keeping the chain rejected - Add _recalculateModelStats after auto-prune splice to keep stats consistent - Sanitize numeric features with Number.isFinite guards in _extractFeatures - Honor minSamplesForPrediction threshold in _stagePredict - Reverse durationValues before EWMA so highest-similarity samples get largest weight - Add _ensureLoaded guard to recordDecision and save in decision-memory - Use atomicWriteSync instead of fs.writeFileSync in decision-memory save - Improve test assertions: verify unique IDs in concurrent test Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

vercel · 2026-03-10T17:41:07Z

@rafaelscosta is attempting to deploy a commit to the Pedro Valério Lopez's projects Team on Vercel.

A member of the Team first needs to authorize it.

coderabbitai · 2026-03-10T17:41:27Z

Walkthrough

This PR introduces two new core modules for agentic decision-making: PredictivePipeline for forecasting task outcomes using historical data and k-NN similarity matching, and DecisionMemory for persistent cross-session decision recording and pattern detection. Includes a compatibility wrapper, comprehensive test suites, and manifest updates.

Changes

Cohort / File(s)	Summary
PredictivePipeline Implementation `.aiox-core/core/execution/predictive-pipeline.js`	Implements a new PredictivePipeline class with a five-stage workflow (preprocess → match → predict → score → recommend) for forecasting task outcomes. Features lazy-loaded persistence, k-NN similarity matching, statistical aggregation (taskType, agent, and strategy stats), risk and anomaly detection, weighted prediction metrics (EWMA for duration, resource estimates), confidence scoring with variance consideration, and batch prediction support. Emits events for predictions, anomalies, high-risk detection, and retraining. Includes utilities for feature extraction, similarity computation, and robust error handling.
PredictivePipeline Compatibility Wrapper `.aios-core/core/execution/predictive-pipeline.js`	Adds a retrocompatible wrapper module that re-exports the canonical PredictivePipeline implementation from the aiox-core path, enabling backward compatibility without runtime changes.
DecisionMemory Implementation `.aiox-core/core/memory/decision-memory.js`	Introduces a DecisionMemory class for persistent, cross-session decision storage with schema versioning and atomic writes. Supports recording decisions with auto-detected categories, keyword extraction, and task context; updating outcomes with confidence decay; retrieving relevant past decisions via keyword similarity and time-decay scoring; pattern detection for recurring decisions; and comprehensive statistics. Emits events for decision recording, outcome updates, pattern detection, and context injection.
Test Suites `tests/core/execution/predictive-pipeline.test.js`, `tests/core/memory/decision-memory.test.js`	Comprehensive test coverage for PredictivePipeline (constructor, outcome recording, prediction logic, batch operations, similarity matching, risk assessment, recommendations, persistence, anomaly detection, confidence scoring) and DecisionMemory (loading, recording, outcome updates, context injection, pattern detection, stats, persistence, time decay).
Manifest Update `.aiox-core/install-manifest.yaml`	Updates metadata with new file entries for both modules, revised file count, updated timestamps, and hash values for new and affected entries.

Sequence Diagrams

sequenceDiagram
    actor Client
    participant Pipeline as PredictivePipeline
    participant Persist as Persistence Layer
    
    Client->>Pipeline: predict(taskSpec) or recordOutcome(outcome)
    activate Pipeline
    Pipeline->>Pipeline: 1. Preprocess: extract features
    Pipeline->>Pipeline: 2. Match: compute similarity, find k-NN
    Pipeline->>Pipeline: 3. Predict: weighted metrics, EWMA, estimates
    Pipeline->>Pipeline: 4. Score: confidence, anomaly detection
    Pipeline->>Pipeline: 5. Recommend: best agent/strategy
    Pipeline->>Persist: save outcomes (lazy, sequenced writes)
    Persist-->>Pipeline: persisted
    Pipeline-->>Client: prediction result + events
    deactivate Pipeline

sequenceDiagram
    actor Client
    participant Memory as DecisionMemory
    participant Persist as Persistence Layer
    
    Client->>Memory: recordDecision(spec) or findRelevant(taskSpec)
    activate Memory
    Memory->>Memory: auto-detect category & extract keywords
    Memory->>Persist: save decision (atomic write, schema versioned)
    Persist-->>Memory: persisted
    Memory->>Memory: check pattern threshold
    alt Pattern detected
        Memory->>Memory: emit PATTERN_DETECTED event
    end
    Memory-->>Client: result + events (DECISION_RECORDED, PATTERN_DETECTED)
    deactivate Memory
    
    Client->>Memory: getRelevant(taskSpec)
    activate Memory
    Memory->>Memory: keyword similarity + time decay scoring
    Memory->>Memory: filter by category, success rate, confidence
    Memory-->>Client: sorted relevant decisions
    deactivate Memory

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 60.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly summarizes the main changes: introducing a Predictive Pipeline feature and applying CodeRabbit review fixes.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions

Welcome to aiox-core! Thanks for your first pull request.

What happens next?

Automated checks will run on your PR
A maintainer will review your changes
Once approved, we'll merge your contribution!

PR Checklist:

Tests pass (npm test)
Linting passes (npm run lint)
Commit messages follow Conventional Commits

Thanks for contributing!

coderabbitai

Actionable comments posted: 7

🧹 Nitpick comments (2)

tests/core/execution/predictive-pipeline.test.js (2)
12-17: Use the repo’s absolute import form here.

This relative require(...) makes the test brittle to file moves and bypasses the project import convention. Switch it to the repository’s absolute module path.

As per coding guidelines **/*.{js,jsx,ts,tsx}: Use absolute imports instead of relative imports in all code.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/core/execution/predictive-pipeline.test.js` around lines 12 - 17, The
test imports core symbols using a brittle relative path; update the require call
that pulls in PredictivePipeline, PipelineStage, RiskLevel, and DEFAULTS to use
the repository’s absolute module path for the core execution predictive-pipeline
module (follow the project's absolute import convention) so tests use the
canonical module entry rather than a relative file path; locate the line
importing those symbols and replace the relative require with the repo's
absolute module specifier for the same module.
250-263: Assert the derived stats after auto-prune, not just the array length.

The regression fixed in this PR was stale taskType / agent / strategy aggregates after pruning. This test still goes green if outcomes is trimmed to 5 while those derived counters remain wrong. Seed at least one task/agent/strategy that should be fully pruned and assert getStats() drops it too.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/core/execution/predictive-pipeline.test.js` around lines 250 - 263, The
test currently only checks outcomes length but not that derived aggregates were
updated after pruning; update the test that creates PredictivePipeline (small)
to seed outcomes with at least one unique taskType/agent/strategy that will be
completely pruned when maxOutcomes is exceeded (use recordOutcome to create
multiple entries so older entries for that task/agent/strategy are removed),
then call getStats() and assert that the returned derived aggregates (the
taskType/agent/strategy counts or lists provided by getStats()) no longer
include the pruned task/agent/strategy; ensure you reference the same
PredictivePipeline instance (small), use recordOutcome to insert the
distinguishing entries, and assert on getStats() fields rather than only
stats.outcomes length.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.aiox-core/core/execution/predictive-pipeline.js:
- Around line 697-703: findSimilarTasks and assessRisk call _extractFeatures
directly and will throw a TypeError on null/undefined input; add an explicit
nullish guard at the start of each public method (findSimilarTasks and
assessRisk) to validate taskSpec (e.g., if (taskSpec == null) throw new
TypeError("taskSpec is required")); this ensures callers get a clear validation
error instead of a property-access crash and keeps _extractFeatures unchanged.
- Around line 300-314: Summary: recordOutcome() currently mutates in-memory
state before persistence, causing double-counting/inconsistent snapshots when
persistence fails; serialize the whole logical transaction and roll back on
failure. Fix: wrap the entire critical section in a mutex/serial lock (e.g.,
add/use this._transactionLock or a small async lock around recordOutcome),
perform the in-memory updates (push to this._outcomes, increment
this._stats.outcomesRecorded, call this._updateModelStats, prune via
this._recalculateModelStats when > this.maxOutcomes), then call
this._persistOutcomes and this._persistModel while still holding the lock; if
either persist rejects, revert the in-memory changes (pop the pushed records,
restore previous stats/model state) before releasing the lock and rethrow the
error; ensure the lock/serialization also prevents concurrent recordOutcome runs
to avoid race conditions.
- Around line 529-531: The prediction stage incorrectly uses `|| 3` which treats
an explicit minSamplesForPrediction = 0 as falsy and falls back to 3; update the
check inside the `_runStage(PipelineStage.PREDICT, ...)` callback so it uses a
nullish-coalescing style default (i.e., respect 0) when reading
`this.minSamplesForPrediction` before comparing to `neighbors.length`, and call
`_defaultPrediction(features)` only when the resolved minimum is >
neighbors.length.

In @.aiox-core/core/memory/decision-memory.js:
- Around line 252-254: updateOutcome currently reads this.decisions without
ensuring persisted decisions are loaded, causing valid IDs to return null for
fresh DecisionMemory instances; modify updateOutcome to await this.load() (same
guard used in recordDecision / save) before searching this.decisions, then
proceed to find the decision by id and update outcome/notes and call this.save()
as appropriate so cross-session behavior matches recordDecision/save.
- Around line 508-521: The recommendation for a new pattern is incorrectly set
to the "underperformed" message when all similar decisions are still
Outcome.PENDING because successCount and failureCount are both zero; update the
logic in the block that builds the pattern object (where similar, outcomes,
successCount, failureCount are computed and recommendation is assigned) to
detect the case outcomes.length === 0 and set a neutral recommendation like
"Insufficient resolved outcomes to recommend; monitor for more data." instead of
choosing success or failure branches.
- Around line 147-150: The loader currently only checks data.schemaVersion
before assigning this.decisions and this.patterns, which allows non-array types
to slip through and later crash; update the block that handles schemaVersion
(the code referencing schemaVersion, this.decisions, this.patterns) to validate
that data.decisions and data.patterns are arrays via Array.isArray; if either is
not an array, treat the file as corrupted by logging/warning (use the existing
logger if available), set the offending property to an empty array
(this.decisions = [] / this.patterns = []), and persist or mark the state
accordingly so downstream array-only operations are safe.

In `@tests/core/execution/predictive-pipeline.test.js`:
- Around line 324-335: Remove the test's registration of the spy on the shared
fixture "pipeline" and only attach it to the instance under test "p" (i.e., call
p.on('high-risk-detected', spy) only), then tighten the assertion to assert the
exact call count from the instance under test (e.g.,
expect(spy).toHaveBeenCalledTimes(1) or expect(spy.mock.calls.length).toBe(1))
after calling p.predict(...); this ensures only emissions from
PredictivePipeline p.predict() are measured.

---

Nitpick comments:
In `@tests/core/execution/predictive-pipeline.test.js`:
- Around line 12-17: The test imports core symbols using a brittle relative
path; update the require call that pulls in PredictivePipeline, PipelineStage,
RiskLevel, and DEFAULTS to use the repository’s absolute module path for the
core execution predictive-pipeline module (follow the project's absolute import
convention) so tests use the canonical module entry rather than a relative file
path; locate the line importing those symbols and replace the relative require
with the repo's absolute module specifier for the same module.
- Around line 250-263: The test currently only checks outcomes length but not
that derived aggregates were updated after pruning; update the test that creates
PredictivePipeline (small) to seed outcomes with at least one unique
taskType/agent/strategy that will be completely pruned when maxOutcomes is
exceeded (use recordOutcome to create multiple entries so older entries for that
task/agent/strategy are removed), then call getStats() and assert that the
returned derived aggregates (the taskType/agent/strategy counts or lists
provided by getStats()) no longer include the pruned task/agent/strategy; ensure
you reference the same PredictivePipeline instance (small), use recordOutcome to
insert the distinguishing entries, and assert on getStats() fields rather than
only stats.outcomes length.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 8fdd3a3a-e9c6-4b49-be5d-b101c6803f54

📥 Commits

Reviewing files that changed from the base of the PR and between fcfb757 and 2b9c3db.

📒 Files selected for processing (6)

.aios-core/core/execution/predictive-pipeline.js
.aiox-core/core/execution/predictive-pipeline.js
.aiox-core/core/memory/decision-memory.js
.aiox-core/install-manifest.yaml
tests/core/execution/predictive-pipeline.test.js
tests/core/memory/decision-memory.test.js

coderabbitai · 2026-03-10T17:54:13Z

.aiox-core/core/execution/predictive-pipeline.js

+    this._outcomes.push(record);
+    this._stats.outcomesRecorded++;
+
+    // Update model stats
+    this._updateModelStats(record);
+
+    // Auto-prune if exceeding max
+    if (this._outcomes.length > this.maxOutcomes) {
+      const excess = this._outcomes.length - this.maxOutcomes;
+      this._outcomes.splice(0, excess);
+      this._recalculateModelStats();
+    }
+
+    await this._persistOutcomes();
+    await this._persistModel();


⚠️ Potential issue | 🔴 Critical

Serialize the whole recordOutcome() transaction, not just the file writes.

If either persistence step fails, the promise rejects after _outcomes, _model, and _stats have already been changed. A retry will double-count the same logical outcome, and a first-write/second-write split can leave disk snapshots inconsistent on the next load.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.aiox-core/core/execution/predictive-pipeline.js around lines 300 - 314, Summary: recordOutcome() currently mutates in-memory state before persistence, causing double-counting/inconsistent snapshots when persistence fails; serialize the whole logical transaction and roll back on failure. Fix: wrap the entire critical section in a mutex/serial lock (e.g., add/use this._transactionLock or a small async lock around recordOutcome), perform the in-memory updates (push to this._outcomes, increment this._stats.outcomesRecorded, call this._updateModelStats, prune via this._recalculateModelStats when > this.maxOutcomes), then call this._persistOutcomes and this._persistModel while still holding the lock; if either persist rejects, revert the in-memory changes (pop the pushed records, restore previous stats/model state) before releasing the lock and rethrow the error; ensure the lock/serialization also prevents concurrent recordOutcome runs to avoid race conditions.

coderabbitai · 2026-03-10T17:54:13Z

.aiox-core/core/execution/predictive-pipeline.js

+    return this._runStage(PipelineStage.PREDICT, () => {
+      if (neighbors.length < (this.minSamplesForPrediction || 3)) {
+        return this._defaultPrediction(features);


⚠️ Potential issue | 🟠 Major

Respect an explicit minSamplesForPrediction: 0.

The constructor preserves 0 via ??, but || 3 converts it back to 3 here. Any caller that intentionally disables the sample threshold still gets default predictions.

Suggested fix

- if (neighbors.length < (this.minSamplesForPrediction || 3)) { + if (neighbors.length < this.minSamplesForPrediction) { return this._defaultPrediction(features); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

return this._runStage(PipelineStage.PREDICT, () => {

if (neighbors.length < (this.minSamplesForPrediction || 3)) {

return this._defaultPrediction(features);

return this._runStage(PipelineStage.PREDICT, () => {

if (neighbors.length < this.minSamplesForPrediction) {

return this._defaultPrediction(features);

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.aiox-core/core/execution/predictive-pipeline.js around lines 529 - 531, The prediction stage incorrectly uses `|| 3` which treats an explicit minSamplesForPrediction = 0 as falsy and falls back to 3; update the check inside the `_runStage(PipelineStage.PREDICT, ...)` callback so it uses a nullish-coalescing style default (i.e., respect 0) when reading `this.minSamplesForPrediction` before comparing to `neighbors.length`, and call `_defaultPrediction(features)` only when the resolved minimum is > neighbors.length.

coderabbitai · 2026-03-10T17:54:13Z

.aiox-core/core/execution/predictive-pipeline.js

+  findSimilarTasks(taskSpec, opts = {}) {
+    this._ensureLoaded();
+
+    const limit = opts.limit ?? 10;
+    const minSimilarity = opts.minSimilarity ?? 0;
+    const features = this._extractFeatures(taskSpec);
+


⚠️ Potential issue | 🟠 Major

Guard nullish taskSpec in the public read APIs.

findSimilarTasks() and assessRisk() call _extractFeatures() directly, so null / undefined inputs currently explode with a property-access TypeError instead of a targeted validation error.

Suggested fix

findSimilarTasks(taskSpec, opts = {}) { this._ensureLoaded(); + if (!taskSpec || typeof taskSpec !== 'object') { + throw new Error('taskSpec must be an object'); + } const limit = opts.limit ?? 10; const minSimilarity = opts.minSimilarity ?? 0; const features = this._extractFeatures(taskSpec); @@ assessRisk(taskSpec) { this._ensureLoaded(); + if (!taskSpec || typeof taskSpec !== 'object') { + throw new Error('taskSpec must be an object'); + } const features = this._extractFeatures(taskSpec); const neighbors = this._stageMatch(features);

As per coding guidelines, `Check for proper input validation on public API methods`.
Also applies to: 747-750

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.aiox-core/core/execution/predictive-pipeline.js around lines 697 - 703, findSimilarTasks and assessRisk call _extractFeatures directly and will throw a TypeError on null/undefined input; add an explicit nullish guard at the start of each public method (findSimilarTasks and assessRisk) to validate taskSpec (e.g., if (taskSpec == null) throw new TypeError("taskSpec is required")); this ensures callers get a clear validation error instead of a property-access crash and keeps _extractFeatures unchanged.

coderabbitai · 2026-03-10T17:54:13Z

.aiox-core/core/memory/decision-memory.js

+        if (data.schemaVersion === this.config.schemaVersion) {
+          this.decisions = data.decisions || [];
+          this.patterns = data.patterns || [];
+        }


⚠️ Potential issue | 🟠 Major

Treat non-array persisted collections as corruption.

schemaVersion is the only shape check here. If decisions or patterns is an object/string in an otherwise matching file, later calls hit array-only operations and crash.

Suggested fix

- if (data.schemaVersion === this.config.schemaVersion) { - this.decisions = data.decisions || []; - this.patterns = data.patterns || []; - } + if (data.schemaVersion === this.config.schemaVersion) { + this.decisions = Array.isArray(data.decisions) ? data.decisions : []; + this.patterns = Array.isArray(data.patterns) ? data.patterns : []; + }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if (data.schemaVersion === this.config.schemaVersion) {

this.decisions = data.decisions || [];

this.patterns = data.patterns || [];

}

if (data.schemaVersion === this.config.schemaVersion) {

this.decisions = Array.isArray(data.decisions) ? data.decisions : [];

this.patterns = Array.isArray(data.patterns) ? data.patterns : [];

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.aiox-core/core/memory/decision-memory.js around lines 147 - 150, The loader currently only checks data.schemaVersion before assigning this.decisions and this.patterns, which allows non-array types to slip through and later crash; update the block that handles schemaVersion (the code referencing schemaVersion, this.decisions, this.patterns) to validate that data.decisions and data.patterns are arrays via Array.isArray; if either is not an array, treat the file as corrupted by logging/warning (use the existing logger if available), set the offending property to an empty array (this.decisions = [] / this.patterns = []), and persist or mark the state accordingly so downstream array-only operations are safe.

coderabbitai · 2026-03-10T17:54:13Z

.aiox-core/core/memory/decision-memory.js

+  updateOutcome(decisionId, outcome, notes = '') {
+    const decision = this.decisions.find(d => d.id === decisionId);
+    if (!decision) return null;


⚠️ Potential issue | 🟠 Major

Lazy-load before looking up the decision ID.

A fresh DecisionMemory instance with an existing .aiox/decisions.json will return null for valid IDs here until callers remember to await load() manually. This mutator needs the same load guard as recordDecision() / save() to preserve the cross-session behavior the module is adding.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.aiox-core/core/memory/decision-memory.js around lines 252 - 254, updateOutcome currently reads this.decisions without ensuring persisted decisions are loaded, causing valid IDs to return null for fresh DecisionMemory instances; modify updateOutcome to await this.load() (same guard used in recordDecision / save) before searching this.decisions, then proceed to find the decision by id and update outcome/notes and call this.save() as appropriate so cross-session behavior matches recordDecision/save.

coderabbitai · 2026-03-10T17:54:13Z

.aiox-core/core/memory/decision-memory.js

+    if (similar.length >= this.config.patternThreshold - 1) {
+      const outcomes = similar.map(d => d.outcome).filter(o => o !== Outcome.PENDING);
+      const successCount = outcomes.filter(o => o === Outcome.SUCCESS).length;
+      const failureCount = outcomes.filter(o => o === Outcome.FAILURE).length;
+
+      const pattern = {
+        id: `pattern-${this.patterns.length + 1}`,
+        category: newDecision.category,
+        description: `Recurring ${newDecision.category} decision: "${newDecision.description}"`,
+        occurrences: similar.length + 1,
+        successRate: outcomes.length > 0 ? successCount / outcomes.length : 0,
+        recommendation: successCount > failureCount
+          ? 'This approach has historically worked well. Consider reusing.'
+          : 'This approach has historically underperformed. Consider alternatives.',


⚠️ Potential issue | 🟠 Major

Use a neutral recommendation when no resolved outcomes exist yet.

When every similar decision is still pending, successCount and failureCount are both 0, so this falls into the "underperformed" branch and emits misleading advice with zero evidence.

Suggested fix

- recommendation: successCount > failureCount - ? 'This approach has historically worked well. Consider reusing.' - : 'This approach has historically underperformed. Consider alternatives.', + recommendation: outcomes.length === 0 + ? 'Pattern observed, but no completed outcomes exist yet.' + : successCount > failureCount + ? 'This approach has historically worked well. Consider reusing.' + : 'This approach has historically underperformed. Consider alternatives.',

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if (similar.length >= this.config.patternThreshold - 1) {

const outcomes = similar.map(d => d.outcome).filter(o => o !== Outcome.PENDING);

const successCount = outcomes.filter(o => o === Outcome.SUCCESS).length;

const failureCount = outcomes.filter(o => o === Outcome.FAILURE).length;

const pattern = {

id: `pattern-${this.patterns.length + 1}`,

category: newDecision.category,

description: `Recurring ${newDecision.category} decision: "${newDecision.description}"`,

occurrences: similar.length + 1,

successRate: outcomes.length > 0 ? successCount / outcomes.length : 0,

recommendation: successCount > failureCount

? 'This approach has historically worked well. Consider reusing.'

: 'This approach has historically underperformed. Consider alternatives.',

if (similar.length >= this.config.patternThreshold - 1) {

const outcomes = similar.map(d => d.outcome).filter(o => o !== Outcome.PENDING);

const successCount = outcomes.filter(o => o === Outcome.SUCCESS).length;

const failureCount = outcomes.filter(o => o === Outcome.FAILURE).length;

const pattern = {

id: `pattern-${this.patterns.length + 1}`,

category: newDecision.category,

description: `Recurring ${newDecision.category} decision: "${newDecision.description}"`,

occurrences: similar.length + 1,

successRate: outcomes.length > 0 ? successCount / outcomes.length : 0,

recommendation: outcomes.length === 0

? 'Pattern observed, but no completed outcomes exist yet.'

: successCount > failureCount

? 'This approach has historically worked well. Consider reusing.'

: 'This approach has historically underperformed. Consider alternatives.',

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.aiox-core/core/memory/decision-memory.js around lines 508 - 521, The recommendation for a new pattern is incorrectly set to the "underperformed" message when all similar decisions are still Outcome.PENDING because successCount and failureCount are both zero; update the logic in the block that builds the pattern object (where similar, outcomes, successCount, failureCount are computed and recommendation is assigned) to detect the case outcomes.length === 0 and set a neutral recommendation like "Insufficient resolved outcomes to recommend; monitor for more data." instead of choosing success or failure branches.

coderabbitai · 2026-03-10T17:54:14Z

tests/core/execution/predictive-pipeline.test.js

+    it('should emit high-risk-detected for risky tasks', () => {
+      // No history → low confidence → potentially high risk
+      const spy = jest.fn();
+      pipeline.on('high-risk-detected', spy);
+
+      // Predict with no history, anomaly threshold set low
+      const p = new PredictivePipeline(tmpDir, { highRiskThreshold: 0.3 });
+      p.on('high-risk-detected', spy);
+      p.predict({ taskType: 'never-seen-before' });
+
+      // Should have been called at least once (either from pipeline or p)
+      expect(spy.mock.calls.length).toBeGreaterThanOrEqual(1);


⚠️ Potential issue | 🟡 Minor

Listen only on the instance under test.

Registering the same spy on both pipeline and p makes this assertion non-specific; a future emission from the shared fixture could satisfy it even if p.predict() stops emitting. Attach the spy only to p and assert the exact call count.

Suggested tightening

it('should emit high-risk-detected for risky tasks', () => { - // No history → low confidence → potentially high risk const spy = jest.fn(); - pipeline.on('high-risk-detected', spy); - - // Predict with no history, anomaly threshold set low const p = new PredictivePipeline(tmpDir, { highRiskThreshold: 0.3 }); p.on('high-risk-detected', spy); p.predict({ taskType: 'never-seen-before' }); - - // Should have been called at least once (either from pipeline or p) - expect(spy.mock.calls.length).toBeGreaterThanOrEqual(1); + expect(spy).toHaveBeenCalledTimes(1); });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tests/core/execution/predictive-pipeline.test.js` around lines 324 - 335, Remove the test's registration of the spy on the shared fixture "pipeline" and only attach it to the instance under test "p" (i.e., call p.on('high-risk-detected', spy) only), then tighten the assertion to assert the exact call count from the instance under test (e.g., expect(spy).toHaveBeenCalledTimes(1) or expect(spy.mock.calls.length).toBe(1)) after calling p.predict(...); this ensures only emissions from PredictivePipeline p.predict() are measured.

nikolasdehor · 2026-03-11T02:52:59Z

@Pedrovaleriolopez @oalanicolas, o módulo Predictive Pipeline já está no nosso PR #575 (aberto 10/mar, antes deste). Mesma feature, implementação completa com 89 testes unitários.

nikolasdehor and others added 3 commits March 6, 2026 23:05

rafaelscosta requested review from Pedrovaleriolopez and oalanicolas as code owners March 10, 2026 17:41

rafaelscosta mentioned this pull request Mar 10, 2026

feat(execution): Predictive Pipeline — previsão de resultados antes da execução #575

Closed

4 tasks

github-actions bot reviewed Mar 10, 2026

View reviewed changes

coderabbitai bot requested changes Mar 10, 2026

View reviewed changes

Uh oh!

Conversation

rafaelscosta commented Mar 10, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

CodeRabbit Fixes Applied

Test Results

Summary by CodeRabbit

Uh oh!

vercel bot commented Mar 10, 2026

Uh oh!

coderabbitai bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagrams

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

nikolasdehor commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rafaelscosta commented Mar 10, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 10, 2026 •

edited

Loading