feat(rl-env): populate observation with task instruction

`WAALiveAdapter._get_observation()` does not populate `raw_observation` with the task instruction. `RLEnvironment` stores `_current_task.instruction` but never exposes it on observations.

This means RL agents using observations cannot see what task they should perform.

**Fix:** populate `raw_observation["instruction"]` in `observe()`, or add a public `current_instruction` property to `RLEnvironment`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rl-env): populate observation with task instruction #77

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feat(rl-env): populate observation with task instruction #77

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions