More precise logics for n_actions in Dataset and Simulator

# Possible Issue
In bandit feedback, n_actions are set as ```int(self.action.max() + 1)```, which doesn't raise any error in above code,
assuming that logs generated by ```policy``` covered all possible actions.

However, to be more precise, I think ```n_actions``` should be explicitly given, rather than extracted from log data.
And if changed, the above code might raise error.
If 1000 possible actions and only 0~998 actions exist in bandit _feedback and somehow policy selected action 999,
this might raise out-of-index error.

# Idea
1. BanditFeedback data is given ```n_actions``` explicitly.
Rather than:
https://github.com/st-tech/zr-obp/blob/55ab57ea214db3e69550e48c466ca8d52074145e/obp/dataset/real.py#L78-L81

2. Use ```n_actions``` directly in ```convert_to_action_dist```
Rather than:
https://github.com/st-tech/zr-obp/blob/55ab57ea214db3e69550e48c466ca8d52074145e/obp/simulator/simulator.py#L75-L78

	@property
	def n_actions(self) -> int:
	"""Number of actions."""
	return int(self.action.max() + 1)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More precise logics for n_actions in Dataset and Simulator #45

Possible Issue

Idea

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	action_dist = convert_to_action_dist(
	n_actions=bandit_feedback["action"].max() + 1,
	selected_actions=np.array(selected_actions_list),
	)

More precise logics for n_actions in Dataset and Simulator #45

Description

Possible Issue

Idea

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions