User Guide
Core Components
Belief
Belief.config_id
Belief.from_config()
Belief.inplace_update()
Belief.sample()
Belief.update()
Environment
Environment.discount_factor
Environment.name
Environment.space_info
Environment.reward_range
Environment.output_dir
Environment.debug
Environment.cache_visualization()
Environment.compute_metrics()
Environment.config_id
Environment.from_dict()
Environment.get_metric_names()
Environment.initial_observation_dist()
Environment.initial_state_dist()
Environment.is_equal_observation()
Environment.is_terminal()
Environment.logger
Environment.observation_model()
Environment.reward()
Environment.reward_batch()
Environment.sample_next_step()
Environment.state_transition_model()
Environment.to_dict()
SpaceType
SpaceType.DISCRETE
SpaceType.CONTINUOUS
SpaceType.MIXED
Policy
Policy.environment
Policy.discount_factor
Policy.name
Policy.log_path
Policy.debug
Policy.action()
Policy.config_id
Policy.get_info_variable_names()
Policy.get_space_info()
Policy.load()
Policy.logger
Policy.save()
Examples
API Reference