class documentation

class NoRandomActionMechanism(RandomActionMechanism):

View In Hierarchy

This class inherits from the RandomActionMechanism class and represents a random action mechanism in which random actions are never executed. The random action probability is identically equal to 0 throughout the entire learning process.

Method reset This abstract method must be implemented by any concrete subclass. It is invoked by an RL agent during the initialization process and it must initialize or reset all internal state maintained by the random action mechanism.
Method step This abstract method must be implemented by any concrete subclass. It is invoked by an RL agent at the end of each iteration of the learning process and it must update the internal state of the random action mechanism based on the previous best score and the current best score.
Property random_action_probability This abstract property must be implemented by any concrete subclass. It must return the current random action probability as a float value from the interval [0, 1].
def reset(self):

This abstract method must be implemented by any concrete subclass. It is invoked by an RL agent during the initialization process and it must initialize or reset all internal state maintained by the random action mechanism.

def step(self, previous_best_score: float, current_best_score: float):

This abstract method must be implemented by any concrete subclass. It is invoked by an RL agent at the end of each iteration of the learning process and it must update the internal state of the random action mechanism based on the previous best score and the current best score.

Parameters
previous_best_score:floatThe value of the best score before the current iteration, given as a float.
current_best_score:floatThe value of the best score after the current iteration, given as a float.
random_action_probability: float =

This abstract property must be implemented by any concrete subclass. It must return the current random action probability as a float value from the interval [0, 1].