imitation.scripts.ingredients.expert#
This ingredient provides an expert policy.
The expert policy is either loaded from disk or from the HuggingFace Model Hub or is a test policy (e.g., random or zero). The supported policy types are:
ppoandsac: A policy trained with SB3.Needs a path in the loader_kwargs.
<algo>-huggingface(algo can be ppo or sac):A policy trained with SB3 and uploaded to the HuggingFace Model Hub. Will load the model from the repo
<organization>/<algo>-<env_name>. You can set the organization with the organization key inloader_kwargs. The default is HumanCompatibleAI.
random: A policy that takes random actions.zero: A policy that takes zero actions.
Functions
|
|
|