• ylaiOP
    link
    fedilink
    arrow-up
    5
    ·
    edit-2
    8 months ago

    In the case of Google/DeepMind’s SIMA it is an instruction-following agent for simpler, but menial tasks in a game. It is particularly not autonomous, and has no notion of reward. And what is being used is a modified behavior cloning with a text-conditioned policy network.