• Ephera
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 年前

    The major difference I see, is that current AIs only provide narrow AI. They have very few sensors and are optimized for very few tasks.

    Broad AI or human intelligence involves tons of sensors/senses which may not directly be involved in a given task, but still allow you to judge its success independently. We also need to perform many different tasks, some of which may be similar to a new task we need to tackle.
    And humans spend several decades running around with those senses in different situations, performing different tasks, constantly evaluating their own success.

    For example, writing a poem. ChatGPT et al can do that. But they can’t listen to someone reading their poem, to judge how the rhythm of the words activates their reward system for successful pattern predictions, like it does for humans.

    They also don’t have complex associations with certain words. When we speak of a dreary sky, we associate coldness from sensing it with our skin, and we associate a certain melancholy, from our brain not producing the right hormones to keep us fully awake.
    A narrow AI doesn’t have a multitude of sensors + training data for it, so it cannot have such impressions.

    • blank_sl8
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 年前

      Google especially is working on multimodal models that do both language and image, audio, etc understanding in the same model. Their latest work, PaLM-E, demonstrates that learning in one domain (eg images) can indirectly benefit the model’s performance in other domains (eg text) without additional training in the other domain.