LLMs are solving MCAT, the bar test, SAT etc like they’re nothing. At this point their performance is super human. However they’ll often trip on super simple common sense questions, they’ll struggle with creative thinking.

Is this literally proof that standard tests are not a good measure of intelligence?

  • t_var_s
    link
    fedilink
    English
    arrow-up
    3
    ·
    4 months ago

    Tests built for humans are not tests built for machines.