David Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · il y a 6 moisRemember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a liewww.nytimes.comexternal-linkmessage-square205fedilinkarrow-up1630arrow-down10file-textcross-posted to: fuck_ai@lemmy.world
arrow-up1630arrow-down1external-linkRemember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a liewww.nytimes.comDavid Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · il y a 6 moismessage-square205fedilinkfile-textcross-posted to: fuck_ai@lemmy.world
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up8arrow-down1·il y a 6 moiseven if that wasn’t the case, a 90% success rate is absolutely abysmal in practice.
minus-squareCouldbealeotard@lemmy.worldlinkfedilinkEnglisharrow-up45·il y a 6 mois90th percentile means it performed equal or better than 90% of the comparisons, no? Not that it got 90% score.
even if that wasn’t the case, a 90% success rate is absolutely abysmal in practice.
90th percentile means it performed equal or better than 90% of the comparisons, no? Not that it got 90% score.