☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to technology@hexbear.netEnglish · 2 days agoDeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learningarxiv.orgexternal-linkmessage-square0fedilinkarrow-up112arrow-down10cross-posted to: technology@lemmygrad.mlmachinelearning
arrow-up112arrow-down1external-linkDeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learningarxiv.org☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to technology@hexbear.netEnglish · 2 days agomessage-square0fedilinkcross-posted to: technology@lemmygrad.mlmachinelearning