ylaiEnglish · 8 months agoHow Chain-of-Thought Reasoning Helps Neural Networks Computeplus-squarewww.quantamagazine.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkHow Chain-of-Thought Reasoning Helps Neural Networks Computeplus-squarewww.quantamagazine.orgylaiEnglish · 8 months agomessage-square0fedilink
ylaiEnglish · 10 months agoEvaluating LLMs with WeightWatcher Part III: The Magic of Mistral, a Story of Dragon Kingsplus-squarecalculatedcontent.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkEvaluating LLMs with WeightWatcher Part III: The Magic of Mistral, a Story of Dragon Kingsplus-squarecalculatedcontent.comylaiEnglish · 10 months agomessage-square0fedilink
ylaiEnglish · 1 year agoConvNets Match Vision Transformers at Scaleplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkConvNets Match Vision Transformers at Scaleplus-squarearxiv.orgylaiEnglish · 1 year agomessage-square0fedilink
ylaiEnglish · 1 year agoInside the Matrix: Visualizing Matrix Multiplication, Attention and Beyondplus-squarepytorch.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkInside the Matrix: Visualizing Matrix Multiplication, Attention and Beyondplus-squarepytorch.orgylaiEnglish · 1 year agomessage-square0fedilink
CanadaPlus@lemmy.sdf.orgEnglish · 1 year agoWhat is the state of the art on putting text samples into a latent space?message-squaremessage-square2fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1message-squareWhat is the state of the art on putting text samples into a latent space?CanadaPlus@lemmy.sdf.orgEnglish · 1 year agomessage-square2fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoAttention Is All You Needplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1imageAttention Is All You Needplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoAttention Is Off By Oneplus-squarewww.evanmiller.orgexternal-linkmessage-square0fedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkAttention Is Off By Oneplus-squarewww.evanmiller.orgmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Modelsplus-squarelemmy.intai.techimagemessage-square1fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Modelsplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square1fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoPersonality Traits in Large Language Modelsplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imagePersonality Traits in Large Language Modelsplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoLarge Language Models as General Pattern Machinesplus-squarelemmy.intai.techimagemessage-square2fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1imageLarge Language Models as General Pattern Machinesplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square2fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoLarge Language Models as Tool Makersplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageLarge Language Models as Tool Makersplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoLanguage models can explain neurons in language modelsplus-squareopenai.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkLanguage models can explain neurons in language modelsplus-squareopenai.commanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
taters@lemmy.intai.techEnglish · edit-21 year agoCurious Replay for Model-based Adaptationplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageCurious Replay for Model-based Adaptationplus-squarelemmy.intai.techtaters@lemmy.intai.techEnglish · edit-21 year agomessage-square0fedilink
taters@lemmy.intai.techEnglish · edit-21 year agoThe imperative for regulatory oversight of large language models (or generative AI) in healthcareplus-squarewww.nature.comexternal-linkmessage-square0fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1external-linkThe imperative for regulatory oversight of large language models (or generative AI) in healthcareplus-squarewww.nature.comtaters@lemmy.intai.techEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoMicrosoft Announces: LongNet - Scaling LLM Transformers to 1,000,000,000 Tokens & Context Lengthplus-squaremessage-squaremessage-square2fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1message-squareMicrosoft Announces: LongNet - Scaling LLM Transformers to 1,000,000,000 Tokens & Context Lengthplus-squaremanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square2fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoLarge Language Models Enable Few-Shot Clusteringplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1imageLarge Language Models Enable Few-Shot Clusteringplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoPreference Ranking Optimization for Human Alignmentplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up14arrow-down10
arrow-up14arrow-down1imagePreference Ranking Optimization for Human Alignmentplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · 1 year agoPushing the Limits of Machine Design Automated CPU Design with AIplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1imagePushing the Limits of Machine Design Automated CPU Design with AIplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · 1 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoIs ChatGPT A Good Translator? Yes With GPT-4 As The Engineplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up18arrow-down10
arrow-up18arrow-down1imageIs ChatGPT A Good Translator? Yes With GPT-4 As The Engineplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink
manitcor@lemmy.intai.techMEnglish · edit-21 year agoSequenceMatch - Imitation Learning for Autoregressive Sequence Modelling with Backtrackingplus-squarelemmy.intai.techimagemessage-square0fedilinkarrow-up14arrow-down10
arrow-up14arrow-down1imageSequenceMatch - Imitation Learning for Autoregressive Sequence Modelling with Backtrackingplus-squarelemmy.intai.techmanitcor@lemmy.intai.techMEnglish · edit-21 year agomessage-square0fedilink