☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆

Machine Learning

machinelearning

PostsComments

☆ Yσɠƚԋσʂ ☆English · 1 month ago

Neurosymbolic AI -- Why, What, and How

arxiv.org

Neurosymbolic AI -- Why, What, and How

arxiv.org

☆ Yσɠƚԋσʂ ☆English · 1 month ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 1 month ago

Classical Sorting Algorithms as a Model of Morphogenesis: self-sorting arrays reveal unexpected competencies in a minimal model of basal intelligence

arxiv.org

Classical Sorting Algorithms as a Model of Morphogenesis: self-sorting arrays reveal unexpected competencies in a minimal model of basal intelligence

arxiv.org

☆ Yσɠƚԋσʂ ☆English · 1 month ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 2 months ago

Genie 2: A large-scale foundation world model

deepmind.google

Genie 2: A large-scale foundation world model

deepmind.google

☆ Yσɠƚԋσʂ ☆English · 2 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 2 months ago

A good primer on what to expect running local LLMs

nullprogram.com

A good primer on what to expect running local LLMs

nullprogram.com

☆ Yσɠƚԋσʂ ☆English · 2 months ago

Shamar@feddit.it

Shamar@feddit.itEnglish · 3 months ago

A community statement supporting the Open Source Definition (OSD)

osd.fyi

A community statement supporting the Open Source Definition (OSD)

osd.fyi

Shamar@feddit.itEnglish · 3 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 4 months ago

How ‘Embeddings’ Encode What Words Mean

www.quantamagazine.org

How ‘Embeddings’ Encode What Words Mean

www.quantamagazine.org

☆ Yσɠƚԋσʂ ☆English · 4 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

New AI model “learns” how to simulate Super Mario Bros. from video footage

arstechnica.com

New AI model “learns” how to simulate Super Mario Bros. from video footage

arstechnica.com

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

Reflection 70B holds its own against even the top closed-source models (Claude 3.5 Sonnet, GPT-4o)

huggingface.co

Reflection 70B holds its own against even the top closed-source models (Claude 3.5 Sonnet, GPT-4o)

huggingface.co

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

It’s Not Intelligent If It Always Halts: A Critical Perspective on Current Approaches to AGI

www.lifeiscomputation.com

It’s Not Intelligent If It Always Halts: A Critical Perspective on Current Approaches to AGI

www.lifeiscomputation.com

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

The Difference Between Speaking and Thinking

www.theatlantic.com

The Difference Between Speaking and Thinking

www.theatlantic.com

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

Diffusion Models Are Real-Time Game Engines

gamengen.github.io

Diffusion Models Are Real-Time Game Engines

gamengen.github.io

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

Liger Kernel is a collection of Triton kernels designed specifically for LLM training. It can effectively increase multi-GPU training throughput by 20% and reduces memory usage by 60%.

github.com

Liger Kernel is a collection of Triton kernels designed specifically for LLM training. It can effectively increase multi-GPU training throughput by 20% and reduces memory usage by 60%.

github.com

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

Transformer Explainer

poloclub.github.io

Transformer Explainer

poloclub.github.io

☆ Yσɠƚԋσʂ ☆English · 5 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 5 months ago

Alibaba claims no. 1 spot in AI math models with Qwen2-Math

venturebeat.com

Alibaba claims no. 1 spot in AI math models with Qwen2-Math

venturebeat.com

☆ Yσɠƚԋσʂ ☆English · 5 months ago

yboutros@infosec.pub

yboutros@infosec.pubEnglish · 6 months ago

How to convert a positionally encoded predicted embedding from a decoder to its matching token?

yboutros@infosec.pubEnglish · 6 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 6 months ago

New Open-Source AI Image Generator Beats Midjourney, SD3 and Auraflow

decrypt.co

New Open-Source AI Image Generator Beats Midjourney, SD3 and Auraflow

decrypt.co

☆ Yσɠƚԋσʂ ☆English · 6 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 6 months ago

AI models collapse when trained on recursively generated data

www.nature.com

AI models collapse when trained on recursively generated data

www.nature.com

☆ Yσɠƚԋσʂ ☆English · 6 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 7 months ago

RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing

lmsys.org

RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing

lmsys.org

☆ Yσɠƚԋσʂ ☆English · 7 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 7 months ago

Alibaba's Qwen LLM model leading open source rankings

huggingface.co

Alibaba's Qwen LLM model leading open source rankings

huggingface.co

☆ Yσɠƚԋσʂ ☆English · 7 months ago

☆ Yσɠƚԋσʂ ☆

☆ Yσɠƚԋσʂ ☆English · 7 months ago

By using the same techniques Google used to solve Go (MTCS and backprop), Llama8B gets 96.7% on math benchmark GSM8K. That’s better than GPT-4, Claude and Gemini, with 200x fewer parameters!

arxiv.org

By using the same techniques Google used to solve Go (MTCS and backprop), Llama8B gets 96.7% on math benchmark GSM8K. That’s better than GPT-4, Claude and Gemini, with 200x fewer parameters!

arxiv.org

☆ Yσɠƚԋσʂ ☆English · 7 months ago