Then it was pulling its calculations directly from a web source, not using generative large language models. I’m not saying a chatbot can’t do this, I’m saying language models can’t do this.
Apparently Bing Chat is able to do some maths. I asked Bing multiple variations of this question based on different speeds of the solar sail (e.g. what if it travels at 50% the speed of light). It was able to calculate both the travel time and the time dilation.
If it is only pulling the answer from web sources, how did it handle the variable speeds?
Your failure in reasoning here is assuming that all of them are purely and only language models. That they receive no other source of learning other than language models – for example, they aren’t fed any kind of pop science math.
It’s clear that this is true of models like ChatGPT, but isn’t the Bing thing powered by GPT4 with a number of other enhancements? Fixing this “can’t do math” thing is a low-hanging fruit for development improvements.
Then it was pulling its calculations directly from a web source, not using generative large language models. I’m not saying a chatbot can’t do this, I’m saying language models can’t do this.
Apparently Bing Chat is able to do some maths. I asked Bing multiple variations of this question based on different speeds of the solar sail (e.g. what if it travels at 50% the speed of light). It was able to calculate both the travel time and the time dilation.
If it is only pulling the answer from web sources, how did it handle the variable speeds?
It’s also possible that Bing’s chatbot is using a math-specific plugin in addition to its websearching plugin.
Your failure in reasoning here is assuming that all of them are purely and only language models. That they receive no other source of learning other than language models – for example, they aren’t fed any kind of pop science math.
It’s clear that this is true of models like ChatGPT, but isn’t the Bing thing powered by GPT4 with a number of other enhancements? Fixing this “can’t do math” thing is a low-hanging fruit for development improvements.