Without the same training data you wouldn’t be able to recreate the results even when having the computing power. Thus it’s not fully open source. Training data is a part of the source to create the result, “LLM”. It’s like having to add your own lines of code to open source program to make it work because the company doesn’t provide it.
Without the same training data you wouldn’t be able to recreate the results even when having the computing power. Thus it’s not fully open source. Training data is a part of the source to create the result, “LLM”. It’s like having to add your own lines of code to open source program to make it work because the company doesn’t provide it.