Do such models exist? Yes. Are they the big-boy models anyone’s really using? Ehhh not really.
There are in-use models that are “here’s a thing do whatever good luck,” which is at least as open-source as any MIT project. (Permissive licenses being “here is the code, have a nice life.”) Very few models are properly reproducible, because even when their training data includes DVDs you probably own, it also includes a ton of random internet pages that maybe don’t exist anymore. The push for ever-larger models, trained on as much stuff as possible, makes the use of “open source” regrettable or even deceptive choice. But quite a few are unrestricted for whatever weird shit you want to get up to.
Do such models exist? Yes. Are they the big-boy models anyone’s really using? Ehhh not really.
There are in-use models that are “here’s a thing do whatever good luck,” which is at least as open-source as any MIT project. (Permissive licenses being “here is the code, have a nice life.”) Very few models are properly reproducible, because even when their training data includes DVDs you probably own, it also includes a ton of random internet pages that maybe don’t exist anymore. The push for ever-larger models, trained on as much stuff as possible, makes the use of “open source” regrettable or even deceptive choice. But quite a few are unrestricted for whatever weird shit you want to get up to.