New Pleias 1.0 LLMs trained exclusively on openly licensed data

from blog Simon Willison's Weblog, | ↗ original
New Pleias 1.0 LLMs trained exclusively on openly licensed data I wrote about the Common Corpus public domain dataset back in March. Now Pleias, the team behind Common Corpus, have released the first family of models that are: [...] trained exclusively on open data, meaning data that are either non-copyrighted or are published under a permissible...