Ok, but what if in the future I could guarantee that my generative model was not...

buckle8017 · 2026-03-05T16:23:50 1772727830

I wish you luck proving it wasn't trained on the original library or any work that infringed itself.

airforce1 · 2026-03-05T16:42:09 1772728929

I think there could be a market for "permissive/open models" in the future where a company specifically makes LLM models that are trained on a large corpus of public domain or permissively licensed text/code only and you can prove it by downloading the corpus yourself and reproducing the exact same model if desired. Proving that all MIT licensed code is non-infringing is probably impossible though at that point copyright law is meaningless because everyone would be in violation if you dig deep enough.

vunderba · 2026-03-05T17:37:01 1772732221

This is what Adobe ostensibly is trying to do with their GenAI image model, Firefly.

https://en.wikipedia.org/wiki/Adobe_Firefly