“The use of copyrighted material in training large language models (LLMs) has sparked legal battles and takedown notices. In the Netherlands, anti-piracy group BREIN takes credit for forcing the popular 'GEITje' LLM offline, which in part was trained on copyrighted texts.”
To clarify, the issue is not the use of copyrighted materials, the issue is the use of copyrighted materials without express authorization by the owner.
https://torrentfreak.com/llm-taken-down-following-legal-pressure-from-anti-piracy-group-250128/