Peektastic.com

It turns out you can train AI models without copyrighted material

AI companies claim their tools couldn't exist without training on copyrighted material. It turns out, they could — it's just really hard. To prove it, AI researchers trained a new model that's less powerful but much more ethical. That's because the LLM's dataset uses only public domain and openly licensed material.<br /> The paper (via The Washington Post) was a collaboration between 14 different institutions. The authors represent universities like MIT, Carnegie Mellon and the University of Toronto. Nonprofits like Vector Institute and the Allen Institute for AI also contributed.<br /> The group built an 8 TB ethically-sourced dataset. Among the data was a set of 130,000 books in the Library of Congress. After inputting the material, they trained a s [...]

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

OpenAI and Google ask for a government exemption to train their AI models on copyrighted material

OpenAI is calling on the Trump administration to give AI companies an exemption to train their models on copyrighted material. In a blog post spotted by The Verge, the company this week published its [...]

More Copy

Match Score: 110.25

blogspot

How I Get Free Traffic from ChatGPT in 2025 (AIO vs SEO)

Three weeks ago, I tested something that completely changed how I think about organic traffic. I opened ChatGPT and asked a simple question: "What's the best course on building SaaS with Wor [...]

More Copy

Match Score: 63.07

Disney accuses ByteDance of 'virtual smash-and-grab' when using copyrighted works to train its AI

Disney is going after another generative AI tool, accusing ByteDance and its recently released Seedance 2.0 of using its copyrighted material without permission. As first reported on by Axios, the Wal [...]

More Copy

Match Score: 51.82

Google gives Android an animated makeover with Material 3 Expressive

Well, the leaks were right. Google is updating the look of Android and Wear OS to make its mobile and wearable operating systems more visually expressive and animated.<br /> Google has dubbed it [...]

More Copy

Match Score: 51.33

OpenAI suddenly thinks intellectual property theft is not cool, actually, amid DeepSeek’s rise

OpenAI claims that Chinese startups are persistently trying to copy the technology of American AI companies. Aligned with that, OpenAI says it and partner Microsoft have been banning accounts suspecte [...]

More Copy

Match Score: 44.41

Three YouTubers accuse Apple of illegal scraping to train its AI models

Three YouTube channels have banded together and filed a class action lawsuit against Apple, as first spotted by MacRumors. According to the lawsuit, the creators behind h3h3 Productions, MrShortGameGo [...]

More Copy

Match Score: 42.46

The UK's House of Lords kicks back bill that let AI train on copyrighted content

The UK's House of Lords just voted to add an amendment to a data bill that mandates that tech companies disclose which copyright-protected works were used to train AI models, as reported by The G [...]

More Copy

Match Score: 41.78

UK government delays AI copyright rules amid artist outcry

The UK government is working on a controversial data bill that would allow AI companies like Google and OpenAI to train their models on copyrighted materials without consent. However, following a two [...]

More Copy

Match Score: 41.32

Meta wins AI copyright case filed by Sarah Silverman and other authors

Federal Judge Vince Chhabria has ruled in favor of Meta over the 13 book authors, including Sarah Silverman, who sued the company for training its large language model on their published work without [...]

More Copy

Match Score: 40.34