Microsoft deletes blog telling users to train AI on pirated Harry Potter books



“I think that the regurgitation and the creation of fan fiction, they both could flag copyright issues, in that fan fiction often has to take from the expressive elements, a copyrighted character, a character that’s famous enough to be protected by a copyright law or plot stories or sequences,” Smith said. “If these things are copied and reproduced, then that output could be potentially infringing.”

But it’s also still a gray area. Looking at the blog, Smith said, “I would be concerned,” but “I wouldn’t say it’s automatically infringement.”

Smith told Ars that, in pulling the blog, Microsoft “was probably smart,” since courts have only generally said that training AI on copyrighted books is fair use. But courts continue to probe questions about pirated AI training materials.

On the deleted Kaggle dataset page, Maindola previously explained that to source the data, he “downloaded the ebooks and then converted them to txt files.”

Microsoft may have infringed copyrights

If Microsoft ever faced questions as to whether the company knowingly used pirated books to train the example models, fair use “could be a difficult argument,” Smith said.

Hacker News commenters suggested the blog could be considered fair use, since the training guide was for “educational purposes,” and Smith said that Microsoft could raise some “good arguments” in its defense.

However, she also suggested that Microsoft could be deemed liable for contributing to infringement on some level after leaving the blog up for a year. Before it was removed, the Kaggle dataset was downloaded more than 10,000 times.

“The ultimate result is to create something infringing by saying, ‘Hey, here you go, go grab that infringing stuff and use that in our system,’” Smith said. “They could potentially have some sort of secondary contributory liability for copyright infringement, downloading it, as well as then using it to encourage others to use it for training purposes.”



Source link

  • Related Posts

    What the Rise of AI Scientists May Mean for Human Research

    Ahead of an artificial intelligence conference held last April, peer reviewers considered papers written by “Carl” alongside other submissions. What the reviewers did not know was that, unlike other authors,…

    Job titles of the future: Breast biomechanic

    A sports necessity Wearing a bra that’s too tight can limit breathing. Wearing one that’s too loose can create back, shoulder, and neck pain. Pain can also be caused by…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Trump has other tariff options after Supreme Court strikes down his worldwide import taxes

    Trump has other tariff options after Supreme Court strikes down his worldwide import taxes

    What the Rise of AI Scientists May Mean for Human Research

    What the Rise of AI Scientists May Mean for Human Research

    Police to question Andrew’s former protection officers over his Epstein links | UK news

    Police to question Andrew’s former protection officers over his Epstein links | UK news

    Next Week on Xbox: New Games for February 23 to 27

    Next Week on Xbox: New Games for February 23 to 27

    Trump says he’s considering a limited strike on Iran to force a deal.

    Melissa McCarthy Dons Crystal McQueen Booties for ‘Colbert’

    Melissa McCarthy Dons Crystal McQueen Booties for ‘Colbert’