Uncategorized

HarperCollins is asking authors to license their books for AI training

An illustration of a glitchy pencil writing on paper.
Image: Hugo Herrera / The Verge

HarperCollins has agreed with an unnamed AI tech company to let the company use some nonfiction titles to train its models, 404 Media reports, but only if authors opt-in to having their books be used for training. Some authors are currently suing companies like OpenAI, accusing them of copyright infringement for training AI models on their works without permission.

According to a statement HarperCollins gave to 404 Media, the agreement protects authors’ “underlying value of their works and our shared revenue and royalty streams.” Author Daniel Kibblesmith posted screenshots of an email showing that he would be paid $2,500 if he allowed one of his books to be licensed.

Here is the full statement given to 404 Media:

HarperCollins has reached an agreement with an artificial intelligence technology company to allow limited use of select nonfiction backlist titles for training AI models to improve model quality and performance. While we believe this deal is attractive, we respect the various views of our authors, and they have the choice to opt in to the agreement or to pass on the opportunity.

HarperCollins has a long history of innovation and experimentation with new business models. Part of our role is to present authors with opportunities for their consideration while simultaneously protecting the underlying value of their works and our shared revenue and royalty streams. This agreement, with its limited scope and clear guardrails around model output that respects author’s rights, does that.

HarperCollins didn’t immediately reply to a request for comment from The Verge.