Google and Reddit have announced a huge content licensing deal, reportedly worth a whopping $60 million - but Reddit users are pissed.
Why, you might ask? Well, the deal involves Google using content posted by users on Reddit to train its AI models, chiefly its newly launched Google Gemini AI suite. It makes sense; Reddit contains a wealth of information and users typically talk colloquially, which Google is probably hoping will make for a more intelligent and more conversational AI service. However, this also essentially means that anything you post on Reddit now becomes fuel for the AI engine, something many users are taking umbrage at.
While the very first thing that came to mind was MIT’s insane Reddit-trained ‘psychopath AI’ from years ago, it’s fair to say that AI model training has come a long way since then - so hooking it up to Reddit hopefully won’t turn Gemini into a raving lunatic.
The deal, announced yesterday by Reddit in a blog post, will have other benefits as well: since many people specifically append ‘reddit’ to their search queries when looking for the answer to a question, Google aims to make getting to the relevant content on Reddit easier. Reddit plans to use Google’s Vertex AI to improve its own internal site search functionality, too, so Reddit users will enjoy a boost to the user experience - rather than getting absolutely nothing in return for their training data.
Do Redditors deserve a cut of that $60 million?
A lot of Reddit users have been complaining about the deal in various threads on the site, for a wide variety of reasons. Some users have privacy worries, some voiced concerns about the quality of output from an AI trained on Reddit content (which, let’s be honest, can get pretty toxic), and others simply don’t want their posts ‘stolen’ to train an AI.
Unfortunately for any unhappy Redditors, the site’s Terms of Service do mean that Reddit can (within reason) do whatever it wants with your posts and comments. Calling the content ‘stolen’ is inaccurate: if you’re a Reddit user, you’re the product, and Reddit is the one selling.
Personally, I’m glad to see a company actually getting paid for providing AI training data, unlike the legal grey-area dodginess of previous chatbots and AI art tools that were trained on data scraped from the internet for free without user consent. By agreeing to the Reddit TOS, you’re essentially consenting to your data being used for this.
Some users are positively incensed by this though, claiming that if they’re the ones making the content, surely they should be entitled to a slice of the AI pie. I’m going to hand out some tough love here: that’s a ridiculous and naive argument. Do these people believe they deserve a cut of ad revenue too, since they made a hit post that drew thousands of people to Reddit? This isn’t the same as AI creators quietly nabbing work from independent artists on Twitter.
At the end of the day, you’re never going to please everyone. If this deal has actual potential to improve not just Google Gemini, but Google Search in general (as well as Reddit’s site search), then the benefits arguably outweigh the costs - although I do think Reddit has a moral obligation to ensure that all of its users are fully informed about the use of their data.
A few paragraphs in the TOS aren’t enough, guys: you know full well nobody reads those.