Is there an actual way to stop it? I don’t think so. At least, moving to the fediverse would stop any particular corporation from having the monopoly of it, prevent reddit-like abuse of power, would give users more power, among a few other things.
How restrictive do you want to be with the accounts? If you’re too restrictive, there won’t be enough users. If you’re not restrictive enough, the data will be used for AI training.
That defeats the purpose of a knowledge base. The whole reason why everyone is using SO is that you don’t need an account to access it and it’s fully indexed by Google.
The real question is why the fuck are people ok with Google indexing SO and not OpenAI? Doesn’t make any fucking sense.
The real question is why the fuck are people ok with Google indexing SO and not OpenAI? Doesn’t make any fucking sense.
Because Google is free and OpenAI isn’t. It’s one thing to take free content, index it, then allow anyone to access that index. It’s another thing when you take free content, index it, then hide that index behind a paywall.
Are you sure? Because Google is not free at all, you’re paying for it through privacy invasion and ads. While ChatGPT is actually free to use for end users - no ads, nothing.
As your link is for custom enterprise solutions, it’s worth noting that Google has the same shit which also costs money https://cloud.google.com/pricing/
It’s “freemium”, not free. There is a difference. You can’t use ChatGPT 4 without paying as well as the API. Also, you are limited in the number of prompts you can make per hour before you are put on pause and asked to pay.
Search engines like Ecosia, DuckDuckGo, etc. don’t ask you for money. Regardless how intensively you use it. (They might come with other drawbacks though like Google with privacy, environment, ethical principles, …)
What does Google’s cloud service have to do with what we’re discussing (Google indexing content vs. SO OpenAI doing it)? They’re not even similar services.
The price difference is that google steals your data. That’s it. OpenAI steals data, ask for money to use most of their models, and buy even more data from other companies stealing user data (like google and SO). Also indexing web pages is not even the “stealing” part of google, it’s just not comparable.
Yes, training AI on user data for free then selling the end product is a reasonable thing to be concerned about. It’d be different if the product was free or the data was sold to them with user consent.
SO has announced a subscription-based service trained on user data for free, and not only there’s not even opt-out, they’re mass-banning users for trying to “opt-out” manually. Tell me one thing here that’s not completely fucked up.
Maybe we need a technical questions and answers siteon the fediverse!
Not gonna stop your knowledge being fed to an AI.
Is there an actual way to stop it? I don’t think so. At least, moving to the fediverse would stop any particular corporation from having the monopoly of it, prevent reddit-like abuse of power, would give users more power, among a few other things.
what about instances that need you to be logged in to view posts and require authorized requests for federation?
All it needs is an account to access troves of training data?
That should be manually approved
How restrictive do you want to be with the accounts? If you’re too restrictive, there won’t be enough users. If you’re not restrictive enough, the data will be used for AI training.
That defeats the purpose of a knowledge base. The whole reason why everyone is using SO is that you don’t need an account to access it and it’s fully indexed by Google.
The real question is why the fuck are people ok with Google indexing SO and not OpenAI? Doesn’t make any fucking sense.
Because Google is free and OpenAI isn’t. It’s one thing to take free content, index it, then allow anyone to access that index. It’s another thing when you take free content, index it, then hide that index behind a paywall.
Are you sure? Because Google is not free at all, you’re paying for it through privacy invasion and ads. While ChatGPT is actually free to use for end users - no ads, nothing.
https://openai.com/api/pricing/
No, it’s free https://chatgpt.com/
As your link is for custom enterprise solutions, it’s worth noting that Google has the same shit which also costs money https://cloud.google.com/pricing/
It’s “freemium”, not free. There is a difference. You can’t use ChatGPT 4 without paying as well as the API. Also, you are limited in the number of prompts you can make per hour before you are put on pause and asked to pay.
Search engines like Ecosia, DuckDuckGo, etc. don’t ask you for money. Regardless how intensively you use it. (They might come with other drawbacks though like Google with privacy, environment, ethical principles, …)
What does Google’s cloud service have to do with what we’re discussing (Google indexing content vs.
SOOpenAI doing it)? They’re not even similar services.Edit: SO -> OpenAI
The price difference is that google steals your data. That’s it. OpenAI steals data, ask for money to use most of their models, and buy even more data from other companies stealing user data (like google and SO). Also indexing web pages is not even the “stealing” part of google, it’s just not comparable.
Yes, training AI on user data for free then selling the end product is a reasonable thing to be concerned about. It’d be different if the product was free or the data was sold to them with user consent.
SO has announced a subscription-based service trained on user data for free, and not only there’s not even opt-out, they’re mass-banning users for trying to “opt-out” manually. Tell me one thing here that’s not completely fucked up.
But it’s free. Unlike Google.
Nothing stopping them from scraping that too