141

Stack Overflow bans users en masse for rebelling against OpenAI partnership — users banned for deleting answers to prevent them being used to train ChatGPT (www.tomshardware.com)

submitted 1 week ago by [email protected] to c/[email protected]

229 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[-] [email protected] 251 points 1 week ago

If this is true, then we should prepare to be shout at by chatgpt why we didnt knew already that simple error.

[-] [email protected] 188 points 1 week ago

ChatGPT now just says “read the docs!” To every question

[-] [email protected] 152 points 1 week ago

Hey ChatGPT, how can I ...

"Locking as this is a duplicate of [unrelated question]"

[-] [email protected] 47 points 1 week ago

And then links to a similar sounding but ultimately totally unrelated site.

load more comments (1 replies)

load more comments (3 replies)

load more comments (2 replies)

[-] [email protected] 152 points 1 week ago

Take all you want, it will only take a few hallucinations before no one trusts LLMs to write code or give advice

[-] [email protected] 66 points 1 week ago

[…]will only take a few hallucinations before no one trusts LLMs to write code or give advice

Because none of us have ever blindly pasted some code we got off google and crossed our fingers ;-)

[-] [email protected] 67 points 1 week ago* (last edited 1 week ago)

It's way easier to figure that out than check ChatGPT hallucinations. There's usually someone saying why a response in SO is wrong, either in another response or a comment. You can filter most of the garbage right at that point, without having to put it in your codebase and discover that the hard way. You get none of that information with ChatGPT. The data spat out is not equivalent.

load more comments (1 replies)

load more comments (3 replies)

[-] [email protected] 41 points 1 week ago

We should already be at that point. We have already seen LLMs' potential to inadvertently backdoor your code and to inadvertently help you violate copyright law (I guess we do need to wait to see what the courts rule, but I'll be rooting for the open-source authors).

If you use LLMs in your professional work, you're crazy. I would never be comfortably opening myself up to the legal and security liabilities of AI tools.

load more comments (2 replies)

load more comments (23 replies)

[-] [email protected] 139 points 1 week ago

I got an email ban.

1609 hours logged 431 solved threads

[-] [email protected] 98 points 1 week ago* (last edited 1 week ago)

Reddit/Stack/AI are the latest examples of an economic system where a few people monetize and get wealthy using the output of the very many.

load more comments (2 replies)

[-] [email protected] 88 points 1 week ago

First, they sent the missionaries. They built communities, facilities for the common good, and spoke of collaboration and mutual prosperity. They got so many of us to buy into their belief system as a result.

Then, they sent the conquistadors. They took what we had built under their guidance, and claimed we "weren't using it" and it was rightfully theirs to begin with.

[-] [email protected] 74 points 1 week ago

You really don't need anything near as complex as AI...a simple script could be configured to automatically close the issue as solved with a link to a randomly-selected unrelated issue.

load more comments (4 replies)

[-] [email protected] 73 points 1 week ago

Maybe we should replace Stack Overflow with another site where experts can exchange information? We can call it "Experts Exchange".

[-] [email protected] 60 points 1 week ago

Expert Sex Change?

load more comments (5 replies)

[-] [email protected] 33 points 1 week ago

codidact ... Stack overflow had a mass exodus of mods a 2-3 years ago and a some of them made codidact.

load more comments (1 replies)

[-] [email protected] 72 points 1 week ago

Oh I didn't consider deleting my answers. Thanks for the good idea ~~Barbra~~ StackOverflow.

load more comments (6 replies)

[-] [email protected] 67 points 1 week ago

Messages that people post on Stack Exchange sites are literally licensed CC-BY-SA, the whole point of which is to enable them to be shared and used by anyone for any purpose. One of the purposes of such a license is to make sure knowledge is preserved by allowing everyone to make and share copies.

[-] [email protected] 74 points 1 week ago

That license would require chatgpt to provide attribution every time it used training data of anyone there and also would require every output using that training data to be placed under the same license. This would actually legally prevent anything chatgpt created even in part using this training data from being closed source. Assuming they obviously aren't planning on doing that this is massively shitting on the concept of licensing.

load more comments (4 replies)

[-] [email protected] 56 points 1 week ago

Share Alike

I can't wait to download my own version of the latest gpt model

load more comments (1 replies)

[-] [email protected] 63 points 1 week ago

I despise this use of mod power in response to a protest. It's our content to be sabotaged if we want - if Stack Overlords disagree then to hell with them.

I'll add Stack Overflow to my personal ban list, just below Reddit.

load more comments (2 replies)

[-] [email protected] 63 points 1 week ago

See, this is why we can't have nice things. Money fucks it up, every time. Fuck money, it's a shitty backwards idea. We can do better than this.

load more comments (3 replies)

[-] [email protected] 58 points 1 week ago

Eventually, we will need a fediverse version of StackOverflow, Quora, etc.

[-] [email protected] 72 points 1 week ago

Those would be harvested to train LLMs even without asking first. 😐

[-] [email protected] 42 points 1 week ago

At this point I’m assuming most if not all of these content deals are essentially retroactive. They already scrapped the content and found it useful enough to try and secure future use, or at least exclude competitors.

load more comments (2 replies)

[-] [email protected] 29 points 1 week ago

Honestly? I'm down with that. And when the LLM's end up pricing themselves out of usefulness, we'll still have the fediverse version. Having free sites on the net with solid crowd-sourced information is never a bad thing even if other people pick up the data and use it.

It's when private sites like Duolingo and Reddit crowd source the information and then slowly crank down the free aspect that we have the problems.

The Ad sponsored web model is not viable forever.

load more comments (1 replies)

[-] [email protected] 26 points 1 week ago

I’d rather the harvesting be open to all than only the company hosting it.

load more comments (6 replies)

load more comments (12 replies)

[-] [email protected] 56 points 1 week ago

primary use for AI is self destructing your website.

load more comments (3 replies)

[-] [email protected] 51 points 1 week ago

Begun, the AI wars have.

Faces on T-shirts, you must print print. Fake facts into old forum comments, you must edit. Poison the data well, you must.

load more comments (1 replies)

[-] [email protected] 49 points 1 week ago

The enshittification is very real and is spreading constantly. Companies will leech more from their employees and users until things start to break down. Acceleration is the only way.

[-] [email protected] 49 points 1 week ago

Data should be socialized and machine learning algorithms should be nationalized for public use.

[-] [email protected] 28 points 1 week ago

Better yet, copyright should be abolished completely.

load more comments (3 replies)

load more comments (1 replies)

[-] [email protected] 48 points 1 week ago

Reddit did almost the same and don't forget guys to delete your Reddit account

load more comments (6 replies)

[-] [email protected] 48 points 1 week ago

I'm going to run out of sites at this pace.

[-] [email protected] 40 points 1 week ago

Right? It seems like the modern internet is made up of like 5 monolithic sites, and unlimited SEO spam.

I know that's not literally true, but it sure feels like it.

load more comments (1 replies)

[-] [email protected] 45 points 1 week ago

A malicious response by users would be to employ an LLM instructed to write plausibly sounding but very wrong answers to historical and current questions, then an army of users upvoting the known wrong answer while downvoting accurate ones. This would poison the data I would think.

load more comments (3 replies)

[-] [email protected] 44 points 1 week ago

And the enshittification continues...

load more comments (1 replies)

[-] [email protected] 34 points 1 week ago

So they pulled a "reddit"?

load more comments (2 replies)

[-] [email protected] 31 points 1 week ago

Aren’t a lot of answers outdated on stackoverflow?

[-] [email protected] 69 points 1 week ago* (last edited 1 week ago)

You are now banned from stackoverflow

[-] [email protected] 29 points 1 week ago

And if you try to delete your comment, you'll be DOUBLE BANNED.

load more comments (4 replies)

load more comments (5 replies)

[-] [email protected] 31 points 1 week ago

RIP in pieces Stack Overflow

load more comments (2 replies)

[-] [email protected] 30 points 1 week ago* (last edited 1 week ago)

Time to download the last dump: https://archive.org/details/stackexchange

E: Seeding.

[-] [email protected] 28 points 1 week ago* (last edited 1 week ago)

While I think the reaction of StackOverflow is not good, I don't understand the users either.

EDIT: seems like the language model won't be free, I understand then.

[-] [email protected] 30 points 1 week ago

OpenAI is a terribly misleading name.

load more comments (2 replies)

[-] [email protected] 24 points 1 week ago* (last edited 1 week ago)

If we can't delete our questions and answers, can we poison the well by uploading masses of shitty questions and answers? If they like AI we could have it help us generate them.

[-] [email protected] 32 points 1 week ago

Poison the well by using AI-generated comments and answers. There isn't currently a way to reliably determine if content is human or AI-generated, and training AI on AI is the equivalent of inbreeding.

load more comments (2 replies)

load more comments (1 replies)

load more comments

this post was submitted on 08 May 2024

141 points (95.5% liked)

Technology

54242 readers

4434 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 11 months ago

MODERATORS

[email protected]