this post was submitted on 13 Nov 2024
496 points (95.8% liked)

Technology

59340 readers
6004 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 11 points 1 hour ago

Thank fuck. Can we have cheaper graphics cards again please?

I'm sure a RTX 4090 is very impressive, but it's not £1800 impressive.

[–] [email protected] 2 points 1 hour ago* (last edited 1 hour ago)

Marcus is right, incremental improvements in AIs like ChatGPT will not lead to AGI and were never on that course to begin with. What LLMs do is fundamentally not "intelligence", they just imitate human response based on existing human-generated content. This can produce usable results, but not because the LLM has any understanding of the question. Since the current AI surge is based almost entirely on LLMs, the delusion that the industry will soon achieve AGI is doomed to fall apart - but not until a lot of smart speculators have gotten in and out and made a pile of money.

[–] [email protected] 22 points 4 hours ago

It's so funny how all this is only a problem within a capitalist frame of reference.

[–] [email protected] 16 points 4 hours ago (4 children)

The hype should go the other way. Instead of bigger and bigger models that do more and more - have smaller models that are just as effective. Get them onto personal computers; get them onto phones; get them onto Arduino minis that cost $20 - and then have those models be as good as the big LLMs and Image gen programs.

[–] [email protected] 2 points 42 minutes ago

Other than with language models, this has already happened: Take a look at apps such as Merlin Bird ID (identifies birds fairly well by sound and somewhat okay visually), WhoBird (identifies birds by sound, ) Seek (visually identifies plants, fungi, insects, and animals). All of them work offline. IMO these are much better uses of ML than spammer-friendly text generation.

[–] [email protected] 7 points 1 hour ago

This has already started to happen. The new llama3.2 model is only 3.7GB and it WAAAAY faster than anything else. It can thow a wall of text at you in just a couple of seconds. You're still not running it on $20 hardware, but you no longer need a 3090 to have something useful.

[–] [email protected] 1 points 1 hour ago* (last edited 1 hour ago)

Well, you see, that's the really hard part of LLMs. Getting good results is a direct function of the size of the model. The bigger the model, the more effective it can be at its task. However, there's something called compute efficient frontier (technical but neatly explained video about it). Basically you can't make a model more effective at their computations beyond said linear boundary for any given size. The only way to make a model better, is to make it larger (what most mega corps have been doing) or radically change the algorithms and method underlying the model. But the latter has been proving to be extraordinarily hard. Mostly because to understand what is going on inside the model you need to think in rather abstract and esoteric mathematical principles that bend your mind backwards. You can compress an already trained model to run on smaller hardware. But to train them, you still need the humongously large datasets and power hungry processing. This is compounded by the fact that larger and larger models are ever more expensive while providing rapidly diminishing returns. Oh, and we are quickly running out of quality usable data, so shoveling more data after a certain point starts to actually provide worse results unless you dedicate thousands of hours of human labor producing, collecting and cleaning the new data. That's all even before you have to address data poisoning, where previously LLM generated data is fed back to train a model but it is very hard to prevent it from devolving into incoherence after a couple of generations.

[–] [email protected] 1 points 2 hours ago (1 children)

That would be innovation, which I'm convinced no company can do anymore.

It feels like I learn that one of our modern innovations was already thought up and written down into a book in the 1950s, and just wasn't possible at that time due to some limitation in memory, precision, or some other metric. All we did was do 5 decades of marginal improvement to get to it, while not innovating much at all.

[–] [email protected] 2 points 2 hours ago

Are you talking about something specific?

[–] [email protected] 12 points 5 hours ago

This is why you're seeing news articles from Sam Altman saying that AGI will blow past us without any societal impact. He's trying to lessen the blow of the bubble bursting for AI/ML.

[–] [email protected] 9 points 5 hours ago

It's been 5 minutes since the new thing did a new thing. Is it the end?

[–] [email protected] 6 points 6 hours ago (1 children)

As I use copilot to write software, I have a hard time seeing how it'll get better than it already is. The fundamental problem of all machine learning is that the training data has to be good enough to solve the problem. So the problems I run into make sense, like:

  1. Copilot can't read my mind and figure out what I'm trying to do.
  2. I'm working on an uncommon problem where the typical solutions don't work
  3. Copilot is unable to tell when it doesn't "know" the answer, because of course it's just simulating communication and doesn't really know anything.

2 and 3 could be alleviated, but probably not solved completely with more and better data or engineering changes - but obviously AI developers started by training the models on the most useful data and strategies that they think work best. 1 seems fundamentally unsolvable.

I think there could be some more advances in finding more and better use cases, but I'm a pessimist when it comes to any serious advances in the underlying technology.

[–] [email protected] 5 points 4 hours ago* (last edited 4 hours ago) (1 children)

Not copilot, but I run into a fourth problem:
4. The LLM gets hung up on insisting that a newer feature of the language I'm using is wrong and keeps focusing on "fixing" it, even though it has access to the newest correct specifications where the feature is explicitly defined and explained.

[–] [email protected] 4 points 1 hour ago (1 children)

Oh god yes, ran into this asking for a shell.nix file with a handful of tricky dependencies. It kept trying to do this insanely complicated temporary pull and build from git instead of just a 6 line file asking for the right packages.

[–] [email protected] 3 points 1 hour ago (1 children)

"This code is giving me a return value of X instead of Y"

"Ah the reason you're having trouble is because you initialized this list with brackets instead of new()."

"How would a syntax error give me an incorrect return"

"You're right, thanks for correcting me!"

"Ok so like... The problem though."

[–] [email protected] 1 points 51 minutes ago

Yeah, once you have to question its answer, it's all over. It got stuck and gave you the next best answer in it's weights which was absolutely wrong.

You can always restart the convo, re-insert the code and say what's wrong in a slightly different way and hope the random noise generator leads it down a better path :)

I'm doing some stuff with translation now, and I'm finding you can restart the session, run the same prompt and get better or worse versions of a translation. After a few runs, you can take all the output and ask it to rank each translation on correctness and critique them. I'm still not completely happy with the output, but it does seem that sometime if you MUST get AI to answer the question, there can be value in making it answer it across more than one session.

[–] [email protected] 23 points 9 hours ago (5 children)

Huh?

The smartphone improvements hit a rubber wall a few years ago (disregarding folding screens, that compose a small market share, improvement rate slowed down drastically), and the industry is doing fine. It's not growing like it use to, but that just means people are keeping their smartphones for longer periods of time, not that people stopped using them.

Even if AI were to completely freeze right now, people will continue using it.

Why are people reacting like AI is going to get dropped?

[–] [email protected] 6 points 3 hours ago* (last edited 3 hours ago)

People are dumping billions of dollars into it, mostly power, but it cannot turn profit.

So the companies who, for example, revived a nuclear power facility in order to feed their machine with ever diminishing returns of quality output are going to shut everything down at massive losses and countless hours of human work and lifespan thrown down the drain.

This will have an economic impact quite large as many newly created jobs go up in smoke and businesses who structured around the assumption of continued availability of high end AI need to reorganize or go out of business.

Search up the Dot Com Bubble.

[–] [email protected] 14 points 5 hours ago

Because in some eyes, infinite rapid growth is the only measure of success.

[–] [email protected] 3 points 6 hours ago (1 children)

Because novelty is all it has. As soon as it stops improving in a way that makes people say "oh that's neat", it has to stand on the practical merits of its capabilities, which is, well, not much.

[–] [email protected] 5 points 5 hours ago (2 children)

I’m so baffled by this take. “Create a terraform module that implements two S3 buckets with cross-region bidirectional replication. Include standard module files like linting rules and enable precommit.” Could I write that? Yes. But does this provide an outstanding stub to start from? Also yes.

And beyond programming, it is otherwise having positive impact on science and medicine too. I mean, anybody who doesn’t see any merit has their head in the sand. That of course must be balanced with not falling for the hype, but the merits are very real.

[–] [email protected] 2 points 1 hour ago

The merits are real. I do understand the deep mistrust people have for tech companies, but there's far too much throwing out of the baby with the bath water.

As a solo developer, LLMs are a game-changer. They've allowed me to make amazing progress on some of my own projects that I've been stuck on for ages.

But it's not just technical subjects that benefit from LLMs. ChatGPT has been a great travel guide for me. I uploaded a pic of some architecture in Berlin and it went into the history of it, I asked it about some damage to an old church in Spain - turned out to be from the Spanish civil war, where revolutionaries had been mowed down by Franco's firing squads.

Just today, I was getting help from an LLM for an email to a Portuguese removals company. I sent my message in English with a Portuguese translation, but the guy just replied back with a single sentence in broken English:

"Yes a can , need tho mow m3 you need delivery after e gif the price"

The first bit is pretty obviously "Yes I can" but I couldn't really be sure what he was trying to say with the rest of it. So I asked ChatGPT who responded:

It seems he's saying he can handle the delivery but needs to know the total volume (in cubic meters) of your items before he can provide a price. Here's how I’d interpret it:

“Yes, I can [do the delivery]. I need to know the [volume] in m³ for delivery, and then I’ll give you the price.”

Thanks to LLMs, I'm able to accomplish so many things that would have previously taken multiple internet searches and way more effort.

[–] [email protected] 2 points 2 hours ago (1 children)

There's a pretty big difference between chatGPT and the science/medicine AIs.

And keep in mind that for LLMs and other chatbots, it's not that they aren't useful at all but that they aren't useful enough to justify their costs. Microsoft is struggling to get significant uptake for Copilot addons in Microsoft 365, and this is when AI companies are still in their "sell below cost and light VC money on fire to survive long enough to gain market share" phase. What happens when the VC money dries up and AI companies have to double their prices (or more) in order to make enough revenue to cover their costs?

[–] [email protected] 1 points 14 minutes ago

Nothing to argue with there. I agree. Many companies will go out of business. Fortunately we’ll still have the llama3’s and mistral’s laying around that I can run locally. On the other hand cost justification is a difficult equation with many variables, so maybe it is or will be in some cases worth the cost. I’m just saying there is some merit.

[–] [email protected] 9 points 9 hours ago (1 children)

People pay real money for smartphones.

[–] [email protected] 9 points 8 hours ago

People pay real Money for AIaaS as well..

[–] [email protected] 3 points 9 hours ago
[–] [email protected] 13 points 10 hours ago (1 children)

I am so tired of the ai hype and hate. Please give me my gen art interest back please just make it obscure again to program art I beg of you

[–] [email protected] 3 points 5 hours ago* (last edited 5 hours ago) (1 children)

It's still quite obscure to actually mess with AI art instead of just throwing prompts at it, resulting in slop of varying quality levels. And I don't mean controlnet, but github repos with comfyui plugins with little explanation but a link to a paper, or "this is absolutely mathematically unsound but fun to mess with". Messing with stuff other than conditioning or mere model selection.

[–] [email protected] 2 points 2 hours ago

I know, it's actually still a beautiful community but much harder to talk to outsiders about

[–] [email protected] 4 points 8 hours ago

Theres no bracing for this, OpenAI CEO said the same thing like a year ago and people are still shovelling money at this dumpster fire today.

[–] [email protected] 4 points 8 hours ago

It's had all the signs of a bubble for the last few years.

[–] [email protected] 17 points 12 hours ago

Good. I look forward to all these idiots finally accepting that they drastically misunderstood what LLMs actually are and are not. I know their idiotic brains are only able to understand simple concepts like "line must go up" and follow them like religious tenants though so I'm sure they'll waste everyone's time and increase enshitification with some other new bullshit once they quietly remove their broken (and unprofitable) AI from stuff.

[–] [email protected] 5 points 10 hours ago

Sigh I hope LLMs get dropped from the AI bandwagon because I do think they have some really cool use cases and love just running my little local models. Cut government spending like a madman, write the next great American novel, or eliminate actual jobs are not those use cases.

load more comments
view more: next ›