technology
On the road to fully automated luxury gay space communism.
Spreading Linux propaganda since 2020
- Ways to run Microsoft/Adobe and more on Linux
- The Ultimate FOSS Guide For Android
- Great libre software on Windows
- Hey you, the lib still using Chrome. Read this post!
Rules:
- 1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
- 2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
- 3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
- 4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
- 5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
- 6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
- 7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.
These things suck and will literally destroy the world and the human spirit from the inside out no matter who makes them
I think this kind of statement needs to be more elaborate to have proper discussions about it.
LLMs can really be summarized as “squeezing the entire internet into a black box that can be queried at will”. It has many use cases but even more potential for misuse.
All forms of AI (artificial intelligence in the literal sense) as we know it (i.e., not artificial general intelligence or AGI) are just statistical models that do not have the capacity to think, have no ability to reason and cannot critically evaluate or verify a certain piece of information, which can equally come from legitimate source or some random Reddit post (the infamous case of Google AI telling you to put glue on your pizza can be traced back to a Reddit joke post).
These LLM models are built by training on the entire internet’s datasets using a transformer architecture that has very good memory retention, and more recently, with reinforcement learning with human input to reduce their tendency to produce incorrect output (i.e. hallucinations). Even then, these dataset require extensive tweaking and curation and OpenAI famously employ Kenyan workers at less than $2 per hour to perform the tedious work of dataset annotation used for training.
Are they useful if you just need to pull up a piece of information that is not critical in the real world? Yes. Is it useful if you don’t want to do your homework and just let the algorithm solve everything for you? Yes (of course, there is an entire discussion about future engineers/doctors who are “trained” by relying on these AI models and then go on to do real things in the real world without developing the capacity to think/evaluate for themselves). Would you ever trust it if your life depends on it (i.e. building a car, plane or a house, or treating an illness)? Hell no.
A simple test case is to ask yourself if you would ever trust an AI model over a trained physician to treat your illness? A human physician has access to real-world experience that an AI will never have (no matter how much medical literature it can devour on the internet), has the capacity to think and reason and thus the ability to respond to anomalies which have never been seen before.
An AI model needs thousands of images to learn the difference between a cat and a dog, a human child can learn that with just a few examples. Without a huge input dataset (helped annotated by an army of underpaid Kenyan workers), the accuracy is simply crap. The fundamental process of learning is very different between the two, and until we have made advances on AGI (which is as far as you could get from the current iterations of AI), we’ll always have to deal with the potential misuses of AI in our lives.
that's a deeply reactionary take
LLMs are literally reactionary by design but go off
What does that even mean
they "react" to your input and every letter after i guess?? lmao
Hard disk drives are literally revolutionary by design because they spin around. Embrace the fastest spinning and most revolutionary storage media
sorry sweaty, ssds are problematic
Scratch a SSD and a NVMe bleeds.
Sufi whirling is the greatest expression of revolutionary spirit in all of time.
Pushing glasses up nose further than you ever thought imaginable *every token after
hey man come here i have something to show you
It's a model with heavy cold war liberalism bias (due to information being fed to it), unless you prompt it - you'll get freedom/markets/entrepreneurs out of it for any problem. As people are treating them as gospel of the impartial observer -
The fate of the world will be ultimately decided on garbage answers spewed out by an LLM trained on Reddit posts. That’s just how the future leaders of the world will base their decisions on.
That's not the technology's fault though, it's just that the technology is produced by an imperialist capitalist society that treats cold war propaganda as indisputable fact.
Feed different data to the machine and you will get different results. For example if you just train a model on CIA declassified documents it will be able to answer questions about the real role of the CIA historically. Add a subjective point of view on these events and it can either answer you with right wing bullshit if that's what you gave it, or a marxist analysis of the CIA as an imperialist weapon that it is.
As with technology in general, it's effect on society lies with the hands that wield it.
"let's just use autocorrect to create the future this is definitely cool and not regressive and reactionary and a complete recipe for disaster"
It's technology with many valid use-cases. The misapplication of the technology by capital doesn't make the tech itself inherently reactionary.
LLMs literally cannot do anything else other than reproduce data it has been given. The closer the output is to the input, the better it is. Now if the input is "all the data that capitalism has produced" then the expected output is "an infinite amount of variations on that data". That's why it is reactionary.
It's incredibly power hungry.
The context of the discussion is that it's already 50x less power hungry than just a little while ago.
For now. We've been seeing great strides in reducing that power hunger recently, including by the LLM that's the subject of this post.
That also doesn't make it inherently reactionary.
We've been seeing great strides in reducing that power hunger recently, including by the LLM that's the subject of this post.
Due to the market economy in both the United State and China, further development of LLM efficiency is probably the worst thing that could possibly happen. Even if China did not want to subject LLMs to market forces, they are going to need to compete with the US. This is going further accelerate the climate disaster.
Again, an issue with capitalism and not the technology itself.
Well I agree with you there. Too bad there's all this capitalism.
For now. Are we supposed to just halt all technological progress because capitalism is inevitably going to misuse it? Should we stop trying to develop new medical treatments and drugs because capitalism is going to prevent all but the wealthiest from accessing them in our lifetime?
Regardless, my point was that the tech itself isn't inherently reactionary. Not that it won't be misused under capitalism.
Vacuum tubes were too
This is a stupid take. I like the autocorrect analogy generally, but this veers into Luddite-ism.
Let me add, the way we're pushed to use LLMs is pretty dumb and a waste of time and resources, but the technology has pretty fascinating use-cases in material and drug discovery.
🙄
how do you measure performance of an llm? ask it how many 'r's there are in 'strawberry' and how many times you have to say 'no thats wrong' until it gets 3
Basically speed and power usage to process a query. Also, there's been tangible progress in doing reasoning with unsupervised learning seen in DeepSeek R1 and approaches such as neurosymbolics. These types of models can actually explain the steps they take to arrive at the answer, and you can correct them.
it requires fewer tons of CO2 to tell you that 757 * 128 = 3042
They use synthetic AI generated benchmarks
It's computer silicon blowing itself basically