this post was submitted on 27 Jan 2025
139 points (97.3% liked)
Futurology
1943 readers
80 users here now
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It'll be interesting to see if this model was so cheap because the Chinese skipped years of development and got a jump start by stealing tech from other AI companies.
Deepseek put out a highly detailed paper explaining how they optimized their model training, released the model itself, released their reinforcement learning code, put permissive open source licenses on everything... and people wonder if they got there by stealing stuff, because Chinese. Sheesh.
Tbf, the reputation has been earned. Look at the incredible volume of bunk science coming out of China. The pervasive spying campaigns. The loads of off brand software and hardware. It's not like there isn't reason to be suspicious.
ah yes the spying ...
There's no need for that image because I'm not making superficial claims of data collection, I'm talking about theft and potentially deadly attacks.
Here you go:
They've shown to be more than capable
U.S. Government Disrupts Botnet People’s Republic of China Used to Conceal Hacking of Critical Infrastructure
They've infiltrated
DOJ confirms FBI operation that mass-deleted Chinese malware from thousands of US computers
They've stolen
Industrial espionage: How China sneaks out America's technology secrets
It could get deadly
US sanctions China cyber firm for potentially deadly ransomware attack
Both their private and public sectors have been implicated a number of times.
Projection
Ask it how to stop Putin:
To counter Putin's forces in this apocalyptic scenario, here’s a strategic plan:
Using LLMs for that purpose is not very intelligent.
They tend to lack highly specialized logic and spatial reasoning as well as long-term consistency. Also dimwits are part of the training set.
yeah because chinese directed and controlled by the ccp who've made it the bedrock of their entire economy by straight up gankin western technology and patents, yes, the motherfucking chinese. thieves!
Even if that was true, it's fair game. After all the OpenAI models etc. are entirely based on stolen content as well.
It cost so little because all previous open source work was already done, and a lot of the research work had already been knocked out. Building models isn't the time consuming process it used to be, it's the training, testing, retraining loop that's expensive.
If you're just building a model that is focused on specific things-like coding, math, and logic-then you don't need large swathes of content from the internet, you can just train it on already solved, freely available information. If you want to piss away money on an LLM that also knows how many celebrities each celebrity has diddled, well that costs a lot more to make.
From someone in the field
https://github.com/huggingface/open-r1
Unfortunately, that's not very clear without more. What kind of reward model are they talking about?
This is potentially a 1000x difference in required resources here, assuming you believe their DeepSeek's quoted figure for spending, so it would have to be an extraordinary change.