Futurology

3404 readers

198 users here now

founded 2 years ago

MODERATORS

Cheap-to-run Open Source Chinese AI is equalling and bettering the AI investors are pouring hundreds of billions of dollars into, which may have profound future implications. (self.futurology)

submitted 9 months ago by Lugh to c/futurology

11 comments fedilink hide all child comments

Throughout 2024 Open Source AI has been slowly catching up with investor-funded AI, but in the first weeks of 2025 that has dramatically accelerated. Now Open Source isn't just catching up, it is arguably better and superior to investor-funded AI.

Restrictions on chip imports seem to be driving Chinese innovation, not slowing them down. Using lesser chips, they've optimized AI to run cheaper and more efficiently, but be just as powerful. Not only that, they've open-sourced that AI.

Where does that leave the hundreds of billions poured into investor-funded AI? Who knows. But they've no product to sell that people can't get elsewhere way cheaper or for free.

This also means AI will become decentralized and democratized. Many thought it would just be in the hands of Big Tech, but the exact opposite scenario is playing out.

What are the economic implications? AI hype is keeping the US stock market afloat - how long can that last?

Source

you are viewing a single comment's thread
view the rest of the comments

[–] voracitude@lemmy.world 4 points 9 months ago* (last edited 9 months ago) (3 children)

Yes, really. But like I said, I was running it locally in ~~LMStudio~~ ollama. I think it was the 1B model, which is pretty small; I'd be surprised if that's what's running on their service.

Edit: it was ollama webui turns out, and the 1.5B model. This won't make it any more believable because image manipulation exists, but I shared a screenshot with my colleague Thursday when it happened:

[–] FaceDeer@fedia.io 9 points 9 months ago

I think it was the 1B model

Well there you go, you took a jet ski and then complained that it was having difficulty climbing steep inclines in mountains.

Small models like that are not going to "know" much. Their purpose is generally to process whatever information you give them. For example you could use one to quickly and cheaply categorize documents based on their contents, or use one as a natural-language interface you could use to ask it to execute commands on other tools.

[–] pennomi@lemmy.world 6 points 9 months ago (1 children)

Ah yeah, anything under 7B gets kind of dicey imo. Especially if quantized.

[–] cyd@lemmy.world 3 points 9 months ago* (last edited 9 months ago)

The smaller models are not the actual R1 model, they are distills of R1 based on smaller models like Qwen. Those were released by Deepseek as proofs of concept for how to imbue weaker models with reasoning abilities. It's like seeing a dancing bear: the marvel is not that it dances well, but that it dances at all.