this post was submitted on 05 Mar 2025
9 points (100.0% liked)

Technology

1060 readers
25 users here now

A tech news sub for communists

founded 2 years ago
MODERATORS
top 3 comments
sorted by: hot top controversial new old
[–] marl_karx@lemmygrad.ml 3 points 2 days ago (1 children)

Isnt deepseek based on qwen? at least the distilled models?

[–] yogthos@lemmygrad.ml 3 points 2 days ago

I think so, but this looks like an update of qwen with some new tricks.

[–] yogthos@lemmygrad.ml 7 points 3 days ago* (last edited 3 days ago)

can grab it here

I find it absolutely wild how quickly we went from needing a full blown data centre to run models of this scale to being able to run them on a laptop.