Isnt deepseek based on qwen? at least the distilled models?
this post was submitted on 05 Mar 2025
8 points (100.0% liked)
Technology
1060 readers
37 users here now
A tech news sub for communists
founded 2 years ago
MODERATORS
I think so, but this looks like an update of qwen with some new tricks.
can grab it here
I find it absolutely wild how quickly we went from needing a full blown data centre to run models of this scale to being able to run them on a laptop.