KrasnaiaZvezda

joined 2 years ago
[–] KrasnaiaZvezda@lemmygrad.ml 4 points 1 month ago

First of all, DeepSeek-R1-0528-Qwen3-8B is not the Deepseek model people refer to when talking about Deepseek, so that's misleading to say the least. The actual Deepseek model is the 671B parameter model which they breafly mention but is not the main topic of the article as one would assume from the title. That model is really good, the best open source and one of the best in general, and it is possible to run locally, but requires some 200GB RAM/VRAM to run at the smallest qualities and 800GB+ RAM/VRAM if running at full quality.

As for the model the article is about and that you mentioned, it is based on the Qwen3-8B model which can be run in as little as ~5GB available RAM/VRAM quantized to q4_k_m, ie. it can be run on computers and even in some phones.

As for the target audience, anyone wanting privacy in their LLM uses or simply not paying for an API access for use in automation tasks or research. As this is a thinking version though it will take quite a few tokens to get to an answer, so it's better for people that have a GPU or those who simply need something more powerful locally sometimes.

[–] KrasnaiaZvezda@lemmygrad.ml 4 points 2 months ago

Since it is only decaying through Beta minus decay (a neutron decaying into a proton that stays in the nucleus, an electron that can be ejected and an antineutrino that can pass through the Earth without interacting with anything) it probably depends on how well it can capture the electrons, ie. it kinda depends on how efficient it is.

I'm also curious about the power curve. I guess the power falls with time, following the half life of the element (100 years for Ni63, meaning about 70% left after 50 years), but it would be nice if the article talked more about it, like if the diamond semiconductor can last that long. It should also be interesting how future batteries could use many elements with many decay routes that could keep the power output closer to constant for very long.

[–] KrasnaiaZvezda@lemmygrad.ml 8 points 3 months ago (1 children)

So, does anyone know how this technology works? Does it use quantum entangled photons or is it something else?

[–] KrasnaiaZvezda@lemmygrad.ml 5 points 3 months ago (1 children)

At least part of it seems to be a "guard-model" identifying that the topic might be illegal or something like that and just stoping the main model from going further while saying "can't do that". Other Chinese models, from what I heard, might handle things like this better, although in DeepSeek's case they might have gone too far because they might have been under attack early on from bad faith people asking all sorts of "questions" about China and things like that, but I can't say for sure.

As the other person said, you can either try running the model yourself, or if you don't have a computer with at least 400GB RAM lying around you can look for other services hosting DeepSeek. I can't guarantee they won't have "guard-models"/censorship themselves but there should be some without it.

But another point to consider though is that DeepSeek was still trained on a lot of western ~~data~~ propaganda, so don't expect impartiallity from DS or any other model for that matter. We may still be a ways off of models that can actually understand their bias and correct it properly.

[–] KrasnaiaZvezda@lemmygrad.ml 2 points 3 months ago

wouldn’t it just make more sense to rebuild the facility to be more efficient for cheaper and more sensible machinery

Yes, but that takes time and we need to have a good and mature enough tecnology for it to make at least some "financial" sense. So, if the humanoid robots can do work that humans can do then it means that we could try make/have AIs make new robots and new enviroments that work well toghether without consideration for human form.

Basically, from the first factory robots half a century ago until a few years back there wasn't that much of an advance that could allow robots to work alone in anything but a few cases. Now though the technology that allows for humanoid robots is the technology that allows non humanoid robots to work a lot better in basically all cases, so we will finally begin seeing more and more robots and infrastructure being built to take advantage of the new advances, at the same time we'll also get humanoid robots for other tasks, like some of the ones that involve working closelly with people...

I was never convinced by the argument that “Humanoid robots are better suited to working in environments that were built for humans”, like, WE are the ones who make these environments

And they aren't built for free so they were built to work, and now that the technology is allowing some of that to change it will gradually change.

[–] KrasnaiaZvezda@lemmygrad.ml 16 points 4 months ago (4 children)

The vast majority of infrastructure is made for humanoids, so having humanoid robots using what we have is much simpler. And then, when building new things with and for robot workers then they can be co-developed with non humanoid robots for maximum efficiency.

[–] KrasnaiaZvezda@lemmygrad.ml 9 points 4 months ago

Nice to see airships making such a nice comeback!

[–] KrasnaiaZvezda@lemmygrad.ml 5 points 5 months ago

It is more expensive although Deepseek's model is quite cheaper than the others making this much less of a factor. Aditionally these "reasoning models" aren't necessecerilly better for every task though, so for many things a normal, and cheaper model, might be prefered still.

[–] KrasnaiaZvezda@lemmygrad.ml 1 points 5 months ago

Agreed. My point was about the process. I am saying she signed off on a lucrative grift. But it makes her a grifter rather than a crypto sis. I would call her a crypto sis if she endorsed the technology. But rich people only care about money and not the technology.

Saying something like that instead, pehaps

[–] KrasnaiaZvezda@lemmygrad.ml -1 points 5 months ago (2 children)

Partly is because the way you said it seemed like you wouldn't have used the same words if it was a middle aged man, part is because it seemed to imply that she couldn't understand this even if she tried to, while again making no mention of the men not being able to understand it, and if anything, assuming they do understand it, at least better than her.

[–] KrasnaiaZvezda@lemmygrad.ml 0 points 5 months ago (4 children)

But I am sure she is clueless about it.

That sounds sexist as well...

view more: next ›