pcalau12i

joined 4 months ago
[–] pcalau12i@lemmygrad.ml 1 points 1 month ago

So usually this is explained with two scientists, Alice and Bob, on far away planets. They’re each in the possession of a particle that is entangled with the other, and in a superposition of state 1 and state 2.

This "usual" way of explaining it is just overly complicating it and making it seem more mystical than it actually is. We should not say the particles are "in a superposition" as if this describes the current state of the particle. The superposition notation should be interpreted as merely a list of probability amplitudes predicting the different likelihoods of observing different states of the system in the future.

It is sort of like if you flip a coin, while it's in the air, you can say there is a 50% chance it will land heads and a 50% chance it will land tails. This is not a description of the coin in the present as if the coin is in some smeared out state of 50% landed heads and 50% landed tails. It has not landed at all yet!

Unlike classical physics, quantum physics is fundamentally random, so you can only predict events probabilistically, but one should not conflate the prediction of a future event to the description of the present state of the system. The superposition notation is only writing down probability amplitudes of the likelihoods of what you will observe (state 1 or state 2) of the particles in the future event that you go to the interact with it and is not a description of the state of the particles in the present.

When Alice measures the state of her particle, it collapses into one of the states, say state 1. When Bob measures the state of his particle immediately after, before any particle travelling at light speed could get there, it will also be in state 1 (assuming they were entangled in such a way that the state will be the same).

This mistreatment of the mathematical notation as a description of the present state of the system also leads to confusing language like "it collapses into one of the states" as if the change in a probability distribution represents a physical change to the system. The mental picture people say this often have is that the particle literally physically becomes the probability distribution prior to measuring it---the particle "spreads out" like a wave according to the probability amplitudes of the state vector---and when you measure the particle, this allows you to update the probabilities, and so they must interpret this as the wave physically contracting into an eigenvalue---it "collapses" like a house of cards.

But this is, again, overcomplicating things. The particle never spreads out like a wave and it never "collapses" back into a particle. The mathematical notation is just a way of capturing the likelihoods of the particle showing up in one state or the other, and when you measure what state it actually shows up in, then you can update your probabilities accordingly. For example, if you the coin is 50%/50% heads/tails and you observe it land on tails, you can update the probabilities to 0%/100% heads/tails because you know it landed on tails and not heads. Nothing "collapsed": you're just observing the actual outcome of the event you were predicting and updating your statistics accordingly.

[–] pcalau12i@lemmygrad.ml 2 points 1 month ago

Any time you do something to the particles on Earth, the ones on the Moon are affected also

The no-communication theorem already proves that manipulating one particle in an entangled pair has no impact at al on another. The proof uses the reduced density matrices of the particles which capture both their probabilities of showing up in a particular state as well as their coherence terms which capture their ability to exhibit interference effects. No change you can make to one particle in an entangled pair can possibly lead to an alteration of the reduced density matrix of the other particle.

[–] pcalau12i@lemmygrad.ml 1 points 1 month ago

I don't think solving the Schrodinger equation really gives you a good idea of why quantum mechanics is even interesting. You also shouldstudy very specific applications of it where it yields counterintuitive outcomes to see why it is interesting, such as in the GHZ experiment.

[–] pcalau12i@lemmygrad.ml 2 points 1 month ago* (last edited 1 month ago)

There is no "consciousness." False belief in "consciousness" is a product of Kantianism, which itself was heavily inspired by Newtonian physics (Kant was heavily inspired by Newton), which we have changed some categories over the years but the fundamentals have not and have become deeply integrated into western psyche in how we think about the world, and probably in many other cultures as well.

Modern day philosophers have just renamed Kant's phenomena to "consciousness" or "subjective experience" and renamed his "noumena" to "matter." Despite the renaming, the categories are still treated identically: the "consciousness" is everything we perceive, and the "matter" is something invisible, the true physical thing-in-itself beyond our perception and what "causes" our perception.

Since all they have done is rename Kant's categories, they do not actually solve Kant's mind-body problem, but have just rediscovered it and thus renamed it in the form of the "hard problem of consciousness," which is ultimately the same exact problem just renamed: that there seems to be a "gap" between this "consciousness" and "matter."

Most modern day philosophers seem to split into two categories. The first are the "promissory materialists" who just say there is a real problem here but shrug their shoulders and say one day science will solve it so we don't have to worry about it, but give no explanation of what a solution could even possibly look like. The second are the mystics who insist this "consciousness" can't be reconciled with "matter" because it must be some fundamental force of reality. They talk about things like "consciousness fields" or "cosmic consciousness" or whatever.

However, both are wrong. Newtonian physics is not an accurate represent of reality, we already know this, and so the Kantian mindset inspired from it should also be abandoned. When you abandon the Kantian mindset, there is no longer a need for the "phenomena" and "noumena" division, or, in modern lingo, there is no longer a need for the "consciousness" and "matter" division. There is just reality.

Imagine you are looking at a candle. The apparent size of the candle you will see will depend upon how far you are away from it: if you are further away it appears smaller. Technically, light doesn't travel at an infinite speed, and so the further away you are, the further in the past you are seeing the candle. The candle also may appear a bit different under different lighting conditions.

A Kantian would say there is a true candle, the "candle-in-itself," or, in modern lingo, the material candle, the "causes" all these different perceptions. The perceptions themselves are then said to be brain-generated, not part of the candle, not even something real at all, but something purely immaterial, part of the phenomena, or, in modern lingo, part of "consciousness."

If every possible perception of the candle is part of "consciousness," then the candle-in-itself, the actual material object, must be independent of perception, i.e. it's invisible. No observation can reveal it because all observations are part of "consciousness." This is the Kantian worldview: everything we perceive is part of a sort of illusion created within the mind as opposed to the "true" world that is entirely imperceptible. The mind-body problem, or in modern lingo the "hard problem," then arises as to how an entirely imperceptible (non-phenomenal/non-conscious) world can give rise to what we perceive in a particular configuration.

However, the Kantian worldview is a delusion. In Newtonian physics, if I launch a cannonball from point A to point B, simply observing it at point A and point B is enough to fill in the gaps to say where the object was at every point in between A and B independently of anything else. This Newtonian worldview allows us to conceive of the cannonball as a thing-in-itself, an object with its own inherent properties that can be meaningful conceived of existing even when in complete isolation, that always has an independent of history of how it ends up where it does.

As Schrodinger pointed out, this mentality does not apply to modern physics. If you fire a photon from point A to point B and observe it at those two points, you cannot always meaningfully fill in the gaps of what the photon was doing in between those two points without running into contradictions. As Schrodinger concluded, one has to abandon the notion that particles really are independent autonomous entities with their own independent existence that can be meaningfully conceived of in complete isolation. They only exist from moment to moment in the context of whatever they are interacting with and not in themselves.

If this is true for particles, it must also be true of everything made up of particles: there is no candle-in-itself either. It's a high-level abstraction that doesn't really exist. What we call the "candle" is not an independent unobservable entity separate from all our different perceptions of it, but what we call the candle is precisely the totality of all the different ways it is and can be perceived, all the different ways it interacts with other objects from those objects' perspectives.

Kant justified the noumena by arguing that it makes no sense to talk about objects "appearing" (the word "phenomena" means "the appearance of") without there being something that is doing the appearing (the noumena). He is correct on this, but for a different reason. We should not use this to justify the noumena, but it shows that if we reject the noumena, we must also reject the phenomena ("consciousness"): it makes no sense to treat the different instances of a candle as some sort of separate "consciousness" realm, or some sort of illusion or whatever independent of the real material world as it really is.

No, what we perceive directly is material reality as it actually is. Reality is what you are immersed in every day, what surrounds you, what you are experiencing in this very moment. It is not some illusion from which there is a "true" invisible reality beyond it. When you look at the candle, you are seeing the candle as it really is from your own perspective. That is the real candle in the real world. The Kantian distinction between noumena-phenomena (or between "matter" and "consciousness") should be abandoned. It is just not compatible with the modern physical sciences.

But I know no one will even know what I'm talking about, so writing this is rather pointless. Kantianism is too deeply ingrained into the western psyche, people cannot even comprehend that it is possible to criticize it because it underlies how they think about everything. This nonsense debate about "consciousness" will continue forever, in ten thousand years people will still be arguing over it, because it's an intrinsic problem that arises out of the dualistic structure in Kantian thinking. If you begin from the get-go with an assumption that there is a division between mind and matter, you cannot close this division without contradicting yourself, which leads to this debate around "consciousness." But it seems unrealistic at this point to get people to abandon this dualistic way of thinking, so it seems like the "consciousness" debate will proceed forever.

[–] pcalau12i@lemmygrad.ml 2 points 1 month ago (1 children)

You have not made any point at all. Your first reply to me entirely ignored the point of my post which you did not read followed with an attack, I reply pointing out you ignored the whole point of my post and just attacked me without actually respond to it, and now you respond again with literally nothing of substance at all just saying "you're wrong! touch grass! word salad!"

You have nothing of substance to say, nothing to contribute to the discussion. You are either a complete troll trying to rile me up, or you just have a weird emotional attachment to this topic and felt an emotional need to respond and attack me prior to actually thinking up a coherent thing to criticize me on. Didn't your momma ever teach you that "if you have nothing positive or constructive to say, don't say anything at all"? Learn some manners, boy. Blocked.

[–] pcalau12i@lemmygrad.ml 1 points 1 month ago

They are incredibly efficient for short-term production, but very inefficient for long-term production. Destroying the environment is a long-term problem that doesn't have immediate consequences on the businesses that engage in it. Sustainable production in the long-term requires foresight, which requires a plan. It also requires a more stable production environment, i.e. it cannot be competitive because if you are competing for survival you will only be able to act in your immediate interests to avoid being destroyed in the competition.

Most economists are under a delusion known as neoclassical economics which is literally a nonphysical theory that treats the basis of the economy as not the material world we actually live in but abstract human ideas which are assumed to operate according to their own internal logic without any material causes or influences. They then derive from these imagined "laws" regarding human ideas (which no one has ever experimentally demonstrated but were just invented in some economists' armchair one day) that humans left to be completely free to make decisions without any regulations at all will maximize the "utils" of the population, making everyone as happy as possible.

With the complete failure of this policy leading to the US Great Depression, many economists recognized this was flawed and made some concessions, such as with Keynesianism, but they never abandoned the core idea. In fact, the core idea was just reformulated to be compatible with Keynesianism in what is called the neoclassical synthesis. It still exists as a fundamental belief to most every economist that completely unregulated market economy without any plan at all will automagically produce a society with maximal happiness, and while they will admit some caveats to this these days (such as the need for a central organization to manage currency in Keynesianism), these are treated as an exception and not the rule. Their beliefs are still incompatible with long-term sustainable planning because in their minds the success of markets from comes util-maximizing decisions built that are fundamental to the human psyche and so any long-term plan must contradict with this and lead to a bad economy that fails to maximize utils.

The rise of Popperism in western academia has also played a role here. A lot of material scientists have been rather skeptical of the social sciences and aren't really going to take arguments like those based in neoclassical economics which is based largely in mysticism about human free will seriously, and so a second argument against long-term planning was put forward by Karl Popper which has become rather popular in western academia. Popper argued that it is impossible to learn from history because it is too complicated with too many variables and you cannot control them all. You would need a science that studies how human societies develop in order to justify a long-term development plan into the future, but if it's impossible to study them to learn how they develop because they are too complicated, then it is impossible to have such a science, and thus impossible to justify any sort of long-term sustainable development plan. It would always be based on guesswork and so more likely to do more harm than good. Popper argued that instead of long-term development plans, the state should instead be purely ideological, what he called an "open society" operating purely on the ideology of liberalism rather getting involved in economics.

As long as both neoclassical economics and Popperism are dominate trends in western academia there will never be long-term sustainable planning because they are fundamentally incompatible ideas.

[–] pcalau12i@lemmygrad.ml 1 points 1 month ago (3 children)

You did not read what I wrote, so it is unironic you call it "word salad" when you are not even aware of the words I wrote since you had an emotional response and wrote this reply without actually addressing what I argued. I stated that it is impossible to have an very large institution without strict rules that people follow, and this requires also the enforcement of the rules, and that means a hierarchy as you will have rule-enforcers.

Also, you are insisting your personal definition of anarchism is the one true definition that I am somehow stupid for disagreeing with, yet anyone can just scroll through the same comments on this thread and see there are other people disagreeing with you while also defending anarchism. A lot of anarchists do not believe anarchism means "no hierarchy," like, seriously, do you unironically believe in entirely abolishing all hierarchies? Do you think a medical doctor should have as much authority on how to treat an injured patient as the janitor of the same hospital? Most anarchists aren't even "no hierarchy" they are "no unjustified hierarchy."

The fact you are entirely opposed to hierarchy makes your position even more silly than what I was criticizing.

[–] pcalau12i@lemmygrad.ml 3 points 1 month ago* (last edited 1 month ago) (8 children)

All libertarian ideologies (including left and right wing anarchism) are anti-social and primitivist.

It is anti-social because it arises from a hatred of working in a large groups. It's impossible to have any sort of large-scale institution without having rules that people want to follow, and libertarian ideology arises out of people hating to have to follow rules, i.e. to be a respectable member of society, i.e. they hate society and don't want to be social. They thus desire very small institutions with limited rules and restrictions. Right-wing libertarians envision a society dominated by small private businesses while left-wing libertarians imagine a society dominated by either small worker-cooperative, communes, or some sort of community council.

Of course, everyone of all ideologies opposes submitting to hierarchies they find unjust, but hatred of submitting to hierarchies at all is just anti-social, as any society will have rules, people who write the rules, people who enforce the rules. It is necessary for any social institution to function. It is part of being an adult and learning to live in a society to learn to obey the rules, such as traffic rules. Sometimes it is annoying or inconvenient, but you do it because you are a respectable member of society and not a rebellious edgelord who makes things harder on everyone else because they don't obey basic rules.

It is primitivist because some institutions simply only work if they are very large. You cannot have something like NASA that builds rocket ships operated by five people. You are going to always need an enormous institution which will have a ton of people, a lot of different levels of command ("hierarchy"), strict rules for everyone to follow, etc. If you tried to "bust up" something like NASA or SpaceX to be small businesses they simply would lose their ability to build rocket ships at all.

Of course, anarchists don't mind, they will say, "who cares about rockets? They're not important." It reminds me of the old meme that spread around where someone asked anarchists how their tiny communes would be able to organize current massive supply chains in our modern societies and they responded by saying that the supply chain would be reduced to just people growing beans in their backyard and eating it, like a feudal peasant. They won't even defend that their system could function as well as our modern economy but just says modern marvels of human engineering don't even matter, because they are ultimately primitivists at heart.

I never understood the popularity of libertarian and anarchist beliefs in programming circles. We would never have entered the Information Age if we had an anarchism or libertarian system. No matter how much they might pretend these are the ideal systems, they don't even believe it themselves. If a libertarian has a serious medical illness, they are either going to seek medical help at a public hospital or a corporate hospital. Nobody is going to seek medical help at a "hospital small business" ran out of someone's garage. We all intuitively and implicitly understand that large swathes of economy that we all take advantage of simply cannot feasibly be ran by small organizations, but libertarians are just in denial.

[–] pcalau12i@lemmygrad.ml 5 points 1 month ago* (last edited 1 month ago)

Anarchism thus becomes meaningless as anyone who defends certain hierarchies obviously does so because they believe they are just. Literally everyone on earth is against "unjust hierarchies" at least in their own personal evaluation of said hierarchies. People who support capitalism do so because they believe the exploitative systems it engenders are justifiable and will usually immediately tell you what those justifications are. Sure, you and I might not agree with their argument, but that's not the point. To say your ideology is to oppose "unjust hierarchies" is to not say anything at all, because even the capitalist, hell, even the fascist would probably agree that they oppose "unjust hierarchies" because in their minds the hierarchies they promote are indeed justified by whatever twisted logic they have in their head.

Telling me you oppose "unjust hierarchies" thus tells me nothing about what you actually believe, it does not tell me anything at all. It is as vague as saying "I oppose bad things." It's a meaningless statement on its own without clarifying what is meant by "bad" in this case. Similarly, "I oppose unjust hierarchies" is meaningless statement without clarifying what qualifies "just" and "unjust," and once you tell me that, it would make more sense you label you based on your answer to that question. Anarchism thus becomes a meaningless word that tells me nothing about you. For example, you might tell me one unjust hierarchy you want to abolish is prison. It would make more sense for me to call you a prison abolitionist than an anarchist since that term at least carries meaning, and there are plenty of prison abolitionists who don't identify as anarchist.

[–] pcalau12i@lemmygrad.ml 0 points 2 months ago* (last edited 2 months ago)

There is no "fundamentally" here, you are referring to some abstraction that doesn't exist. The models are modified during the fine-tuning process, and the process trains them to learn to adopt DeepSeek R1's reasoning technique. You are acting like there is some "essence" underlying the model which is the same between the original Qwen and this model. There isn't. It is a hybrid and its own thing. There is no such thing as "base capability," the model is not two separate pieces that can be judged independently. You can only evaluate the model as a whole. Your comment is just incredibly bizarre to respond to because you are referring to non-existent abstractions and not actually speaking of anything concretely real.

The model is neither Qwen nor DeepSeek R1, it is DeepSeek R1 Qwen Distill as the name says. it would be like saying it's false advertising to say a mule is a hybrid of a donkey and a horse because the "base capabilities" is a donkey and so it has nothing to do with horses, and it's really just a donkey at the end of the day. The statement is so bizarre I just do not even know how to address it. It is a hybrid, it's its own distinct third thing that is a hybrid of them both. The model's capabilities can only be judged as it exists, and its capabilities differ from Qwen and the original DeepSeek R1 as actually scored by various metrics.

Do you not know what fine-tuning is? It refers to actually adjusting the weights in the model, and it is the weights that define the model. And this fine-tuning is being done alongside DeepSeek R1, meaning it is being adjusted to take on capabilities of R1 within the model. It gains R1 capabilities at the expense of Qwen capabilities as DeepSeek R1 Qwen Distill performs better on reasoning tasks but actually not as well as baseline models on non-reasoning tasks. The weights literally have information both of Qwen and R1 within them at the same time.

Speaking of its "base capabilities" is a meaningless floating abstraction which cannot be empirically measured and doesn't refer to anything concretely real. It only has its real concrete capabilities, not some hypothetical imagined capabilities. You accuse them of "marketing" even though it is literally free. All DeepSeek sells is compute to run models, but you can pay any company to run these distill models. They have no financial benefit for misleading people about the distill models.

You genuinely are not making any coherent sense at all, you are insisting a hybrid model which is objectively different and objectively scores and performs differently should be given the exact same name, for reasons you cannot seem to actually articulate. It clearly needs a different name, and since it was created utilizing the DeepSeek R1 model's distillation process to fine-tune it, it seems to make sense to call it DeepSeek R1 Qwen Distill. Yet for some reason you insist this is lying and misrepresenting it and it actually has literally nothing to do with DeepSeek R1 at all and it should just be called Qwen and we should pretend it is literally the same model despite it not being the same model as its training weights are different (you can do a "diff" on the two model files if you don't believe me!) and it performs differently on the same metrics.

There is simply no rational reason to intentionally want to mislabel the model as just being Qwen and having no relevance to DeepSeek R1. You yourself admitted that the weights are trained on R1 data so they necessarily contain some R1 capabilities. If DeepSeek was lying and trying to hide that the distill models are based on Qwen and Llama, they wouldn't have literally put that in the name to let everyone know, and released a paper explaining exactly how those were produced.

It is clear to me that you and your other friends here have some sort of alternative agenda that makes you not want to label it correctly. DeepSeek is open about the distill models using Qwen and Llama, but you want them to be closed and not reveal that they also used DeepSeek R1. The current name for it is perfectly fine and pretending it is just a Qwen model (or Llama, for the other distilled versioned) is straight-up misinformation, and anyone who downloads the models and runs them themselves will clearly see immediately that they perform differently. It is a hybrid model correctly called what they are: DeepSeek R1 Qwen Distill and DeepSeek R1 Llama Distill.

[–] pcalau12i@lemmygrad.ml 5 points 2 months ago* (last edited 2 months ago) (2 children)

The 1.5B/7B/8B/13B/32B/70B models are all officially DeepSeek R1 models, that is what DeepSeek themselves refer to those models as. It is DeepSeek themselves who produced those models and released them to the public and gave them their names. And their names are correct, it is just factually false to say they are not DeepSeek R1 models. They are.

The "R1" in the name means "reasoning version one" because it does not just spit out an answer but reasons through it with an internal monologue. For example, here is a simple query I asked DeepSeek R1 13B:

Me: can all the planets in the solar system fit between the earth and the moon?

DeepSeek: Yes, all eight planets could theoretically be lined up along the line connecting Earth and the Moon without overlapping. The combined length of their diameters (approximately 379,011 km) is slightly less than the average Earth-Moon distance (about 384,400 km), allowing them to fit if placed consecutively with no required spacing.

However, on top of its answer, I can expand an option to see its internal monologue it went through before generating the answer, which you can find the internal monologue here because it's too long to paste.

What makes these consumer-oriented models different is that that rather than being trained on raw data, they are trained on synthetic data from pre-existing models. That's what the "Qwen" or "Llama" parts mean in the name. The 7B model is trained on synthetic data produced by Qwen, so it is effectively a compressed version of Qen. However, neither Qwen nor Llama can "reason," they do not have an internal monologue.

This is why it is just incorrect to claim that something like DeepSeek R1 7B Qwen Distill has no relevance to DeepSeek R1 but is just a Qwen model. If it's supposedly a Qwen model, why is it that it can do something that Qwen cannot do but only DeepSeek R1 can? It's because, again, it is a DeepSeek R1 model, they add the R1 reasoning to it during the distillation process as part of its training. They basically use synthetic data generated from DeepSeek R1 to fine-tune readjust its parameters so it adopts a similar reasoning style. It is objectively a new model because it performs better on reasoning tasks than just a normal Qwen model. It cannot be considered solely a Qwen model nor an R1 model because its parameters contain information from both.

[–] pcalau12i@lemmygrad.ml 2 points 2 months ago* (last edited 2 months ago)

As I said, they will likely come to the home in form of cloud computing, which is how advanced AI comes to the home. You can run some AI models at home but they're nowhere near as advanced as cloud-based services and so not as useful. I'm not sure why, if we ever have AGI, it would need to be run at home. It doesn't need to be. It would be nice if it could be ran entirely at home, but that's no necessity, just a convenience. Maybe your personal AGI robot who does all your chores for you only works when the WiFi is on. That would not prevent people from buying it, I mean, those Amazon Fire TVs are selling like hot cakes and they only work when the WiFi is on. There also already exists some AI products that require a constant internet connection.

It is kind of similar with quantum computing, there actually do exist consumer-end home quantum computers, such as Triangulum, but it only does 3 qubits, so it's more of a toy than a genuinely useful computer. For useful tasks, it will all be cloud-based in all likelihood. The NMR technology Triangulum is based on, it's not known to be scalable, so the only other possibility that quantum computers will make it to the home in a non-cloud based fashion would be optical quantum computing. There could be a breakthrough there, you can't rule it out, but I wouldn't keep my fingers crossed. If quantum computers become useful for regular people in the next few decades, I would bet it would be all through cloud-based services.

view more: ‹ prev next ›