this post was submitted on 08 Aug 2025

186 points (100.0% liked)

Chapotraphouse

14160 readers

880 users here now

Banned? DM Wmill to appeal.

No anti-nautilism posts. See: Eco-fascism Primer

Slop posts go in c/slop. Don't post low-hanging fruit here.

founded 4 years ago

MODERATORS

LENINSGHOSTFACEKILLA@hexbear.net

corgiwithalaptop@hexbear.net

PorkrollPosadist@hexbear.net

a_little_red_rat@hexbear.net

khizuo@hexbear.net

thelastaxolotl@hexbear.net

CoolerOpposide@hexbear.net

186

All that water, all those resources, everything will be sacrificed to keep the line going up. (hexbear.net)

submitted 2 months ago by Super_Lumalo@hexbear.net to c/chapotraphouse@hexbear.net

69 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] AssortedBiscuits@hexbear.net 9 points 2 months ago* (last edited 2 months ago) (2 children)

I wanna see the results if you ask ChatGPT the same question a million times. What percentage of responses would actually get the correct number?

[–] purpleworm@hexbear.net 9 points 2 months ago (1 children)

I think that heavily depends on whether it gets the initial answer right, since it will use that as context

[–] Cysioland@lemmygrad.ml 4 points 2 months ago (1 children)

When you're calling it through an API then you can simply choose not to pass it any context

[–] purpleworm@hexbear.net 2 points 2 months ago

Fair enough

[–] gay_king_prince_charles@hexbear.net 2 points 2 months ago

It depends on the temperature. There's a variable you can play with that adds since randomness to the responses (LLMs are fully deterministic when temperature is 0). Sometimes the F1 or F2 score is used to determine correctness of many questions, but I don't have a great understanding of how that metric works and what ChatGPTs is.