Technology

74382 readers

2929 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

365

Your public ChatGPT queries are getting indexed by Google and other search engines (techcrunch.com)

submitted 3 weeks ago* (last edited 3 weeks ago) by themachinestops@lemmy.dbzer0.com to c/technology@lemmy.world

45 comments fedilink hide all child comments

https://duckduckgo.com/?q=site%3Achatgpt.com%2Fshare

you are viewing a single comment's thread
view the rest of the comments

[–] BreadstickNinja@lemmy.world 7 points 3 weeks ago (2 children)

Yes, Ollama or a range of other backends (Ooba, Kobold, etc.) can run LLMs locally. Huggingface has a huge number of models suited to different tasks like coding, storywriting, general purpose, and so on. If you run both the backend and frontend locally, then no one monetizes your data.

The part I'd argue that the previous poster is glazing over a little bit is performance. Unless you have an enterprise-grade GPU cluster sitting in your basement, you're going to make compromises on speed and/or quality relative to the giant models that run on commercial services.

[–] tal@lemmy.today 1 points 3 weeks ago (1 children)

It's also going to cost more, because you almost certainly are only going to be using your hardware a tiny fraction of the time.

[–] BreadstickNinja@lemmy.world 2 points 3 weeks ago* (last edited 3 weeks ago)

Possibly, yes. There are models that will run on consumer-grade GPUs that you might already have or might have purchased anyway, where you might say there's no incremental cost. But the issue is that the performance will be limited. The models are forgetful and prone to getting stuck in loops of repeated phrases.

So if instead you custom-build a workstation with two 5090s or a Pro 6000 or something that pushes you up to the 100 GB VRAM tier, then absolutely, just as you said, you'll be spending thousands of dollars that probably won't pay back relative to renting cloud GPU time.

[–] puck@lemmy.world 1 points 3 weeks ago

Thanks for the info. Yeah, I was wondering what kind of hardware you’d need to host LLMs locally with decent performance and your post clarifies that. I doubt many people would have the kind of hardware required.