this post was submitted on 28 Jan 2025
63 points (86.2% liked)

Technology

61263 readers
3973 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 
  • Chinese AI lab DeepSeek launched the DeepSeek-R1 model, rivaling OpenAI in math reasoning and code generation.

  • The model is (in part?*)open-sourced for global research use.

  • Requires way less computing power than competitors like Meta.

  • Competes with OpenAI in critical areas such as mathematical reasoning, code generation, and cost efficiency

  • Overcame U.S. chip export restrictions through optimized architecture.

  • Big Tech are sore loosers

*DeepSeek employs a dual licensing structure for its models. The codebase for DeepSeek-Coder-V2 is released under the MIT License, which allows unrestricted use, modification, and distribution. However, the pre-trained models are governed by the DeepSeek License Agreement, permitting research and commercial use with specific restrictions to prevent harmful applications. While DeepSeek's models are open in many aspects, some argue they do not fully meet all criteria for being considered "open source" due to these licensing nuances

all 9 comments
sorted by: hot top controversial new old
[–] sunzu2@thebrainbin.org 22 points 3 days ago

The biggest thing deepseek did though is that it exposed entire US "AI" industry for the grifters that they are.

These same parasites ruined crypto too

[–] Giooschi@lemmy.world -4 points 3 days ago (5 children)

It's not open-source, stop spreading disinformation. The core of the product are the model weights and no source is provided for them, making them irreproducible. This is as open source as distributing a single exe file because after all you can read the assembly code, no?

[–] Gumus@lemmy.world 10 points 3 days ago

I prefer to call these models "open-weights". However, "open-source" is widely used and understood in this context. Not an intentional disinformation.

[–] dreadbeef@lemmy.dbzer0.com 4 points 3 days ago* (last edited 2 days ago)

You are fighting a losing battle. I understand why you think that, but the organization that owns the trademarks of open source do not agree with you (or me). I also disagree with that organization's definition of open source AI, but they own the legal right to define the meaning of "open source" in the technology trade, the trade in which they own the trade mark. But laws are laws and you either abide by them (as a corp, what are you gonna do?) or don't (fuck yeah, commit crimes).

Weights are the only thing you'll get from "open source" ai. You need to look for stricter legal definitions to meet your understandable criteria.

[–] ekZepp@lemmy.world 6 points 3 days ago* (last edited 2 days ago) (1 children)

I've used the original title of the article and checked some sources.

https://github.com/deepseek-ai/DeepSeek-V2/blob/main/LICENSE-CODE

https://github.com/huggingface/open-r1

Give me better sources and i'll change the title. 👇

[–] ekZepp@lemmy.world 8 points 3 days ago

Still waiting for links, but feel free to downvote instead 🤷

[–] filister@lemmy.world 0 points 3 days ago (1 children)

Please show me an LLM model that is really open source. My understanding is that most of the open models are open weights. For the record Mistral is also releasing Open weights models.

[–] Giooschi@lemmy.world 2 points 3 days ago

The fact that no widely used LLM is open source is not a good reason to change its meaning.