this post was submitted on 25 Sep 2025
        
      
      81 points (94.5% liked)
      Technology
    76365 readers
  
      
      1268 users here now
      This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
        founded 2 years ago
      
      MODERATORS
      
    you are viewing a single comment's thread
view the rest of the comments
    view the rest of the comments
NPUs are very scammy, with all use vendor specific proprietary, often undocumented, implementations that are often incompatible with previous vendor architectures. Microsoft is makeing DirectML, but AMD/Intel (different NPUs that keep changing) aren't fully supported. Copilot does manage to do some minimal AI use. Their small LLM is snapdragon elite only. but 27 tokens/s for 1.6gb ram (4 bit int quantized) is much lower than x86 (or gpu) performance on similar sized models. ultra low power use is the benefit, but so far, any chip die space given to NPU is, IMO, a waste of money, partly because it is a dark black box that only Microsoft has the key to.
Yeah I agree on these fronts. The hardware might be good but software frameworks need to support it, which historically has been very hit or miss.