Futurology

3386 readers

32 users here now

founded 2 years ago

MODERATORS

DeepSeek has open-sourced DeepSeek-OCR, a 3B-parameter model that upends how AI will deal with text, by being far more efficient than anything else available. (eu.36kr.com)

submitted 1 week ago by Lugh to c/futurology

4 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] FauxLiving@lemmy.world 3 points 1 week ago

They were able to efficiently encode visual information to be used by further networks. In this case the further network was a language model trained on an OCR task.

The news is the technique, the OCR software is a demonstration of the technique. Encoding visual information efficiently is also key for robotics which use trained networks in their feedback control loops. Being able to process 10 times as much visual data with the same hardware is a very significant increase in capability.