• About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post
No Result
View All Result
Digital Phablet
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
No Result
View All Result
Digital Phablet
No Result
View All Result

Home » DeepSeek Open-Source 3B OCR Model: 97% Accuracy Breaks Text Compression Limits

DeepSeek Open-Source 3B OCR Model: 97% Accuracy Breaks Text Compression Limits

Seok Chen by Seok Chen
October 21, 2025
in AI
Reading Time: 1 min read
A A
D658DCF1DB5D11485D3F21BE52A2C69C381D90B8 size103 w579 h328.png
ADVERTISEMENT

Select Language:

In recent news, DeepSeek has made a significant breakthrough by open-sourcing its latest research— the DeepSeek-OCR model—on GitHub. The team has unveiled this advanced optical character recognition (OCR) model, which boasts approximately 3 billion parameters. This marks their initial exploration into the feasibility of using “optical 2D mapping compression” technology for processing long-text contexts.

ADVERTISEMENT

The core structure of the DeepSeek-OCR model consists of two main components: the DeepEncoder and the DeepSeek3B-MoE-A570M decoder. The DeepEncoder is designed to operate efficiently under high-resolution input conditions, maintaining low activation levels while achieving high compression ratios and generating an adequate number of visual tokens. These visual tokens are then precisely transformed into textual information by the decoder.

According to experimental results, the model demonstrates impressive performance when the number of text tokens is kept within ten times the number of visual tokens—that is, with a compression rate of less than 10x. Under these conditions, the OCR recognition accuracy reaches as high as 97%. Even when the compression rate is increased to 20x, the model maintains an accuracy level of around 60%, showcasing its robustness and potential for handling heavily compressed long texts.

The research team emphasizes that this development opens new avenues for the study of long contextual compression techniques and offers fresh insights into the memory and forgetting mechanisms within large language models. This breakthrough could pave the way for more efficient processing of lengthy textual data in various artificial intelligence applications.

ChatGPT ChatGPT Perplexity AI Perplexity Gemini AI Logo Gemini AI Grok AI Logo Grok AI
Google Banner
ADVERTISEMENT
Seok Chen

Seok Chen

Seok Chen is a mass communication graduate from the City University of Hong Kong.

Related Posts

Which Character Should You Play as in Raccoin: Coin Pusher Roguelike?
Gaming

Which Character Should You Play as in Raccoin: Coin Pusher Roguelike?

April 3, 2026
AI

Title: US Tech Layoffs Hit Record High in 2023, Over 50,000 Laid Off So Far

April 3, 2026
Understanding Constipation: Causes & Easy Remedies
Health

Understanding Constipation: Causes & Easy Remedies

April 3, 2026
USA vs Isreal vs Iran Military Strength
Infotainment

USA Israel and Iran Military Strength Comparison

April 3, 2026
Next Post
How to Play as Ryu in Every Chapter of Ninja Gaiden 4 by Completing and Solving

How to Play as Ryu in Every Chapter of Ninja Gaiden 4 by Completing and Solving

  • About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post

© 2026 Digital Phablet

No Result
View All Result
  • Home
  • News
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones

© 2026 Digital Phablet