• About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post
No Result
View All Result
Digital Phablet
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
No Result
View All Result
Digital Phablet
No Result
View All Result

Home » DeepSeek-OCR 2 Launches: AI Reads Complex Documents Like Humans

DeepSeek-OCR 2 Launches: AI Reads Complex Documents Like Humans

Seok Chen by Seok Chen
January 27, 2026
in AI
Reading Time: 1 min read
A A
4FC16A93FAACA53639D703BD604D365352FBC701 size90 w690 h588.jpg
ADVERTISEMENT

Select Language:

On January 27th, the DeepSeek team announced the release of their new research paper titled “DeepSeek-OCR 2: Visual Causal Flow,” along with the open-source availability of the DeepSeek-OCR 2 model. This innovative model introduces a novel encoding architecture called DeepEncoder V2, which dynamically adjusts the processing sequence of visual information based on the semantics of the image. Essentially, the model intelligently sorts visual content before performing text recognition, mimicking human reading patterns more closely.

ADVERTISEMENT

Traditionally, visual language models have segmented images into multiple visual tokens, processing them in a fixed, grid-like order from the top-left to bottom-right. While straightforward, this approach doesn’t align well with how humans navigate complex documents, tables, or mathematical formulas—often jumping between elements based on semantic and logical relationships.

The DeepSeek research team emphasized that their breakthrough stems from rethinking the conventional treatment of visual data. Especially in scenarios involving complex layouts, visual elements often possess logical sequences and hierarchies. Relying solely on spatial order can limit a model’s understanding of the structural and semantic relationships within a document.

To validate their model’s effectiveness, the team conducted extensive testing on the OmniDocBench v1.5 benchmark, which includes a wide variety of Chinese and English documents such as academic papers, magazines, and reports. The benchmark assesses capabilities like text recognition, formula parsing, table reconstruction, and reading order comprehension.

ADVERTISEMENT

Results from these evaluations are promising. When working with lower visual token limits, DeepSeek-OCR 2 achieved an overall accuracy score of 91.09%, representing a 3.73% improvement over its predecessor. Notably, in reading order accuracy, the model significantly reduced the editing distance from 0.085 to 0.057, indicating a more precise understanding of document structures and logic.

ChatGPT ChatGPT Perplexity AI Perplexity Gemini AI Logo Gemini AI Grok AI Logo Grok AI
Google Banner
ADVERTISEMENT
Seok Chen

Seok Chen

Seok Chen is a mass communication graduate from the City University of Hong Kong.

Related Posts

AI

Wu Yonghui Takes Over Byte Seed This Year

February 9, 2026
Lawmaker: Ghislaine Maxwell Will Not Answer Questions in Deposition
News

Lawmaker: Ghislaine Maxwell Will Not Answer Questions in Deposition

February 9, 2026
Top 10 Hardest vs Easiest Languages to Learn

Hardest Languages to Learn

1)  Ma
Infotainment

Top 10 Hardest and Easiest Languages to Learn

February 9, 2026
Trump Criticizes Bad Bunny's Super Bowl Halftime Show as 'One of the Worst'
Entertainment

Trump Criticizes Bad Bunny’s Super Bowl Halftime Show as ‘One of the Worst’

February 9, 2026
Next Post

How to Reset Your HP 7000 Series Smart Tank and Fix Printing Issues

  • About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post

© 2026 Digital Phablet

No Result
View All Result
  • Home
  • News
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones

© 2026 Digital Phablet