• About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post
No Result
View All Result
Digital Phablet
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
  • Home
  • NewsLatest
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones
  • AI
  • Reviews
  • Interesting
  • How To
No Result
View All Result
Digital Phablet
No Result
View All Result

Home » DeepSeek-OCR 2 Launches: AI Reads Complex Documents Like Humans

DeepSeek-OCR 2 Launches: AI Reads Complex Documents Like Humans

Seok Chen by Seok Chen
January 27, 2026
in AI
Reading Time: 1 min read
A A
4FC16A93FAACA53639D703BD604D365352FBC701 size90 w690 h588.jpg
ADVERTISEMENT

Select Language:

On January 27th, the DeepSeek team announced the release of their new research paper titled “DeepSeek-OCR 2: Visual Causal Flow,” along with the open-source availability of the DeepSeek-OCR 2 model. This innovative model introduces a novel encoding architecture called DeepEncoder V2, which dynamically adjusts the processing sequence of visual information based on the semantics of the image. Essentially, the model intelligently sorts visual content before performing text recognition, mimicking human reading patterns more closely.

ADVERTISEMENT

Traditionally, visual language models have segmented images into multiple visual tokens, processing them in a fixed, grid-like order from the top-left to bottom-right. While straightforward, this approach doesn’t align well with how humans navigate complex documents, tables, or mathematical formulas—often jumping between elements based on semantic and logical relationships.

The DeepSeek research team emphasized that their breakthrough stems from rethinking the conventional treatment of visual data. Especially in scenarios involving complex layouts, visual elements often possess logical sequences and hierarchies. Relying solely on spatial order can limit a model’s understanding of the structural and semantic relationships within a document.

To validate their model’s effectiveness, the team conducted extensive testing on the OmniDocBench v1.5 benchmark, which includes a wide variety of Chinese and English documents such as academic papers, magazines, and reports. The benchmark assesses capabilities like text recognition, formula parsing, table reconstruction, and reading order comprehension.

ADVERTISEMENT

Results from these evaluations are promising. When working with lower visual token limits, DeepSeek-OCR 2 achieved an overall accuracy score of 91.09%, representing a 3.73% improvement over its predecessor. Notably, in reading order accuracy, the model significantly reduced the editing distance from 0.085 to 0.057, indicating a more precise understanding of document structures and logic.

ChatGPT ChatGPT Perplexity AI Perplexity Gemini AI Logo Gemini AI Grok AI Logo Grok AI
Google Banner
ADVERTISEMENT
Seok Chen

Seok Chen

Seok Chen is a mass communication graduate from the City University of Hong Kong.

Related Posts

AI

Ant CEO Han Xinyi Sends Company-Wide Note: Reject Small Wins, Launch AI Incentive Plan

February 2, 2026
World's Top 50 Countries by GDP (PPP)

1.  China – $43.4 Trillion
2.  United Sta
Infotainment

Top 50 Countries by GDP PPP in 2023

February 2, 2026
Wildest Outfits at the 2026 Grammys: Chappell Roan’s Bold Gown
Entertainment

Wildest Outfits at the 2026 Grammys: Chappell Roan’s Bold Gown

February 2, 2026
BMW Names Christian Ach as New China President and CEO
Business

BMW Names Christian Ach as New China President and CEO

February 2, 2026
Next Post

How to Reset Your HP 7000 Series Smart Tank and Fix Printing Issues

  • About Us
  • Contact Us
  • Advertise
  • Privacy Policy
  • Guest Post

© 2026 Digital Phablet

No Result
View All Result
  • Home
  • News
  • Technology
    • Education Tech
    • Home Tech
    • Office Tech
    • Fintech
    • Digital Marketing
  • Social Media
  • Gaming
  • Smartphones

© 2026 Digital Phablet