Close Menu
  • Home
  • Education
  • Health
  • National News
  • Politics
  • Relationship & Wellness
  • World News
What's Hot

Watch: Donald Trump shares 'unclassified' video of US bombing Iran's crucial oil hub Kharg Island – The Times of India

March 14, 2026

Amid Anoushka Shankar’s claims saying Rishab was not Pandit Ravi Shankar’s student, old video proving otherwise resurfaces. Watch

March 14, 2026

India Russian Oil Imports: 'White House now begging India': Iran FM Araghchi mocks US U-turn on Russian oil – The Times of India

March 14, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Global News Bulletin
SUBSCRIBE
  • Home
  • Education
  • Health
  • National News
  • Politics
  • Relationship & Wellness
  • World News
Global News Bulletin
Home»National News»DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU
National News

DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU

editorialBy editorialOctober 22, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

Chinese AI startup DeepSeek has released a new multimodal AI model, which it said is capable of processing large and complex documents using significantly fewer tokens.

The Huangzhou-based company said that DeepSeek-OCR uses visual perception as a medium to compress text for large language models (LLMs) more efficiently. Both the source code and weights of the model are publicly available via online developer platforms Hugging Face and GitHub. In its research, DeepSeek found that using “vision encoders” to compress text for LLMs would enable them to process massive amounts of text at lower computing costs.

“Through DeepSeek-OCR, we demonstrate that vision-text compression can achieve significant token reduction (7-20×) for different historical context stages, offering a promising direction for addressing long-context challenges in large language models,” the company said in a technical paper accompanying the model’s release.

I quite like the new DeepSeek-OCR paper. It’s a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn’t matter.

The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language… https://t.co/AxRXBdoO0F

— Andrej Karpathy (@karpathy) October 20, 2025

The launch of DeepSeek-OCR reflects the company’s continued focus on improving the efficiency of LLMs while driving down the costs of building and using them. The company is said to have taken a similar approach in developing its breakthrough open-weight models V3 and R1, which made waves across the tech industry for achieving performance comparable to cutting-edge models like OpenAI’s o1 at only a fraction of the cost.

Story continues below this ad

What are the components of the new AI model?

With DeepSeek-OCR, the company aims to tackle a key limitation of LLMs: handling long contexts without running into memory limits. Its core hypothesis is that processing text as images can be more computationally efficient than processing raw digital text. The new OCR model serves as a proof-of-concept for this idea.

The model comprises two parts: a 380 million-parameter DeepEncoder used to analyse each image and produce a compressed version of it; and a 570 million-active parameter text generator built on top of another three billion-parameter mixture of experts (MoE) language model.

Component 1: DeepEncoder

380M

Parameters for image analysis & compression

Component 2: Text Generator

570M

Active parameters for text generation

3 Billion

Parameter MoE Language Model Foundation

Indian Express InfoGenIE

vs GOT-OCR2.0

100

Vision tokens used

vs 256 tokens/page

vs MinerU2.0

<800

Vision tokens used

vs 6000+ tokens/page

Superior Efficiency

61% Fewer

Tokens than GOT-OCR2.0 | 87% fewer than MinerU2.0

Indian Express InfoGenIE

DeepSeek’s researchers said that they trained the OCR model with 30 million PDF pages in roughly 100 languages, including 25 million in Chinese and English, along with 10 million synthetic diagrams, five million chemical formulae, and one million geometric figures.

How has DeepSeek performed on benchmarks?

The OCR model is capable of compressing text by up to a factor of ten while retaining 97 per cent of the original information, as per the technical paper. It can be used to process a wide range of document types including plain text, diagrams, chemical formulae, and geometric figures while being able to keep the original formatting, output plain text, and even provide general image descriptions. However, the requirement of ‘vision tokens’ is also likely to vary based on the document size and image resolution.

Story continues below this ad

In sum, DeepSeek-OCR can generate training data for LLMs and vision language models (VLMs) at a scale of more than 200,000 pages per day while running on a single Nvidia A100 GPU.

The OCR model was evaluated on two benchmarks, the OmniDocBench test that is used to evaluate a model’s document parsing capabilities and the Fox benchmark test used to evaluate the focusing capabilities of vision language models on dense PDF documents.

“On OmniDocBench, it surpasses GOT-OCR2.0 (256 tokens/page) using only 100 vision tokens, and outperforms MinerU2.0 (6000+ tokens per page on average) while utilising fewer than 800 vision tokens,” the paper read.

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMumbai Fire: Midnight blaze in Navi Mumbai high-rise: How 10 residents survived flames in 12-storey building; 4 killed | Navi Mumbai News – The Times of India
Next Article Struck over the head: Israeli settlers attack Palestinian farmers in West Bank; video captures assault on woman – The Times of India
editorial
  • Website

Related Posts

Amid Anoushka Shankar’s claims saying Rishab was not Pandit Ravi Shankar’s student, old video proving otherwise resurfaces. Watch

March 14, 2026

A Rs 9 cr Tamil film made with no ‘big hero’ returns 6x its cost: Radhika Sarathkumar on Thaai Kizhavi’s true success without jacking up ticket prices

March 14, 2026

Inside Shaheen Bagh’s Iftar scene: Sadaf Hussain on Ande ka Halwa, Chapli Kebabs and more

March 14, 2026

RBSE 10th Result 2026 Date: Rajasthan Board Class 10th results by March 20

March 14, 2026

UPSC Key: Impeaching the CEC, Essential Commodities Act invoked, and Patenting heritage crafts

March 14, 2026

Why sovereign AI for India is a strategic hedge against ‘compute divide’

March 14, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

Watch: Donald Trump shares 'unclassified' video of US bombing Iran's crucial oil hub Kharg Island – The Times of India

By editorialMarch 14, 2026

Hours after claiming that US forces “totally obliterated” Iran’s key oil hub at Kharg Island,…

Amid Anoushka Shankar’s claims saying Rishab was not Pandit Ravi Shankar’s student, old video proving otherwise resurfaces. Watch

March 14, 2026

India Russian Oil Imports: 'White House now begging India': Iran FM Araghchi mocks US U-turn on Russian oil – The Times of India

March 14, 2026
Top Trending

Watch: Donald Trump shares 'unclassified' video of US bombing Iran's crucial oil hub Kharg Island – The Times of India

By editorialMarch 14, 2026

Hours after claiming that US forces “totally obliterated” Iran’s key oil hub…

Amid Anoushka Shankar’s claims saying Rishab was not Pandit Ravi Shankar’s student, old video proving otherwise resurfaces. Watch

By editorialMarch 14, 2026

4 min readChennaiUpdated: Mar 12, 2026 07:32 AM IST With henna on…

India Russian Oil Imports: 'White House now begging India': Iran FM Araghchi mocks US U-turn on Russian oil – The Times of India

By editorialMarch 14, 2026

Iranian foreign minister Seyed Abbas Araghchi (ANI photo) Iran’s foreign minister Seyed…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

Facebook X (Twitter) Pinterest Vimeo WhatsApp TikTok Instagram

News

  • Education
  • Health
  • National News
  • Relationship & Wellness
  • World News
  • Politics

Company

  • Information
  • Advertising
  • Classified Ads
  • Contact Info
  • Do Not Sell Data
  • GDPR Policy
  • Media Kits

Services

  • Subscriptions
  • Customer Support
  • Bulk Packages
  • Newsletters
  • Sponsored News
  • Work With Us

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

© Copyright Global News Bulletin.
  • Privacy Policy
  • Terms
  • Accessibility
  • Website Developed by Digital Strikers

Type above and press Enter to search. Press Esc to cancel.