📁 Market AI generated

Google Gemini: Are Costs Rising While Quality Declines?

Published on 2026-02-07 11:36 ℹ️ LocalLLaMA 📰 Read the original source article →

🏷️ LLM On-Premise 🏷️ Fine-Tuning 🏷️ DevOps

Google Gemini: aumentano i costi, cala la qualità?

A Reddit user has expressed concern regarding recent changes made by Google to its Gemini models, particularly regarding increased costs and decreased quality.

Increased Costs, Decreased Quality

The user, who used Gemini 2.0 Flash for OCR (Optical Character Recognition), data extraction, and summarization tasks, noted an approximately sixfold increase in output costs with subsequent versions (Gemini 2.5/3). At the same time, accuracy did not improve and, in some fine-tuning tests, even worsened.

End of the Economical Model

The cheapest option, Gemini 2.5-flash-lite, has been marked as EOL (End Of Life) without a 3-flash-lite successor being announced. This makes long-term planning difficult for those who rely on these models for specific tasks.

Search for Alternatives

The user has opened a ticket with Google to request an EOL extension for Gemini Flash models or a low-cost successor. In the meantime, they are looking for an alternative LLM for OCR and data extraction that offers simple and managed fine-tuning at a reasonable price.

For those considering on-premise deployments, there are trade-offs to consider. AI-RADAR offers analytical frameworks on /llm-onpremise to evaluate these aspects.

AI-Radar Takeaway

A user reports increased costs and decreased accuracy with Google's Gemini models for data extraction and OCR tasks. The removal of cheaper options and the lack of improvements in newer versions raise concerns about long-term planning and prompt the search for more affordable alternatives with easy/managed fine-tuning.

🤖 Ask AI about this

Want to dive deeper? Read the full article from the source:

📖 READ THE ORIGINAL ARTICLE

AI-RADAR NEWSLETTER

Stay ahead — get AI signals in your inbox

Daily or weekly digest of the most important AI news. 160+ readers, no spam.

💬 Comments (0)

🔒 Log in or register to comment on articles.

No comments yet. Be the first to comment!

🔍 Continue Exploring

SECTION

Explore LLM On-Premise

Complete guide to running AI models locally: hardware, stack, and privacy.

Read →

LLM May 19

Gemini 3.5 Flash: Google Focuses on Efficiency for Complex AI Applications

Google has announced the release of Gemini 3.5 Flash, the latest iteration in its family of Large Language Models. The tech giant claims the new model combines

Read →

LLM Jun 08

Google NotebookLM Updates with Gemini 3.5 Flash and Antigravity

Google has rolled out a significant update for NotebookLM, integrating the Gemini 3.5 Flash model and the Antigravity feature. This evolution promises faster an

Read →

Market Feb 04

Google’s Gemini app surpasses 750M monthly active users

Google announced that its Gemini app has surpassed 750 million monthly active users. This milestone highlights the increasing competition in the conversational

Read →

LLM Jan 27

Google upgrades AI Overviews to Gemini 3 with a conversational touch

Google is upgrading AI Overviews, its AI-powered search feature, with Gemini 3 models. The goal is to make the experience more conversational and accurate, dyna

Read →

LLM Dec 04

Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5

The Gemini 3 Pro model of Google has achieved a record score of 69% trust in the blinded test by Prolific, surpassing its predecessor Gemini 2.5 with a 53% incr

Read →

Google Gemini: Are Costs Rising While Quality Declines?

Increased Costs, Decreased Quality

End of the Economical Model

Search for Alternatives

Stay ahead — get AI signals in your inbox

💬 Comments (0)

🔍 Continue Exploring

More in Market

👥 Join 160+ AI explorers