Google Cloud adds quantitative models for science, admitting LLMs can’t do math

Large language models, the kind that power most of today’s AI industry, are remarkably good with words and surprisingly unreliable with numbers. Anyone who has asked them to solve a differential equation or a numerical optimization problem knows this: linguistic statistics are not enough when mathematical rigor matters. Google Cloud’s latest move openly acknowledges this, adding specialist models built for science to its marketplace.

Beyond the domain of words

The company has announced the integration of “large quantitative models” (LQMs) developed by SandboxAQ, a spin-off from Alphabet. These models are not just variants of LLMs; they are architectures designed to digest equations, laboratory data, and physical simulations. The aim is to pair them with Gemini, Google’s generalist model, to cover domains where language understanding alone falls short. It’s a clear signal: the generative AI race is pushing toward vertical specialization, where the “one model to rule them all” mindset gives way to families of models optimized for specific tasks.

LQMs are not disguised language models. They operate on quantitative representations, are trained on heterogeneous numerical datasets, and can tackle problems in computational chemistry, materials science, or fluid dynamics. They don’t generate text — they produce quantitative predictions. This fundamentally shifts the computational load profile: no longer just tokens per second, but high-precision floating-point operations, often with memory and bandwidth requirements reminiscent of supercomputers rather than LLM inference servers.

The deployment dilemma: cloud versus on-premise

Bringing these models to Google Cloud’s marketplace instantly makes available computational capabilities that were once confined to labs with dedicated HPC infrastructure. Yet for many scientific organizations, cloud processing raises unresolved issues. Experimental data, intellectual property on new materials, simulations tied to sensitive projects — in these contexts, data sovereignty is non-negotiable. GDPR and similar regulations impose strict localization constraints, while transfer and storage costs can drive total cost of ownership (TCO) up unpredictably.

Those evaluating on-premise deployment for scientific models, however, face unconventional hardware demands. Consumer GPUs or standard workstations won’t cut it: you need accelerators with native FP64 support, ample VRAM, fast interconnects, and often parallel storage for terabyte-scale datasets. The gap between a cloud license fee and the cost of a local cluster must be carefully weighed, factoring in upfront CapEx, energy consumption, and the expertise required to orchestrate complex pipelines. The decision is never just technological — it’s strategic.

A market growing in sophistication

The move by Google Cloud and SandboxAQ signals that the AI ecosystem is maturing beyond the chatbot bubble. Demand for scientific research tools is pushing vendors to offer alternative models, integrable via standard APIs but built on different mathematics. For teams working on simulation, drug discovery, or climate modeling, the arrival of LQMs on a cloud marketplace adds a new option, but it doesn’t eliminate the need to carefully assess where workloads should run. The debate between cloud flexibility and on-premise control, here more than elsewhere, is bound to remain open.