Hardware for Local Intelligence
Benchmarks, GPU sizing guides, and workstation builds for sovereignty.
AMD ROCm 7.2 Released with Extended Radeon Graphics Card Support
AMD has released ROCm 7.2, a significant update to its open-source GPU compute stack. The new version extends support to more Radeon graphics...
PyTorch 2.10: Optimizations and Numerical Debugging
The new PyTorch 2.10 release introduces significant improvements in performance and tools for numerical debugging. Key features include...
Trump promises nuclear datacenter permits in 3 weeks
Donald Trump promised to expedite permits for nuclear-powered data centers. Jensen Huang, CEO of Nvidia, presented his vision of AI at Davos.
PyTorch 2.10 Released With More Improvements For AMD ROCm & Intel GPUs
PyTorch 2.10 is out today as the latest feature update to this widely-used deep learning library. The new PyTorch release continues improving...
Intel axes 12th Gen Alder Lake and 4th Gen Xeon Sapphire Rapids
Intel has announced the end-of-life (EOL) for its 12th Generation Alder Lake and 4th Generation Xeon Sapphire Rapids processors. Customers will...
Nvidia Dethrones Apple as TSMC’s Largest Customer
Nvidia CEO Jensen Huang confirmed that his company has overtaken Apple as TSMC's biggest customer, becoming its top client after more than 20...
NVIDIA GB10 CPU Performance Challenged AMD Ryzen AI Max+ in Linux Tests
The NVIDIA GB10 superchip, designed for AI, has been tested in traditional Linux scenarios to evaluate its CPU performance. Phoronix benchmarks...
OpenAI aims to ship its first device in 2026, and it could be earbuds
OpenAI is on track to announce its first hardware device, possibly earbuds, in 2026. OpenAI Chief Global Affairs Officer Chris Lehane said that...
GLM 4.7: How to Run with llama.cpp and Flash Attention
Here's how to get GLM 4.7 working on llama.cpp using Flash Attention for improved performance. The guide includes configuration details and a link...
Nvidia CEO Jensen Huang to visit China as H200 shipments loom
Nvidia CEO Jensen Huang is heading to China in late January for a customary Lunar New Year visit. The trip gains importance as it coincides with...
Fix for GLM 4.7 Flash Merged into llama.cpp
A fix for an issue related to GLM 4.7 Flash has been merged into llama.cpp. In parallel, FA (Fused Attention) support for CUDA is under...
Customer Buys RTX 5080, Receives Relabelled RTX 5060 Ti
An Amazon customer was scammed: instead of an RTX 5080 graphics card, they received a relabelled RTX 5060 Ti. The package was sold and shipped by...
Linux: One Line of Code Reduces Latency on Xeon CPUs by 5x
A Linux kernel patch aims to significantly reduce wake-up latency on modern Intel Xeon servers. The modification, involving a single line of code,...
OpenAI sets 2026 as year for practical AI adoption, eyes hardware debut and new revenue streams
OpenAI has set 2026 as the key year for the widespread adoption of truly usable artificial intelligence solutions. The company is also looking at...
Fracttal raises $35M to expand AI-driven maintenance
Fracttal, a Madrid-based company specializing in AI-powered maintenance solutions, has closed a $35 million funding round led by Riverwood...
Building an LM from Scratch: Day 6 Update
An enthusiast shares progress on building a language model (LM) from scratch. After stabilizing the system, the focus shifted to training,...
AdaFRUGAL: Adaptive Memory-Efficient Training with Dynamic Control
A new framework, AdaFRUGAL, promises to drastically reduce memory consumption and training times for large language models (LLMs). Through dynamic...
Intel recruits Qualcomm GPU chief to lead future AI PC efforts
Intel has recruited a former Qualcomm GPU executive to lead its future AI PC efforts. This strategic move aims to strengthen Intel's position in...
OpenAI reportedly charts five hardware devices, starting with 'Sweetpea' audio product
OpenAI is reportedly planning to enter the hardware market with a series of devices. The first in line is said to be 'Sweetpea', an audio product....
China's AI industry reshapes as GPUs rise to be core strategic asset
China's artificial intelligence sector is undergoing a profound transformation, with GPUs taking on an increasingly central role as strategic...
Nvidia unveils Alpamayo platform for L4 self-driving
Nvidia has announced Alpamayo, a new platform designed for the development of Level 4 self-driving vehicles. The platform aims to provide car...
Nvidia challenges Apple's longtime TSMC priority
Nvidia aims to displace Apple as TSMC's priority customer. The competition to secure TSMC's manufacturing capabilities is intensifying, with...
Anthropic's CEO Criticizes Nvidia and US over China Chip Exports
Anthropic CEO Dario Amodei has strongly criticized Nvidia and the US administration regarding the sale of chips to China. The statements are...
Inventec doubles 2026 capex to US$1 billion for AI servers
Inventec has announced a doubling of its planned capital expenditure for 2026, bringing it to US$1 billion. The decision is driven by growing...
Anthropic CEO: Selling H200s to China like giving nukes to North Korea
Anthropic CEO Dario Amodei isn’t happy about the US allowing Nvidia to sell GPUs to Chinese companies, and likened the decision to giving nuclear...
AMD Making It Easier To Install vLLM For ROCm
AMD has introduced a simpler method for installing vLLM on Radeon/Instinct hardware via ROCm. A new Python wheel facilitates installation without...
GLM-4.7-Flash: impressive benchmarks on H200 and RTX 6000 Ada
The GLM-4.7-Flash model demonstrates remarkable performance in new benchmarks. On a single H200 GPU, it achieves a peak throughput of 4,398 tokens...
Linux 7.0: Intel GPU Firmware Updates on Non-x86 Systems Ready
Support for updating Intel discrete GPU firmware on non-x86 systems is coming with Linux 7.0. The necessary patches are ready for integration into...
Windows 11, not AI, kick-started the PC upgrade cycle
In 2025, corporate IT hardware upgrades were driven by the necessity to maintain support, rather than excitement for new AI-related features. IT...
LocalLLaMA: The unstoppable rise of local language models
A Reddit post highlights the surprising capabilities of language models running locally with LocalLLaMA. The discussion emphasizes how these...
Looking for general AI news?
< AI-RADAR MAIN