Hardware for Local Intelligence
Benchmarks, GPU sizing guides, and workstation builds for sovereignty.
Musk pushes Terafab as AI chip crunch intensifies
Elon Musk is considering the creation of a chip factory, named Terafab, to address the increasing shortage of semiconductors needed for artificial...
Adata invests US$3 million in KonstTech to boost AI computing infrastructure
Adata has invested US$3 million in KonstTech to boost its AI computing infrastructure. The investment aims to strengthen KonstTech's capabilities...
Innodisk says AI success depends on software-hardware integration
Innodisk emphasizes that the integration between hardware and software is crucial for the success of artificial intelligence implementations,...
AI Gets Scary Good: A Behind-the-Scenes Look at STH
An exclusive preview from ServeTheHome (STH) offers a behind-the-scenes look at the company. The original article, titled 'AI Got Scary Good',...
AMD GAIA: Web UI for Local AI Agents with Privacy-First Approach
AMD has released a new version of GAIA, the AI agent framework for Ryzen AI hardware. Version 0.17 introduces Agent UI, a privacy-focused web...
Meta to fund natural gas power plants for Louisiana AI data center
Meta partners with Entergy to build seven new natural gas power plants. The goal is to deliver 7 gigawatts of power to its planned AI data center...
Chinese universities performing military research acquired Super Micro servers with sanctioned Nvidia AI chips
Public documents reveal that Chinese universities involved in military research acquired Super Micro servers equipped with Nvidia chips subject to...
Ambitious modder bolts a 360mm server AIO onto an RTX 3080, slashes VRAM temps in half
An enthusiast successfully mounted a 360mm server AIO cooler on an RTX 3080 graphics card, achieving a near 50% reduction in VRAM temperatures and...
Minisforum AI X1 Pro 470 review: AMD's Gorgon Point in a sleek mini PC desktop
Review of the Minisforum AI X1 Pro 470, a mini PC desktop integrating the AMD Gorgon Point platform. This compact device is designed for...
Aivres Showcases NVIDIA Vera Rubin at NVIDIA GTC 2026
Aivres showcased NVIDIA Vera CPUs and Rubin GPUs at NVIDIA GTC 2026. Blackwell Ultra and BlueField-4 DPUs were also on display. The event offered...
M5 Max vs M3 Max Inference Benchmarks: Qwen3.5 on MacBook Pro
Inference performance comparison of Qwen 3.5 models on 16-inch MacBook Pro, equipped with M5 Max and M3 Max chips (40 GPU cores, 128GB unified...
Holy Stone Enterprise expands Japan and Taiwan capacity, signaling tighter MLCC supply for AI power supplies
Holy Stone Enterprise is expanding its production capacity in Japan and Taiwan. This move signals a potential tightening of the supply of...
Google TurboQuant running Qwen 3.5 Locally on MacBook Air
An experiment demonstrates how Google's TurboQuant algorithm enables running the Qwen 3.5–9B model with a 20000 token context window on a MacBook...
Google's TurboQuant-v3: LLM Weight Compression on Consumer GPUs
Google introduces TurboQuant-v3, a technique for compressing the weights of large language models (LLMs), reducing VRAM usage and accelerating...
Testing DirectStorage with GPU decompression — do Blackwell GPUs have the upper hand?
A DirectStorage test with GPU decompression raises questions about the performance of future Blackwell GPUs. The article analyzes the implications...
Microsoft tightens Windows kernel security
Microsoft is tightening requirements for Windows kernel drivers, excluding those not compliant with the Windows Hardware Compatibility Program...
Apple Still Plans to Sell iPhones When It Turns 100
As the tech giant turns 50, WIRED spoke to executives about how they plan to win in the AI era. The company is looking to the future, planning to...
Intel Xe Driver Improves Memory Pressure / Out-Of-Memory Behavior For vRAM With Linux 7.1
The Intel Xe graphics driver for Linux is set to receive significant updates with the arrival of kernel 7.1. The changes involve a new user-space...
Local LLMs in Manufacturing: An Underrated Use Case
The use of large language models (LLMs) in industrial environments, directly in factories, is emerging as a high-value, yet under-discussed...
GLM-5.1 Released: Hope for Open Source Version
The release of GLM-5.1 has been announced. The open-source community hopes for an open-source release of the model. No further technical details...
GLM 5.1 Released: Updates for Language Models
Version 5.1 of GLM, a language model, has been released. The announcement was shared via the LocalLLaMA online community, a forum dedicated to...
AMD ROCm 7.12 Tech Preview Brings More Consumer APU & GPU Support
AMD has released ROCm 7.12 as the newest tech preview, working towards the presumed ROCm 8.0 release. This release extends support to a greater...
VibeVoice 9B: New open-source benchmark for medical STT
A recent study benchmarked 31 speech-to-text (STT) models on medical audio. Microsoft's VibeVoice-ASR 9B stands out as the open-source leader with...
AMDGPU Driver For Linux 7.1: Debug Improvements, New Hardware IP
Ahead of Linux 7.1, updates have been released for the AMDGPU/AMDKFD kernel driver. These updates primarily focus on debug improvements and the...
Intel Arc Pro B70: Preliminary Testing Results and Performance
Preliminary testing results for the Intel Arc Pro B70 graphics card have surfaced, focusing on performance in mixed usage scenarios, including...
Taiwan's ALi bets on custom chips for 2026 turnaround
Taiwan's ALi (Acer Laboratories Inc.) is investing in the development of custom chips with the goal of a turnaround by 2026. The strategy focuses...
Google TurboQuant: LLM memory reduced by 6x, AI inference cost curve reset
Google introduces TurboQuant, a technique that promises to drastically reduce the memory footprint of large language models (LLMs), with a...
SK Hynix keeps HBM shipments steady, targets HBM4E sample this year
SK Hynix keeps HBM (High Bandwidth Memory) shipments steady and plans to release the first HBM4E samples by the end of the year. The Nvidia Vera...
AI drives semiconductor market to US$1.8 trillion by 2030
The semiconductor market is poised for exponential growth, reaching US$1.8 trillion by 2030, primarily driven by artificial intelligence. China is...
TSMC prioritises AI, core clients; 3nm capacity remains constrained
Semiconductor manufacturer TSMC is prioritizing chip production for artificial intelligence applications and its core clients. The 3nm production...
Looking for general AI news?
< AI-RADAR MAIN