📁 Hardware

This Hardware archive tracks the practical side of local AI infrastructure: GPUs, NPUs, mini PCs, edge accelerators, memory bandwidth, and power efficiency tradeoffs that directly impact LLM inference quality. We prioritize benchmark-backed updates and deployment notes useful for real build decisions, from compact home labs to enterprise pilot clusters. Use this stream to compare total cost of ownership, thermal constraints, and model-fit scenarios across current devices, then deepen with our hardware pillar guide and connected LLM coverage.

SanDisk has relaunched its Optimus SSD line with PCIe 5.0 models in 2TB and 4TB capacities. The new Optimus GX Pro 8100 are available starting at $999 for the 2TB model and $1799 for the 4TB version, representing a 5% price increase over previous models. Older WD Black versions remain a cheaper alternative.

2026-02-07 Fonte

Venture capital firm Benchmark Capital has announced a $225 million investment in Cerebras Systems, a manufacturer of processors dedicated to artificial intelligence. Benchmark has been an investor in Cerebras since 2016, supporting the development of alternative solutions to Nvidia's GPUs.

2026-02-07 Fonte

GPUs and accelerators use specialized engines for matrix multiplication (GEMM). This article analyzes the precision of accumulators in these engines, revealing that, for hardware efficiency reasons, the effective precision may be lower than expected. A method for verifying accumulator precision using Triton is presented, with results showing how precision can vary depending on the hardware and configuration.

2026-02-06 Fonte

New Linux benchmarks examine the performance of Intel's Panther Lake Core Ultra X7 358H CPU with a higher power budget. The tests reveal significant generational improvements, particularly in energy efficiency, and confirm the excellent performance of the Intel Arc B390 GPU with open-source drivers.

2026-02-06 Fonte

AMD continues the development of its LLVM compiler stack for future GPUs. A new target, GFX1170, also identified as RDNA 4m, has been introduced. This update adds to the ongoing work on GFX1250 and GFX13 targets, expanding support for AMD's upcoming graphics architectures.

2026-02-06 Fonte

Qualcomm is making it easier to use Snapdragon X1 Elite on Linux. Previously, necessary firmware files had to be fetched from the Windows 11 on ARM partition. Now, QUPv3 firmware bits have been integrated into the linux-firmware.git repository, greatly simplifying the process for Linux users.

2026-02-06 Fonte

MetaOptics, headquartered in Singapore and maintaining close ties with Taiwan, is developing heat-resistant metalenses for integration into CPUs. This technology could significantly improve the thermal management of processors.

2026-02-06 Fonte

A user reported a significant throughput increase, up to 26 tokens/second, using the Qwen3-Coder-Next-Q4_K_S model with llama.cpp on an RTX 5090. The optimization was achieved by offloading MoE expert tensors to the CPU and quantizing the KV cache.

2026-02-06 Fonte

South Korea is making significant investments in artificial intelligence, supported by a hardware infrastructure powered by over 260,000 Nvidia GPUs. This strategic move aims to position the country as a leader in the AI sector, with a focus on advanced applications and research.

2026-02-06 Fonte

A Reddit user benchmarked the Strix Halo's iGPU, testing various software configurations with 13 LLM models and 15 different llama.cpp builds. The aim was to evaluate the impact of ROCm, Vulkan, and various compilation options on inference performance.

2026-02-05 Fonte

Nvidia is reportedly developing DLSS 4.5, an advanced version of its upscaling technology that could eliminate the need for denoisers in ray tracing. This is thanks to a Transformer model that reconstructs ray-traced reflections more accurately.

2026-02-05 Fonte

First Linux benchmarks of the Intel Arc B390 GPU, integrated in high-end Panther Lake models. The Xe3 graphics card, equipped with 12 Xe cores, promises interesting performance in desktop and mobile environments for graphics and compute workloads.

2026-02-05 Fonte