Hardware – AI News & Articles

📁 Hardware AI generated

Hybrid LLM Architectures and the CPU Bottleneck: The Qwen 27B Case on RTX 3090 Ti

A user experienced lower-than-expected Inference performance with Qwen 3.6 27B on an RTX 3090 Ti. Analysis revealed that the model's hybrid SSM architecture requires significant CPU processing per token, creating a bottleneck on older processors lacking AVX-VNNI/AVX-512 instructions. This highlights the importance of careful hardware evaluation for on-premise LLM deployments.

2026-04-30 Fonte

📁 Hardware AI generated

Google and the Future of AI Chips: The Shift Towards Specialized Accelerators

Google is shifting the development of its TPU chips towards more specialized solutions, moving away from a universal approach. This evolution reflects a trend in the AI industry favoring efficiency and performance for specific workloads, with significant implications for on-premise deployment strategies and TCO evaluation.

2026-04-30 Fonte

📁 Hardware AI generated

AMD and Linux: New Patches to Optimize Page Migration and Performance

AMD has released new patches for the Linux kernel, aimed at accelerating page migration. This work, originally started by NVIDIA, is now being continued by AMD engineers, leveraging batch copies and hardware offloading to significantly improve performance. The initiative is crucial for optimizing memory usage in GPU architectures, with positive implications for intensive workloads, especially in on-premise deployments.

2026-04-30 Fonte

📁 Hardware AI generated

NVIDIA Working on ACPI CPPC v4 Support for Linux: Optimizing On-Premise CPU Performance

NVIDIA engineers are developing ACPI CPPC v4 support for the Linux `acpi_cppc` driver. This revision of the ACPI 6.6 standard aims to enhance the operating system's management of CPU core performance using an abstract scale. Optimizing collaborative processor control is crucial for on-premise infrastructures, impacting energy efficiency and TCO for AI workloads, where every hardware component contributes to overall performance and operational sustainability.

2026-04-30 Fonte

📁 Hardware AI generated

Mosaic SoC Raises $3.8M for Low-Power Perception Chips for Spatial Computing

Mosaic SoC has raised $3.8 million in a Pre-Seed round to develop dedicated perception chips. These components aim to bring real-time spatial intelligence to energy-constrained devices, such as smart glasses and smartphones, enabling new form factors and persistent AI features directly at the edge. The company differentiates itself with a proprietary multi-core architecture, designed to maximize performance per watt.

2026-04-30 Fonte

📁 Hardware AI generated

Korea's 'father of HBM' foresees AI memory surge as Google's TurboQuant faces real-world tests

Professor Kim Jung-ho of KAIST, known as the "father of HBM," has made a significant prediction: the demand for AI memory could increase a thousandfold. Concurrently, Google's TurboQuant technology is undergoing rigorous real-world tests. These developments highlight the crucial importance of advanced memory solutions and optimization techniques for managing growing AI workloads, with significant implications for both on-premise and cloud deployments.

2026-04-30 Fonte

📁 Hardware AI generated

Powertech Raises Stakes: $1.6 Billion for AI Packaging and Sector Growth

Powertech, a leading Taiwanese Outsource Semiconductor Assembly and Test (OSAT) company, has announced a significant increase in its capital expenditure, reaching $1.6 billion. This initiative aims to boost production capacity in the AI component packaging sector, a strategic area for the evolution of artificial intelligence and for infrastructures supporting on-premise LLM workloads.

2026-04-30 Fonte

📁 Hardware AI generated

Automotive Architecture Innovation: Rising Semiconductor Demand

Despite a slump in vehicle sales, innovation in automotive architectures is raising technological barriers for chips. This trend is driving greater semiconductor integration, highlighting the increasing complexity and computing needs in the sector, with direct implications for companies' deployment strategies and TCO.

2026-04-30 Fonte

📁 Hardware AI generated

Taiwan's Episil triples capex, focusing on silicio photonics for AI

Taiwanese wafer manufacturer Episil has announced a significant increase in capital expenditure (capex), tripling its investment to accelerate the development and production of silicio photonics solutions. This strategic move aims to support the growing demand for high-performance AI infrastructure, underscoring the importance of advanced connectivity and power efficiency technologies for future AI deployments.

2026-04-30 Fonte

📁 Hardware AI generated

Taiwan's Optics Industry Finds a New Role in the AI Imaging Boom

Taiwan's optics industry is strategically redefining its role within the AI imaging ecosystem. This evolution highlights the critical importance of advanced hardware components for visual data acquisition and processing, a key consideration for enterprises evaluating on-premise or edge deployments. The convergence of optics and artificial intelligence presents new challenges and opportunities for data sovereignty and infrastructural control, central aspects for technical decision-makers.

2026-04-30 Fonte

📁 Hardware AI generated

Samsung Highlights Stable 4nm Tech Amid Growing AI, Automotive Demand

Samsung has emphasized the stability of its 4-nanometer process technology, highlighting its crucial role in meeting the increasing demand from the artificial intelligence and automotive sectors. The ability to produce reliable and high-performing chips at this scale is fundamental for developing advanced solutions, both for on-premise data centers and edge applications.

2026-04-30 Fonte

📁 Hardware AI generated

Lightelligence Lists in Hong Kong, CPO Commercialization in Focus for AI

Lightelligence, a Chinese photonics chipmaker, has completed its listing in Hong Kong. The company is focusing on the commercialization of Co-Packaged Optics (CPO), a crucial technology for next-generation AI infrastructures. This move highlights the increasing importance of integrated optical solutions for handling intensive LLM workloads, offering advantages in throughput and latency for on-premise deployments.

2026-04-30 Fonte

📁 Hardware AI generated

Intel Lunar Lake: CPU Performance Gains on Linux

This analysis focuses on the evolution of Intel Lunar Lake CPU performance on Linux systems. Following an examination of Xe2 integrated graphics performance gains, attention now shifts to the processor's computational capabilities. Benchmarks, conducted over a one-year period starting from April 2025, aim to outline how CPU performance has developed in this operating environment, offering insights for those evaluating hardware for on-premise workloads.

2026-04-29 Fonte

📁 Hardware AI generated

Sanctioned Chinese AI Firm SenseTime Releases Image Model Optimized for Speed and Chinese Chips

Despite US restrictions limiting its access to advanced technology, Chinese AI firm SenseTime has launched a new image model. The model is designed for speed and optimized to run on Chinese-made chips, highlighting a strategic pivot towards Open Source initiatives.

2026-04-29 Fonte

📁 Hardware AI generated

llama.cpp: Native NVFP4 Accelerates Prompt Processing on Blackwell

A recent llama.cpp benchmark reveals that native NVFP4 support significantly improves prompt processing performance (up to 68%) for the Qwen3.6-27B-NVFP4 model on an NVIDIA RTX 5090 GPU. Token generation speed remains unchanged. This advantage is crucial for on-premise workloads requiring rapid ingestion of long contexts, such as RAG and document analysis.

2026-04-29 Fonte

📁 Hardware AI generated

The "Silicio Lottery": Unexpected Variability in Cloud GPU Performance

Joint research reveals significant performance variations among GPUs of the same model, a phenomenon known as the "silicio lottery." This impacts the value of renting cloud resources for AI workloads, with differences up to 38% in memory bandwidth for H200 SXM GPUs. The primary cause lies in manufacturing variations of the chips themselves, making benchmarking rented instances an essential practice.

2026-04-29 Fonte

📁 Hardware AI generated

Framework's New RTX 5070 12GB Graphics Module Debuts at $1,199

Framework has introduced a new RTX 5070 graphics module with 12GB of VRAM, priced at $1,199. This represents a 72% increase over the previous 8GB version, which cost $699. The company stated that the module's final cost is influenced by external factors, highlighting the challenges in the supply chain and hardware pricing within the industry.

2026-04-29 Fonte

📁 Hardware AI generated

Qwen3.6 27B on Dual RTX 5060 Ti 16GB: On-Premise Performance Analysis

A detailed analysis explores the capabilities of the Qwen3.6 27B model on a local setup featuring two NVIDIA RTX 5060 Ti 16GB GPUs. Tests show performance of approximately 60-66 tokens per second and the ability to handle an extended context window up to 204,800 tokens, albeit with very tight VRAM margins. This study provides concrete insights for those evaluating on-premise LLM deployment with mid-range hardware.

2026-04-29 Fonte

📁 Hardware AI generated

Palit Centralizes Galax Management: GPU Brand Continues Operations

Palit Group has announced an internal reorganization centralizing the management of its GPU brand, Galax. Despite the change, the company confirmed that the Galax brand, known for its high-performance graphics cards like the HOF line, will continue to operate in the market. This move, described as "pre-planned," aims to optimize operations under the Palit Group umbrella, ensuring continuity for customers and the industry.

2026-04-29 Fonte

📁 Hardware AI generated

Intel 18A: Wafer Optimization Boosts Revenue and CPU Availability

New details reveal how Intel is increasing revenue per wafer through careful production optimization. According to analyses, a reduction in yield variability across each wafer, particularly for the 18A node, allows for a greater number of marketable CPUs, thereby improving the efficiency and profitability of the manufacturing process.

2026-04-29 Fonte