Topic / Trend Rising

AI Hardware & Infrastructure

This trend covers the intense development, manufacturing, and deployment of specialized AI hardware like GPUs, CPUs, and memory, alongside the expansion and optimization of data centers and cooling solutions crucial for AI workloads.

Detected: 2026-04-02 · Updated: 2026-05-26

Related Coverage

2026-05-26 DigiTimes

InPsytech and the Chiplet Era: AI Connectivity in the Spotlight

The rise of chiplet technology is highlighting the crucial importance of advanced connectivity solutions for artificial intelligence. In this scenario, Taiwanese company InPsytech positions itself as a key player, thanks to its specialized intellectu...

#Hardware #LLM On-Premise #DevOps
2026-05-26 DigiTimes

Wonderful Hi-Tech Sees Rising Demand from Satellites and AI Data Centers

Wonderful Hi-Tech is experiencing a significant surge in demand, driven by the expansion of AI-dedicated data centers and low-orbit satellite connectivity projects. This trend highlights the growing need for robust and high-performance infrastructure...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-26 DigiTimes

AMD and Nvidia Deepen Investments in Taiwan's Semiconductor Ecosystem

AMD and Nvidia are increasing their investments in Taiwan's semiconductor ecosystem. This strategic move highlights the island's central role in advanced chip manufacturing, which is crucial for the development and deployment of Large Language Models...

#Hardware #LLM On-Premise #DevOps
2026-05-26 DigiTimes

Intel's Rio Rancho Fab: A Testbed for AI-Era Chip Packaging Innovation

Intel's Rio Rancho facility is emerging as a critical hub for developing advanced chip packaging technologies, essential for meeting the growing demands of artificial intelligence. This hardware innovation is vital for companies evaluating on-premise...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-26 Phoronix

Meta Relaunches CacheLib: An Answer to Soaring DRAM Costs in the AI Era

Meta has released a new version of CacheLib, its open-source caching engine, after a two-year hiatus. This move comes amid "astronomical" DRAM costs in 2026, exacerbated by the increasing demand linked to AI. CacheLib, originally designed to optimize...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-26 DigiTimes

Taiwan MLCC Makers Boost Production Amid AI Server Demand

Taiwanese multi-layer ceramic capacitor (MLCC) manufacturers are responding to increasing demand for essential components in AI servers. This trend highlights the critical role of the hardware supply chain for AI infrastructure, with direct implicati...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-26 DigiTimes

Intel CEO in Taiwan: Strategic AI Hardware Moves Pre-Computex

Lip-Bu Tan, Intel's CEO, is visiting Taiwan for a series of closed-door meetings ahead of Computex. This mission underscores the island's crucial role in the global semiconductor supply chain and its strategic implications for the AI hardware market,...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-26 DigiTimes

Micron Boosts US DDR4 Production: AI Demand Tightens Global Supply

Micron has announced an expansion of its DDR4 memory production capacity in the United States. This strategic move responds to increasing global demand, largely driven by artificial intelligence applications, which is keeping the supply of essential ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-26 DigiTimes

AI Demand Surge Threatens to Deepen Global MLCC Shortages

Holy Stone Enterprise forecasts a worsening global shortage of Multi-Layer Ceramic Capacitors (MLCCs), driven by the surge in power demand for artificial intelligence. This situation could have significant repercussions on the availability of critica...

#Hardware #LLM On-Premise #DevOps
2026-05-26 DigiTimes

Castrol Enters AI Data Center Liquid Cooling: Focus on Testing and Services

Castrol, a company known in the lubricants sector, is expanding into the liquid cooling market for AI-dedicated data centers. The initiative involves offering testing and lifecycle management services for these solutions. This move highlights the inc...

#Hardware #LLM On-Premise #DevOps
2026-05-26 DigiTimes

Powerchip Unveils 3D AI Foundry with 3D WoW DRAM Stacking at COMPUTEX 2026

Powerchip announced its 3D AI Foundry, a new manufacturing capability integrating Wafer-on-Wafer (WoW) DRAM stacking, at COMPUTEX 2026. This innovation aims to enhance the performance and efficiency of AI-dedicated chips, offering significant potenti...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

Nvidia's Vera CPU: A New Front in the Data Center Chip Race

Nvidia is intensifying competition in the data center chip sector with the introduction of its Vera CPU. This move marks a new front in the hardware innovation race, where the integration between CPU and GPU becomes crucial for performance and energy...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 404 Media

Growing Opposition to Data Centers: An Obstacle for AI Infrastructure

The expansion of data centers, crucial for AI, faces increasing bipartisan opposition in the United States. Local communities and states are introducing moratoriums and construction bans, citing concerns about energy and water consumption, noise, and...

#Hardware #LLM On-Premise #DevOps
2026-05-25 DigiTimes

CATL Considers deepSeek Stake: A Signal for AI and Infrastructure

Battery giant CATL is reportedly considering an investment in AI startup deepSeek. This move highlights the growing importance of artificial intelligence across diverse sectors and raises questions about deployment strategies for AI companies, partic...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

Edge AI Accelerates Demand for Edge Computing and the IPC Industry

The growing adoption of Artificial Intelligence solutions directly on physical hardware, particularly for edge computing, is driving demand for edge infrastructure. This phenomenon positively impacts order visibility for Industrial PC (IPC) manufactu...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

Kawasaki Opens Physical AI Center in Silicon Valley, Deepening Nvidia Ties

Kawasaki has inaugurated a new artificial intelligence center in Silicon Valley. This initiative, highlighting the company's commitment to the AI sector, aims to further consolidate its collaboration with Nvidia, a key player in the development of ha...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

The New Frontier in the AI Chip War: Nvidia and AMD's Strategic Moves

Nvidia and AMD are redefining their strategies in the artificial intelligence chip market. Nvidia's reporting pivot and AMD's $10 billion investment in Taiwan signal a crucial phase in the competition for AI hardware dominance, with direct implicatio...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

Nvidia's Vera CPU Push: A Boost for LPDDR Memory Outlook

Nvidia is expanding its presence in the CPU market with the Vera project, a move expected to strengthen the demand for LPDDR memory. This strategy has significant implications for major manufacturers like Samsung and SK Hynix, highlighting the evolvi...

#Hardware #LLM On-Premise #DevOps
2026-05-25 DigiTimes

Global PMX and AI Server Cooling: A Response to Compute Demand

Global PMX is shifting its focus towards AI server cooling solutions, responding to the escalating demand for compute power. This move highlights the critical importance of thermal management for AI infrastructures, particularly in on-premise deploym...

#Hardware #LLM On-Premise #DevOps
2026-05-25 DigiTimes

AI Accelerates Demand for Passive Components: The Case of MLCCs

Ample Electronic reports a significant surge in demand for Multi-Layer Ceramic Capacitor (MLCC) passive components, crucial for modern electronics, driven by the increasing adoption of artificial intelligence. This trend highlights AI's impact on the...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

AI Data Centers Drive 800V HVDC Adoption: Impact on Asian Supply Chain

The escalating demand for artificial intelligence infrastructure is accelerating the adoption of 800V HVDC power systems in data centers. This transition, aimed at enhancing efficiency and power density, significantly impacts the supply chain, partic...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-25 DigiTimes

Huawei Invests in InP Chips to Boost AI Optical Networking

Huawei has announced a strategic investment in Milphoton Semiconductor, a startup specializing in Indium Phosphide (InP) based chips. This initiative aims to strengthen optical networking capabilities for artificial intelligence infrastructures, a cr...

#Hardware #LLM On-Premise #DevOps
2026-05-24 ServeTheHome

APC PowerForge: Text-to-3D Transformation with Dell and NVIDIA at DTW 2026

At Dell Tech World 2026, APC unveiled the PowerForge system, a rack solution developed in collaboration with Dell and NVIDIA. The demonstration highlighted its ability to generate 3D models directly from a text prompt, then physically print them in r...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-24 LocalLLaMA

BitCPM-CANN: Native 1.58-bit LLM Training on Ascend NPU

The BitCPM-CANN research introduces a training system for 1.58-bit (ternary) Large Language Models (LLMs) optimized for Huawei Ascend NPUs. This innovation allows for maintaining high reasoning capabilities on models up to 8 billion parameters, with ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-23 TechCrunch AI

xAI and SpaceX's Energy Shift: From Solar to Orbital Data Centers

xAI's recent pivot towards natural gas and SpaceX's interest in orbital data centers signal a potential departure from Elon Musk's promised solar-electric economy vision. This shift raises questions about future AI infrastructure, its environmental i...

#LLM On-Premise #DevOps
2026-05-22 Phoronix

OpenCL 3.1.1: A Crucial Update for AI and HPC Performance

The Khronos Group has released OpenCL 3.1.1, an update aimed at resolving a potential performance regression identified in the previous 3.1 version. This specification, fundamental for Artificial Intelligence and High-Performance Computing workloads,...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 LocalLLaMA

OpenBMB and BitCPM-CANN 1.58 bit: LLM Efficiency on Huawei Ascend

OpenBMB has introduced BitCPM-CANN, an LLM featuring 1.58-bit quantization. This approach aims to optimize inference efficiency by reducing memory footprint and computational requirements. The model is currently undergoing testing on the Huawei Ascen...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 Wired AI

The Gulf's AI Expansion: The Undersea Cable Challenge

The rapid development of artificial intelligence in the Gulf region is straining existing internet infrastructure. With the stakes rising for AI workloads, hyperscalers are pushing for a re-evaluation of undersea networks, highlighting the growing re...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 DigiTimes

AMD: AI Demand Is 'Absolutely Real' as CPUs Regain Focus

Lisa Su, AMD's CEO, has confirmed the robust demand in the artificial intelligence sector, highlighting a renewed interest in the role of CPUs. This strategic shift suggests an evolution in AI workload architecture, with significant implications for ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 DigiTimes

EverDisplay Reshuffles Leadership: A Signal for the Future of On-Premise AI?

EverDisplay announced a significant overhaul of its board, appointing a former Hua Hong executive as its new chairman. This strategic move raises questions about future directions in the technology sector, particularly concerning the supply chain and...

#Hardware #LLM On-Premise #DevOps
2026-05-22 DigiTimes

Wuhan and Huagong Tech: A 12.8 Tbps Optical Module for Chinese AI

Wuhan's optics hub in China is bolstering its commitment to artificial intelligence with the debut of a 12.8 Tbps optical module, developed by Huagong Tech. This component is crucial for AI infrastructure, enabling high-speed interconnects necessary ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 DigiTimes

Nvidia's Accelerated Innovation Cycle Strains AI Supply Chain

Nvidia's rapid iteration cycle in the artificial intelligence sector is creating significant strain across its supply chain. This dynamic impacts the availability of crucial hardware for on-premise deployments, raising questions about costs, lead tim...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 DigiTimes

Glass Substrates: BOE and Corning Explore New Frontiers for AI Packaging

BOE and Corning are focusing their efforts on developing glass substrates for AI chip packaging. This innovation aims to overcome the limitations of current technologies, offering higher interconnect density, improved thermal, and electrical properti...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-22 DigiTimes

AION: A €10 Billion AI Data Center Campus for Europe

The French AION consortium is seeking European Union funding to build an ambitious AI-dedicated data center campus. The project, estimated at €10 billion, aims to strengthen European digital sovereignty by providing large-scale on-premise AI infrastr...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

Nvidia LPX: The Niche Silicon for High-Speed Tokens

Nvidia has unveiled LPX, a new silicon designed for specific workloads requiring high-speed token processing. This solution is positioned as a niche offering, optimized for high performance in contexts where rapid response and critical data managemen...

#Hardware #LLM On-Premise #DevOps
2026-05-21 DigiTimes

AI Agents Fuel Arm CPU Demand Surge: Over 6 Million Units Expected by 2026

The Tech Forum 2026 highlights a significant increase in Arm CPU demand, primarily driven by the adoption of AI agents. Projections indicate that shipments will exceed 6 million units by 2026, signaling Arm's expanding role in the artificial intellig...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 The Next Web

Brett Adcock’s Hark Secures $700M for AI Hardware

Hark, the AI hardware startup founded by Brett Adcock, has raised over $700 million in a Series A round, achieving a $6 billion valuation. The company focuses on developing an integrated chip-and-model stack, emerging from stealth after initial fundi...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Phoronix

NVIDIA RTX PRO Blackwell: New Professional GPUs Tested on Linux

NVIDIA has introduced its new range of professional RTX PRO "Blackwell" graphics cards, designed for workstations. The analysis focuses on their performance in a Linux environment, evaluating how they compare against competing solutions from AMD Rade...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Tom's Hardware

Nvidia: Soaring Memory Costs Drive AI Systems to $7.8 Million

An analysis of Nvidia's AI system costs reveals a staggering 485% surge in memory expenses, now accounting for a quarter of the total system cost. The latest AI systems are priced at $7.8 million, with individual Rubin GPUs costing $50,000. This scen...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Phoronix

AMD Enhances Ryzen AI Drivers on Linux with Expandable Heap Support

AMD engineers are continuing the development of the AMDXDNA driver for Ryzen AI NPUs on Linux. The latest enhancement involves the introduction of expandable heap support, a crucial feature for optimizing memory management and improving AI workload p...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Tom's Hardware

Taiwan Cracks Down on Nvidia AI Chip Smuggling, Raids 12 Locations

Taiwan has launched its first formal operation against Nvidia AI chip smuggling, conducting raids at 12 locations and seeking three fugitives. The investigation, which also involves Super Micro, focuses on allegations of document forgery and fraudule...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Tom's Hardware

Meta MTIA: The Custom AI Accelerator and Its Infrastructure Implications

Meta is investing in the development of MTIA, its custom AI ASIC. This strategic move, common among large operators, aims to optimize performance for specific workloads like Large Language Models, reducing TCO and ensuring greater control over the en...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Tom's Hardware

AMD EPYC Venice: Production Ramp Begins for 2nm 256-Core HPC Chip

AMD has commenced mass production for its EPYC Venice processor, a 256-core HPC chip built on 2-nanometer technology. This new architecture promises a significant performance leap, positioning itself as a key solution for demanding data center and on...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Tom's Hardware

Musk and ASML: The $119 Billion TeraFab Chipmaking Project in Texas

Elon Musk is reportedly planning an ambitious semiconductor manufacturing megaproject, named TeraFab, with an estimated investment of $119 billion in Texas. ASML's CEO has confirmed direct discussions with Musk, highlighting the seriousness of the in...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 The Next Web

AMD Invests Over $10 Billion in Taiwan's AI Ecosystem for Helios Platform

AMD has announced a multi-year investment exceeding $10 billion in Taiwan's semiconductor ecosystem. The initiative aims to strengthen strategic partnerships and expand advanced packaging manufacturing for next-generation AI infrastructure, including...

#Hardware #LLM On-Premise #DevOps
2026-05-21 DigiTimes

AI and the Supply Chain: Taiwan's 5G FWA in 2026

Taiwan's 5G FWA CPE industry is set for a rebound in shipments by Q1 2026. However, this growth is challenged by an "AI-driven supply crunch," putting pressure on the global supply chain and raising questions for infrastructure deployments, especiall...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

Nvidia's CPU Strategy: Deployment Models for AI Infrastructure

Nvidia is outlining its strategy for commercializing its CPUs, a key component for AI infrastructure. The company intends to offer these solutions through various deployment models, an approach that reflects the growing complexity and diverse require...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

AMD Commits $10 Billion to Taiwan for AI Infrastructure and Packaging

AMD has announced an investment exceeding $10 billion in Taiwan's ecosystem. The goal is to enhance AI packaging and infrastructure capacity. This strategic move aims to support the escalating demand for AI hardware, with significant implications for...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 Tech.eu

Muybridge Secures $16M for Software-Defined Imaging and GPU Systems

Muybridge, a Norwegian imaging technology company, has closed an oversubscribed $16 million Series A funding round. The company develops a software-defined imaging platform that replaces traditional broadcast camera infrastructure with 4K sensors and...

#Hardware #LLM On-Premise #DevOps
2026-05-21 DigiTimes

AI Chip Boom Strains ABF Substrate Supply Chain

The rapid expansion of the artificial intelligence chip market is creating significant strain on the supply chain for ABF (Ajinomoto Build-up Film) substrates. These components are crucial for assembling high-performance processors, such as GPUs, whi...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

Efficiency and Strategy: Managing Capacity in AI Infrastructure

Strategic capacity management and the elimination of inefficiencies are essential for any technology supply chain. In the LLM sector, this translates into critical decisions for on-premise deployment, where hardware optimization, VRAM, and TCO are ke...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

Nvidia Redefines AI Market: New Segments and Strategic Partnerships

Nvidia has announced the creation of the ACIE sub-segment, a partnership with Anthropic, and an emphasis on Physical AI. These strategic moves indicate a growing specialization in the artificial intelligence market, with significant implications for ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

Starcloud: A $2.2 Billion Growth and the Orbital Bet for AI's Energy Crisis

Starcloud has seen its valuation soar from $10 million to $2.2 billion in just 17 months. The company aims to address the growing energy crisis of artificial intelligence with an innovative solution, described as "orbital." This rapid development hig...

#Hardware #LLM On-Premise #DevOps
2026-05-21 DigiTimes

FII Challenges Broadcom and Nvidia as CPO Race Shifts to System Integration

The competitive landscape for Co-Packaged Optics (CPO) is undergoing a transformation, with FII emerging as a challenger to industry giants like Broadcom and Nvidia. Competition is increasingly shifting towards system integration, a crucial factor fo...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

OSE Targets AI Server SMT Growth Driven by Memory Demand

OSE, a key player in semiconductor assembly and test services, is strategically focusing on Surface Mount Technology (SMT) for AI servers. This move is supported by increasing demand for memory components, which is improving the company's market outl...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 TechCrunch AI

Jensen Huang: AI Agent CPUs Represent a $200 Billion Market for Nvidia

Jensen Huang, Nvidia's CEO, has identified a significant new market valued at $200 billion. The company plans to focus on developing CPUs specifically for AI agents, marking a potential strategic expansion beyond its traditional GPU dominance. This m...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-21 DigiTimes

Nvidia's Revenue Surges 85%, Data Center Sales Drive AI Expansion

Nvidia reported an impressive 85% growth in overall revenue, with data center segment sales jumping by 92%. These results underscore the exponential demand for dedicated AI hardware infrastructure, particularly for Large Language Models workloads. Th...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-20 The Next Web

France Bids $10 Billion for EU AI Gigafactory Site

A consortium of French companies, led by Iliad's Scaleway, has submitted a bid of approximately $10 billion to host one of the five 'AI gigafactories' planned by the European Union. The AION consortium, which includes partners like Hugging Face and S...

#Hardware #LLM On-Premise #DevOps
2026-05-20 Tom's Hardware

China Bans Nvidia 5090D V2: A Signal for AI Technological Independence

During CEO Jensen Huang's visit, China reportedly banned the Nvidia 5090D V2 GPU. This move is part of Beijing's strategy to promote the adoption of locally produced chips by domestic tech companies, highlighting growing geopolitical tensions in the ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-20 AI News

Alibaba Redefines the AI Race with Chips and LLMs for Agents

Alibaba has unveiled the Zhenwu M890 AI processor, a multi-year silicon roadmap, and the new Qwen 3.7-Max LLM. This strategic move aims to build an integrated AI stack, focusing on AI agents and technological sovereignty, thereby reducing reliance on...

#Hardware #LLM On-Premise #DevOps
2026-05-20 DigiTimes

Alibaba T-Head Bolsters AI Infrastructure with Zhenwu M890

Alibaba T-Head, the semiconductor division of the Chinese tech giant, is intensifying its commitment to developing dedicated artificial intelligence infrastructure. The introduction of the Zhenwu M890 marks a significant step in this direction, aimin...

#Hardware #LLM On-Premise #DevOps
2026-05-20 DigiTimes

Liquid Cooling: A Necessity for AI Servers at Tech Forum 2026

At Tech Forum 2026, liquid cooling emerged as an indispensable solution for AI servers. The increasing power density of AI hardware stacks, essential for Large Language Models, is driving the adoption of more efficient thermal dissipation technologie...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-20 DigiTimes

Google and Blackstone's Joint Venture for TPU Leasing: Impact on ASIC Demand

Google and Blackstone have announced a joint venture focused on leasing Tensor Processing Units (TPUs). This initiative aims to facilitate access to specialized AI hardware, potentially boosting demand for Application-Specific Integrated Circuits (AS...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-20 DigiTimes

Quanta Cloud Technology Invests in California for AI Server Expansion

Quanta Cloud Technology (QCT) has allocated US$61.71 million for a new facility lease in California. This investment aims to support the increasing demand for AI-dedicated servers, reflecting the trend of companies enhancing their physical infrastruc...

#Hardware #LLM On-Premise #DevOps
2026-05-20 DigiTimes

China Intensifies Push for AI Chip Self-Reliance After Trump-Xi Talks

Following a meeting between former President Trump and President Xi, it has emerged that China is intensifying its efforts to achieve self-reliance in AI chip production. This strategic move underscores the critical importance of controlling the sili...

#Hardware
2026-05-20 DigiTimes

AI Rack Boom Drives Nvidia's Growth Forecasts

Supply chains are increasingly optimistic about Nvidia's upcoming financial results. The expectation of strong performance is fueled by robust demand for artificial intelligence infrastructure, particularly "AI racks," reflecting an acceleration in e...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-20 DigiTimes

China's OSATs Pursue Enhanced Role Amid AI Chip Packaging Strain

Chinese Outsourced Semiconductor Assembly and Test (OSAT) companies are intensifying their involvement in the global supply chain, driven by the surging demand for AI chips. This push aims to alleviate pressure on the packaging market, a critical bot...

#Hardware #LLM On-Premise #DevOps
2026-05-19 ServeTheHome

AMD Unveils EPYC 8005 Series: Up to 84 Zen 5 Cores with 225W TDP

AMD has released details of its new EPYC 8005 processor series, based on the Zen 5 architecture. Featuring configurations of up to 84 cores and a 225W TDP, this line represents a significant evolution for servers, offering a balance between core dens...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-19 The Next Web

Meta Invests Over $200 Billion in Hyperion, Its AI Campus in Louisiana

Meta is building Hyperion, a massive AI data center campus in Louisiana. With an estimated cost exceeding $200 billion, the project represents the most expensive private infrastructure in U.S. history, an investment that has grown exponentially since...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-19 DigiTimes

Dell AI Factory Exceeds 5,000 Enterprise Clients, Driven by Nvidia Demand

Dell Technologies announced that its AI Factory initiative has surpassed 5,000 enterprise clients. This milestone highlights the growing adoption of dedicated AI infrastructure solutions, with particularly strong demand for Nvidia-powered platforms. ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-19 TechWire Asia

AMD and Yotta-Scale AI: Malaysia at the Core of Infrastructure Strategy

AMD identifies Malaysia as a strategic pillar for AI infrastructure development in Southeast Asia, anticipating the rise of yotta-scale AI. This evolution compels enterprises to rethink infrastructure planning, favoring open and distributed systems t...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-02 DigiTimes

Asahi Kasei Enters AI Chip Fiberglass Market, Challenging Nittobo

Asahi Kasei has announced its entry into the AI chip fiberglass market, a critical sector for advanced hardware component manufacturing. This move aims to challenge Nittobo's dominant position, signaling an intensification of competition in the AI ma...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-02 DigiTimes

Nvidia Invests $2 Billion in Marvell for NVLink Fusion Integration with ASICs

Nvidia has announced a $2 billion investment in Marvell, aiming to integrate NVLink Fusion technology directly into ASICs. This strategic move seeks to enhance interconnection capabilities for custom chips, accelerating the development of optimized h...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-02 DigiTimes

Chinese Companies Capture Nearly 41% of Domestic AI Accelerator Server Market

Chinese enterprises have secured a significant market share, almost 41%, in the domestic AI accelerator server sector. This highlights a growing local capability in providing critical infrastructure for Large Language Models (LLM) workloads and other...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-02 DigiTimes

The Arm Architecture Redefines AI Servers: Towards a Post-x86 Era

Hyperscalers are re-engineering AI server CPUs by adopting the Arm architecture, signaling a potential shift from the x86 era. This transition promises greater energy efficiency and flexibility, with significant implications for TCO and data sovereig...

#Hardware #LLM On-Premise #DevOps
2026-04-02 DigiTimes

Huawei's AI Strategy: Infrastructure at the Core

Huawei's 2025 annual report, featuring insights from Meng Wanzhou, highlights the company's AI strategy, which begins with foundational infrastructure. This vision underscores the importance of robust, scalable architecture to support the development...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-02 DigiTimes

Microsoft Invests $6.5 Billion in Southeast Asia AI Buildout

Microsoft has announced a $6.5 billion investment to boost its artificial intelligence infrastructure in Southeast Asia, with a specific focus on Singapore and Thailand. This strategic move underscores the region's growing importance as a tech hub an...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-02 Phoronix

AMD GPU Driver Optimizations Arriving with Linux 7.1

AMD is introducing new optimizations for its GPU drivers, including the DC Idle Manager and Multi-SDMA Engine, slated for the Linux 7.1 kernel. These updates aim to enhance the efficiency and performance of AMD graphics cards, a crucial aspect for on...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 TechCrunch AI

Meta: Hyperion AI data center powered by ten gas plants

Meta is building its next AI data center, named Hyperion. This critical infrastructure for artificial intelligence workloads will be powered by ten new natural gas plants. Meta's energy choice for a project of this magnitude raises questions about pr...

#Hardware #LLM On-Premise #DevOps
2026-04-01 TechCrunch AI

Cognichip Raises $60M to Advance AI-Designed Chips for AI

Cognichip has secured $60 million in funding to pursue an innovative approach: leveraging artificial intelligence to design the very chips that power AI applications. The company aims to revolutionize the semiconductor industry by promising to reduce...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 Tom's Hardware

DRAM and NAND: Prices Soar in Q2 Due to AI Server Demand

According to Trendforce, DRAM and NAND Flash prices are set for significant increases in Q2, with projected jumps of 63% and up to 75% respectively. These increases follow substantial hikes in Q1 and are attributed to the surging demand for AI server...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 Tom's Hardware

Gigabyte X870E Aorus Xtreme AI Top: The Hardware Foundation for On-Premise AI

The Gigabyte X870E Aorus Xtreme AI Top positions itself as a flagship motherboard designed for high-performance systems. Its architecture is relevant for building AI workstations or servers in self-hosted environments, where stability, connectivity, ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

Fujitsu and Rapidus: Japan's 1.4nm AI Chip Production Takes Shape

Fujitsu has announced plans for the production of cutting-edge AI chips, based on 1.4-nanometer technology. Manufacturing will take place in Japan, in collaboration with Rapidus, at the company's first fab in Chitose, Hokkaido. Operations are schedul...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

Silicio Photonics and Advanced Packaging: Pillars for Future AI

Recent discussions at Touch Taiwan highlighted the increasing importance of Silicio Photonics (SiPh) and advanced packaging. These technologies are considered crucial for overcoming current hardware limitations and enabling the next generation of AI ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

SDI targets AI and heat spreader growth

SDI, a technology sector player, is directing its growth strategies towards artificial intelligence and heat spreader development. This move reflects the increasing demand for advanced thermal solutions, crucial for managing the heat generated by int...

#Hardware #LLM On-Premise #DevOps
2026-04-01 DigiTimes

Micron Reportedly Developing Stacked GDDR to Meet AI Memory Demand

Micron is reportedly developing a new generation of GDDR memory using stacked technology to address the increasing demands of AI workloads. This innovation is crucial for the evolution of infrastructures hosting Large Language Models, directly impact...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

Nvidia and Marvell: The $2 Billion Bet Redefining AI Alliances

Nvidia has invested $2 billion in Marvell, transforming a potential rival into a strategic partner. This move highlights the importance of collaborations for AI infrastructure, with significant implications for enterprises evaluating on-premise deplo...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

Nvidia Aims for Full AI Stack Ownership with Three-System Strategy

Nvidia is expanding its offerings beyond GPUs, aiming to provide comprehensive AI solutions. This strategic move, based on a three-system approach, seeks to consolidate control over the entire AI pipeline, from computation to software. The goal is to...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

China GPU Maker Biren Triples Revenue on AI Data Center Demand

Chinese GPU manufacturer Biren has reported impressive revenue growth, tripling its earnings due to increasing demand from artificial intelligence data centers. This trend highlights the strong expansion of the AI hardware market, with a particular f...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-01 DigiTimes

Arm and Tesla Reshape AI Chip Market: Impact on Supply Chains and Memory

The AI chip landscape is undergoing a profound transformation, driven by the rise of Arm architecture and custom silicio development strategies from companies like Tesla. These shifts are redefining global supply chains and fueling a surging demand f...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 Tom's Hardware

Tryx Stage 360 AIO: The All-in-One Approach for On-Premise AI Infrastructure

The Tryx Stage 360 AIO is presented as an All-in-One solution promising a distinctive user experience, focused on design and quiet operation. For companies evaluating on-premise Large Language Model (LLM) deployment, adopting integrated systems can o...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 The Next Web

Oracle Cuts Thousands of Jobs to Fund AI Data Centers

Oracle is undergoing a significant workforce reorganization, with estimates suggesting up to 30,000 layoffs. The goal is to free up an estimated $8-10 billion to finance massive investments in AI infrastructure and data centers. These decisions, affe...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 Phoronix

Intel Panther Lake & Linux AI/LLM Debates Dominated Q1

The first quarter saw intense activity within the Linux landscape, with upcoming Intel Panther Lake processors and discussions surrounding Large Language Models (LLM) and artificial intelligence taking center stage. These topics generated significant...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 The Register AI

Agentic AI: Arm calls for new CPUs, Intel pushes back

Arm and Nvidia have unveiled specific CPUs designed to run agentic AIs, such as OpenClaw, suggesting a need for dedicated architectures. This view, however, is challenged by Intel, whose Data Center chief does not believe a radical shift in CPU desig...

#Hardware #LLM On-Premise #DevOps
2026-03-31 Tech.eu

Nebius Announces 310 MW AI Mega Data Center in Finland

Nebius, a European AI infrastructure company, has announced the construction of a 310 MW data center in Lappeenranta, Finland, expected to be operational by 2027. The facility will be one of Europe's largest dedicated AI data centers, used for traini...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 The Next Web

Microsoft Commits Over $1 Billion to Cloud and AI Infrastructure in Thailand

Microsoft has announced an investment exceeding $1 billion in Thailand between 2026 and 2028. The initiative aims to bolster the country's cloud and AI infrastructure, encompassing data center construction, cybersecurity enhancement, the development ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 DigiTimes

MediaTek and Airoha Strengthen Open Source Platform for Edge AI

MediaTek and Airoha are intensifying their collaboration on an open-source platform for the telecommunications sector. The initiative aims to compete with established players like Broadcom and Qualcomm, focusing specifically on developing solutions f...

#Hardware #LLM On-Premise #DevOps
2026-03-31 DigiTimes

Lens Technology Shifts Focus to AI Servers, Robotics, and Aerospace

Lens Technology, known for its iPhone component manufacturing, is expanding its operations. The company is now concentrating on strategic sectors such as artificial intelligence servers, robotics, and aerospace. This move marks a significant diversif...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 ServeTheHome

Gigabyte Showcases NVIDIA Vera Rubin Platforms and More at GTC 2026

At GTC 2026, Gigabyte unveiled its latest hardware innovations, with a particular focus on new platforms built around the NVIDIA Vera Rubin architecture. These next-generation systems and components are designed to tackle the most intensive Large Lan...

#Hardware #LLM On-Premise #DevOps
2026-03-31 DigiTimes

Chinese GPU Maker Moore Threads Secures $91 Million AI Cluster Order

Chinese GPU manufacturer Moore Threads has secured a $91 million order for an AI cluster. This deal highlights the increasing demand for dedicated artificial intelligence infrastructure and the emerging role of new players in the global LLM hardware ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 DigiTimes

CPU Resurgence Reshapes AI Chip Demand: Terafab Funding Questions Emerge

The AI chip market is undergoing a transformation, with an unexpected resurgence of CPUs beginning to redefine hardware requirements for artificial intelligence. This trend raises questions about future investments in manufacturing infrastructures li...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-31 DigiTimes

Nvidia: Vera Rubin Design Unfinalized, Focus on Supply Chain Diversification

Nvidia is still finalizing the design of its "Vera Rubin" compute tray. This development phase coincides with a corporate strategy aimed at diversifying its supply chain. Nvidia's move highlights the importance of mitigating risks associated with the...

#Hardware #LLM On-Premise #DevOps
2026-03-31 DigiTimes

The Rise of Custom Chips: Taiwan Responds to ASIC Demand for AI

The increasing demand for custom chips, known as ASICs, is prompting Taiwanese firms to strengthen their presence in this market segment. This trend reflects the need for more efficient and specialized hardware solutions to handle intensive LLM and A...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-30 DigiTimes

Samsung SDI strengthens LFP supply chain for US AI data centers

Samsung SDI is expanding its LFP cathode supply chain, targeting the growing US market for Energy Storage Systems (ESS) in AI data centers. This strategic move, involving Posco Future M, highlights the critical role of energy infrastructure in suppor...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-30 The Next Web

Rebellions Raises $400M for AI Inference Chips, Valued at $2.34 Billion

South Korean fabless AI chip company Rebellions, focused on AI Inference, has closed a $400 million pre-IPO funding round, reaching a $2.34 billion valuation. Backed by giants like Samsung and SK Hynix, the company targets US customers such as Meta a...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-30 The Register AI

South Korea's Rebellions Raises $400M for Global Rack-Scale AI Platform

South Korean AI chip startup Rebellions, backed by SK Telecom, has secured $400 million in a pre-IPO funding round. This investment aims to support the global expansion of its new rack-scale compute platform, designed for enterprises and sovereign cl...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-30 DigiTimes

Arm Expands Beyond Licensing with New AI CPU Platform

Arm is redefining its traditional licensing business model by introducing an innovative CPU platform specifically designed for artificial intelligence workloads. This strategic move aims to offer optimized hardware solutions for AI, potentially influ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-30 DigiTimes

DRAM Scaling Limits: New Memory Crucial for On-Premise AI

DRAM scalability is reaching its limits, while next-generation memories face delays. Atomera's MST technology promises to improve power and bandwidth efficiency, offering benefits comparable to a manufacturing node transition, a key factor for on-pre...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-30 DigiTimes

AI compute shifts to inference, reshaping data center bottlenecks

DIGITIMES Research's analysis highlights a transition in the AI computing landscape: the focus is increasingly shifting towards inference. This change, presented at AI EXPO 2026 by Jim Hsiao, senior analyst, is redefining the challenges and bottlenec...

#Hardware #LLM On-Premise #Fine-Tuning
2026-03-29 Tom's Hardware

New Cambridge chip slashes AI energy use

A new chip developed at Cambridge promises to drastically reduce the energy consumption of artificial intelligence systems. The component uses a new type of memristor with a switching current approximately one million times lower than conventional de...

#LLM On-Premise #DevOps
2026-03-29 The Next Web

European investments: focus on AI infrastructure

Last week saw a surge of investments in Europe, with a particular focus on infrastructural layers. Funding spanned diverse sectors such as semiconductor physics, orbital logistics, defense systems, and artificial intelligence, signaling a strong inte...

#LLM On-Premise #DevOps
2026-03-28 The Next Web

Kandou AI raises $225 million to bet on copper interconnects

Swiss company Kandou AI, specializing in copper-based chip-to-chip interconnect technologies, has secured a $225 million Series A funding round. The investment, led by Maverick Silicio, includes strategic participation from SoftBank, Synopsys, Cadenc...

2026-03-28 Tom's Hardware

Meta to fund natural gas power plants for Louisiana AI data center

Meta partners with Entergy to build seven new natural gas power plants. The goal is to deliver 7 gigawatts of power to its planned AI data center in Louisiana, ensuring sufficient energy for compute-intensive operations.

#Hardware #LLM On-Premise #DevOps
2026-03-28 ServeTheHome

Aivres Showcases NVIDIA Vera Rubin at NVIDIA GTC 2026

Aivres showcased NVIDIA Vera CPUs and Rubin GPUs at NVIDIA GTC 2026. Blackwell Ultra and BlueField-4 DPUs were also on display. The event offered a glimpse into NVIDIA's upcoming hardware architectures for advanced workloads.

#Hardware #LLM On-Premise #DevOps
2026-03-27 TechCrunch AI

AI Infrastructure: Real-World Resistance Emerges

The expansion of AI infrastructure into the real world is meeting resistance. An AI company offered an 82-year-old woman $26 million to build a data center on her land, but she refused. Tensions are rising regarding the territorial and social impact ...

#LLM On-Premise #DevOps
2026-03-27 Phoronix

AMD ROCm 7.12 Tech Preview Brings More Consumer APU & GPU Support

AMD has released ROCm 7.12 as the newest tech preview, working towards the presumed ROCm 8.0 release. This release extends support to a greater number of consumer APUs and GPUs, expanding options for developers using the ROCm ecosystem.

#Hardware
2026-03-27 DigiTimes

Taiwan's ALi bets on custom chips for 2026 turnaround

Taiwan's ALi (Acer Laboratories Inc.) is investing in the development of custom chips with the goal of a turnaround by 2026. The strategy focuses on specialized hardware solutions for emerging markets, with a particular focus on optimizing performanc...

#Hardware #LLM On-Premise #DevOps
2026-03-27 DigiTimes

SK Hynix keeps HBM shipments steady, targets HBM4E sample this year

SK Hynix keeps HBM (High Bandwidth Memory) shipments steady and plans to release the first HBM4E samples by the end of the year. The Nvidia Vera Rubin AI platform highlights the growing demand for advanced memory in AI systems.

#Hardware #LLM On-Premise #DevOps
2026-03-26 LocalLLaMA

Qwen 3.5 27B: 1.1M tok/s on B200s, configurations on GitHub

Qwen 3.5 27B achieved 1.1 million tokens per second using 96 B200 GPUs across 12 nodes, thanks to optimizations like DP=8 over TP=8, a context window reduced to 4K, FP8 KV cache, and MTP-1 speculative decoding. Scaling efficiency reached 96.5% on 12 ...

#Hardware #LLM On-Premise #DevOps
← Back to All Topics