AI Hardware Race Intensifies

2026-02-07 • TechCrunch AI

Benchmark raises $225M in special funds to double down on Cerebras

Venture capital firm Benchmark Capital has announced a $225 million investment in Cerebras Systems, a manufacturer of processors dedicated to artificial intelligence. Benchmark has been an investor in Cerebras since 2016, supporting the development o...

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-07 • DigiTimes

Microsoft's Maia 200 chip targets AI inference cost advantage, not Nvidia rivalry

Microsoft unveiled Maia 200, a chip designed to optimize AI inference costs. The goal is not to compete directly with Nvidia, but to offer a more cost-effective solution for specific workloads. The chip is intended for Microsoft data centers.

#Hardware #LLM On-Premise #DevOps

2026-02-07 • DigiTimes

AI demand spillover lifts 2026 general-purpose server shipments 10%

The increasing demand for artificial intelligence applications is having a significant impact on the server market. General-purpose server shipments are projected to increase by 10% by 2026, driven by the need for more powerful computing infrastructu...

#LLM On-Premise #Fine-Tuning #DevOps

2026-02-06 • The Register AI

Record Investments: Big Tech to Spend $635 Billion on AI Infrastructure

Amazon, Google, Meta, and Microsoft are projected to collectively invest approximately $635 billion in infrastructure, with a significant portion allocated to datacenters and AI infrastructure. This figure surpasses Israel's GDP and the entire global...

#LLM On-Premise #DevOps

2026-02-06 • Tom's Hardware

Infineon allegedly hikes prices of power switches and ICs amid AI boom

Infineon has reportedly increased the prices of its power switches and integrated circuits (ICs). This move, apparently linked to the expansion of artificial intelligence, could have repercussions on the production costs of a wide range of electronic...

2026-02-06 • Phoronix

Pushing The Intel Panther Lake CPU Performance Further On Linux

New Linux benchmarks examine the performance of Intel's Panther Lake Core Ultra X7 358H CPU with a higher power budget. The tests reveal significant generational improvements, particularly in energy efficiency, and confirm the excellent performance o...

#Hardware #LLM On-Premise #DevOps

2026-02-06 • Phoronix

AMD Prepares the Ground for RDNA 4 GPUs with GFX1170 Target

AMD continues the development of its LLVM compiler stack for future GPUs. A new target, GFX1170, also identified as RDNA 4m, has been introduced. This update adds to the ongoing work on GFX1250 and GFX13 targets, expanding support for AMD's upcoming ...

#Hardware

2026-02-06 • DigiTimes

TSMC’s 3nm bet in Japan signals a deeper Taiwan-Japan tech pact

TSMC's investment in 3nm technology in Japan signals a strengthening of technological collaboration between Taiwan and Japan. This strategic move could have significant implications for the global semiconductor supply chain and international technolo...

2026-02-06 • DigiTimes

Pegatron partners with Sysgration to expand BBU for US-made AI servers

Pegatron is partnering with Sysgration to expand the production of Battery Backup Units (BBUs) for AI servers manufactured in the US. This collaboration aims to strengthen the domestic supply chain for critical AI server components.

#LLM On-Premise #DevOps

2026-02-06 • DigiTimes

MetaOptics drives heat-resistant metalenses into CPUs

MetaOptics, headquartered in Singapore and maintaining close ties with Taiwan, is developing heat-resistant metalenses for integration into CPUs. This technology could significantly improve the thermal management of processors.

2026-02-06 • DigiTimes

PSMC narrows losses as DRAM prices and AI demand boost revenue

Memory manufacturer PSMC reports narrowing losses, driven by rising DRAM prices and increasing demand for artificial intelligence solutions. This positive trend reflects an improving semiconductor market.

#LLM On-Premise #DevOps

2026-02-06 • DigiTimes

CSPs turn to custom silicio to break Nvidia dependence

Cloud service providers (CSPs) are exploring custom silicio solutions to diversify their hardware options and reduce dependence on traditional vendors like Nvidia. This trend could lead to new architectures optimized for specific workloads.

#Hardware #LLM On-Premise #DevOps

2026-02-06 • DigiTimes

Google doubles AI capex, turning TPU ASIC orders into high-stakes supplier race

Google is significantly increasing its investments in AI infrastructure, particularly in TPU ASICs. This move intensifies competition among suppliers and signals a strong push towards custom hardware solutions for artificial intelligence workloads.

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-06 • DigiTimes

Foxconn sees 35% revenue increase in January on AI server demand

Foxconn reports a 35% revenue increase in January, driven by strong demand for AI servers. This reflects the growing importance of specialized hardware for AI workloads, both in the cloud and on-premise.

#Hardware #LLM On-Premise #DevOps

2026-02-06 • DigiTimes

Wistron posts strongest January on AI server growth

Taiwanese manufacturer Wistron reported an exceptionally positive January, driven by strong demand for servers dedicated to artificial intelligence. This highlights the growing market interest in specialized hardware solutions for AI workloads.

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-06 • DigiTimes

AI and AP Drive Load Board Shipments with January Revenue Up

According to DIGITIMES, artificial intelligence and advanced applications (AP) are boosting shipments of load boards. January revenues show growth, indicating strong demand in the sector.

#Hardware #LLM On-Premise #DevOps

2026-02-06 • DigiTimes

Taiwan LED equipment maker FitTech wins arbitration, recovers NT$1.49b from China's Sanan

Taiwanese LED equipment maker FitTech has won an international arbitration against China's Sanan Optoelectronics, recovering NT$1.49 billion (approximately $46 million). The dispute concerned alleged contract violations. The decision highlights the i...

#LLM On-Premise #DevOps

2026-02-06 • DigiTimes

Cerebras raises US$1 billion, valuation nearly triples in 6 months

Cerebras Systems has announced a funding round that nearly triples its valuation in just six months. The company focuses on developing specialized hardware for artificial intelligence workloads, particularly for training large models.

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-05 • LocalLLaMA

gWorld: 8B model beats 402B Llama 4 by generating web code

Trillion Labs and KAIST AI introduced gWorld, an open-weight visual world model for mobile GUIs. gWorld, available in 8B and 32B versions, generates executable web code instead of pixels, surpassing larger models like Llama 4 in accuracy. This approa...

#LLM On-Premise #Fine-Tuning #DevOps

2026-02-05 • Tom's Hardware

Tenstorrent reduces Tensor Cores on Blackhole p150 via Firmware Update

Tenstorrent announced a reduction in the number of Tensor cores on its Blackhole p150 cards, from 140 to 120, via a firmware update. The company anticipates a 1-2% performance drop for existing users. New cards will ship with 120 Tensor cores.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • Tom's Hardware

Western Digital details 14-platter 3.5-inch HAMR HDD designs with 140 TB and beyond

Western Digital announces the development of 3.5-inch HDDs (Hard Disk Drives) based on HAMR (Heat-Assisted Magnetic Recording) technology with a capacity reaching 140 TB, thanks to the use of 14 platters. This technology promises to significantly inc...

#LLM On-Premise #DevOps

2026-02-05 • Phoronix

Intel Arc B390 Graphics Performance On Linux With Panther Lake

First Linux benchmarks of the Intel Arc B390 GPU, integrated in high-end Panther Lake models. The Xe3 graphics card, equipped with 12 Xe cores, promises interesting performance in desktop and mobile environments for graphics and compute workloads.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Google AI platform win elevates Innoscience's 8-inch GaN manufacturing clout

Google's selection of Innoscience for its AI platform highlights the importance of GaN (gallium nitride) manufacturing on 8-inch wafers. This technology promises to improve the efficiency and performance of artificial intelligence systems, opening ne...

#LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Siemens expands EDA stack with AI metrology acquisition of Canopus AI

Siemens Digital Industries Software has announced the acquisition of Canopus AI, a move aimed at enhancing its Electronic Design Automation (EDA) stack with advanced AI-powered metrology capabilities. The integration is expected to improve semiconduc...

2026-02-05 • DigiTimes

Alphabet's US$185 billion hardware mandate: Breaking the AI supply bottleneck

Alphabet plans to invest US$185 billion in hardware infrastructure dedicated to artificial intelligence. The initiative aims to overcome current supply chain bottlenecks and ensure the computing capacity needed for its ambitious AI projects.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Qualcomm reports record results, flags memory constraints

Qualcomm reported record financial results for Q1FY26. However, the company anticipates potential limitations related to memory availability in the near term, a factor that could impact deliveries and the ability to meet demand.

#LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Jensen Huang: AI factories will power a trillion-dollar reindustrialization

According to Jensen Huang, CEO of NVIDIA, AI factories are the engine of a new wave of reindustrialization. These specialized infrastructures will be fundamental for the development and deployment of advanced AI solutions in various industrial sector...

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Infineon's fiscal 1Q26 resilience highlights AI-driven growth amid cyclical pressures

Infineon's resilience in fiscal Q1 2026 highlights how growth in the artificial intelligence sector is offsetting cyclical market pressures. The company demonstrates its ability to navigate economic challenges through a strategic focus on AI.

#LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Alphabet pledges record $185 billion capital spend as AI fuels cloud boom

Alphabet plans to invest a record $185 billion, fueled by cloud growth and AI opportunities. The company aims to strengthen its infrastructure to support the increasing demand for AI and cloud services.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Japan's Ibiden channels US$3.3bn into IC substrate expansion for AI servers

Japanese manufacturer Ibiden is investing heavily in expanding its production of IC substrates. The goal is to meet the growing demand for servers dedicated to artificial intelligence. The total investment amounts to US$3.3 billion.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

AEMC joint venture to build semiconductor adhesives R&D center in Southern Taiwan Science Park

AEMC will build a research and development center dedicated to semiconductor adhesives in the Southern Taiwan Science Park. The initiative aims to strengthen the company's position in the electronics materials market.

2026-02-04 • TechCrunch AI

Positron challenges Nvidia with AI chips: $230M Series B round

Positron has raised $230 million in a Series B funding round, with participation from the Qatar Investment Authority. The company aims to compete with Nvidia in the artificial intelligence chip market, amid growing demand and with Qatar aiming to dev...

#Hardware

2026-02-04 • DigiTimes

TI and NXP report strong results as AI data center power management boosts semiconductor packaging demand

Texas Instruments and NXP report strong results, driven by increasing demand for power management solutions in AI data centers. The rising complexity of power systems for GPUs and accelerators is boosting the advanced semiconductor packaging sector.

#Hardware #Fine-Tuning

2026-02-04 • DigiTimes

Intel CEO unveils plans to enter GPU market dominated by Nvidia

Intel's CEO has announced plans to enter the GPU market, currently dominated by Nvidia. This strategic move could bring new dynamics to the hardware acceleration sector for artificial intelligence and graphics workloads.

#Hardware #LLM On-Premise #DevOps

2026-02-04 • DigiTimes

Nvidia shapes the HBM4 race as Samsung, SK Hynix jockey for position

The race for HBM4 memory production intensifies, with Nvidia playing a key role in defining the specifications. Samsung and SK Hynix are vying for leadership in this sector crucial for future GPUs and AI accelerators.

#Hardware #LLM On-Premise #DevOps

2026-02-04 • DigiTimes

Vanguard International Semiconductor sees strong 2026 AI server power demand

Vanguard International Semiconductor anticipates strong growth in power demand for AI servers starting in 2026. The company expects a significant impact on the semiconductor market, with implications for hardware manufacturers and cloud service provi...

#LLM On-Premise #DevOps

2026-02-04 • DigiTimes

AI drives ODM/EMS growth despite weak consumer electronics in 2025

The ODM/EMS sector anticipates growth in 2025, primarily driven by the demand for AI-based solutions, offsetting the slowdown in the consumer electronics market. This trend highlights the increasing importance of AI as an engine for innovation and ec...

#Hardware #LLM On-Premise #DevOps

2026-02-04 • DigiTimes

AMD prioritizes supply chain for second-half AI ramp

AMD is focusing its efforts on optimizing its supply chain to support the increasing demand for AI solutions in the second half of the year. This strategic move aims to ensure the availability of necessary components for the production and distributi...

#Hardware #LLM On-Premise #DevOps

2026-02-04 • DigiTimes

Nvidia expands validation and testing, silicio photonics could be GTC 2026 focus

Nvidia is expanding its validation and testing processes. The company may focus on silicio photonics as a key element for future GPUs, with potential announcements at GTC 2026. This technology promises to significantly improve the speed and energy ef...

#Hardware

2026-02-04 • DigiTimes

AMD: Financial Results Meet Expectations, AI Market Awaits More

AMD reported solid financial results, but the AI market's expectations, particularly regarding dedicated solutions, remain partially unmet. Investors are awaiting more concrete signs of AMD's ability to compete in the rapidly expanding AI sector.

#Hardware #LLM On-Premise #DevOps

2026-02-04 • DigiTimes

Supermicro’s AI boom comes with a risk: one customer, 63% of revenue

Supermicro's growth in the artificial intelligence sector is remarkable, but the company is heavily reliant on a single customer, who generates 63% of its revenue. This concentration represents a significant risk to future financial stability.

#LLM On-Premise #DevOps

2026-02-04 • DigiTimes

Samsung Electro-Mechanics poised to enter Nvidia NVSwitch substrate supply chain

Samsung Electro-Mechanics is preparing to enter the supply chain for Nvidia NVSwitch substrates. This strategic move could strengthen Samsung's position in the high-performance component market for artificial intelligence and accelerated computing ap...

#Hardware #LLM On-Premise #DevOps

2026-02-03 • TechCrunch AI

Intel to enter the Nvidia-dominated GPU market

Intel is ramping up efforts to compete in the GPU market, currently dominated by Nvidia. The company is building a dedicated team and will develop a GPU strategy focused on customer needs. This marks a significant evolution in the graphics processor ...

#Hardware #LLM On-Premise #DevOps

2026-02-03 • Tom's Hardware

Intel is co-developing new Z-Angle Memory for AI data centers

Intel and SoftBank subsidiary, Saimemory, are collaborating to develop Z-Angle Memory (ZAM), a vertical-stacked memory for AI data centers. ZAM promises 2 to 3x more capacity, greater bandwidth, and half the power consumption compared to current solu...

#Hardware #LLM On-Premise #DevOps

2026-02-03 • LocalLLaMA

Intel Xeon 600 Workstation CPUs Launched: Up To 86 Cores

Intel has launched the new Xeon 600 series processors for workstations, offering up to 86 cores. These processors support memory up to 8000 MT/s, 128 PCIe Gen5 lanes, and a TDP of 350W with overclocking support. They are positioned as an alternative ...

#Hardware #LLM On-Premise #DevOps

2026-02-03 • Tom's Hardware

Photonics and high-speed data movement is the next big AI bottleneck

Generative AI is pushing demand across the industry. Data interconnects, such as Silicio Photonics, may well be the next big bottleneck that hyperscalers need to be paying attention to. Following copper, power, DRAM, and NAND, data movement speed bec...

#LLM On-Premise #DevOps

2026-02-03 • DigiTimes

AI redraws data center power architecture as rack-level energy systems go mainstream

Data centers are evolving to support the increasing workloads of artificial intelligence. New rack-level power architectures are emerging to better manage the energy demands of high-performance GPUs, optimizing efficiency and reducing operating costs...

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-03 • DigiTimes

China's AI chip swarm hits mass scale, challenging Nvidia

China is scaling up its AI chip production, eroding Nvidia's dominant position in the Chinese market. This push for technological self-sufficiency could reshape the AI hardware landscape, with significant implications for companies operating in the s...

#Hardware #LLM On-Premise #DevOps

2026-02-03 • DigiTimes

King Slide and Nan Juen listed in Nvidia server rail supply chain

King Slide and Nan Juen are listed in Nvidia's server rail supply chain. Fositek is positioning itself as a promising contender in this growing market. Competition for supplying high-performance server components is increasing.

#Hardware #LLM On-Premise #DevOps

2026-02-03 • DigiTimes

Nvidia reclaims cooling control as AI CDU ushers software-defined thermal management

Nvidia introduces AI CDU (Cooling Distribution Unit), signaling a software-defined approach to thermal management in AI data centers. This development could optimize energy efficiency and performance of on-premise inference and training systems.

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-02 • Tom's Hardware

Ryzen 7 9850X3D: Factory Overclock of the 9800X3D?

Binning data from 13 Ryzen 7 9850X3D samples suggests the CPU is essentially a 9800X3D with higher voltages to achieve higher clock speeds. The single-core performance of the 9850X3D appears to primarily stem from this factory overclock.

#LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Computex: Huang heralds a new phase in the AI race

NVIDIA CEO Jensen Huang is setting the stage for Computex, signaling an intensification of competition in the artificial intelligence sector. The event is expected to shed light on the latest hardware and software innovations powering the next wave o...

#Hardware #LLM On-Premise #DevOps

2026-02-02 • Tom's Hardware

Jensen Huang warns TSMC needs to 'work very hard' to meet AI demand

Nvidia CEO Jensen Huang says TSMC needs to work very hard to expand capacity in order to keep up with AI demand. Huang says its demand alone may force doubling its capacity over the next decade.

#Hardware #LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Nvidia GB200 fuels chassis sector pivot to liquid cooling, rack integration

The introduction of the Nvidia GB200 GPU is accelerating the adoption of liquid cooling systems and rack-level integration in the chassis sector. This transition is driven by the need to manage the increased power density and thermal requirements of ...

#Hardware #LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Semco's Tianjin MLCC plant capacity runs flat out on EV, AI server expansion

Semco's Tianjin MLCC plant is operating at full capacity to meet the increasing demand for components used in AI servers and electric vehicles. The expansion is driven by the need for high-performance multilayer ceramic capacitors (MLCC) for advanced...

2026-02-02 • DigiTimes

Taiwan PCB makers vie for AI server market with new 2026 capacity

Taiwanese printed circuit board (PCB) manufacturers are investing in new production capacity, expected by 2026, to meet the growing demand for AI servers. This strategic move aims to position Taiwanese companies as key suppliers in a rapidly expandin...

#LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Nvidia: Huang expects TSMC capacity to double, reaffirms OpenAI investment

Nvidia CEO Jensen Huang anticipates TSMC's capacity to double by 2026. He also highlighted memory supply challenges and reaffirmed Nvidia's investment in OpenAI. Huang's Taiwan visit underscores the region's strategic importance for the company.

#Hardware #LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Nvidia speeds up silicio photonics momentum in 2026, optical supply chain mobilizes

Nvidia is targeting mass production of silicio photonics solutions by 2026. This development could significantly impact the optical supply chain, paving the way for faster and more efficient interconnects for future high-performance computing workloa...

#Hardware #LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Taiwan's cooling and power solutions rise to serve soaring AI chips demand

Taiwan's cooling and power solutions industry is responding to the increasing demand for AI chips. The ability to manage increased power consumption and heat dissipation is crucial for the efficient operation of AI systems.

#Hardware #LLM On-Premise #DevOps

2026-02-01 • Phoronix

GNOME Resources 1.10 Adds Monitoring Support For AMD Ryzen AI NPUs

GNOME Resources 1.10, the newest version of this system monitoring app, introduces monitoring support for AMD Ryzen AI NPUs. This application is now used by default on distributions like the upcoming Ubuntu 26.04 LTS. The update also includes other u...

#Hardware

2026-02-01 • LocalLLaMA

OLMO 3.5: Hybrid Model for Efficient LLM Inference Coming Soon

AI2's OLMO 3.5 model combines standard transformer attention with linear attention using Gated Deltanet. This hybrid approach aims to improve efficiency and reduce memory usage while maintaining model quality. The OLMO series is fully open source, fr...

#Fine-Tuning

2026-02-01 • DigiTimes

CSPs ramp up AI capex as supply chain gains confidence

Cloud service providers (CSPs) are increasing investments in AI infrastructure, thanks to a more stable supply chain. This increase in CapEx is an indicator of the growing demand for computational resources for artificial intelligence and machine lea...

#Hardware #LLM On-Premise #DevOps

2026-01-31 • LocalLLaMA

M4 Max (128 GB) vs Ryzen AI Max+ (128 GB) for LLM Inference

A user is evaluating which device is best suited for large language model (LLM) inference in a production environment, considering speed and fine-tuning capabilities. The comparison is between an M4 Max-based Mac Studio and a GMKtec EVO-X2 AI Mini PC...

#LLM On-Premise #Fine-Tuning #DevOps

2026-01-31 • DigiTimes

Nvidia CEO calls ASIC rivalry "illogical" as R&D spending heads toward $45 billion

Nvidia's CEO downplays ASIC competition, calling it "illogical" given the company's massive R&D investments, nearing $45 billion. Nvidia continues to focus on GPUs for accelerating complex workloads.

#Hardware #LLM On-Premise #DevOps

2026-01-31 • DigiTimes

Nvidia CEO tells TSMC to "work harder" as 2026 demand surges

Nvidia's CEO urges TSMC to increase production capacity to meet the growing demand expected by 2026. The company plans to double its capacity over the next 10 years, signaling a strong expansion in the artificial intelligence and high-performance com...

#Hardware #LLM On-Premise #DevOps

AI Hardware Race Intensifies

Related Coverage