Topic / Trend Rising

AI Hardware Race Intensifies

The demand for AI hardware is surging, leading to increased competition among chip manufacturers and cloud service providers exploring custom silicio solutions. Companies like Nvidia, Intel, AMD, and even cloud providers are vying for dominance in the AI hardware market.

Detected: 2026-02-07 · Updated: 2026-02-07

Related Coverage

2026-02-07 TechCrunch AI

Benchmark raises $225M in special funds to double down on Cerebras

Venture capital firm Benchmark Capital has announced a $225 million investment in Cerebras Systems, a manufacturer of processors dedicated to artificial intelligence. Benchmark has been an investor in Cerebras since 2016, supporting the development o...

#Hardware #LLM On-Premise #Fine-Tuning
2026-02-07 DigiTimes

AI demand spillover lifts 2026 general-purpose server shipments 10%

The increasing demand for artificial intelligence applications is having a significant impact on the server market. General-purpose server shipments are projected to increase by 10% by 2026, driven by the need for more powerful computing infrastructu...

#LLM On-Premise #Fine-Tuning #DevOps
2026-02-06 The Register AI

Record Investments: Big Tech to Spend $635 Billion on AI Infrastructure

Amazon, Google, Meta, and Microsoft are projected to collectively invest approximately $635 billion in infrastructure, with a significant portion allocated to datacenters and AI infrastructure. This figure surpasses Israel's GDP and the entire global...

#LLM On-Premise #DevOps
2026-02-06 Phoronix

Pushing The Intel Panther Lake CPU Performance Further On Linux

New Linux benchmarks examine the performance of Intel's Panther Lake Core Ultra X7 358H CPU with a higher power budget. The tests reveal significant generational improvements, particularly in energy efficiency, and confirm the excellent performance o...

#Hardware #LLM On-Premise #DevOps
2026-02-06 Phoronix

AMD Prepares the Ground for RDNA 4 GPUs with GFX1170 Target

AMD continues the development of its LLVM compiler stack for future GPUs. A new target, GFX1170, also identified as RDNA 4m, has been introduced. This update adds to the ongoing work on GFX1250 and GFX13 targets, expanding support for AMD's upcoming ...

#Hardware
2026-02-06 DigiTimes

TSMC’s 3nm bet in Japan signals a deeper Taiwan-Japan tech pact

TSMC's investment in 3nm technology in Japan signals a strengthening of technological collaboration between Taiwan and Japan. This strategic move could have significant implications for the global semiconductor supply chain and international technolo...

2026-02-06 DigiTimes

MetaOptics drives heat-resistant metalenses into CPUs

MetaOptics, headquartered in Singapore and maintaining close ties with Taiwan, is developing heat-resistant metalenses for integration into CPUs. This technology could significantly improve the thermal management of processors.

2026-02-06 DigiTimes

CSPs turn to custom silicio to break Nvidia dependence

Cloud service providers (CSPs) are exploring custom silicio solutions to diversify their hardware options and reduce dependence on traditional vendors like Nvidia. This trend could lead to new architectures optimized for specific workloads.

#Hardware #LLM On-Premise #DevOps
2026-02-06 DigiTimes

Wistron posts strongest January on AI server growth

Taiwanese manufacturer Wistron reported an exceptionally positive January, driven by strong demand for servers dedicated to artificial intelligence. This highlights the growing market interest in specialized hardware solutions for AI workloads.

#Hardware #LLM On-Premise #Fine-Tuning
2026-02-06 DigiTimes

Cerebras raises US$1 billion, valuation nearly triples in 6 months

Cerebras Systems has announced a funding round that nearly triples its valuation in just six months. The company focuses on developing specialized hardware for artificial intelligence workloads, particularly for training large models.

#Hardware #LLM On-Premise #Fine-Tuning
2026-02-05 LocalLLaMA

gWorld: 8B model beats 402B Llama 4 by generating web code

Trillion Labs and KAIST AI introduced gWorld, an open-weight visual world model for mobile GUIs. gWorld, available in 8B and 32B versions, generates executable web code instead of pixels, surpassing larger models like Llama 4 in accuracy. This approa...

#LLM On-Premise #Fine-Tuning #DevOps
2026-02-05 Tom's Hardware

Tenstorrent reduces Tensor Cores on Blackhole p150 via Firmware Update

Tenstorrent announced a reduction in the number of Tensor cores on its Blackhole p150 cards, from 140 to 120, via a firmware update. The company anticipates a 1-2% performance drop for existing users. New cards will ship with 120 Tensor cores.

#Hardware #LLM On-Premise #DevOps
2026-02-05 Phoronix

Intel Arc B390 Graphics Performance On Linux With Panther Lake

First Linux benchmarks of the Intel Arc B390 GPU, integrated in high-end Panther Lake models. The Xe3 graphics card, equipped with 12 Xe cores, promises interesting performance in desktop and mobile environments for graphics and compute workloads.

#Hardware #LLM On-Premise #DevOps
2026-02-05 DigiTimes

Qualcomm reports record results, flags memory constraints

Qualcomm reported record financial results for Q1FY26. However, the company anticipates potential limitations related to memory availability in the near term, a factor that could impact deliveries and the ability to meet demand.

#LLM On-Premise #DevOps
2026-02-05 DigiTimes

Jensen Huang: AI factories will power a trillion-dollar reindustrialization

According to Jensen Huang, CEO of NVIDIA, AI factories are the engine of a new wave of reindustrialization. These specialized infrastructures will be fundamental for the development and deployment of advanced AI solutions in various industrial sector...

#Hardware #LLM On-Premise #DevOps
2026-02-04 TechCrunch AI

Positron challenges Nvidia with AI chips: $230M Series B round

Positron has raised $230 million in a Series B funding round, with participation from the Qatar Investment Authority. The company aims to compete with Nvidia in the artificial intelligence chip market, amid growing demand and with Qatar aiming to dev...

#Hardware
2026-02-04 DigiTimes

Intel CEO unveils plans to enter GPU market dominated by Nvidia

Intel's CEO has announced plans to enter the GPU market, currently dominated by Nvidia. This strategic move could bring new dynamics to the hardware acceleration sector for artificial intelligence and graphics workloads.

#Hardware #LLM On-Premise #DevOps
2026-02-04 DigiTimes

AI drives ODM/EMS growth despite weak consumer electronics in 2025

The ODM/EMS sector anticipates growth in 2025, primarily driven by the demand for AI-based solutions, offsetting the slowdown in the consumer electronics market. This trend highlights the increasing importance of AI as an engine for innovation and ec...

#Hardware #LLM On-Premise #DevOps
2026-02-04 DigiTimes

AMD prioritizes supply chain for second-half AI ramp

AMD is focusing its efforts on optimizing its supply chain to support the increasing demand for AI solutions in the second half of the year. This strategic move aims to ensure the availability of necessary components for the production and distributi...

#Hardware #LLM On-Premise #DevOps
2026-02-04 DigiTimes

AMD: Financial Results Meet Expectations, AI Market Awaits More

AMD reported solid financial results, but the AI market's expectations, particularly regarding dedicated solutions, remain partially unmet. Investors are awaiting more concrete signs of AMD's ability to compete in the rapidly expanding AI sector.

#Hardware #LLM On-Premise #DevOps
2026-02-04 DigiTimes

Supermicro’s AI boom comes with a risk: one customer, 63% of revenue

Supermicro's growth in the artificial intelligence sector is remarkable, but the company is heavily reliant on a single customer, who generates 63% of its revenue. This concentration represents a significant risk to future financial stability.

#LLM On-Premise #DevOps
2026-02-03 TechCrunch AI

Intel to enter the Nvidia-dominated GPU market

Intel is ramping up efforts to compete in the GPU market, currently dominated by Nvidia. The company is building a dedicated team and will develop a GPU strategy focused on customer needs. This marks a significant evolution in the graphics processor ...

#Hardware #LLM On-Premise #DevOps
2026-02-03 Tom's Hardware

Intel is co-developing new Z-Angle Memory for AI data centers

Intel and SoftBank subsidiary, Saimemory, are collaborating to develop Z-Angle Memory (ZAM), a vertical-stacked memory for AI data centers. ZAM promises 2 to 3x more capacity, greater bandwidth, and half the power consumption compared to current solu...

#Hardware #LLM On-Premise #DevOps
2026-02-03 LocalLLaMA

Intel Xeon 600 Workstation CPUs Launched: Up To 86 Cores

Intel has launched the new Xeon 600 series processors for workstations, offering up to 86 cores. These processors support memory up to 8000 MT/s, 128 PCIe Gen5 lanes, and a TDP of 350W with overclocking support. They are positioned as an alternative ...

#Hardware #LLM On-Premise #DevOps
2026-02-03 Tom's Hardware

Photonics and high-speed data movement is the next big AI bottleneck

Generative AI is pushing demand across the industry. Data interconnects, such as Silicio Photonics, may well be the next big bottleneck that hyperscalers need to be paying attention to. Following copper, power, DRAM, and NAND, data movement speed bec...

#LLM On-Premise #DevOps
2026-02-03 DigiTimes

China's AI chip swarm hits mass scale, challenging Nvidia

China is scaling up its AI chip production, eroding Nvidia's dominant position in the Chinese market. This push for technological self-sufficiency could reshape the AI hardware landscape, with significant implications for companies operating in the s...

#Hardware #LLM On-Premise #DevOps
2026-02-03 DigiTimes

King Slide and Nan Juen listed in Nvidia server rail supply chain

King Slide and Nan Juen are listed in Nvidia's server rail supply chain. Fositek is positioning itself as a promising contender in this growing market. Competition for supplying high-performance server components is increasing.

#Hardware #LLM On-Premise #DevOps
2026-02-02 Tom's Hardware

Ryzen 7 9850X3D: Factory Overclock of the 9800X3D?

Binning data from 13 Ryzen 7 9850X3D samples suggests the CPU is essentially a 9800X3D with higher voltages to achieve higher clock speeds. The single-core performance of the 9850X3D appears to primarily stem from this factory overclock.

#LLM On-Premise #DevOps
2026-02-02 DigiTimes

Computex: Huang heralds a new phase in the AI race

NVIDIA CEO Jensen Huang is setting the stage for Computex, signaling an intensification of competition in the artificial intelligence sector. The event is expected to shed light on the latest hardware and software innovations powering the next wave o...

#Hardware #LLM On-Premise #DevOps
2026-02-02 DigiTimes

Nvidia GB200 fuels chassis sector pivot to liquid cooling, rack integration

The introduction of the Nvidia GB200 GPU is accelerating the adoption of liquid cooling systems and rack-level integration in the chassis sector. This transition is driven by the need to manage the increased power density and thermal requirements of ...

#Hardware #LLM On-Premise #DevOps
2026-02-02 DigiTimes

Taiwan PCB makers vie for AI server market with new 2026 capacity

Taiwanese printed circuit board (PCB) manufacturers are investing in new production capacity, expected by 2026, to meet the growing demand for AI servers. This strategic move aims to position Taiwanese companies as key suppliers in a rapidly expandin...

#LLM On-Premise #DevOps
2026-02-02 DigiTimes

Nvidia: Huang expects TSMC capacity to double, reaffirms OpenAI investment

Nvidia CEO Jensen Huang anticipates TSMC's capacity to double by 2026. He also highlighted memory supply challenges and reaffirmed Nvidia's investment in OpenAI. Huang's Taiwan visit underscores the region's strategic importance for the company.

#Hardware #LLM On-Premise #DevOps
2026-02-01 Phoronix

GNOME Resources 1.10 Adds Monitoring Support For AMD Ryzen AI NPUs

GNOME Resources 1.10, the newest version of this system monitoring app, introduces monitoring support for AMD Ryzen AI NPUs. This application is now used by default on distributions like the upcoming Ubuntu 26.04 LTS. The update also includes other u...

#Hardware
2026-02-01 LocalLLaMA

OLMO 3.5: Hybrid Model for Efficient LLM Inference Coming Soon

AI2's OLMO 3.5 model combines standard transformer attention with linear attention using Gated Deltanet. This hybrid approach aims to improve efficiency and reduce memory usage while maintaining model quality. The OLMO series is fully open source, fr...

#Fine-Tuning
2026-02-01 DigiTimes

CSPs ramp up AI capex as supply chain gains confidence

Cloud service providers (CSPs) are increasing investments in AI infrastructure, thanks to a more stable supply chain. This increase in CapEx is an indicator of the growing demand for computational resources for artificial intelligence and machine lea...

#Hardware #LLM On-Premise #DevOps
2026-01-31 LocalLLaMA

M4 Max (128 GB) vs Ryzen AI Max+ (128 GB) for LLM Inference

A user is evaluating which device is best suited for large language model (LLM) inference in a production environment, considering speed and fine-tuning capabilities. The comparison is between an M4 Max-based Mac Studio and a GMKtec EVO-X2 AI Mini PC...

#LLM On-Premise #Fine-Tuning #DevOps
2026-01-31 DigiTimes

Nvidia CEO tells TSMC to "work harder" as 2026 demand surges

Nvidia's CEO urges TSMC to increase production capacity to meet the growing demand expected by 2026. The company plans to double its capacity over the next 10 years, signaling a strong expansion in the artificial intelligence and high-performance com...

#Hardware #LLM On-Premise #DevOps
← Back to All Topics