Cloud Infrastructure for AI

2026-02-09 • DigiTimes

North American clients drive CHPT's growth towards 2026, targeting quarterly gains

According to Digitimes, CHPT's growth in 2026 will be primarily driven by demand from North America. The company aims to improve quarterly results, focusing on market expansion and operational optimization.

#LLM On-Premise #DevOps

2026-02-09 • DigiTimes

Tower Semiconductor, Nvidia advance 1.6T optical modules for AI data center networking

Tower Semiconductor and Nvidia are collaborating to develop 1.6T optical modules aimed at improving the performance of AI data center networks. This technology promises to significantly accelerate data transfer, which is crucial for artificial intell...

#Hardware #LLM On-Premise #DevOps

2026-02-08 • DigiTimes

Musk flags manufacturing bottlenecks, floats 'TeraFab' as chip supply strains

Elon Musk signals potential bottlenecks in chip manufacturing, suggesting the creation of a 'TeraFab' to address growing supply challenges. The move highlights the difficulties in sourcing essential components to support the growth of his technologic...

#LLM On-Premise #DevOps

2026-02-08 • DigiTimes

CSP orders and space economy fuel strong start to 2026 for Taiwan's supply chain

Taiwan's technology supply chain anticipates a positive start to 2026, driven by demand from cloud service providers (CSPs) and the growth of the aerospace sector. These factors offset global economic uncertainties, supporting local production and te...

#LLM On-Premise #DevOps

2026-02-08 • LocalLLaMA

Strix Halo Distributed Cluster: LLM Inference with RDMA RoCE v2

A two-node cluster based on AMD Strix Halo, interconnected via Intel E810 (RoCE v2), has been built for distributed LLM inference using Tensor Parallelism. Benchmarks and setup guide are available online, opening new possibilities for local model exe...

#Hardware #LLM On-Premise #DevOps

2026-02-07 • TechCrunch AI

New York lawmakers propose a three-year pause on new data centers

The state of New York is considering a three-year pause on the construction of new data centers. New York is at least the sixth state to consider such a measure, although the bill's prospects remain uncertain.

#LLM On-Premise #DevOps

2026-02-07 • LocalLLaMA

Comprehensive Grafana Monitoring for On-Premise LLM Server

A user has implemented a comprehensive monitoring system for their home LLM server, using Grafana, Prometheus, and DCGM to track metrics such as GPU utilization, power consumption, and token processing rates. The solution is containerized with Docker...

#Hardware #LLM On-Premise #DevOps

2026-02-06 • LocalLLaMA

GLM-5 Is Being Tested On OpenRouter

The GLM-5 language model is currently being tested on the OpenRouter platform. This news, originating from a Reddit discussion, indicates a potential expansion of the models available to OpenRouter users, opening new possibilities for artificial inte...

#LLM On-Premise #DevOps

2026-02-06 • The Register AI

Record Investments: Big Tech to Spend $635 Billion on AI Infrastructure

Amazon, Google, Meta, and Microsoft are projected to collectively invest approximately $635 billion in infrastructure, with a significant portion allocated to datacenters and AI infrastructure. This figure surpasses Israel's GDP and the entire global...

#LLM On-Premise #DevOps

2026-02-06 • DigiTimes

CSPs turn to custom silicio to break Nvidia dependence

Cloud service providers (CSPs) are exploring custom silicio solutions to diversify their hardware options and reduce dependence on traditional vendors like Nvidia. This trend could lead to new architectures optimized for specific workloads.

#Hardware #LLM On-Premise #DevOps

2026-02-06 • DigiTimes

Foxconn sees 35% revenue increase in January on AI server demand

Foxconn reports a 35% revenue increase in January, driven by strong demand for AI servers. This reflects the growing importance of specialized hardware for AI workloads, both in the cloud and on-premise.

#Hardware #LLM On-Premise #DevOps

2026-02-06 • DigiTimes

AI and AP Drive Load Board Shipments with January Revenue Up

According to DIGITIMES, artificial intelligence and advanced applications (AP) are boosting shipments of load boards. January revenues show growth, indicating strong demand in the sector.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • TechCrunch AI

AWS revenue soars as AI demand drives growth

Amazon Web Services (AWS) recorded its best quarter in 13 quarters in Q4 2025. Strong demand for artificial intelligence services significantly contributed to this result, driving adoption of Amazon's cloud platform.

#LLM On-Premise #DevOps

2026-02-05 • Phoronix

Ubuntu To Support The SpacemiT K3 As One Of The First RISC-V RVA23 SoCs

Canonical and SpacemiT announced that Ubuntu Linux will be officially supported on SpacemiT's new K3 RISC-V SoC. What makes the K3 interesting is being one of the first available RISC-V RVA23 designs.

2026-02-05 • LocalLLaMA

Hugging Face: Down but online?

Reports of access issues to the Hugging Face platform have surfaced online. Some users report being unable to access the platform, while others claim that core services remain operational. The cause and extent of the problem are not yet clear.

#LLM On-Premise #Fine-Tuning #DevOps

2026-02-05 • DigiTimes

Alphabet's US$185 billion hardware mandate: Breaking the AI supply bottleneck

Alphabet plans to invest US$185 billion in hardware infrastructure dedicated to artificial intelligence. The initiative aims to overcome current supply chain bottlenecks and ensure the computing capacity needed for its ambitious AI projects.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

MediaTek projects strong growth in cloud ASIC market, aims for US$1 billion revenue by 2026

MediaTek projects strong growth in the cloud ASIC market, aiming for US$1 billion in revenue by 2026. The company aims to strengthen its position in this expanding sector by providing customized solutions for major cloud service providers.

#Hardware #LLM On-Premise #DevOps

2026-02-05 • DigiTimes

Alphabet pledges record $185 billion capital spend as AI fuels cloud boom

Alphabet plans to invest a record $185 billion, fueled by cloud growth and AI opportunities. The company aims to strengthen its infrastructure to support the increasing demand for AI and cloud services.

#Hardware #LLM On-Premise #DevOps

2026-02-04 • TechCrunch AI

Positron challenges Nvidia with AI chips: $230M Series B round

Positron has raised $230 million in a Series B funding round, with participation from the Qatar Investment Authority. The company aims to compete with Nvidia in the artificial intelligence chip market, amid growing demand and with Qatar aiming to dev...

#Hardware

2026-02-04 • DigiTimes

TI and NXP report strong results as AI data center power management boosts semiconductor packaging demand

Texas Instruments and NXP report strong results, driven by increasing demand for power management solutions in AI data centers. The rising complexity of power systems for GPUs and accelerators is boosting the advanced semiconductor packaging sector.

#Hardware #Fine-Tuning

2026-02-04 • DigiTimes

Vanguard International Semiconductor sees strong 2026 AI server power demand

Vanguard International Semiconductor anticipates strong growth in power demand for AI servers starting in 2026. The company expects a significant impact on the semiconductor market, with implications for hardware manufacturers and cloud service provi...

#LLM On-Premise #DevOps

2026-02-03 • Tom's Hardware

Intel is co-developing new Z-Angle Memory for AI data centers

Intel and SoftBank subsidiary, Saimemory, are collaborating to develop Z-Angle Memory (ZAM), a vertical-stacked memory for AI data centers. ZAM promises 2 to 3x more capacity, greater bandwidth, and half the power consumption compared to current solu...

#Hardware #LLM On-Premise #DevOps

2026-02-03 • DigiTimes

AI redraws data center power architecture as rack-level energy systems go mainstream

Data centers are evolving to support the increasing workloads of artificial intelligence. New rack-level power architectures are emerging to better manage the energy demands of high-performance GPUs, optimizing efficiency and reducing operating costs...

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-03 • DigiTimes

Nvidia reclaims cooling control as AI CDU ushers software-defined thermal management

Nvidia introduces AI CDU (Cooling Distribution Unit), signaling a software-defined approach to thermal management in AI data centers. This development could optimize energy efficiency and performance of on-premise inference and training systems.

#Hardware #LLM On-Premise #Fine-Tuning

2026-02-02 • Tom's Hardware

Jensen Huang warns TSMC needs to 'work very hard' to meet AI demand

Nvidia CEO Jensen Huang says TSMC needs to work very hard to expand capacity in order to keep up with AI demand. Huang says its demand alone may force doubling its capacity over the next decade.

#Hardware #LLM On-Premise #DevOps

2026-02-02 • DigiTimes

Nvidia GB200 fuels chassis sector pivot to liquid cooling, rack integration

The introduction of the Nvidia GB200 GPU is accelerating the adoption of liquid cooling systems and rack-level integration in the chassis sector. This transition is driven by the need to manage the increased power density and thermal requirements of ...

#Hardware #LLM On-Premise #DevOps

Cloud Infrastructure for AI

Related Coverage