Topic / Trend Rising

AI Hardware, Infrastructure & Energy Challenges

The surging demand for AI is driving massive investments in specialized hardware, data centers, and advanced cooling solutions. This expansion faces significant challenges related to energy consumption and public opposition.

Detected: 2026-05-19 · Updated: 2026-05-19

Related Coverage

2026-05-19 DigiTimes

XPeng Unveils Mass-Produced Robotaxi Featuring In-House AI Chips

XPeng has introduced a mass-produced Robotaxi, integrating AI chips developed in-house. This move highlights the growing trend among automotive manufacturers to invest in proprietary silicon for artificial intelligence, aiming to optimize performance...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-19 ArXiv cs.LG

Apple M3 Ultra: Diffusion Model Optimization Rewrites the Rules

A comprehensive study explores real-time Diffusion Model optimization on the Apple M3 Ultra, featuring a 60-core GPU and 512 GB of unified memory. Researchers achieved 22.7 FPS for 512x512 img2img transformation by combining CoreML conversion and the...

#Hardware #LLM On-Premise #DevOps
2026-05-19 DigiTimes

China Aims for National Compute Network to Support AI Growth

The rapid expansion of artificial intelligence in China is fueling a push for the creation of a national compute network. This strategic initiative aims to ensure data sovereignty and provide the necessary infrastructure resources for training and in...

#Hardware #LLM On-Premise #DevOps
2026-05-18 DigiTimes

AI Data Centers: SanDisk on Cost, HDDs Resist SSDs

SanDisk has highlighted that, currently, AI-dedicated data centers still lack a compelling economic case to fully replace hard disk drives (HDDs) with solid-state drives (SSDs). The statement underscores the challenges related to Total Cost of Owners...

#LLM On-Premise #Fine-Tuning #DevOps
2026-05-18 DigiTimes

Nvidia Rubin Platform to Drive LPDDR Demand Past Apple and Samsung by 2027

Nvidia's upcoming Rubin platform is projected to significantly impact the LPDDR memory market, surpassing the combined demand from giants like Apple and Samsung by 2027. This forecast, based on market analysis, highlights the accelerating demand for ...

#Hardware #LLM On-Premise #DevOps
2026-05-18 Phoronix

Intel Xe and Crescent Island: New Clues for Multiple Accelerators on Linux

Recent Intel Xe graphics driver patches for Linux reveal the existence of multiple PCI IDs associated with the upcoming "Crescent Island" (CRI) accelerators. This discovery suggests a diversified offering of models, with implications for on-premise d...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-18 The Next Web

Meta Cuts 8,000 Jobs Amid $145 Billion AI Infrastructure Bet

Meta is set to cut approximately 8,000 jobs starting May 20, marking the largest layoff round since 2023, and will also cancel 6,000 open positions. This strategic move reflects a massive $145 billion investment in artificial intelligence infrastruct...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-18 PyTorch Blog

ExecuTorch and MLX: GPU Acceleration for PyTorch Models on Apple Silicon

The new ExecuTorch MLX delegate enables optimized, GPU-accelerated Inference for PyTorch models on Apple Silicon Macs, leveraging Apple's MLX framework. This integration delivers 3-6x higher throughput compared to previous solutions on macOS, support...

#Hardware #LLM On-Premise #DevOps
2026-05-18 The Next Web

Google's TPU Demand Outstrips Supply, Even for Internal Researchers

Google has built a top-tier AI infrastructure, relying on its custom TPU chips and a robust cloud business. The success of collaborations with external partners like Anthropic and Meta has generated such high demand for compute capacity that even Goo...

#Hardware #LLM On-Premise #DevOps
2026-05-18 Tom's Hardware

Samsung: Disparate Bonuses and the Chip Sector Talent Crisis

Internal Samsung transcripts reveal significantly disparate bonuses between memory staff (up to 607%) and logic chip staff (as low as 50%). This disparity, according to unions, is creating a talent retention crisis the company cannot afford, with pot...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-18 Phoronix

AMD Lemonade SDK: macOS Reaches General Availability with ROCm 7.13

AMD has announced that its Lemonade SDK for local artificial intelligence is now in General Availability for macOS. The open-source project, largely developed by AMD engineers, integrates ROCm 7.13 and aims to optimize Large Language Model execution ...

#Hardware #LLM On-Premise #DevOps
2026-05-18 DigiTimes

Rising MLCC Demand for AI Servers Highlights Supply Chain Pressures

Prosperity Dielectrics observes intense customer demand for MLCCs in AI server power applications, signaling pressure on the critical component supply chain. This trend underscores the expansion of AI infrastructure and potential implications for the...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-18 DigiTimes

Pan Jit: AI Revenue Growth and Supply Chain Challenges

Pan Jit's AI-related revenue has reached 11% of its total, with order lead times extending to six months. This scenario highlights strong demand for AI infrastructure and growing supply chain challenges, with significant implications for companies pl...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-18 Tech.eu

Greenpixie Raises £4.7M to Optimize AI and Cloud Energy Efficiency

UK startup Greenpixie has secured a £4.7 million pre-Series A funding round to help large enterprises reduce energy waste associated with AI and cloud infrastructure. Its proprietary technology aims to provide "sustainability intelligence," enabling ...

#Hardware #LLM On-Premise #DevOps
2026-05-18 DigiTimes

Lotes: Server and AI Connectors Drive Record Revenue, Targeting Market Share

Lotes has achieved record revenues, driven by the increasing demand for connectors in server and AI applications. The company is maintaining a competitive pricing strategy to expand its market share in a critical infrastructure segment for Large Lang...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-17 DigiTimes

Advanced Substrates: Nan Ya PCB Ramps Up Production for AI Chips

Nan Ya PCB, a key player in printed circuit board manufacturing, is increasing its production capacity. This move responds to the growing demand for advanced substrates, essential for next-generation AI chips. The expansion highlights the pressure on...

#Hardware #LLM On-Premise #DevOps
2026-05-17 Phoronix

Canonical Releases Ubuntu 'Concept' ISOs for CIX P1 AI CPU

Canonical has begun releasing "Concept" Ubuntu ISOs specifically optimized for the CIX P1 AI CPU, a platform dedicated to artificial intelligence. These distributions aim to provide cutting-edge hardware support not yet integrated into the mainline L...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-17 The Next Web

IT Infrastructure as a Pillar for Business Performance and AI

Every strong business rests on robust foundations. IT infrastructure, particularly that dedicated to Large Language Model (LLM) workloads, proves crucial for sustaining growth, ensuring operational efficiency, and maximizing productivity. For organiz...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-17 The Next Web

Cerebras Debuts on Nasdaq with a $5.55 Billion IPO, Largest Since 2020

Cerebras Systems concluded its first day of trading on Nasdaq with a market capitalization of approximately $95 billion, raising $5.55 billion. This marks the largest US tech IPO since 2020, highlighting the growing market interest in AI hardware com...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-17 Tom's Hardware

LineShine: China's 1.54-Exaflop Supercomputer with 2.4 Million Armv9 Cores

China has unveiled LineShine, a 1.54-exaflop supercomputer based exclusively on CPUs, equipped with 2.4 million Huawei-designed Armv9 cores. This CPU-only architecture represents a strategic response to US GPU restrictions, highlighting an alternativ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-16 DigiTimes

Taiwan chipmakers quietly fill gaps left by Korea's HBM push

The global semiconductor market sees Taiwanese chipmakers, such as Nanya, stepping up High Bandwidth Memory (HBM) production. This move aims to fill supply gaps left by a stronger Korean focus on other areas, ensuring a crucial supply for next-genera...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-16 Tom's Hardware

Fiber Optic Demand for AI Data Centers Explodes: One-Year Delivery Delays

AI-dedicated data centers demand 36 times more fiber optic cabling than standard server configurations. This surge in demand, coupled with a severe glass shortage, is causing cable delivery lead times to stretch up to a full year. This presents a sig...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-16 Tom's Hardware

RTX 5090 and MacBook: The Potential of eGPUs for Intensive Workloads

A recent test demonstrated the capability of an RTX 5090 GPU, connected via an eGPU dock to an M-series MacBook, to handle extremely intensive graphical workloads. The experiment, which saw the system run Cyberpunk 2077 at over 100 FPS with max setti...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-16 DigiTimes

Rising AI Server Demand Fuels Growth in Infrastructure Component Market

The surge in demand for artificial intelligence servers is generating significant revenue growth for manufacturers of infrastructure components, such as server rack rail kits. This trend highlights an acceleration in physical infrastructure investmen...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 TechCrunch AI

The AI Energy Wave: Lake Tahoe and Rising Costs

The escalating energy demand driven by artificial intelligence is beginning to manifest in significant price increases, as highlighted by the situation in Lake Tahoe. This popular Silicon Valley destination is bracing for higher electricity prices, a...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 Tech.eu

AI Investments and Infrastructure: Europe's Evolving Tech Landscape

The European tech sector saw over €1.4 billion in funding this week, with a growing emphasis on artificial intelligence and infrastructure. Major investment rounds for Nscale and Recursive Superintelligence highlight the push towards AI compute capab...

#LLM On-Premise #DevOps
2026-05-15 DigiTimes

Agentic AI Accelerates Server Market: Nearly 20 Million Units by 2026

The global server market is poised for significant growth, with shipments projected to approach 20 million units by 2026. This expansion is driven by the increasing adoption of Agentic AI, which demands robust and dedicated infrastructure. DIGITIMES'...

#Hardware #LLM On-Premise #DevOps
2026-05-15 Phoronix

Vulkan 1.4.352: NVIDIA Introduces Cooperative Matrix Support, AI Impact

The latest revision of the Vulkan specification, version 1.4.352, includes an important proprietary NVIDIA extension: VK_NV_cooperative_matrix_decode_vector. This new feature aims to optimize matrix operations, which are fundamental for artificial in...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 Tom's Hardware

xAI: Colossus 1 Reallocated for Inference, Colossus 2 to Focus on Blackwell

xAI's Colossus 1 supercomputer, initially intended for Grok's training, has been reallocated for inference workloads by Anthropic due to its inefficient mixed-architecture design. Meanwhile, Elon Musk is preparing Colossus 2, a new infrastructure bas...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 The Next Web

STT Global Data Centres India Prepares for $500M Mumbai IPO

STT Global Data Centres India, a data center operator with Singaporean control and a minority Tata stake, is preparing to launch an Initial Public Offering (IPO) in Mumbai. The operation aims to raise up to $500 million, positioning itself as one of ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 The Next Web

Iceotope Secures $26M: Liquid Cooling Becomes Crucial for AI

Iceotope, a British company specializing in precision liquid cooling, has closed a $26 million Series B funding round. The investment, led by Barclays Climate Ventures and Two Seas Capital, aims to expand the company's product line and patent portfol...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Energy Crisis and RE100: Taiwan Risks Global Tech Orders

Ping Cheng, Chairman of Delta Electronics, has warned about potential delays in Taiwan's adherence to the RE100 initiative. A shortage of green power could jeopardize the island's ability to meet sustainability commitments, risking its crucial positi...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Nvidia Vera Rubin: Issues Reportedly Cleared, Production Ramp for 3Q26

Nvidia has reportedly resolved issues concerning its upcoming Vera Rubin platform, with the supply chain aiming for a production ramp-up in the third quarter of 2026. This timeline is crucial for enterprises planning on-premise AI infrastructures, im...

#Hardware #LLM On-Premise #DevOps
2026-05-15 DigiTimes

AAEON: AI Wave Fuels Growth and Orders, Strategy to 2026

AAEON, a hardware solutions provider, is experiencing a significant increase in orders, driven by growing demand in the artificial intelligence sector. This trend is part of a strategic growth plan the company has outlined until 2026. The expansion r...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Phison aiDAPTIV and Dimensity 9500: Boosting AI at the Edge

Phison has introduced aiDAPTIV, a solution designed to accelerate the deployment of AI workloads directly at the edge. Its integration with MediaTek's Dimensity 9500 processor highlights a focus on optimizing performance and energy efficiency for art...

#Hardware #LLM On-Premise #DevOps
2026-05-15 DigiTimes

Nvidia H200 Sales to China Slow Despite US Approval

Despite approval from US authorities, sales of Nvidia H200 GPUs in China are facing significant slowdowns. This scenario emerges within a context of geopolitical tensions and trade restrictions that impact the availability of critical hardware for ar...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Auras: No Operational Impact from Nvidia Vera Rubin Changes, Revenue Jumps

Auras announced that modifications to the Nvidia Vera Rubin project will not affect its operations. The company reported a significant increase in revenue and profit, highlighting the resilience of supply chains in the AI hardware sector. Decisions o...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Pan-International's Strategic Shift Towards AI Servers and AFM Motors

Pan-International has announced a significant strategic reorientation, focusing on AI servers and AFM motors to generate over half of its revenue by 2030. This move highlights a clear direction towards high-growth sectors, with notable implications f...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Foxconn: AI Servers Drive 63% Operating Profit Jump, Offsetting Seasonal Dips

Foxconn reported a 63% increase in operating profit, a significant achievement highlighting the growing demand for AI-dedicated infrastructure. The strong expansion in the AI server segment enabled the company to offset seasonal downturns in other ar...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Nan Ya PCB Targets High-End IC Substrate Growth Amid AI Demand

Nan Ya PCB is increasing its production of high-end integrated circuit (IC) substrates, responding to the growing demand from the artificial intelligence market. This strategic move underscores the importance of advanced hardware components in suppor...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-15 DigiTimes

Indium Phosphide Semiconductors: New Horizons for AI Power and Bandwidth

Indium Phosphide (InP) compound semiconductors are emerging as a promising technology to overcome current power and bandwidth limitations in AI hardware. This innovation could redefine architectures for Large Language Model (LLM) inference and traini...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 LocalLLaMA

MLX and Quantization: Optimizing Nemotron-8B for Apple Silicon

A developer has converted the `nvidia/llama-embed-nemotron-8b` embedding model into various quantized versions (from `fp16` to `2-bit`) using Apple's MLX framework. This effort aims to optimize model execution on Apple Silicon hardware, eliminating t...

#Hardware #LLM On-Premise #DevOps
2026-05-14 Ars Technica AI

Lake Tahoe Energy Crisis: Data Centers Prioritized Over Residents

Lake Tahoe residents face an impending energy crisis as supplier NV Energy will cease provision by May 2027. This decision stems from the increasing power demand for new data centers in Nevada, projected to require 5,900 megawatts by 2033, highlighti...

#Hardware #LLM On-Premise #DevOps
2026-05-14 Phoronix

AMD: Progress in Linux Enablement for Next-Gen AIE4 NPU

AMD is making significant strides in integrating its next-generation AIE4 NPU platform into the Linux kernel via the AMDXDNA accelerator. The company's software engineers have been working on these crucial hardware support patches since March. While ...

#Hardware #LLM On-Premise #DevOps
2026-05-14 Tech.eu

Iceotope Raises $26M for AI Infrastructure Cooling

Iceotope Group, a leader in precision liquid cooling solutions, has closed a $26 million Series B funding round. The investment, led by Two Seas Capital and Barclays Climate Ventures, will support the development of critical technologies for AI infra...

#Hardware #LLM On-Premise #DevOps
2026-05-14 Tom's Hardware

Growing Opposition to Data Centers: 70% of Americans Reject Them Near Homes

The escalating demand for AI compute capacity is clashing with strong public opposition. In the United States, 70% of citizens oppose the construction of data centers near their homes, making them less popular than nuclear power plants. This phenomen...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 The Next Web

Samsung: Strike Looms, AI Memory Chips at Risk

Samsung Electronics' largest union is preparing an 18-day strike, threatening the supply of crucial AI memory chips. The wage dispute and bonus formula are at the heart of the conflict, which could have significant repercussions on the global AI hard...

#Hardware #LLM On-Premise #DevOps
2026-05-14 The Next Web

SK Hynix Nears Trillion-Dollar Valuation Driven by AI Memory Demand

SK Hynix is on the verge of reaching a trillion-dollar market capitalization, having grown ninefold in the past two years. This milestone, fueled by the surging demand for AI memory, would make South Korea the first country outside the United States ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 DigiTimes

TSMC Boosts AI Chip Production: CoWoS and SoIC Expansion

TSMC, the leading semiconductor manufacturer, is significantly increasing its production capacity for advanced packaging technologies, CoWoS and SoIC. This strategic move responds to the surging demand for AI accelerators, particularly for Large Lang...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 The Next Web

700°C Memristor: Tetramem's Breakthrough for AI in Extreme Environments

A startup is developing AI chips based on memristors capable of operating at extreme temperatures, up to 700 degrees Celsius. This innovation promises to extend artificial intelligence computing capabilities into contexts inaccessible to traditional ...

#Hardware #LLM On-Premise #DevOps
2026-05-14 DigiTimes

TSMC: AI Expansion Drives Demand for Advanced Packaging

During its recent symposium, TSMC highlighted the significant expansion of AI and the increasing demand for advanced packaging solutions. This trend underscores the critical importance of sophisticated integration technologies to support the computat...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 DigiTimes

Tower Semiconductor: $1.3 Billion for Silicon Photonics Amid AI Acceleration

Tower Semiconductor has secured $1.3 billion in commitments for silicon photonics, addressing the accelerating demand for advanced artificial intelligence solutions. This technology is crucial for enhancing interconnects and data transfer speeds in d...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 DigiTimes

AI Surge Fuels Patent Race in Server Cooling

The explosion of artificial intelligence is catalyzing an innovation race in server cooling. Taiwanese firms are emerging as global leaders in this competition to develop efficient solutions. This phenomenon underscores the growing importance of phys...

#Hardware #LLM On-Premise #DevOps
2026-05-14 DigiTimes

Pegatron: Q1 2026 Earnings Decline, AI PC Demand Fuels Recovery

Pegatron reported a significant decline in earnings for Q1 2026, attributed to an off-season period. However, the Taiwanese company anticipates a strong recovery in Q2, driven by accelerating demand for new "AI PCs." This trend highlights the growing...

#Hardware #LLM On-Premise #DevOps
2026-05-14 DigiTimes

ASMedia Reports Record Profit, Strategic Expansion into AI and Automotive

ASMedia has reported record profits, signaling a significant strategic expansion beyond the PC chip market. The company is now targeting the artificial intelligence and automotive sectors, diversifying its product portfolio and positioning itself in ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 DigiTimes

OpenAI and Cerebras: The Move Reshaping the AI Supply Chain

OpenAI is exploring new strategic partnerships, such as with Cerebras, to diversify its AI supply chain. This move highlights a growing industry trend towards seeking alternative hardware solutions to traditional GPU clusters, with significant implic...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-14 DigiTimes

Foxconn: AI Server Orders and Co-Packaged Optics in Focus

Foxconn is preparing for a key investor briefing, where clarifications are expected regarding AI server orders and the commercialization of co-packaged optics (CPO). The meeting will outline the manufacturing giant's strategy in the growing artificia...

#Hardware #LLM On-Premise #DevOps
2026-05-14 DigiTimes

Microloops Reports Record Profit as AI Server Cooling Drives Growth

Microloops has announced an unprecedented quarter of profits, a result attributed to the growing demand for cooling solutions for artificial intelligence servers. This success highlights the critical importance of physical infrastructure in supportin...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-13 TechCrunch AI

xAI and Gas Turbines: The Energy Challenges of AI Data Centers

xAI's Colossus 2 data center in Mississippi is at the center of a legal dispute over its use of nearly 50 "mobile" gas turbines as a power source. This case highlights the complex infrastructure challenges and massive energy requirements companies fa...

#Hardware #LLM On-Premise #DevOps
2026-05-13 Tech.eu

Fractile Raises $220 Million to Overcome AI Inference Bottleneck

UK startup Fractile has closed a $220 million Series B round to develop next-generation inference hardware. The company aims to resolve the growing bottleneck related to the time and cost of producing useful outputs at scale for Large Language Models...

#Hardware #LLM On-Premise #DevOps
2026-05-13 DigiTimes

Inventec Forecasts Strong AI and General-Purpose Server Demand Through 2028

Inventec, a key hardware supplier, anticipates robust and sustained demand for both artificial intelligence servers and general-purpose systems. This forecast extends through 2028, indicating continued growth in the IT infrastructure market. The tren...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-13 DigiTimes

Taiwan's Chinsan Secures AI Server Capacity in Thailand Through 2026

Chinsan, a Taiwanese capacitor manufacturer, has secured AI server production capacity in Thailand until 2026. This move highlights the increasing demand for essential hardware components for AI infrastructure and companies' strategies to secure thei...

#Hardware #LLM On-Premise #DevOps
2026-05-13 DigiTimes

AI Demand Lifts Zhen Ding: Server and Chip Substrate Sales Surge

Zhen Ding, a key electronics supplier, is experiencing significant growth in server and chip substrate sales. This increase is directly linked to the surging global demand for artificial intelligence solutions. The phenomenon highlights the growing n...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-13 DigiTimes

ASPEED: AI Server Demand Drives Growth and Strengthens BMC Market Outlook

ASPEED is experiencing sustained growth, propelled by the increasing demand for artificial intelligence servers. This scenario reinforces the market outlook for Baseboard Management Controllers (BMCs), critical components for managing and monitoring ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-13 DigiTimes

China's Fiber Optic Giant Unveils World's Largest Preform for AI Data Centers

Fiberhome Telecommunication Technologies, a Chinese fiber optic leader, has announced the production of the world's largest optical preform. This innovation is strategic for supporting the increasing demand for high-capacity infrastructure required b...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-13 DigiTimes

Samsung Foundry's Resurgence: AI Chips and HBM4 Drive 4nm Demand

Samsung Foundry is experiencing a significant resurgence, driven by the increasing demand for artificial intelligence chips. The adoption of HBM4 technology and advancements in 4-nanometer manufacturing processes are key factors redefining its positi...

#Hardware #LLM On-Premise #DevOps
2026-05-13 DigiTimes

Chinese CPU Vendors Capitalize on AI Inference Demand

The AI inference market is witnessing a significant evolution, with Chinese CPU vendors emerging as key players. Growing demand for artificial intelligence workloads, coupled with supply challenges from giants like Intel and AMD, is creating new oppo...

#Hardware #LLM On-Premise #DevOps
2026-05-13 Wired AI

xAI Boosts Infrastructure with 19 New Gas Turbines Amidst Controversy

xAI, Elon Musk's company, is expanding its power infrastructure at the Colossus 2 site, adding 19 new portable gas turbines. This move occurs amidst an ongoing legal dispute over air quality, raising questions about the environmental implications and...

#Hardware #LLM On-Premise #DevOps
2026-05-13 DigiTimes

Moore Threads and Lightwheel.ai: A New China-Made AI Stack for Embodied AI

Moore Threads, a Chinese GPU company, is developing a new embodied AI stack in collaboration with Lightwheel.ai. The initiative aims to create a complete, entirely China-made AI solution, encompassing both hardware and software. This project highligh...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-12 Ars Technica AI

AI at Home: SPAN Proposes Distributed Data Centers

San Francisco startup SPAN is piloting an innovative solution for AI compute deployment. The project involves installing thousands of XFRA nodes, small data centers equipped with liquid-cooled Nvidia RTX Pro 6000 Blackwell Server Edition GPUs, direct...

#Hardware #LLM On-Premise #DevOps
2026-05-12 TechCrunch AI

Google and SpaceX in talks to put data centers into orbit for AI compute

Google and SpaceX are reportedly in discussions to explore the feasibility of building data centers in space. This initiative aims to position Earth's orbit as a future frontier for AI computing, despite current costs remaining significantly higher t...

#LLM On-Premise #DevOps
2026-05-12 LocalLLaMA

Gemma 4 Benchmark on H100: MTP vs DFlash for Dense and MoE LLMs

A recent benchmark compared Multi-Token Prediction (MTP) and DFlash techniques for Gemma 4 Large Language Model inference, covering both dense and MoE versions, on a single NVIDIA H100 80GB GPU. The results show that efficiency varies significantly b...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-12 IEEE Spectrum

Your Next AI Query: Where Power Is Most Accessible

The AI industry is exploring new strategies to manage the growing energy demands of data centers. Nvidia and its partners are developing a pilot project for distributed micro data centers, strategically located near utility substations. The goal is t...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-12 DigiTimes

The AI Infrastructure Wave: Taiwan at the Heart of the Global Supply Chain

The Taiwanese industry is capitalizing on the explosion in demand for artificial intelligence infrastructure, from substrates to servers. This phenomenon highlights the growing need for robust hardware components to support LLM workloads, with signif...

#Hardware #LLM On-Premise #Fine-Tuning
2026-05-12 DigiTimes

SK Hynix Bolsters AI Supply Chain with Strategic Silicon Valley Acquisition

SK Hynix has reportedly acquired property in Silicon Valley, a move that underscores the increasing importance of high-performance memory for artificial intelligence. This operation aims to consolidate the supply chain for crucial components, such as...

#Hardware #LLM On-Premise #Fine-Tuning
← Back to All Topics