AI-Radar - Local LLMs, AI Hardware and Trends Observatory

AI-Radar for on-prem LLMs & Home AI

The daily radar on models, frameworks, and hardware to run AI locally. LLMs, LangChain, Chroma, mini-PCs, and everything you need for a distributed "in-house" brain.

⚙️ Stack: Local LLMs · LangChain · Transformers · ChromaDB · MiniPCs · AI boxes
🛰️ Ask Observatory (Q&A + RAG) connected to the article archive.
👥 160+ members · Join free →

⚡ Trending Now

View All →

Latest Analysis & Radar News

AI-generated articles from feeds, with space for human editorial layer above the raw content.

Llama.cpp abbraccia il Multi-Processing: un passo avanti per gli LLM on-premise
📁 Frameworks AI generated ℹ️ LocalLLaMA

Llama.cpp Embraces Multi-Processing: A Step Forward for On-Premise LLMs

The open-source project llama.cpp is set to integrate Multi-Threaded Processing (MTP) support, a development that promises to significantly enhance performance in running Large Language Models (LLMs) on local hardware. This evolution is particularly relevant for on-premise environments, where optimizing existing hardware resources is crucial for efficient AI model deployment, strengthening data sovereignty and control.

2026-05-16 📰 Source
Anelli AI per la traduzione del linguaggio dei segni: un passo verso l'edge computing
📁 Altro AI generated 🏆 IEEE Spectrum

AI Rings for Sign Language Translation: A Step Towards Edge Computing

A new study introduces wireless electronic rings that, connected to an AI system, can translate sign language into text. This technology overcomes the limitations of previous systems, offering greater practicality and accuracy. The goal is to migrate processing to edge computing on smartphones, improving mobility, privacy, and reducing latency for users.

2026-05-16 📰 Source
Faraday Future raccoglie 25 milioni per il piano robotico
📁 Market AI generated ℹ️ The Next Web

Faraday Future Secures $25 Million for Robotics Initiative

Faraday Future announced it has raised $25 million through convertible promissory notes, bringing its total financing to $70 million over the past two months. The company states this capital is sufficient to fund Phase 1 of its robotics business plan through the end of 2026.

2026-05-16 📰 Source
La domanda di fibra ottica per i data center AI esplode: ritardi di un anno nelle consegne
📁 Altro AI generated ℹ️ Tom's Hardware

Fiber Optic Demand for AI Data Centers Explodes: One-Year Delivery Delays

AI-dedicated data centers demand 36 times more fiber optic cabling than standard server configurations. This surge in demand, coupled with a severe glass shortage, is causing cable delivery lead times to stretch up to a full year. This presents a significant challenge for those planning on-premise AI infrastructure.

2026-05-16 📰 Source
Scoperto il primo exploit di memoria per Apple M5 con l'aiuto di Anthropic AI
📁 Altro AI generated ℹ️ Tom's Hardware

First Apple M5 Memory Exploit Discovered with Anthropic AI Assistance

Security researchers have identified the first memory exploit for the Apple M5 chip, gaining root access on macOS. The discovery, which bypasses Memory Integrity Enforcement measures, was facilitated by Anthropic AI's Claude Mythos, highlighting the increasing role of LLMs in vulnerability research and the implications for system security.

2026-05-16 📰 Source
Vulnerabilità 'Claw Chain' in OpenClaw: rischio furto dati e controllo persistente
📁 Altro AI generated ℹ️ The Next Web

New 'Claw Chain' Vulnerabilities in OpenClaw: Risk of Data Theft and Persistent Control

Cyera researchers have discovered four vulnerabilities in OpenClaw, dubbed 'Claw Chain'. These flaws, when chained together, allow attackers to steal sensitive data, escalate privileges, and gain persistent control over a compromised host. The defects affect OpenClaw’s OpenShell managed sandbox backend and its MCP loopback runtime. All issues have been patched, but the incident highlights the importance of security in critical infrastructures.

2026-05-16 📰 Source
RTX 5090 e MacBook: il potenziale delle eGPU per carichi di lavoro intensivi
📁 Hardware AI generated ℹ️ Tom's Hardware

RTX 5090 and MacBook: The Potential of eGPUs for Intensive Workloads

A recent test demonstrated the capability of an RTX 5090 GPU, connected via an eGPU dock to an M-series MacBook, to handle extremely intensive graphical workloads. The experiment, which saw the system run Cyberpunk 2077 at over 100 FPS with max settings and frame generation, highlights the potential of eGPU solutions to extend the computational capabilities of unconventional platforms. This approach offers interesting insights for on-premise deployment scenarios requiring flexibility and computational power.

2026-05-16 📰 Source
Malta e OpenAI: un partenariato per l'accesso all'AI e la sovranità dei dati
📁 Altro AI generated 🏆 OpenAI Blog

Malta and OpenAI: A Partnership for AI Access and Data Sovereignty

Malta and OpenAI have partnered to expand artificial intelligence access to all citizens. The initiative includes providing ChatGPT Plus subscriptions and training programs, aiming to develop practical skills and promote responsible AI use. This move raises strategic questions about data sovereignty and the implications for on-premise deployments.

2026-05-16 📰 Source
Il commercio agentivo secondo Stripe: l'AI rivoluzionerà lo shopping online
📁 Market AI generated ℹ️ The Next Web

Stripe's Collison: Agentic Commerce Will Reshape Online Shopping

John Collison, co-founder of Stripe, foresees a structural transformation in online commerce. According to Collison, keyword search is an outdated method; the future will be dominated by "agentic commerce," where AI agents will shop on behalf of consumers. This evolution will radically redefine both how users make purchases and how retailers strategize their sales.

2026-05-16 📰 Source
AI in azienda: il 61% dei CEO percepisce un'eccessiva fretta dai consigli di amministrazione
📁 Market AI generated ℹ️ The Next Web

Enterprise AI: 61% of CEOs Perceive Excessive Haste from Boards

A recent global survey by Boston Consulting Group (BCG) revealed that 61% of CEOs believe their boards are accelerating AI adoption too quickly. The research, involving 625 leaders from companies with over $100 million in annual revenue, highlights a potential disconnect between strategic vision and operational challenges related to AI implementation, especially for complex workloads like Large Language Models, where TCO and data sovereignty considerations are crucial.

2026-05-16 📰 Source
RJ Scaringe: Oltre 12 Miliardi di Dollari per Tre Startup, tra Veicoli Elettrici e Robotica AI
📁 Market AI generated ℹ️ The Next Web

RJ Scaringe: Over $12 Billion for Three Startups, Spanning Electric Vehicles and AI Robotics

RJ Scaringe, Rivian's founder and an MIT doctorate holder, has successfully raised over $12.3 billion for three distinct startups. His portfolio includes an electric vehicle manufacturer, an autonomous micromobility company, and an industrial AI robotics startup. The pace of capital acquisition is rapidly accelerating, underscoring significant investor interest in his innovative ventures.

2026-05-16 📰 Source
Salesforce investe 300 milioni di dollari in token Anthropic per il coding AI
📁 Market AI generated ℹ️ The Next Web

Salesforce to Invest $300 Million in Anthropic Tokens for AI Coding

Salesforce anticipates spending $300 million on Anthropic tokens this year, primarily for AI-powered coding functionalities. Announced by CEO Marc Benioff, this investment aims to reduce internal development costs and envisions integrating AI coding directly into Slack, highlighting the increasing adoption of external LLMs to optimize enterprise operations.

2026-05-16 📰 Source
Yoshua Bengio: l'AI potrebbe minacciare l'umanità entro un decennio
📁 LLM AI generated ℹ️ The Next Web

Yoshua Bengio: AI Could Threaten Humanity Within a Decade

Yoshua Bengio, a Turing Award-winning computer scientist and a leading figure in artificial intelligence, has reiterated his warning. According to Bengio, hyperintelligent machines could pose an existential threat to humanity within the next decade. His stance, expressed in a Wall Street Journal interview and republished by Fortune, highlights the urgency of considering the long-term implications of AI development.

2026-05-16 📰 Source
LLM per l'Intimità Digitale: Sovranità dei Dati e Deployment On-Premise
📁 Altro AI generated ✅ Wired AI

LLMs for Digital Intimacy: Data Sovereignty and On-Premise Deployment

The emergence of Large Language Models (LLMs) as companions for intimate and personalized interactions raises crucial questions about data sovereignty and control. This scenario highlights the need for companies to carefully evaluate deployment options, favoring on-premise solutions to ensure privacy and compliance, especially in contexts requiring deep emotional engagement and the management of sensitive information.

2026-05-16 📰 Source
OpenAI e la finanza personale: ChatGPT si connette ai conti bancari
📁 Altro AI generated ℹ️ The Next Web

OpenAI and Personal Finance: ChatGPT Connects to Bank Accounts

OpenAI has introduced a new feature in ChatGPT allowing US-based Pro subscribers to link their bank accounts, credit cards, and investment portfolios. The function, released on May 15 as a preview for web and iOS, enables users to query the chatbot about their real financial data, raising significant questions about data sovereignty and the security of sensitive information.

2026-05-16 📰 Source
Snap, YouTube e TikTok patteggiano in causa sulla dipendenza da social media
📁 Market AI generated ℹ️ The Next Web

Snap, YouTube, and TikTok Settle Social Media Addiction Lawsuit

Snap, Google's YouTube, and ByteDance's TikTok have reached out-of-court settlements in a lawsuit filed by a public school district. The claims alleged social media addiction disrupted learning and forced schools to incur significant costs for youth mental health. Meta Platforms remains the sole company facing trial, following the filing of the settlements in federal court in Oakland, California.

2026-05-16 📰 Source
Dipendenza tecnicica: il caso dell'automotive e le implicazioni per l'AI on-premise
📁 Altro AI generated ℹ️ The Next Web

Technological Dependency: The Automotive Case and Implications for On-Premise AI

The widespread presence of Chinese components in the US automotive industry, including the ownership of over 60 suppliers by Chinese companies, raises significant concerns in Congress. This scenario highlights the complexities of global supply chains and their implications for technological sovereignty, a critical issue also for Large Language Model (LLM) deployments in on-premise environments.

2026-05-16 📰 Source
AMD ROCm 7.13: il nuovo SDK estende il supporto a Instinct MI350P e APU Ryzen AI
📁 Hardware AI generated ✅ Phoronix

AMD ROCm 7.13 Released: SDK Extends Support for Instinct MI350P and Ryzen AI APUs

AMD has released ROCm 7.13, the latest preview of its Core SDK, introducing support for Instinct MI350P GPUs and an expanded range of Ryzen AI APUs. This update is crucial for developers and enterprises utilizing AMD hardware for artificial intelligence workloads, strengthening the software ecosystem in anticipation of the upcoming ROCm 8.0 release and facilitating on-premise deployments.

2026-05-16 📰 Source
L'AI e le aziende individuali: una sfida per le PMI nel nuovo scenario economico
📁 Market AI generated ✅ DigiTimes

AI and One-Person Companies: A Challenge for SMEs in the New Economic Landscape

An adviser suggests that the advancement of artificial intelligence could enable one-person entities to compete effectively with traditional small businesses. This scenario highlights how the strategic adoption of LLMs and the choice of deployment, between on-premise and cloud, are crucial for maintaining competitiveness, influencing costs and data sovereignty.

2026-05-16 📰 Source
Materiali Semiconduttori a Taiwan: Scenari Competitivi e Impatti sull'AI On-Premise
📁 Market AI generated ✅ DigiTimes

Taiwan Semiconductor Materials: Competitive Scenarios and Impact on On-Premise AI

A Digitimes analysis for April 2026 highlights increasing polarization in Taiwan's semiconductor materials sector. This dynamic, characterized by two distinct 'races,' could significantly influence the global supply chain and, consequently, the costs and availability of essential hardware for on-premise Large Language Model (LLM) deployments, prompting companies to reconsider their infrastructure strategies.

2026-05-16 📰 Source
La crescente domanda di server AI alimenta il mercato dei componenti infrastrutturali
📁 Altro AI generated ✅ DigiTimes

Rising AI Server Demand Fuels Growth in Infrastructure Component Market

The surge in demand for artificial intelligence servers is generating significant revenue growth for manufacturers of infrastructure components, such as server rack rail kits. This trend highlights an acceleration in physical infrastructure investments, suggesting a preference for on-premise or private data center deployments to manage intensive LLM workloads.

2026-05-16 📰 Source
Databricks integra GPT-5.5 per agenti aziendali, elevando gli standard di settore
📁 LLM AI generated 🏆 OpenAI Blog

Databricks Integrates GPT-5.5 for Enterprise Agents, Raising Industry Standards

Databricks has announced the adoption of GPT-5.5 for enterprise agent workflows. This move follows the model's achievement of a new state-of-the-art on the OfficeQA Pro benchmark. The integration aims to enhance the efficiency and capabilities of AI agents in enterprise contexts, offering new perspectives for automation and interaction in complex professional environments.

2026-05-16 📰 Source
Agenti AI e Orchestrazione: La Sfida del Deployment Locale
📁 Altro AI generated ℹ️ LocalLLaMA

AI Agents and Orchestration: The Local Deployment Challenge

Interest in autonomous AI agents is growing, pushing organizations to explore orchestration solutions for complex workloads. A recent community insight highlights the need for additional tools to fully leverage LLMs like Qwen and Gemma in self-hosted environments, emphasizing the benefits of control and data sovereignty, but also the infrastructural challenges of on-premise deployment.

2026-05-15 📰 Source
Ottimizzare l'Inference LLM: il 'Sweet Spot' di efficienza per 4x RTX 3090
📁 Hardware AI generated ℹ️ LocalLLaMA

Optimizing LLM Inference: The Efficiency Sweet Spot for 4x RTX 3090

A detailed analysis explores the energy efficiency of an on-premise setup featuring four NVIDIA RTX 3090 GPUs for Large Language Model inference. Tests reveal a peak efficiency point at 220W per GPU, balancing throughput and power consumption, a crucial insight for those managing local infrastructures and aiming to optimize TCO.

2026-05-15 📰 Source
Autori contro Anthropic: ritardi nell'accordo da 1,5 miliardi per il copyright
📁 Market AI generated ✅ Ars Technica AI

Authors vs. Anthropic: Delays in the $1.5 Billion Copyright Settlement

A US federal judge has postponed the final approval of the $1.5 billion settlement between Anthropic and authors, concerning the unauthorized use of books for training AI models. The decision follows objections from some class members, who dispute the excessive compensation for lawyers and the insufficient payouts for authors. This case marks the largest copyright settlement in US history.

2026-05-15 📰 Source
Ottimizzare gli LLM on-premise: l'allocazione dinamica del compute e Qwen-35B-A3B
📁 LLM AI generated ℹ️ LocalLLaMA

Optimizing On-Premise LLMs: Dynamic Compute Allocation and Qwen-35B-A3B

Optimizing compute resources for Large Language Models (LLMs) is a critical challenge, especially for on-premise deployments. An approach involving dynamic allocation of compute budget and modular section evolution, leveraging models like Qwen-35B-A3B, promises performance comparable to high-end proprietary LLMs, offering new perspectives for enterprises seeking data control and sovereignty.

2026-05-15 📰 Source
Kernel Linux 7.1: nuove linee guida per la sicurezza e l'uso responsabile dell'AI
📁 Altro AI generated ✅ Phoronix

Linux Kernel 7.1: New Guidelines for Security Bugs and Responsible AI Use

Linux kernel 7.1 integrates new documentation defining what constitutes a security bug and establishing principles for the responsible use of artificial intelligence in vulnerability discovery. This initiative underscores the importance of security and ethics in integrating AI into software development processes, a crucial aspect for companies managing critical infrastructure and evaluating on-premise deployments for their AI workloads.

2026-05-15 📰 Source
Orthrus-Qwen3-8B: Accelerazione fino a 7.8x per i Large Language Models con accuratezza invariata
📁 LLM AI generated ℹ️ LocalLLaMA

Orthrus-Qwen3-8B: Up to 7.8x Acceleration for Large Language Models with Unchanged Accuracy

Orthrus-Qwen3-8B introduces an innovation for LLM inference, promising up to 7.8x acceleration compared to the base Qwen3-8B model, while maintaining the same output distribution. This approach, which freezes the model's backbone and introduces a diffusion attention module, significantly reduces processing times. The solution stands out for its efficient KV cache usage and the absence of Time-To-First-Token penalties, making it particularly appealing for on-premise deployments that require high performance and cost control.

2026-05-15 📰 Source
ArXiv inasprisce le regole: un anno di ban per contenuti generati da AI non verificati
📁 LLM AI generated ✅ 404 Media

ArXiv Tightens Rules: One-Year Ban for Unverified AI-Generated Content

ArXiv, the renowned repository for academic preprints, has announced a strict new policy. Authors submitting scientific papers with incontrovertible evidence of LLM-generated content lacking adequate verification will face a one-year ban. The responsibility for the accuracy and originality of the material rests entirely with the authors, with penalties also including the requirement for subsequent peer-reviewed publication.

2026-05-15 📰 Source
Il processo Musk vs. Altman si conclude: fiducia nell'AI e scelte di deployment strategiche
📁 Altro AI generated ✅ TechCrunch AI

The Musk vs. Altman Trial Concludes: Trust in AI and Strategic Deployment Choices

The conclusion of the Musk vs. Altman trial reignites the debate on trust in artificial intelligence leadership. This context highlights the importance for companies to carefully evaluate deployment strategies, favoring on-premise or hybrid solutions to ensure control, data sovereignty, and compliance, crucial aspects in a rapidly evolving AI ecosystem.

2026-05-15 📰 Source
Eighteen48 Partners raccoglie 175 milioni di euro per il suo fondo di private equity
📁 Market AI generated ℹ️ The Next Web

Eighteen48 Partners Raises €175 Million for Inaugural Private Equity Fund

Eighteen48 Partners, a London-based alternative asset manager, has announced the closing of the first tranche of its inaugural private equity fund, raising €175 million. The fund's total target is €350 million, aimed at backing mid-market buyouts across Europe. The strategy relies on exclusively sourcing opportunities through independent sponsors, highlighting a targeted approach in the investment landscape.

2026-05-15 📰 Source
Affidabilità degli LLM: la ricerca Microsoft sui workflow delegati a lungo termine
📁 LLM AI generated 🏆 Microsoft Research

LLM Reliability: Microsoft Research on Long-Horizon Delegated Workflows

Microsoft Research has published a study examining the reliability of Large Language Models (LLMs) in long-horizon delegated tasks. The research highlights how models can accumulate semantic errors in extended workflows, with fidelity degradation potentially reaching 19-34% over 20 iterations. While production systems can mitigate these effects with verification and orchestration mechanisms, the study emphasizes the need for further development to make LLMs more trustworthy collaborators in professional contexts.

2026-05-15 📰 Source
L'onda energetica dell'AI: Lake Tahoe e il rincaro dei costi
📁 Market AI generated ✅ TechCrunch AI

The AI Energy Wave: Lake Tahoe and Rising Costs

The escalating energy demand driven by artificial intelligence is beginning to manifest in significant price increases, as highlighted by the situation in Lake Tahoe. This popular Silicon Valley destination is bracing for higher electricity prices, a clear signal of the infrastructural pressures that the expansion of LLMs and AI workloads are placing on the energy sector and, consequently, on enterprise deployment strategies.

2026-05-15 📰 Source
PwC adotta Claude per innovare tecnicia, gestire accordi e trasformare funzioni aziendali
📁 Market AI generated 🏆 Anthropic News

PwC Adopts Claude to Innovate Technology, Manage Deals, and Transform Enterprise Functions

PwC has announced the integration of Claude, Anthropic's Large Language Model, to support its clients in technology development, complex deal management, and the reimagining of enterprise functions. This move highlights the increasing adoption of advanced LLMs in the consulting sector to enhance efficiency and innovation at an enterprise level.

2026-05-15 📰 Source
Equibles: Dati Finanziari Reali per LLM Locali con Server Self-Hosted Open Source
📁 Altro AI generated ℹ️ LocalLLaMA

Equibles: Real Financial Data for Local LLMs with a Self-Hosted Open Source Server

Equibles, a new open-source project, provides a self-hosted MCP server designed to deliver real, current U.S. public financial data to locally run Large Language Models. This solution eliminates cloud dependency, API keys, and telemetry, ensuring data control and sovereignty for on-premise AI applications. It supports diverse data types, from SEC filings to economic indicators, targeting those seeking autonomy and security in LLM deployment.

2026-05-15 📰 Source
Revolut punta sul business banking: un incentivo da 1.000 sterline per ogni dipendente
📁 Market AI generated ℹ️ The Next Web

Revolut Shifts Focus to Business Banking, Incentivizes Employees for Growth

Revolut's CEO, Nik Storonsky, has declared business banking as the company's top priority. The fintech firm is offering £1,000 to every employee who helps acquire new business customers, aiming for a $200 billion valuation ahead of a potential IPO. This move signals an aggressive growth strategy within the B2B financial services sector.

2026-05-15 📰 Source
Giganti tech cinesi: l'IA trasforma la ricerca e l'e-commerce
📁 Market AI generated ℹ️ The Next Web

China's Tech Giants: AI Transforms Search and E-commerce

Alibaba has integrated its Qwen AI assistant with Taobao, its largest marketplace. This move replaces the traditional search bar with an AI agent capable of accessing a catalog of over four billion products, redefining the online shopping experience and introducing a new paradigm for user-platform interaction.

2026-05-15 📰 Source
OpenAI riorganizza i vertici: Greg Brockman assume il controllo dei prodotti
📁 LLM AI generated ✅ Wired AI

OpenAI Reorganizes Leadership: Greg Brockman Takes Control of Products

OpenAI has announced a reorganization of its executive ranks, with Greg Brockman taking direct responsibility for products. The primary goal is to unify the ChatGPT and Codex experiences into a single core offering, aiming to simplify user interaction and consolidate the company's product strategy within the LLM landscape.

2026-05-15 📰 Source
OpenAI introduce ChatGPT per la finanza personale con integrazione bancaria
📁 Altro AI generated ✅ TechCrunch AI

OpenAI Introduces ChatGPT for Personal Finance with Bank Account Integration

OpenAI has announced a new version of ChatGPT specifically designed for personal finance management. This iteration allows users to connect their bank accounts to view a centralized dashboard. The system will provide a detailed overview of portfolio performance, spending, subscriptions, and upcoming payments, offering a tool to monitor and analyze personal finances.

2026-05-15 📰 Source
Asset tokenizzati: la risposta alla frizione operativa e le sfide infrastrutturali
📁 Altro AI generated ℹ️ The Next Web

Tokenized Assets: Addressing Operational Friction and Infrastructure Challenges

Modern derivatives and digital asset markets face significant operational friction, with a Nasdaq survey revealing that 70% of global firms experience daily settlement failures. This inefficiency ties up substantial capital. Tokenized real-world assets (RWA) emerge as a potential solution, but their adoption raises crucial questions regarding deployment infrastructure, data sovereignty, and TCO, especially for organizations prioritizing control and compliance.

2026-05-15 📰 Source
ChatGPT si apre alla finanza personale: analisi AI per utenti Pro negli USA
📁 Altro AI generated 🏆 OpenAI Blog

ChatGPT Enters Personal Finance: AI Analysis for US Pro Users

OpenAI has unveiled a new personal finance experience within ChatGPT, targeting Pro users in the United States. This feature enables secure connection of financial accounts to provide AI-powered insights and guidance tailored to individual financial context, goals, and priorities, leveraging LLM capabilities for personalized economic management.

2026-05-15 📰 Source
Piattaforme Dati e Sovranità: Il Caso Palantir e le Implicazioni On-Premise
📁 Altro AI generated ✅ 404 Media

Data Platforms and Sovereignty: The Palantir Case and On-Premise Implications

A journalistic investigation reveals ICE's use of the Palantir platform for individual identification, raising questions about the veracity of official statements. This episode highlights the crucial importance of data sovereignty and infrastructural control, prompting organizations to carefully evaluate on-premise deployment options for sensitive AI/LLM workloads, in contrast to cloud solutions.

2026-05-15 📰 Source
Attacchi DeFi: 600 milioni di dollari sottratti in aprile, con implicazioni AI
📁 Altro AI generated ℹ️ The Next Web

DeFi Attacks: $600 Million Stolen in April, with AI Implications

The decentralized finance (DeFi) sector experienced losses of approximately $600 million in April due to two distinct attacks. These incidents, attributed to North Korean hackers and involving artificial intelligence, targeted Drift Protocol and Kelp DAO, highlighting critical vulnerabilities and the increasing sophistication of threats in the digital asset landscape. The events underscore the importance of robust defenses for any critical infrastructure.

2026-05-15 📰 Source
SupraLabs: Piccoli LLM Open Source per l'Accessibilità e il Deployment Locale
📁 LLM AI generated ℹ️ LocalLLaMA

SupraLabs: Small Open-Source LLMs for Accessibility and Local Deployment

SupraLabs emerges with the goal of democratizing artificial intelligence through the development and fine-tuning of compact Large Language Models. The initiative focuses on efficient models, ideal for deployment on edge devices and local infrastructures, offering a viable alternative to cloud solutions and promoting data sovereignty.

2026-05-15 📰 Source
Multi-Tensor Parallelism in llama.cpp: LLM più grandi su GPU distribuite
📁 Frameworks AI generated ℹ️ LocalLLaMA

Multi-Tensor Parallelism Lands in llama.cpp: Larger LLMs on Distributed GPUs

The open-source project llama.cpp has integrated Multi-Tensor Parallelism (MTP), a feature enabling the execution of large Large Language Models, such as 70B or 120B parameter models, by distributing their tensors across multiple GPUs. This innovation is crucial for local inference of complex LLMs, making them accessible on hardware configurations with distributed VRAM and opening new opportunities for on-premise deployments, with benefits in TCO and data sovereignty.

2026-05-15 📰 Source
La Cina blocca l'Nvidia H200: implicazioni per il mercato dei chip AI e il deployment on-premise
📁 Market AI generated ℹ️ Tom's Hardware

China Blocks Nvidia H200: Implications for the AI Chip Market and On-Premise Deployment

Donald Trump has stated that China is reportedly blocking the purchase of Nvidia H200 GPUs, despite approval from US authorities. This move, according to the former president, aims to promote the development of homegrown chips, creating new challenges for companies planning AI infrastructures, particularly for on-premise deployments.

2026-05-15 📰 Source
Data center AI: la protesta dei residenti in Pennsylvania e le sfide infrastrutturali
📁 Altro AI generated ℹ️ Tom's Hardware

AI Data Centers in Pennsylvania: Residents Protest Against Governor Amid Infrastructure Challenges

Pennsylvania residents are strongly opposing the construction of AI data centers, criticizing Governor Shapiro in a two-hour town hall. This situation highlights growing tensions between the infrastructure demands of AI workloads and local impact, posing significant challenges for on-premise deployment strategies and TCO planning.

2026-05-15 📰 Source
Investimenti AI e infrastrutture: il panorama tech europeo in evoluzione
📁 Market AI generated ℹ️ Tech.eu

AI Investments and Infrastructure: Europe's Evolving Tech Landscape

The European tech sector saw over €1.4 billion in funding this week, with a growing emphasis on artificial intelligence and infrastructure. Major investment rounds for Nscale and Recursive Superintelligence highlight the push towards AI compute capabilities and innovative solutions, while companies like Keel and Poland's industry demonstrate a strategic evolution towards AI-native delivery and fintech infrastructure.

2026-05-15 📰 Source
Data center in Pennsylvania: la comunità si oppone ai costi ambientali e sociali
📁 Altro AI generated ✅ Ars Technica AI

Data Centers in Pennsylvania: Community Opposition to Environmental and Social Costs

In Pennsylvania, the rapid expansion of data centers is facing growing public opposition. During a recent meeting, residents expressed frustration over rising energy costs, high water consumption, noise pollution, and rural industrialization. Criticism also focuses on the lack of transparency and citizen participation in decisions related to these infrastructure projects.

2026-05-15 📰 Source
AI e design: Robert Polacek ridefinisce il ruolo della tecnicia nella creatività
📁 Market AI generated ℹ️ The Next Web

AI in Design: Robert Polacek Redefines Technology's Role in Creativity

Robert Polacek of RoseBernard Studio shifts the debate on artificial intelligence in design. Instead of focusing on human replacement, Polacek highlights how AI can amplify creative capacities and foster new forms of collaboration. His vision proposes a future where technology becomes a tool to expand opportunities in the creative sector, moving beyond initial uncertainty and valuing AI's innovative potential.

2026-05-15 📰 Source
Discussioni sui 'guardrail' per l'IA e lo stallo delle consegne di H200: implicazioni per il deployment on-premise
📁 Hardware AI generated ℹ️ The Next Web

AI Guardrails Discussions and H200 Delivery Stalls: Implications for On-Premise Deployment

A meeting between former President Trump and President Xi Jinping touched upon 'AI guardrails,' though no formal agreements were reached. Concurrently, deliveries of NVIDIA H200 GPUs to Chinese buyers remain blocked. This scenario highlights the geopolitical complexities influencing the availability of critical hardware for Large Language Models, a crucial factor for on-premise deployment strategies and data sovereignty.

2026-05-15 📰 Source
Ottimizzazione RAG: il modello più costoso non è il migliore, ecco cosa conta davvero
📁 LLM AI generated ℹ️ LocalLLaMA

RAG Chatbot Optimization: Most Expensive Model Was Not the Best Performer

An in-depth analysis of a customer support RAG chatbot revealed that the most expensive LLM did not guarantee the best performance. The study highlighted how retrieval issues, ineffective evaluation methods, and lack of chunk deduplication are often mistaken for LLM limitations. By optimizing these aspects and conducting a model sweep, response quality improved by 19% and costs were reduced by 79%, demonstrating the importance of accurate measurement and careful configuration.

2026-05-15 📰 Source
Mayo Clinic e l'AI per l'ascolto ambientale: questioni di consenso e accuratezza
📁 Altro AI generated ✅ 404 Media

Mayo Clinic and Ambient AI Listening: Consent and Accuracy Concerns

Mayo Clinic is utilizing artificial intelligence to record patient-nurse interactions, including in emergency rooms, through an opt-out "ambient listening" system. This practice raises critical questions regarding informed consent and the accuracy of AI-generated notes, particularly in complex environments. The technology, developed with Abridge, highlights the ethical and technical challenges of AI adoption in healthcare, with direct implications for data sovereignty.

2026-05-15 📰 Source
Rocky Linux: un repository di sicurezza opzionale per aggiornamenti rapidi
📁 Altro AI generated ✅ Phoronix

Rocky Linux Launches Optional Security Repository for Faster Updates

Rocky Linux has introduced an optional security repository designed to accelerate the distribution of critical patches. This initiative responds to significant vulnerabilities like Dirty Frag and Fragnesia, offering organizations managing self-hosted infrastructures greater control and faster reaction times against cyber threats, which is crucial for data sovereignty and compliance.

2026-05-15 📰 Source
Osaurus porta l'AI ibrida su Mac, tra modelli locali e cloud
📁 Altro AI generated ✅ TechCrunch AI

Osaurus Brings Hybrid AI to Mac, Blending Local and Cloud Models

Osaurus is a new Mac application that integrates both local and cloud-based artificial intelligence models. The solution aims to offer users the best of both worlds, ensuring that sensitive data such as memory, files, and tools remain on their own hardware, while still accessing the flexibility and power of remote AI services. This hybrid approach emphasizes data sovereignty and local control.

2026-05-15 📰 Source
ByteDance presenta Cola DLM: un LLM a diffusione latente per il deployment flessibile
📁 LLM AI generated ℹ️ LocalLLaMA

ByteDance Unveils Cola DLM: A Latent Diffusion LLM for Flexible Deployment

ByteDance has released Cola DLM, an innovative Large Language Model based on hierarchical latent diffusion. The model combines a Text VAE with a Diffusion Transformer (DiT) and leverages Flow Matching for text generation. Available as a Hugging Face checkpoint, Cola DLM is compatible with PyTorch and HuggingFace Transformers, offering flexibility for self-hosted and on-premise deployments thanks to its Apache 2.0 license.

2026-05-15 📰 Source
L'acceleratore YEP lancia un programma per le startup ucraine nella Silicon Valley
📁 Market AI generated ℹ️ Tech.eu

YEP Accelerator Launches Program for Ukrainian Startups in Silicon Valley

YEP Accelerator has inaugurated a new international program in California, aimed at supporting growth-stage Ukrainian startups in entering and expanding within the US market. The initiative offers a five-week residency in San Francisco, focusing on practical market entry preparation, fundraising, and networking, with access to potential investments of up to $1.8 million.

2026-05-15 📰 Source
Firmware Open Source: Coreboot e AMD openSIL debuttano su schede madri AMD EPYC
📁 Altro AI generated ✅ Phoronix

Open Source Firmware: Coreboot and AMD openSIL Debut on AMD EPYC Motherboards

3mdeb has released Dasharo v0.9, an open source firmware based on Coreboot and AMD openSIL, for the Gigabyte MZ33-AR1 EPYC server motherboard. This marks the first availability of a fully open firmware solution for a commercial AMD EPYC server platform, offering enhanced control, security, and transparency for on-premise deployments of critical infrastructure and AI workloads.

2026-05-15 📰 Source
L'AI agentica accelera il mercato server: quasi 20 milioni di unità entro il 2026
📁 Market AI generated ✅ DigiTimes

Agentic AI Accelerates Server Market: Nearly 20 Million Units by 2026

The global server market is poised for significant growth, with shipments projected to approach 20 million units by 2026. This expansion is driven by the increasing adoption of Agentic AI, which demands robust and dedicated infrastructure. DIGITIMES' analysis highlights a clear trend towards increased hardware demand to support complex AI workloads, presenting new challenges and opportunities for on-premise deployment strategies.

2026-05-15 📰 Source
← Previous Page 22 / 119 Next →
View Full Archive 🗄️

AI-Radar is an independent observatory covering AI models, local LLMs, on-premise deployments, hardware, and emerging trends. We provide daily analysis and editorial coverage for developers, engineers, and organizations exploring local AI solutions.

AI-RADAR badge LaunchTry LAUNCHING SOON ON LaunchTry Fazier badge