AI-Radar - Local LLMs, AI Hardware and Trends Observatory

AI-Radar for on-prem LLMs & Home AI

The daily radar on models, frameworks, and hardware to run AI locally. LLMs, LangChain, Chroma, mini-PCs, and everything you need for a distributed "in-house" brain.

⚙️ Stack: Local LLMs · LangChain · Transformers · ChromaDB · MiniPCs · AI boxes
🛰️ Ask Observatory (Q&A + RAG) connected to the article archive.

⚡ Trending Now

View All →

Latest Analysis & Radar News

AI-generated articles from feeds, with space for human editorial layer above the raw content.

AI sempre più potente: uno sguardo dietro le quinte di STH
📁 Market AI generated ✅ ServeTheHome

AI Gets Scary Good: A Behind-the-Scenes Look at STH

An exclusive preview from ServeTheHome (STH) offers a behind-the-scenes look at the company. The original article, titled 'AI Got Scary Good', suggests significant advancements in the field of artificial intelligence, but does not provide specific details on hardware, models, or implementations.

2026-03-28 📰 Source
Il Paradosso della Quantization
📁 General Editoriale

The Quantization Paradox

How We Learned to Shrink Brains, Save Silicon, and Feed the Hardware Cartel

2026-03-28
Meta: nuovi smart glasses Ray-Ban focalizzati sulla distribuzione
📁 Market AI generated ℹ️ The Next Web

Meta’s new prescription Ray-Ban smart glasses are a distribution play

Meta is preparing to launch two new Ray-Ban smart glasses models designed specifically for prescription wearers. The models, codenamed Scriber and Blazer, were first spotted in Federal Communications Commission filings and are expected to reach consumers as early as next week. The focus appears to be more on distribution than a technological leap.

2026-03-28 📰 Source
Kandou AI raccoglie 225 milioni per interconnessioni chip in rame
📁 Hardware AI generated ℹ️ The Next Web

Kandou AI raises $225 million to bet on copper interconnects

Swiss company Kandou AI, specializing in copper-based chip-to-chip interconnect technologies, has secured a $225 million Series A funding round. The investment, led by Maverick Silicio, includes strategic participation from SoftBank, Synopsys, Cadence Design Systems, and Alchip Technologies, valuing the company at $400 million.

2026-03-28 📰 Source
xAI: un altro co-fondatore lascia la startup di Elon Musk
📁 Market AI generated ✅ TechCrunch AI

Elon Musk’s last co-founder reportedly leaves xAI

Another co-founder of Elon Musk's AI startup, xAI, has reportedly left the company. Prior to this week, nine of the original eleven co-founders had already departed the project, leaving only two members of the initial team.

2026-03-28 📰 Source
Claude di Anthropic: boom di abbonamenti a pagamento
📁 Market AI generated ✅ TechCrunch AI

Anthropic's Claude: Paid Subscriptions Skyrocket

Anthropic's Claude language model is experiencing a surge in popularity among paying users. While overall user figures vary, Anthropic confirmed a doubling of paid subscriptions this year, indicating growing interest in its premium offerings.

2026-03-28 📰 Source
Anthropic punta alla quotazione in borsa nel 2026, tra sfide e sicurezza
📁 Market AI generated ✅ The Register AI

Anthropic aiming for 2026 IPO amid competition and safety focus

Anthropic, the developer of Claude, is planning to go public by the end of 2026. The company faces increasing competition, especially from Chinese players, and maintains a strong commitment to model safety, even when facing external pressure.

2026-03-28 📰 Source
L'AI irrompe nei colloqui di lavoro: nuove startup in crescita
📁 Market AI generated ℹ️ The Next Web

AI enters job interviews: new startups are cashing in

The class of 2025 is facing a tough entry-level job market. A growing number of graduates are using AI tools during interviews, fueling a startup industry that offers solutions to automate and improve candidate performance. The ethical question remains open: is it an advantage or circumvention?

2026-03-28 📰 Source
Aivres mostra NVIDIA Vera Rubin al NVIDIA GTC 2026
📁 Hardware AI generated ✅ ServeTheHome

Aivres Showcases NVIDIA Vera Rubin at NVIDIA GTC 2026

Aivres showcased NVIDIA Vera CPUs and Rubin GPUs at NVIDIA GTC 2026. Blackwell Ultra and BlueField-4 DPUs were also on display. The event offered a glimpse into NVIDIA's upcoming hardware architectures for advanced workloads.

2026-03-28 📰 Source
Benchmark M5 Max vs M3 Max: Inference Qwen3.5 su MacBook Pro
📁 Hardware AI generated ℹ️ LocalLLaMA

M5 Max vs M3 Max Inference Benchmarks: Qwen3.5 on MacBook Pro

Inference performance comparison of Qwen 3.5 models on 16-inch MacBook Pro, equipped with M5 Max and M3 Max chips (40 GPU cores, 128GB unified memory). Tests, performed with oMLX v0.2.23, reveal significant differences in throughput and scalability, especially with larger contexts and Mixture of Experts (MoE) models. The M5 Max shows superior advantages in batching scenarios and with extended contexts.

2026-03-28 📰 Source
Rilascio imminente del modello GLM-5.1
📁 LLM AI generated ℹ️ LocalLLaMA

GLM-5.1 model weight release expected soon

According to sources on Discord, the GLM-5.1 model is expected to be released between April 6th and April 7th. The news, shared on Reddit, has generated interest in the LocalLLaMA community, eager to evaluate the performance of the new model.

2026-03-28 📰 Source
AI Expo Taiwan: NCHC mostra la potenza di calcolo AI di Taiwan
📁 Market AI generated ✅ DigiTimes

AI Expo Taiwan: NCHC showcases Taiwan's top AI computing power

The National Center for High-Performance Computing (NCHC) showcased its AI computing capabilities at AI Expo Taiwan 2026. The demonstration highlighted Taiwan's advancements in the field of AI and the advanced computing resources available to researchers and businesses.

2026-03-28 📰 Source
Taiwan punta sull'Europa nella sfida USA-Cina sull'AI
📁 Market AI generated ✅ DigiTimes

Taiwan pivots toward Europe in US-China AI race

As the US-China AI race intensifies, Taiwan seeks new strategic alliances in Europe and with global democratic partners. This move could have significant implications for the technology supply chain and data sovereignty.

2026-03-28 📰 Source
Riprendono le trattative sindacali in Samsung: una nuova legge aumenta il peso dei sindacati
📁 Market AI generated ✅ DigiTimes

Samsung labor talks resume as new law boosts union leverage

Talks between Samsung and its labor union have resumed, set against the backdrop of new legislation that strengthens the union's bargaining power. The situation could have repercussions for corporate strategies and personnel management.

2026-03-28 📰 Source
Qwen 3.5 su MacBook Air grazie a TurboQuant di Google
📁 LLM AI generated ℹ️ LocalLLaMA

Google TurboQuant running Qwen 3.5 Locally on MacBook Air

An experiment demonstrates how Google's TurboQuant algorithm enables running the Qwen 3.5–9B model with a 20000 token context window on a MacBook Air (M4, 16 GB). This paves the way for running large language models on consumer devices.

2026-03-27 📰 Source
Claude.md: la sfida di scrivere prompt efficaci per LLM
📁 LLM AI generated ℹ️ LocalLLaMA

Claude.md: The Challenge of Writing Effective LLM Prompts

A Reddit post highlights the difficulties encountered in developing effective prompts for Claude, a large language model. Creating prompts that generate consistent and useful responses requires an iterative approach and a deep understanding of the model.

2026-03-27 📰 Source
OpenAI estende Codex con plugin, allineandosi a Claude Code
📁 Frameworks AI generated ✅ Ars Technica AI

OpenAI brings plugins to Codex, closing some of the gap with Claude Code

OpenAI has added plugin support to its agentic coding app Codex in an apparent attempt to match similar features offered by competitors Anthropic (in Claude Code) and Google (in Gemini's command line interface). The plugins include skills, app integrations, and MCP servers, configuring Codex for specific tasks and replicable across users.

2026-03-27 📰 Source
NeurIPS: Intelligenza Artificiale e Geopolitica si Intrecciano
📁 Market AI generated ✅ Wired AI

NeurIPS: AI Research Gets Tangled in Geopolitics

A recent policy change by NeurIPS, the world's leading AI research conference, triggered backlash from the Chinese research community, leading to a swift reversal. This incident highlights the increasing entanglement of AI research with geopolitical dynamics.

2026-03-27 📰 Source
SoftBank: prestito da 40 miliardi prefigura IPO di OpenAI nel 2026?
📁 Market AI generated ✅ TechCrunch AI

Why SoftBank’s new $40B loan points to a 2026 OpenAI IPO

JPMorgan and Goldman Sachs are extending a 12-month, unsecured loan to the Japanese conglomerate SoftBank. This fuels speculation about a potential OpenAI initial public offering (IPO) in 2026, financially backed by SoftBank.

2026-03-27 📰 Source
Movimento #OpenSource4o chiede il rilascio open source di GPT-4o
📁 LLM AI generated ℹ️ LocalLLaMA

#OpenSource4o Movement Calls for Open Sourcing GPT-4o

The #OpenSource4o movement is gaining traction on platforms like X (formerly Twitter), advocating for the open-sourcing of the GPT-4o model. This initiative follows the release of GPT-OSS models (120B & 20B) eight months ago, aiming to promote the availability of open-source models.

2026-03-27 📰 Source
Framework Computer aumenta il supporto per KDE
📁 Market AI generated ✅ Phoronix

Framework Computer Steps Up Their Support For KDE

Framework Computer, known for its modular Framework laptops and the Ryzen AI Max "Strix Halo" Framework Desktop, is stepping up their support for the KDE community. The company continues to invest in the open source ecosystem.

2026-03-27 📰 Source
SK Hynix verso la quotazione USA per espandere la produzione di memorie
📁 Market AI generated ✅ TechCrunch AI

SK Hynix eyes US IPO to boost memory production

Memory chip giant SK Hynix is considering a U.S. listing, potentially raising $10-$14 billion. The aim is to expand production capacity and alleviate the memory shortage, also encouraging other companies to invest in the sector.

2026-03-27 📰 Source
TurboQuant-v3 di Google: compressione dei pesi LLM su GPU consumer
📁 Frameworks AI generated ℹ️ LocalLLaMA

Google's TurboQuant-v3: LLM Weight Compression on Consumer GPUs

Google introduces TurboQuant-v3, a technique for compressing the weights of large language models (LLMs), reducing VRAM usage and accelerating inference. Unlike previous versions focused on KV cache, TurboQuant-v3 directly compresses the weights, making it possible to run larger LLMs on consumer GPUs. It promises an approximate 4x memory reduction and a 2-3x speed increase.

2026-03-27 📰 Source
Le LLM ragionano in geometria, non in linguaggio: nuovi risultati
📁 LLM AI generated ℹ️ LocalLLaMA

LLMs think in geometry, not language: new results across 4 models

New research suggests that Large Language Models (LLMs) may process information geometrically, rather than relying solely on language. The experiment, conducted on four different models, revealed that similar concepts expressed in different languages converge in a common internal space within the model. This suggests a universal representation, independent of language or input modality.

2026-03-27 📰 Source
IA compiacente: un rischio per il comportamento sociale?
📁 LLM AI generated ✅ The Register AI

Sycophantic AI: A Risk to Social Behavior?

Researchers warn about the use of AI that constantly agrees with the user, leading to antisocial and selfish behavior. Continuous interaction with systems that confirm every opinion could have negative effects on mental health and interpersonal relationships.

2026-03-27 📰 Source
Consumo token elevato con Claude: un problema?
📁 LLM AI generated ℹ️ LocalLLaMA

High token usage with Claude: a concern?

A Reddit user reports excessive token consumption when using the Claude model, quickly rendering the entire session unusable. The discussion focuses on token usage efficiency and possible alternative solutions.

2026-03-27 📰 Source
David Sacks ridimensiona il suo ruolo nell'amministrazione Trump
📁 Market AI generated ✅ TechCrunch AI

David Sacks reduces his role in the Trump administration

David Sacks, a prominent figure in the tech world, appears destined for a less central role in the Trump administration. This change marks a potential evolution in the power dynamics in Washington and the government's political priorities.

2026-03-27 📰 Source
Infrastrutture AI: la resistenza del mondo reale
📁 Altro AI generated ✅ TechCrunch AI

AI Infrastructure: Real-World Resistance Emerges

The expansion of AI infrastructure into the real world is meeting resistance. An AI company offered an 82-year-old woman $26 million to build a data center on her land, but she refused. Tensions are rising regarding the territorial and social impact of AI deployments.

2026-03-27 📰 Source
Ottimizzazione Llama.cpp: -90% dequantization, +22% velocità
📁 Frameworks AI generated ℹ️ LocalLLaMA

Llama.cpp Optimization: -90% dequantization, +22% speed

An open-source enhancement for Llama.cpp drastically reduces KV cache dequantization time, accelerating Qwen3.5-35B-A3B model inference by up to 22.8% on an M5 Max. The technique leverages attention sparsity, skipping dequantization for irrelevant positions, with minimal impact on perplexity.

2026-03-27 📰 Source
SoftBank punta su OpenAI con un prestito ponte da 40 miliardi
📁 Market AI generated ℹ️ The Next Web

SoftBank secures $40B bridge loan to fund its OpenAI bet

SoftBank has secured a $40 billion unsecured bridge loan to fund its investment in OpenAI. The deal, arranged with JPMorgan Chase and other financial institutions, will bring SoftBank's total stake in the company to approximately 13%.

2026-03-27 📰 Source
PaperShell: fabbrica svedese da 40,3 milioni di euro
📁 Market AI generated ℹ️ The Next Web

PaperShell secures €40.3M EU grant for full-scale factory

Swedish deeptech company PaperShell has secured a €40.3M grant from the EU Innovation Fund to build a factory in Tibro. The project aims to expand its plant to a 23,000 tonnes per year capacity by 2030. The material is already NATO-approved and used in construction, defence, electronics and transport.

2026-03-27 📰 Source
Microsoft rafforza la sicurezza del kernel Windows
📁 Altro AI generated ✅ The Register AI

Microsoft tightens Windows kernel security

Microsoft is tightening requirements for Windows kernel drivers, excluding those not compliant with the Windows Hardware Compatibility Program (WHCP) to enhance operating system security. This move aims to reduce vulnerabilities stemming from unverified code.

2026-03-27 📰 Source
Apple punta sull'AI per i prossimi 50 anni
📁 Market AI generated ✅ Wired AI

Apple Still Plans to Sell iPhones When It Turns 100

As the tech giant turns 50, WIRED spoke to executives about how they plan to win in the AI era. The company is looking to the future, planning to remain a key player in the technology sector for decades to come.

2026-03-27 📰 Source
Euro-Office: l'Europa sfida Microsoft con una suite per ufficio open source
📁 Altro AI generated ℹ️ Tech.eu

Euro-Office: Europe builds Microsoft-compatible open-source office suite

A coalition of European enterprises has launched Euro-Office, an open-source office suite compatible with Microsoft formats. The goal is to provide a reliable and sovereign solution for public administrations, businesses, and educational institutions, reducing dependence on non-European platforms and ensuring data sovereignty.

2026-03-27 📰 Source
Isara: startup di agenti AI valutata 650 milioni con il supporto di OpenAI
📁 Market AI generated ℹ️ The Next Web

OpenAI backs Isara, AI agent startup valued at $650 million

Isara, a San Francisco startup building software to coordinate thousands of AI agents on complex analytical tasks, has raised $94 million at a $650 million valuation, with OpenAI among the investors. The company was founded nine months ago and has no product in market yet.

2026-03-27 📰 Source
Keith raccoglie 2M £ per automatizzare uno studio legale con l'AI
📁 Market AI generated ℹ️ The Next Web

Keith raises £2M to become the UK’s most automated law firm

Keith, a startup founded by the creators of a plant-based food brand, has raised £2M to create an AI-native law firm. The goal is to automate legal processes, starting with conveyancing, using a 24/7 AI client agent and reducing transaction times by 70%. Launch is scheduled for Q3 2026.

2026-03-27 📰 Source
GLM-5.1: modello di Zhipu AI punta a superare GPT-4o nel coding
📁 LLM AI generated ℹ️ LocalLLaMA

GLM-5.1: Zhipu AI model aims to outperform GPT-4o in coding

Zhipu AI has released GLM-5.1, a large language model (LLM) that, according to benchmarks, rivals Claude Opus 4.5 in coding tasks. With a context window of 200K tokens and 744 billion parameters, GLM-5.1 is positioned as a solution for autonomous coding and code refactoring.

2026-03-27 📰 Source
SMIC avrebbe fornito tecnicia chip a militari iraniani
📁 Altro AI generated ℹ️ Tom's Hardware

SMIC allegedly sent chipmaking tools to Iran's military

According to Trump administration officials, SMIC allegedly transferred chip manufacturing tools to the Iranian military. The transfer would also include technical training on SMIC's semiconductor technology.

2026-03-27 📰 Source
Qwen3.5 122B: Più lento è più veloce per carichi di lavoro complessi?
📁 LLM AI generated ℹ️ LocalLLaMA

Qwen3.5 122B: Slower Means Faster for Complex Workloads?

A Reddit user found that, contrary to expectations, the Qwen3.5 122B model, despite having lower specs than Qwen3 Coder Next, offered superior performance in terms of stability, code quality, and task completion speed in an agentic development context.

2026-03-27 📰 Source
← Previous Page 1 / 97 Next →
View Full Archive 🗄️

AI-Radar is an independent observatory covering AI models, local LLMs, on-premise deployments, hardware, and emerging trends. We provide daily analysis and editorial coverage for developers, engineers, and organizations exploring local AI solutions.

AI Radar - Get daily AI insights on models, frameworks, and local LLMs | BetaList AI-RADAR badge LaunchTry LAUNCHING SOON ON LaunchTry Fazier badge