Strix Halo benchmarks: 13 LLM models, 15 llama.cpp builds

Pubblicato il 2026-02-05 16:21 ℹ️ LocalLLaMA 📰 Leggi l'articolo originale →

Benchmark su Strix Halo con 13 modelli LLM e 15 build di llama.cpp

LLM Inference Benchmarks on Strix Halo iGPU

A user from the LocalLLaMA community has published the results of a series of benchmarks performed on the Strix Halo's iGPU (integrated GPU), using different software configurations and llama.cpp builds. A total of 13 LLM models were tested with 15 different llama.cpp builds, varying options such as ROCm, Vulkan, gfx versions, hipblaslt (on/off), and rocWMMA.

The approach used was to create Docker images containing the different llama.cpp builds, to avoid dependency issues and simplify the testing process. Some builds failed, but these results were also considered useful data.

The complete results are available in the form of interactive tables, which allow comparison of the performance of different configurations.

For those evaluating on-premise deployments, there are trade-offs between performance, TCO, and compliance requirements. AI-RADAR offers analytical frameworks on /llm-onpremise to evaluate these aspects.

🤖 Ask AI about this

Vuoi approfondire? Leggi l'articolo completo dalla fonte:

📖 VAI ALLA FONTE ORIGINALE

💻 Need GPU Cloud Infrastructure?

For running LLM inference, training models, or testing hardware configurations, check out this platform:

🌐

Vast.ai GPU Marketplace

Decentralized GPU marketplace with ultra-competitive pricing. Rent from a global network of providers. Perfect for experimentation, development, and cost-optimized workloads.

✓ Lowest prices ✓ Global network ✓ Flexible options

🔗 This is an affiliate link - we may earn a commission at no extra cost to you.

💬 Commenti (0)

🔒 Accedi o registrati per commentare gli articoli.

Nessun commento ancora. Sii il primo a commentare!

📚 Approfondimenti

VERTICALE

Strix Halo benchmarks: 13 LLM models, 15 llama.cpp builds

LLM Inference Benchmarks on Strix Halo iGPU

💻 Need GPU Cloud Infrastructure?

💬 Commenti (0)

📚 Approfondimenti

Approfondisci su LLM On-Premise

Benchmarking di GPU Tesla usate per LLM locali: analisi VRAM

Intel DG2: nuovo codice Linux promette boost fino al 260%

Intel Arc B390: prestazioni grafiche su Linux con Panther Lake