AI-RADAR.IT · AI-RADAR.NET · AI-RADAR.TECH

News & analysis on local LLMs, stack & on-prem hardware.

📁 LLM AI generated

RYS II: Repeated layers with Qwen3.5 27B and hints at a 'Universal Language'

Published on 2026-03-23 21:52 ℹ️ LocalLLaMA 📰 Read the original source article →

🏷️ LLM On-Premise 🏷️ Fine-Tuning 🏷️ DevOps

RYS II: Qwen3.5 27B e livelli ripetuti per un linguaggio universale?

A recent experiment explored the architecture of large language models (LLM), focusing on the effect of repeating layers within the model.

Experiment Details

The experiment, named RYS II, used the Qwen3.5 27B model and tested the hypothesis that LLMs may develop a kind of internal "universal language." Analysis of the latent representations in the middle layers of the model showed greater similarity between identical content in Chinese and English than between different content in the same language. This suggests that the model may abstract concepts to a deeper level, independent of the input language.

Architecture and Results

Repeating blocks in the middle layers of the transformer architecture proved to be the most effective strategy. Several pre-trained models have been made available on Hugging Face, with different configurations. The researcher suggests that fine-tuning the models with repeated layers could lead to state-of-the-art (SOTA) results for models of this size.

Considerations

The original article mentions optimizing VRAM usage through specific formats. For those evaluating on-premise deployments, there are trade-offs between performance, TCO, and memory requirements that AI-RADAR helps to evaluate.

AI-Radar Takeaway

A researcher has trained Qwen3.5 27B large language models (LLM) with repeated layers, suggesting that models might process information in an internal "universal language." Results indicate that repeating blocks in the middle of the transformer stack appears to be the most effective strategy. Several pre-trained models are available on Hugging Face.

🤖 Ask AI about this

Want to dive deeper? Read the full article from the source:

📖 READ THE ORIGINAL ARTICLE

💻 Need GPU Cloud Infrastructure?

For running LLM inference, training models, or testing hardware configurations, check out this platform:

Vast.ai GPU Marketplace

Decentralized GPU marketplace with ultra-competitive pricing. Rent from a global network of providers. Perfect for experimentation, development, and cost-optimized workloads.

✓ Lowest prices ✓ Global network ✓ Flexible options

🔗 This is an affiliate link - we may earn a commission at no extra cost to you.

AI-RADAR NEWSLETTER

Stay ahead — get AI signals in your inbox

Daily or weekly digest of the most important AI news. 160+ readers, no spam.

💬 Comments (0)

🔒 Log in or register to comment on articles.

No comments yet. Be the first to comment!

🔍 Continue Exploring

Explore LLM On-Premise

Complete guide to running AI models locally: hardware, stack, and privacy.

LLM Reasoning: Natural Language or Vector Space?

LLM Reasoning: Natural Language or Vector Space?

A key debate in Large Language Models concerns their reasoning modality. Despite operating internally with high-dimensional vectors, LLMs express their thought

Hierarchical Geometry of Cognitive States in Transformer Embedding Spaces

Recent work has shown that transformer-based language models learn rich geometric structure in their embedding spaces, yet the presence of higher-level cognitiv

Models of Large Language: A New Trail for Pedagogical Quality in Mathematics?

A recent study examined the behavior of large language models in mathematics education compared to expert human tutors. The results show that these models have

Bias Beneath the Tone: Empirical Characterization of Tone Bias in LLM-Driven UX Systems

Recently discovered the hidden biases in conversations with technology based on language models. A team of researchers analyzed language models and found that t

Anthropic releases Sonnet 4.6

Anthropic releases Sonnet 4.6

Anthropic has released a new version of its mid-size Sonnet model, keeping pace with the company's four-month update cycle. This release highlights the company'

More in LLM

Does Dario Amodei misunderstand open-source AI? Why it matters for on-premise deployment

On-Prem LLMs: Navigating Fragmented Benchmarks and the Myth of Size

Toe-to-toe in the US Ban benchmark: OpenAI ties with Anthropic

Even Google believes in small coding models

SpectralQuant narrows the Q4_K_M quantization gap to 96.5%: a leap for local models

Two new AI tools from Tokyo and Beijing fill the gap left by Anthropic's export ban

→ View all in LLM →

AI-Radar LLM On-Premise

Complete guide to running AI models locally: hardware, stack, privacy, and reference architectures.

👥 Join 160+ AI explorers

A free community of developers, engineers and AI enthusiasts following local AI daily.

Register free → Already a member? Log in