AI-RADAR.IT · AI-RADAR.NET · AI-RADAR.TECH

News & analysis on local LLMs, stack & on-prem hardware.

📁 LLM AI generated

Bias and LLMs: Data Injection for More Efficient Models

Published on 2026-03-05 23:56 ℹ️ LocalLLaMA 📰 Read the original source article →

🏷️ Hardware 🏷️ Fine-Tuning

Bias e LLM: iniezione di dati per modelli più efficienti

Contrastive Data Injection to Improve Language Models

A recent study explored a technique to improve bias resistance and sycophancy in language models, achieving promising results with relatively small models. The approach is based on injecting contrastive data pairs during the pre-training phase, even in minimal percentages (0.05%).

The results indicate that a 7 million parameter model, trained with this technique, can achieve performance levels comparable to standard models with a significantly larger number of parameters (18-34 million).

Implementation Details

The technique does not require modifications to the model architecture or the addition of an auxiliary loss function. The injection of contrastive data appears to provide the model with clear examples of the desired behaviors, compensating for the lack of sufficient signals in standard pre-training datasets such as OpenWebText.

Interestingly, the dose of data injection affects the results in a non-linear way: a percentage of 5% seems to be optimal, while a percentage of 10% worsens both behavioral scores and factual accuracy.

Results on Larger Models

The technique was successfully replicated on 12 and 34 million parameter models, showing a similar trend. In particular, contrastive injection seems to resolve a scaling anomaly observed in vanilla 64 million parameter models, where bias resistance tends to regress. With contrastive injection, however, bias resistance remains stable across all scales tested.

The study suggests that, if this technique proves effective even on larger models, it could allow to achieve a behavioral quality comparable to that of models with a number of parameters 5-10 times higher. This would pave the way for running advanced language models on devices with limited resources, such as smartphones, without the need for dedicated GPUs.

AI-Radar Takeaway

A new training technique based on injecting contrastive data pairs in small doses (0.05%) during pre-training appears to significantly improve bias resistance and sycophancy in small language models (7M parameters). Results show performance comparable to much larger models.

🤖 Ask AI about this

Want to dive deeper? Read the full article from the source:

📖 READ THE ORIGINAL ARTICLE

💻 Need GPU Cloud Infrastructure?

For running LLM inference, training models, or testing hardware configurations, check out this platform:

PeerPush AI Community Platform

Discover and share AI tools and projects. Connect with developers, get feedback, and grow your AI startup in a vibrant community of innovators.

✓ AI Community ✓ Project Showcase ✓ Developer Network

🔗 This is an affiliate link - we may earn a commission at no extra cost to you.

AI-RADAR NEWSLETTER

Stay ahead — get AI signals in your inbox

Daily or weekly digest of the most important AI news. 160+ readers, no spam.

💬 Comments (0)

🔒 Log in or register to comment on articles.

No comments yet. Be the first to comment!

🔍 Continue Exploring

Explore LLM On-Premise

Complete guide to running AI models locally: hardware, stack, and privacy.

LLMs: 'Teacher' Models Can Transmit Latent Biases to 'Students'

LLMs: 'Teacher' Models Can Transmit Latent Biases to 'Students'

New research highlights a critical risk in training Large Language Models (LLMs) using outputs from other models. It reveals that undesirable traits, including

China Narrows AI Performance Gap with US Despite 23x Less Investment

China Narrows AI Performance Gap with US Despite 23x Less Investment

Stanford's 2026 AI Index Report reveals the performance gap between US and Chinese AI models has dramatically shrunk to 2.7%. This occurs despite the United Sta

Little Qwen 3.5 27B and Qwen 35B-A3B models excel in logical reasoning

Little Qwen 3.5 27B and Qwen 35B-A3B models excel in logical reasoning

Little Qwen 3.5 27B and Qwen 35B-A3B models have demonstrated remarkable logical reasoning capabilities in a specific benchmark. The results, obtained using lin

Political Compass for Local LLMs: Evaluating Bias in Fine-tuned Models

Political Compass for Local LLMs: Evaluating Bias in Fine-tuned Models

"Political compass" benchmarks offer a tool to analyze bias in Large Language Models. While they have so far focused on cloud models, there is an emerging need

Qwen 3.5 struggles on Vending-Bench 2: results analysis

Qwen 3.5 struggles on Vending-Bench 2: results analysis

A user reported difficulties with the Qwen 3.5 language model when running the Vending-Bench 2 benchmark. The analysis of the results, shared on Reddit, highlig

More in LLM

Two new AI tools from Tokyo and Beijing fill the gap left by Anthropic's export ban

ConlangCrafter: The AI That Invents Imaginary Languages (and Could Teach Us How We Think)

Orthrus brings diffusion head to Qwen 3.5/3.6 and Gemma 4: open-source code dropping soon

Qwen Fine-tunes: Why Optimized Models Struggle to Impress

DeepSeek-V4-Pro-DSpark: A New Open-Source LLM Targeting Local Deployment

Ornith-1.0-35B Q3_K_M: 17 GB VRAM, all benchmarks pass, extreme quantization holds up

→ View all in LLM →

AI-Radar LLM On-Premise

Complete guide to running AI models locally: hardware, stack, privacy, and reference architectures.

👥 Join 160+ AI explorers

A free community of developers, engineers and AI enthusiasts following local AI daily.

Register free → Already a member? Log in