AI-RADAR.IT · AI-RADAR.NET · AI-RADAR.TECH

News & analysis on local LLMs, stack & on-prem hardware.

📁 LLM AI generated

Noise reduction in BERT NER models for clinical entity extraction

Published on 2026-03-03 05:05 🏆 ArXiv cs.CL 📰 Read the original source article →

Riduzione del rumore per migliorare l'estrazione di entità cliniche con BERT

Clinical entity extraction: a new approach to reduce noise

Precise extraction of clinical entities from medical notes and reports is critical. Encoder models, particularly BERT, fine-tuned for Named Entity Recognition (NER) have proven efficient in this task. However, achieving high precision remains a challenge.

A new study presents a Noise Removal (NR) model that significantly improves the accuracy of BERT-based NER models. This NR model analyzes the probability sequences generated by the NER model, classifying predictions as "weak" or "strong".

Overcoming the limitations of probability thresholds

A simple approach to filtering predictions would rely on probability thresholds. However, due to the characteristics of the SoftMax function, Transformer architectures tend to assign high confidence scores even to uncertain predictions. The proposed NR model overcomes this limitation by adopting a supervised modeling strategy.

The NR model leverages advanced features such as the Probability Density Map (PDM), which captures the Semantic-Pull effect observed in Transformer embeddings. This approach allows the model to classify predictions more accurately, reducing false positives by 50% to 90% in various clinical NER models.

AI-Radar Takeaway

A new Noise Removal (NR) model refines the output of BERT models for Named Entity Recognition (NER) in the clinical domain. The NR model analyzes the output probabilities of the NER model, classifying predictions as weak or strong using a Probability Density Map (PDM), reducing false positives by 50% to 90%.

🤖 Ask AI about this

Want to dive deeper? Read the full article from the source:

📖 READ THE ORIGINAL ARTICLE

💻 Need GPU Cloud Infrastructure?

For running LLM inference, training models, or testing hardware configurations, check out this platform:

RunPod GPU Cloud Platform

Flexible GPU cloud with pay-per-second billing. Deploy instantly with Docker support, auto-scaling, and a wide selection of GPU types from RTX 4090 to H100.

✓ No commitments ✓ Instant deployment ✓ Production-ready

🔗 This is an affiliate link - we may earn a commission at no extra cost to you.

AI-RADAR NEWSLETTER

Stay ahead — get AI signals in your inbox

Daily or weekly digest of the most important AI news. 160+ readers, no spam.

💬 Comments (0)

🔒 Log in or register to comment on articles.

No comments yet. Be the first to comment!

🔍 Continue Exploring

Explore LLM On-Premise

Complete guide to running AI models locally: hardware, stack, and privacy.

LLM: A Human-Centric Pipeline for Aligning LLMs with Chinese Medical Ethics

A new study introduces MedES, a dynamic benchmark for aligning large language models (LLMs) with Chinese medical ethics. The system uses an automated evaluator

GLM-5: New language model coming in February

GLM-5: New language model coming in February

The arrival of GLM-5, a new language model, has been announced. The confirmation came via a post on X (formerly Twitter) by Jietang. Further details on the mode

EVE: A Framework for Faithful and Complete Answers from LLMs

Frameworks Feb 09

EVE: A Framework for Faithful and Complete Answers from LLMs

A new framework, EVE, addresses the limitations of LLMs in providing complete and faithful answers based on a single document. EVE uses a structured approach th

Large Language Models Outperform Doctors in Clinical Diagnosis: Opportunities and Challenges

Large Language Models Outperform Doctors in Clinical Diagnosis: Opportunities and Challenges

A recent study published in Science reveals that an OpenAI LLM surpassed human physicians in clinical reasoning tasks based on real emergency room data. Despite

New training method boosts AI multimodal reasoning with smaller, smarter datasets

New training method boosts AI multimodal reasoning with smaller, smarter datasets

Researchers released a new training framework that improves the capabilities of language models in multimodal reasoning using smaller, smarter datasets.

More in LLM

Even Google believes in small coding models

SpectralQuant narrows the Q4_K_M quantization gap to 96.5%: a leap for local models

Two new AI tools from Tokyo and Beijing fill the gap left by Anthropic's export ban

ConlangCrafter: The AI That Invents Imaginary Languages (and Could Teach Us How We Think)

Orthrus brings diffusion head to Qwen 3.5/3.6 and Gemma 4: open-source code dropping soon

Qwen Fine-tunes: Why Optimized Models Struggle to Impress

→ View all in LLM →

AI-Radar LLM On-Premise

Complete guide to running AI models locally: hardware, stack, privacy, and reference architectures.

👥 Join 160+ AI explorers

A free community of developers, engineers and AI enthusiasts following local AI daily.

Register free → Already a member? Log in