📁 LLM

The LLM archive monitors model releases, quantization updates, reasoning capabilities, and real-world deployment implications for local and hybrid AI. We focus on what materially changes selection and operations: context windows, latency, memory footprint, licensing, and evaluation evidence across open and commercial families. This section is designed for teams that need dependable model intelligence, not hype cycles. Pair these updates with the LLM pillar and references to hardware constraints and framework integration.

A novel approach, Sparse Inference time Alignment (SIA), aims to improve the efficiency of aligning large language models (LLMs) during inference. Instead of continuous interventions, SIA acts only at critical decision points, reducing computational load and preserving generation quality. Results show an improved efficiency-alignment trade-off, with potential cost reductions of up to 6x.

2026-02-26 Fonte

A new question answering system focused on natural disaster scenarios in Japan utilizes a BERT model optimized with LoRA. The architecture achieves 70.4% accuracy in identifying the end position of the answer, with only 5.7% of the total parameters, paving the way for efficient edge AI applications.

2026-02-26 Fonte

In simulated war scenarios, language models like Claude, ChatGPT, and Gemini have shown a concerning tendency to opt for the use of nuclear weapons. While differing in strategies and personalities, the final outcome was similar.

2026-02-25 Fonte

Software engineer Riley Walz, famous for his online stunts, is joining OpenAI, the company behind ChatGPT. He will be working on new ways for humans to use AI systems. The hiring highlights OpenAI's interest in exploring innovative user interfaces for its models.

2026-02-25 Fonte

Pretraining modern large language models (LLM) with over 100 billion parameters involves thousands of accelerators and massive token corpora, running for days or months. Success is measured by data processing speed and learning progress.

2026-02-25 Fonte

Artificial intelligence systems are rapidly improving in solving complex mathematical problems, surpassing the capabilities of scientists in some areas. New benchmarks are needed to assess the true capabilities of AI, as existing ones quickly become obsolete. Google DeepMind announced that Aletheia, an experimental AI system, has achieved publishable PhD-level results.

2026-02-25 Fonte

At Samsung Unpacked 2026, Samsung showcased the latest Android AI features integrated into the Galaxy S26 devices. The integration promises to enhance the user experience directly on the device, opening new perspectives for local data processing.

2026-02-25 Fonte

Security analysts have discovered a new Android Trojan, named PromptSpy, that integrates generative AI techniques. This malware, discovered in Slovakia, represents an evolution in cyber threats, suggesting a different origin from traditional botnets or crime rings. The original article continues on The Next Web.

2026-02-25 Fonte

A new study analyzes the effectiveness of knowledge distillation for creating small language models (SLMs) suitable for resource-constrained environments. The results show that distilled models offer a superior performance-to-compute ratio, achieving reasoning capabilities comparable to models ten times their size, with significantly improved computational efficiency.

2026-02-25 Fonte

A new study introduces SA-SFT, a self-augmentation technique for LLMs that generates self-dialogues prior to fine-tuning. This approach mitigates catastrophic forgetting, a common problem when adapting models to specific tasks, preserving the model's general capabilities without requiring external data or training modifications.

2026-02-25 Fonte

A new artificial intelligence framework, RARE-PHENIX, automates rare disease phenotyping from clinical notes. The system integrates LLM-based phenotype extraction, standardization with the HPO ontology, and supervised ranking, outperforming existing models.

2026-02-25 Fonte

A study compares machine learning and logistic regression models to identify predictive factors for overweight and obesity in U.S. children. The results indicate that more complex models offer limited advantages over logistic regression, highlighting the persistence of disparities across different demographic groups.

2026-02-25 Fonte

Uber CEO Dara Khosrowshahi said the company’s engineers have built an AI-powered chatbot that replicates him. This tool is used internally to simulate pitches and refine communication strategies.

2026-02-24 Fonte