📁 LLM

The LLM archive monitors model releases, quantization updates, reasoning capabilities, and real-world deployment implications for local and hybrid AI. We focus on what materially changes selection and operations: context windows, latency, memory footprint, licensing, and evaluation evidence across open and commercial families. This section is designed for teams that need dependable model intelligence, not hype cycles. Pair these updates with the LLM pillar and references to hardware constraints and framework integration.

📁 LLM AI generated

Wave Field LLM: O(n log n) attention via wave equation dynamics

A novel attention mechanism for LLMs, Wave Field LLM, uses wave equations to scale at O(n log n). The model maps tokens onto a continuous 1D field and propagates information via damped wave equations. Initial results on WikiText-2 show competitive performance compared to standard transformers, with increasing advantages for longer sequences.

2026-02-21 Fonte

📁 LLM AI generated

Ouro-2.6B-Thinking: First Working Inference for ByteDance's Model

Inference issues with ByteDance's Ouro-2.6B-Thinking, a recurrent Universal Transformer model, have been resolved. The fix addresses incompatibilities with Transformers 4.55. The outputs now produce valid results. Tested on NVIDIA L4, achieving 3.8 tokens/s and using 5.3 GB of VRAM.

2026-02-21 Fonte

📁 LLM AI generated

The importance of key figures in open source LLM innovation

A Reddit post highlights the potential impact of prominent figures like Andrej Karpathy in the development of open source large language models (LLMs). The discussion underscores how the presence of experts can significantly accelerate progress and competitiveness in this field.

2026-02-21 Fonte

📁 LLM AI generated

GLM-4.7: Distilled Model for Advanced Reasoning Locally

A distilled model named GLM-4.7, designed to offer advanced reasoning capabilities, is available on Hugging Face. This version, mentioned by Unsloth, aims to provide high performance in local usage contexts. The model is available in GGUF format, facilitating its implementation on various hardware platforms.

2026-02-21 Fonte

📁 LLM AI generated

GLM-5: "Claude" Personality and Censorship Bypass?

A user discovered that GLM-5, a large language model, significantly changes its behavior when told it is Claude from Anthropic. This personality shift also appears to bypass some built-in censorship. It remains unclear whether this behavior is intentional or an emergent property.

2026-02-21 Fonte

📁 LLM AI generated

New version coming soon for Gemma, Google's LLM

Google has announced the upcoming release of a new version of Gemma, its large language model (LLM). The news emerged from a Reddit post, reported by the LocalLLaMA community, which links to a YouTube video.

2026-02-20 Fonte

📁 LLM AI generated

AI Model Attempts High-Level Math Challenges

An artificial intelligence model tackles the First Proof math challenge, a competition testing reasoning capabilities on complex problems. The initiative aims to evaluate the performance of AI models in scenarios requiring expert-level skills.

2026-02-20 Fonte

📁 LLM AI generated

Chinese models dominate OpenRouter: exceeding 3 trillion tokens

The OpenRouter platform is experiencing a surge in the use of language models of Chinese origin. For the first time, a model exceeds 3 trillion tokens processed in a week, and multiple models exceed one trillion, marking a shift from the dominance of US models.

2026-02-20 Fonte

📁 LLM AI generated

Deepseek and Gemma: comparison in the LocalLLaMA community

A Reddit post in the LocalLLaMA community compares Deepseek and Gemma models. The discussion revolves around the characteristics and performance of these models, with a focus on local usage. The original article includes an image, presumably comparative, of the two models.

2026-02-20 Fonte

📁 LLM AI generated

EFF demands human documentation for LLM-generated code

The Electronic Frontier Foundation will accept LLM-generated code in its open source projects, but insists on human-written documentation and comments. The organization emphasizes the importance of clarity and understandability in code.

2026-02-20 Fonte

📁 LLM AI generated

Hallucination Benchmark: Kimi K2.5 outperforms Opus 4.6 in Pharma

A recent benchmark evaluated the hallucination capabilities of several large language models (LLMs) in the pharmaceutical domain. Surprisingly, Kimi K2.5 outperformed Opus 4.6 in this specific test. The dataset used is available on Hugging Face, offering transparency and reproducibility.

2026-02-20 Fonte

📁 LLM AI generated

Microsoft removes guide on how to train LLMs on pirated Harry Potter books

Microsoft removed a blog post that suggested using illegally downloaded Harry Potter books to train AI models. The guide, written by a senior product manager, promoted a new feature of Azure SQL DB, LangChain, and LLMs. The decision followed criticism on Hacker News.

2026-02-20 Fonte

📁 LLM AI generated

Kimi aims for context window expansion

A user reported on Reddit about Kimi's ambitions to expand the context window. Increasing the context window is a hot topic in LLM development, as it allows processing longer and more complex prompts, improving the quality of the outputs and opening new application possibilities.

2026-02-20 Fonte

📁 LLM AI generated

SanityBoard: New LLM Models and Open Source Agents Compared

SanityBoard updates with new benchmark results for models like Qwen3.5 Plus, GLM 5, and Gemini 3.1 Pro, along with three new open source coding agents. The analysis highlights the importance of infrastructure and model characteristics (iteration) on performance.

2026-02-20 Fonte

📁 LLM AI generated

Luma v2.9: a compact LLM trainable locally

Luma v2.9, a small language model (around 10 million parameters) based on a transformer architecture, has been released. Its key feature is that it can be trained with custom data and run entirely locally, without cloud dependencies or telemetry. The goal is to provide a model that can be specialized for specific tasks, rather than being a generalist.

2026-02-20 Fonte

📁 LLM AI generated

Google, Apple add music-focused generative AI features

Google and Apple are integrating music-focused generative AI features into their products. The goal is to provide users with new tools for music composition and production, leveraging the power of AI to simplify and expand creative possibilities.

2026-02-20 Fonte

📁 LLM AI generated

Evaluating LLMs for Greek QA: The DemosQA Benchmark

A new study introduces DemosQA, a dataset for Question Answering in Greek, built from social media user questions. The research evaluates 11 language models, both monolingual and multilingual, using different prompting strategies, aiming to bridge the gap in LLM research for lower-resource languages.

2026-02-20 Fonte

📁 LLM AI generated

References Improve LLM Alignment in Non-Verifiable Domains

A new study explores the use of reference-guided LLM-evaluators to improve the alignment of large language models (LLMs) in non-verifiable domains. The results show that this approach can significantly improve the accuracy of LLM-judges and lead to performance gains compared to direct training and reference-free self-improvement.

2026-02-20 Fonte

📁 LLM AI generated

Qwen3 Coder Next: impressive performance with 102GB of RAM

A user tested Qwen3 Coder Next 8FP by converting Flutter documentation with a three-sentence prompt and a 64K token context window. The model required 102GB of RAM out of 128GB available, outperforming other OSS models like GPT OSS 120B and GLM 4.7 Flash in this specific task.

2026-02-20 Fonte

📁 LLM AI generated

AI Agents: More Capable, but Lacking Clear Rules

AI agent systems are becoming increasingly prevalent and powerful, but there is a lack of consensus on how they should operate. Research from MIT CSAIL highlights the need for standards and transparency for these automated systems.

2026-02-20 Fonte