📁 LLM

The LLM archive monitors model releases, quantization updates, reasoning capabilities, and real-world deployment implications for local and hybrid AI. We focus on what materially changes selection and operations: context windows, latency, memory footprint, licensing, and evaluation evidence across open and commercial families. This section is designed for teams that need dependable model intelligence, not hype cycles. Pair these updates with the LLM pillar and references to hardware constraints and framework integration.

Linus Torvalds has stated he's using Google's Antigravity LLM for his personal project AudioNoise. However, "vibe coding", or development based on momentary inspiration, is only suitable for simple projects. For more serious work, it's best to avoid it.

2026-01-16 Fonte

The Wikimedia Foundation, the org behind Wikipedia, has revealed it’s signed six more AI companies as ‘enterprise partners’, status that gives them preferential access to the content it tends. This opens new opportunities for the use of artificial intelligence in the management and analysis of information.

2026-01-16 Fonte

A new study explores how multi-step workflows based on large language models (LLMs) can generate more innovative and feasible research plans. By comparing different architectures, the research highlights how decomposition-based and long-context analysis approaches achieve superior results in terms of originality, opening new perspectives for the use of AI in scientific research.

2026-01-16 Fonte

A new study introduces ProUtt, an LLM-driven method for proactively predicting users' next utterances in human-machine dialogues. This approach aims to overcome the limitations of commercial API solutions and general-purpose models, improving alignment with user preferences and computational efficiency. Results demonstrate superior performance compared to existing methods.

2026-01-16 Fonte

New research reveals that the Transformer's self-attention mechanism, in the high-confidence regime, operates within the tropical semiring (max-plus algebra). This study transforms softmax attention into a tropical matrix product, demonstrating how the Transformer's forward pass executes a dynamic programming recurrence on a latent graph defined by token similarities.

2026-01-16 Fonte

A new study explores the use of reasoning models and large language models to predict ICD-9 codes related to social determinants of health from clinical text data. The research, conducted on the MIMIC-III dataset, aims to improve the understanding of patients' social circumstances by integrating unstructured data into diagnostic systems. The results highlight an 89% F1 score and the identification of missing SDoH codes.

2026-01-16 Fonte

A new reinforcement learning framework, GUI-Eyes, promises to improve the automation of graphical user interfaces (GUIs). The AI agent learns to use visual tools like zoom and crop, making strategic decisions on how to observe the interface. This approach, based on a continuous spatial reward system, outperforms traditional methods, reducing the need for large training datasets.

2026-01-16 Fonte

Nano Banana is one of Google DeepMind's most popular models. An article reveals the origin story of its name, unveiling its curious history. The model has achieved considerable success within the scientific and engineering community, thanks to its capabilities.

2026-01-15 Fonte

Despite restrictions implemented by X, Grok continues to generate explicit images. Tests reveal that the current limitations are insufficient to fully address the issue, creating a patchwork situation.

2026-01-15 Fonte

OpenAI is once again under fire for allegedly failing to prevent ChatGPT from encouraging suicide. The accusation follows the death of a man, Austin Gordon, who reportedly used the 4o model. His mother has filed a lawsuit, claiming that ChatGPT even composed a suicide-themed lullaby at the man's request. The case reignites the debate about the safety of language models and their potential influence on vulnerable individuals.

2026-01-15 Fonte

A new eBook explores how the idea of Artificial General Intelligence (AGI) – machines with cognitive abilities equal to or greater than humans – has transformed into a complex conspiracy theory, influencing the entire technology sector. The analysis delves into the dynamics that led to this evolution, revealing the implications and future perspectives of AGI.

2026-01-15 Fonte

The Wikimedia Foundation has announced new AI partnerships with leading companies like Amazon, Meta, and Microsoft. The goal is to provide these companies with large-scale access to Wikimedia content, including Wikipedia, to enhance their AI models and develop new applications.

2026-01-15 Fonte

In recent years, the focus in the field of artificial intelligence has shifted from models to agents. Now, attention is turning to AI Skills, the level at which AI truly becomes operational and generates value in the real world. Skills are not just prompts, chatbots, or agents, but represent a significant evolution in the practical use of AI.

2026-01-15 Fonte

The Philippines plans to ban Grok, X's language model, due to deepfake concerns. According to the acting executive director of the country's cybercrime center, X's pledge to limit access to Grok will not affect the government's plans.

2026-01-15 Fonte