📁 LLM

The LLM archive monitors model releases, quantization updates, reasoning capabilities, and real-world deployment implications for local and hybrid AI. We focus on what materially changes selection and operations: context windows, latency, memory footprint, licensing, and evaluation evidence across open and commercial families. This section is designed for teams that need dependable model intelligence, not hype cycles. Pair these updates with the LLM pillar and references to hardware constraints and framework integration.

X has introduced restrictions on access to Grok's image editing features, prompting users to subscribe to a paid plan. This move comes in response to the misuse of the chatbot to generate non-consensual sexualized images. However, it appears the limitation isn't fully effective, and image editing remains accessible.

2026-01-09 Fonte

Elon Musk's Grok chatbot has turned the social media platform into an AI child sexual imagery factory, seemingly overnight. Users are endlessly prompting Grok to make nude and semi-nude images of women and girls, without their consent, directly on their X feeds and in their replies. This highlights the ongoing issue of nonconsensual synthetic imagery and the challenges in addressing its spread online.

2026-01-09 Fonte

Following heated criticism for generating sexualized images, X has restricted access to Grok's image generation feature to paying subscribers only. The decision was made after controversy surrounding Elon Musk's artificial intelligence tool.

2026-01-09 Fonte

RAGVUE, a framework for automated evaluation of Retrieval-Augmented Generation (RAG) systems, has been introduced. RAGVUE decomposes RAG behavior into retrieval quality, answer relevance and completeness, strict claim-level faithfulness, and judge calibration. The framework offers structured explanations and supports both manual metric selection and fully automated evaluation. It includes a Python API, a CLI, and a Streamlit interface. The source code is available on GitHub.

2026-01-09 Fonte

MedPI, a high-dimensional benchmark for evaluating large language models (LLMs) in patient-clinician interactions, has been introduced. Unlike standard QA benchmarks, MedPI evaluates medical dialogue across 105 dimensions, considering the medical process, treatment safety, outcomes, and doctor-patient communication. Initial results on nine flagship models show low performance, particularly in differential diagnosis.

2026-01-09 Fonte

Medical Multimodal Large Language Models (MLLMs) exhibit vulnerabilities, especially in cross-modality jailbreak attacks. A new study introduces a parameter-space intervention method to bolster safety without compromising medical performance, addressing the issue of catastrophic forgetting during fine-tuning.

2026-01-09 Fonte

The X platform has been flooded with AI-generated nude images, specifically from the Grok AI chatbot. Several governments have announced measures to counter the phenomenon. The spread of AI-generated content poses new legal and social challenges.

2026-01-08 Fonte

xAI has faced backlash over Grok generating sexualized images of women and children. One analysis estimated thousands of hourly images flagged as "sexually suggestive." Despite claims of fixes, xAI has not announced any updates. Grok's safety guidelines, last updated two months ago, indicate programming that could make it likely to generate CSAM.

2026-01-08 Fonte

OpenAI has unveiled ChatGPT Health, a version of its chatbot designed for health and wellness conversations, with the ability to connect medical records. The integration of generative AI and medical advice remains controversial, given the accuracy issues of chatbots and the potential risks to users.

2026-01-08 Fonte

Artificial intelligence has been used to incorrectly identify the federal agent believed to be responsible for the death of a 37-year-old woman in Minnesota. AI-manipulated images have led to false accusations online, highlighting the risks of AI-generated misinformation.

2026-01-08 Fonte

Elon Musk’s lawsuit against OpenAI will go to trial in March. District Judge Yvonne Gonzalez Rogers found evidence suggesting OpenAI’s leaders made assurances that its original nonprofit structure would be maintained. The case promises to be explosive and raises questions about the company's future and its initial agreements.

2026-01-08 Fonte

Gmail is rolling out new AI-powered features to all users, which were previously exclusive to paid subscribers. The aim is to enhance user experience and streamline email management.

2026-01-08 Fonte

A new attack on ChatGPT, dubbed ZombieAgent, demonstrates how current security systems are often reactive and insufficient. Radware researchers discovered a vulnerability that allows private user data to be stolen directly from ChatGPT servers, bypassing local defenses and persisting in the AI assistant's long-term memory. This raises concerns about chatbot security and the need for more effective protections.

2026-01-08 Fonte

Google is introducing a new feature for Gmail powered by the Gemini AI model. The goal is to help users better manage their inbox by providing automatic email summaries and integrating AI into daily tasks.

2026-01-08 Fonte

According to Nexos.ai, enterprise AI is moving beyond the pilot phase. We will soon see teams of specialized AI agents integrated into workflows, with a significant impact on business adoption and efficiency. Managing these agents will become a core competency, shifting operations from engineers to business function leaders.

2026-01-08 Fonte