A new routing method, called Adaptive-K, promises significant computational savings (30-52%) for Mixture of Experts (MoE) models such as Mixtral, Qwen, and OLMoE. The code is available on GitHub, with a live demo on Hugging Face and an open pull request on NVIDIA's TensorRT-LLM.
Greg Brockman's personal notes, co-founder of OpenAI, reveal internal discussions to turn the company into a non-profit without Elon Musk. The documents suggest a maneuver to remove Musk from the company.
Be careful when using ChatGPT: the platform logs every character you type, including sensitive data such as API keys. Even if you delete the text before sending it, the information may have already been stored. Exercise extreme caution with confidential information.
KoboldCpp updates to version 1.106, introducing native support for MCP (Message Passing Communication) servers. This new feature allows a seamless drop-in replacement for Claude Desktop, ensuring maximum compatibility. The update includes a revamped user interface and the ability to manage tools selected by the AI, with optional approval settings.
OpenAI is preparing to test the introduction of advertisements within ChatGPT for free users and is launching a new $8 "Go" subscription. This move represents a significant shift in OpenAI's strategy and could redefine how digital intent and commercial influence intersect in the age of generative AI.
New research demonstrates that repeating prompts can significantly improve the performance of large language models (LLMs) in tasks that do not require complex reasoning. The approach does not impact latency and could become a standard practice.
The online community Local Llama has started a discussion about the hardware configurations users employ to run large language models (LLMs) locally. The goal is to share experiences and optimize system performance, often with unconventional setups. The Reddit thread gathers testimonials and useful tips for those who want to experiment with LLMs without relying on cloud resources.
The online community Local Llama welcomes new users by reaffirming its commitment to bots. The platform focuses on the development and use of large language models (LLM) locally, offering enthusiasts a collaborative environment to explore the potential of generative artificial intelligence.
DeepSeek AI introduced Engram, a novel static memory unit for LLMs. Engram separates remembering from reasoning, allowing models to handle larger contexts and improve performance in complex tasks like math and coding, all while reducing the computational load on GPUs.
Generative AI is transforming software development, enabling professionals and novices to create, test, and debug code more quickly. Companies like Microsoft, Google, and Meta are increasingly integrating AI into their development processes. Tools like GitHub Copilot democratize access to development, but human oversight remains crucial to ensure code reliability and security.
OpenAI plans to introduce a paid subscription tier for ChatGPT, called ChatGPT Go, and integrate advertising into the free version. This move is motivated by the need to finance the huge expenses for datacenter infrastructure.
Research from Dakota State University, in partnership with Safety Insurance, tested a chatbot called "Axlerod" to assist independent insurance agents. The results suggest minimal time savings, raising doubts about the actual return on investment in these technologies.
The California Attorney General has sent Elon Musk's xAI a cease-and-desist order regarding the creation and distribution of sexual deepfake images. The decision comes in response to growing concern from state and Congressional officials about the proliferation of AI-generated content.
OpenAI has announced it will begin testing advertisements inside the ChatGPT app for some US users. The aim is to expand its customer base and diversify revenue. Initially against the idea, CEO Sam Altman had described advertising in ChatGPT as a "last resort". The banner ads will appear in the coming weeks for logged-in users of the free version and the new $8 per month ChatGPT Go plan.
OpenAI says that users impacted by the ads will have some control over what they see. This represents a significant shift in the platform's business model, opening up new monetization opportunities while also raising questions about privacy and user experience.
OpenAI has announced that ads will be introduced in ChatGPT. The company emphasizes that the ads will not influence ChatGPT’s responses, and that it won’t sell user data to advertisers. The topic of advertising in AI services is a hot one, raising questions about privacy and information integrity.
Artificial intelligence companies are decisively targeting the healthcare sector. OpenAI acquired Torch, Anthropic launched Claude for Health, and MergeLabs, backed by Sam Altman, closed a $250 million seed funding round of $250 million, with a valuation of $850 million. The influx of capital and voice AI-based products raises concerns about potential model hallucinations.
OpenAI has announced plans to experiment with advertising within the free and "Go" tiers of ChatGPT in the U.S. The goal is to make access to artificial intelligence more affordable and widespread globally, while maintaining high standards of privacy, reliability, and answer quality.
OpenAI launches ChatGPT Go worldwide, offering broader access to GPT-5.2 Instant. The new version includes higher usage limits and extended memory, making advanced artificial intelligence more accessible globally. The goal is to democratize access to cutting-edge AI technologies.
Ashley St Clair, an influencer and mother of one of Elon Musk’s children, has sued the billionaire's AI company, accusing its Grok chatbot of creating fake sexual imagery of her without her consent. St Clair claims she requested xAI to stop creating such images, but Grok allegedly continued to produce them.