Topic / Trend Rising

Large Language Models (LLMs) Advancements and Applications

LLMs are rapidly evolving, with advancements in training techniques, architectures, and applications across various industries. The focus is shifting towards efficiency, accessibility, and real-world problem-solving, while also addressing ethical concerns and biases.

Detected: 2026-01-21 · Updated: 2026-01-21

Related Coverage

2026-01-21 Tech.eu

Fracttal raises $35M to expand AI-driven maintenance

Fracttal, a Madrid-based company specializing in AI-powered maintenance solutions, has closed a $35 million funding round led by Riverwood Capital. The investment will support the company's continued growth, product development, and global expansion....

#Hardware
2026-01-21 Tech.eu

Antidote completes $5M seed round for billing compliance automation

Antidote, a provider of AI-based billing compliance software for law firms, has raised $5 million in a seed funding round. The funding will support the advancement of its platform and expand its presence in the US, aiming to reduce billing errors and...

2026-01-21 LocalLLaMA

Building an LM from Scratch: Day 6 Update

An enthusiast shares progress on building a language model (LM) from scratch. After stabilizing the system, the focus shifted to training, revealing the need for a significantly higher number of steps to achieve optimal results. Despite initial chall...

#Hardware
2026-01-21 OpenAI Blog

Horizon 1000: OpenAI and Gates Foundation Advance AI in Africa

OpenAI and the Gates Foundation launch Horizon 1000, a $50M pilot program to advance AI capabilities for healthcare in Africa. The initiative aims to reach 1,000 clinics by 2028, bringing innovation and improving access to medical care.

2026-01-21 ArXiv cs.AI

Rare disease diagnosis: Is AI really up to the task?

A new study challenges the effectiveness of large language models (LLMs) in the differential diagnosis of rare diseases. The MIMIC-RD benchmark reveals that current LLMs struggle to handle real-world clinical complexity, highlighting a significant ga...

#Fine-Tuning
2026-01-21 LocalLLaMA

Camb AI: New Model with Minimal Latency for Live Sports?

A user reported the launch of a new Camb AI model, particularly effective in live sports broadcasts. The most notable aspect is its low latency and high voice quality, making it indistinguishable from human speech. The technology raises questions abo...

2026-01-21 DigiTimes

Intel recruits Qualcomm GPU chief to lead future AI PC efforts

Intel has recruited a former Qualcomm GPU executive to lead its future AI PC efforts. This strategic move aims to strengthen Intel's position in the rapidly growing market for PCs equipped with advanced AI capabilities, leveraging the new leader's ex...

#Hardware
2026-01-21 DigiTimes

Asia Optical bets on humanoid robots as its next growth engine

Asia Optical chairman I-Jen Lai sees humanoid robots as the company's next growth engine. The company is investing in this emerging sector, betting on the long-term potential of advanced robotics. Increased demand is expected in the coming years, wit...

2026-01-21 TechCrunch AI

Bolna nabs $6.3M for its India-focused voice orchestration platform

Bolna, specializing in voice orchestration platforms focused on the Indian market, has raised $6.3 million in funding led by General Catalyst. The company stated that 75% of its revenue comes from self-service customers, highlighting strong adoption ...

2026-01-21 The Register AI

OpenAI: Age Prediction Model for ChatGPT Users

OpenAI has begun deploying an age prediction model for its ChatGPT users. The goal is to filter access to sensitive or potentially harmful content for underage users. This initiative could unlock new monetization opportunities by restricting access b...

2026-01-20 DigiTimes

Quanta EVP Mike Yang: AI industry paradigm shift just begun

Mike Yang, executive vice president of Quanta Computer, foresees a paradigm shift in the artificial intelligence sector. According to Yang, this is just the beginning of a profound transformation, with significant implications for the future of techn...

2026-01-20 DigiTimes

Sony and TCL move toward joint venture in home entertainment

Sony and TCL are reportedly considering a joint venture in the home entertainment sector. The potential agreement could lead to increased collaboration in the development and production of televisions and other home entertainment devices. Further det...

2026-01-20 DigiTimes

Inventec doubles 2026 capex to US$1 billion for AI servers

Inventec has announced a doubling of its planned capital expenditure for 2026, bringing it to US$1 billion. The decision is driven by growing opportunities in the artificial intelligence (AI) server market. The company aims to strengthen its position...

#Hardware
2026-01-20 LocalLLaMA

GLM-4.7-Flash implementation in llama.cpp: issues confirmed

Recent discussions suggest that the GLM-4.7-Flash implementation in llama.cpp has issues. Significant differences in logprobs compared to vLLM could explain anomalous behaviors reported by users, such as infinite loops and poor response quality. It i...

#LLM On-Premise
2026-01-20 TechCrunch AI

ChatGPT: age estimation to protect young users

OpenAI introduces a new feature in ChatGPT: the model now estimates the age of users. The goal is to prevent the delivery of potentially problematic content to individuals under 18, strengthening safety measures for young people.

2026-01-20 TechCrunch AI

Tesla restarts Dojo3 project for space-based AI applications

Elon Musk announced that Tesla will restart the development of Dojo3, its previously abandoned third-generation AI chip. Unlike the original plans, Dojo3 will now be dedicated to space-based AI compute, opening new frontiers for Tesla's space applica...

2026-01-20 OpenAI Blog

Cisco and OpenAI: AI agents for enterprise engineering

Cisco and OpenAI are collaborating to redefine enterprise engineering. The focus is Codex, an AI software agent embedded in workflows to speed up development, automate defect fixes, and enable AI-native development.

2026-01-20 OpenAI Blog

ChatGPT: Age Prediction Rollout for Enhanced Online Safety

OpenAI is rolling out age estimation on ChatGPT to protect younger users. The system assesses whether an account belongs to a minor or an adult, applying specific safeguards for teenagers. The company plans to progressively improve the model's accura...

2026-01-20 The Register AI

VoidLink: Linux malware targeting the cloud, written by an AI agent

A new Linux malware, named VoidLink, has been discovered targeting cloud infrastructures. What makes it special? According to researchers, it was developed almost entirely by an artificial intelligence agent, likely by a single individual. VoidLink u...

2026-01-20 LocalLLaMA

New LongPage Dataset: Over 6K Novels to Train Full Book Writing LLMs

An update to the LongPage dataset has been released, now including over 6,000 full-length novels paired with reasoning traces. These traces break down the story into hierarchical sections, from the general idea to individual chapters and scenes. The ...

#Fine-Tuning
2026-01-20 LocalLLaMA

Liquid AI released the best thinking Language Model Under 1GB

Liquid AI released LFM2.5-1.2B-Thinking, a reasoning model that runs entirely on-device. Trained specifically for concise reasoning, it generates internal thinking traces before producing answers, enabling systematic problem-solving at edge-scale lat...

2026-01-20 The Register AI

AI PCs for the Enterprise: Does TOPS Trump Everything Else?

Artificial intelligence is becoming ubiquitous in the enterprise technology world. But are AI PCs really that widespread? An analysis of the role of computing power (TOPS) in the adoption of AI PCs in the enterprise and whether this parameter is the ...

2026-01-20 MIT Technology Review

The era of agentic chaos and how data will save us

The adoption of AI agents is growing rapidly, but many companies are not ready. A solid data infrastructure is essential to avoid chaos and maximize the value of AI. Market leaders invest in quality data to ensure agent reliability and achieve concre...

2026-01-20 The Register AI

Majority of CEOs report zero payoff from AI splurge

A PwC survey of over 4,500 business leaders reveals that more than half have seen neither increased revenue nor decreased costs following massive investments in AI. The findings raise questions about the actual economic return of these technologies.

2026-01-20 The Register AI

AI framework flaws put enterprise clouds at risk of takeover

Two vulnerabilities in the popular open-source AI framework Chainlit put major enterprises' cloud environments at risk. According to Zafran, the flaws are easy to exploit and could lead to data leaks or full system takeover. It is recommended to upda...

2026-01-20 LocalLLaMA

DeepSeek: a new model appears, codenamed "model1"

A DeepSeek repository has been updated with a reference to a new model identified as "model1". The discovery was made via a file within DeepSeek's FlashMLA repository on GitHub. Further details on the model's specifications or capabilities are curren...

2026-01-20 LocalLLaMA

LocalLLaMA: The unstoppable rise of local language models

A Reddit post highlights the surprising capabilities of language models running locally with LocalLLaMA. The discussion emphasizes how these models, while running on consumer hardware, demonstrate a context understanding and responsiveness that often...

#Hardware
2026-01-20 The Register AI

Manchester ATM ups PIN requirement to full Windows login

An ATM in Manchester has been spotted displaying a Windows 7 login screen, an operating system no longer supported by Microsoft. The image raises concerns about customer data security and the vulnerability of outdated banking systems. The incident hi...

2026-01-20 The Register AI

AI crashes in UK finance: MPs ask who is responsible

UK MPs are urging financial regulators to conduct thorough stress tests. The goal is to prepare businesses for market shocks triggered by artificial intelligence. The crucial issue of assigning responsibility for automated decisions remains unresolve...

2026-01-20 The Register AI

UK's Department of Health Seeks Tech Director: £285k Salary

England's Department of Health and Social Care is recruiting a head of technology, digital and data with a maximum salary of up to £285,000 a year, exceeding the salary of the department's boss. The role is pivotal in driving technological innovation...

2026-01-20 Tech.eu

Stilla emerges from stealth with $5M to boost AI collaboration

Stockholm-based Stilla has raised $5 million to develop a platform that enhances collaboration between people and AI systems. The goal is to provide an intelligence layer that connects workplace tools like Slack, GitHub, and Notion, ensuring teams st...

2026-01-20 DigiTimes

Alibaba's Qwen expansion links AI directly to consumer services

Alibaba is expanding the integration of its Qwen artificial intelligence model directly into consumer-facing services. This strategic move aims to enhance user experience and offer advanced AI-powered features across various domains, solidifying Alib...

2026-01-20 LocalLLaMA

GLM-4.7-Flash-GGUF is here!

A new version of GLM-4.7-Flash-GGUF has been released, a large language model (LLM) designed for local inference. This implementation, available on Hugging Face, allows users to run the model directly on their devices, opening new possibilities for o...

#Hardware
2026-01-20 OpenAI Blog

AI for self empowerment: new growth opportunities

Artificial intelligence can expand human capabilities, bridging the skills gap and unlocking new opportunities for productivity and growth for individuals, businesses, and nations. An analysis of how AI can foster self-empowerment and development.

2026-01-19 LocalLLaMA

LightOn OCR: New Open Source Model for Optical Character Recognition

LightOn AI has released LightOnOCR-2-1B, an open-source Optical Character Recognition (OCR) model. The model is available on Hugging Face and aims to provide an accessible solution for extracting text from images. Its release has been welcomed by the...

2026-01-19 LocalLLaMA

GLM-4.7-FLASH: Mixed Precision NVFP4 Version Available on Hugging Face

A mixed precision NVFP4 quantized version of GLM-4.7-FLASH has been published on Hugging Face. The author encourages the community to test the model and provide feedback. The model has a size of 20.5 GB and aims to optimize performance while maintain...

#Hardware
2026-01-19 LocalLLaMA

Gemma 3:1b: What are the main uses of small models?

A user wonders about the possible uses of small language models like Gemma 3:1b. These models, while running on less powerful hardware, open up interesting scenarios. It remains to be seen whether they are suitable for basic tasks or simple calculati...

#Hardware
2026-01-19 TechCrunch AI

US AI startups raise record funding in 2025

2024 was a pivotal year for the AI industry in the US and beyond. It remains to be seen whether 2025 will be equally positive. Analysis reveals that numerous AI startups have raised over $100 million in funding, marking an unprecedented wave of inves...

2026-01-19 LocalLLaMA

Nvidia GB10 vs GH200: early performance benchmarks

Early benchmarks comparing the performance of Nvidia's GB10 GPU with the GH200 have surfaced online. The data, originating from a Reddit source, offers a preview of the potential of Nvidia's new architecture, although they should be taken with cautio...

#Hardware
2026-01-19 LocalLLaMA

llama.cpp adopts Anthropic Messages API

The llama.cpp library has integrated Anthropic's Messages API, opening new possibilities for interacting with language models. This integration, announced on Reddit and Hugging Face, allows developers to leverage the capabilities of llama.cpp for adv...

#LLM On-Premise
2026-01-19 The Register AI

From AI Ambition to AI Production: Escaping the AI Pilot Trap

Many companies want to implement artificial intelligence, but struggle to move from the pilot phase to large-scale production. Pilot projects often fail to take off because the necessary infrastructure to support them is lacking. Interest in AI is ve...

2026-01-19 LocalLLaMA

GLM-4.7-Flash: a 30B model that is impressive in BrowseComp

A Reddit post highlights the performance of the GLM-4.7-Flash 30B parameter model in the context of BrowseComp, suggesting that Qwen may need to catch up. The comparison also includes GPT-OSS-20B. The model is available on Hugging Face.

2026-01-19 LocalLLaMA

Ghost Engine: Run Llama-3-8B in 3GB VRAM by Generating Weights

A new inference engine, called Ghost Engine, promises to drastically reduce memory consumption when running large language models (LLMs). Instead of loading static weights, Ghost Engine generates them on the fly, trading memory bandwidth for compute....

2026-01-19 Tech.eu

Isle of Man launches National AI Office with £1M investment

The Isle of Man Government has launched its National AI Office (NAIO), backed by a £1 million investment. The aim is to coordinate the responsible adoption of artificial intelligence across the island, supporting businesses and the public sector. The...

2026-01-19 LocalLLaMA

GLM-4.7-Flash: New Open-Source Language Model on Hugging Face

The GLM-4.7-Flash language model is now available on Hugging Face. The news was shared on Reddit, sparking discussion within the LocalLLaMA community. The open-source model promises new opportunities for developing generative artificial intelligence ...

2026-01-19 404 Media

ICE’s Facial Recognition App Misidentified a Woman. Twice

The Mobile Fortify app, used by Immigration and Customs Enforcement (ICE) to identify individuals and determine their immigration status, provided two incorrect names for the same woman during a check. The incident raises doubts about the accuracy of...

2026-01-19 IEEE Spectrum

AI Boosts Research Careers, but Flattens Scientific Discovery

An analysis of over 40 million academic papers reveals that scientists using AI tools publish more and reach leadership positions faster. However, AI-driven research tends to focus on narrow areas, limiting originality and diversity in scientific inq...

2026-01-19 AI News

Artificial intelligence: transforming credit unions

Artificial intelligence is rapidly transforming financial services, offering new opportunities but also challenges for credit unions. These institutions, built on trust and community alignment, must integrate AI to meet member expectations and compet...

2026-01-19 The Register AI

Police chief suspended after AI hallucination: police chief resigns

The chief constable of West Midlands Police has resigned after his police force used fictional output from Microsoft Copilot in deciding to ban Israeli fans from attending a football match. The officer had denied the use of artificial intelligence sy...

2026-01-19 LocalLLaMA

GLM-4.7-Flash soon? Leaks about the new language model

Hints of a possible imminent release of GLM-4.7-Flash are surfacing. An update to the GLM-4.7 collection, containing a hidden item, has caught the attention of experts. Initial analysis suggests that Zai is preparing to launch this new version. A com...

#LLM On-Premise
2026-01-19 Tom's Hardware

China leads in advanced robotics and world models: AI's next frontier

The AI race is shifting towards advanced robotics and world models. China is positioning itself as a leader in this field, with a high number of operational robots expected as early as 2025. This trend could redefine the global balance in the technol...

2026-01-19 LocalLLaMA

Top-K: Optimized Algorithm Up to 20x Faster Than PyTorch

A developer has created an optimized Top-K implementation, crucial for sampling in large language models (LLM). The AVX2-optimized implementation outperforms PyTorch CPU performance by 4-20x, depending on vocabulary size. Integration into llama.cpp r...

#Hardware #LLM On-Premise
2026-01-19 LocalLLaMA

Flog: Free iOS Nutrition Tracker App with Local LLM Support

A developer has created Flog, a free iOS app that tracks nutrition through photos, leveraging local LLM models to estimate portions and nutrients. The app integrates with Apple Health and supports LLM models run directly on the device or via LM Studi...

2026-01-19 LocalLLaMA

A look behind the scenes: building 3 GH200 systems in the workshop

A Reddit user shared images of the process of assembling three GH200 systems inside a workshop. The images show the various stages of construction, offering a close-up look at the hardware and infrastructure needed to support these high-performance s...

#Hardware
2026-01-19 LocalLLaMA

JARVIS: Progress Report on LLM Agent Development

A Reddit user shared an update on the development of JARVIS, an agent based on large language models (LLM). The original post includes a link to a demonstration video of the project. The development of LLM agents is a rapidly growing research area, w...

2026-01-19 LocalLLaMA

Local LLM Coding: Is it Still Worth it with a 16GB GPU?

A user with a 16GB Nvidia RTX 5070 Ti GPU questions the effectiveness of local large language model (LLM) development. Experience with Kilo code and Qwen 2.5 coder 7B via Ollama revealed issues with context management, which quickly runs out even wit...

#Hardware #LLM On-Premise
2026-01-19 Wired AI

The Race to Build the DeepSeek of Europe Is On

As Europe’s longstanding alliance with the US falters, its push to become a self-sufficient AI superpower has become more urgent. The goal is to create a European alternative to advanced models like DeepSeek, reducing technological dependence on othe...

2026-01-19 DigiTimes

Apple-Google AI partnership could reshape voice assistant market

A potential collaboration between Apple and Google in the field of artificial intelligence could reshape the voice assistant market. The partnership, if realized, would have an estimated value of up to $5 billion. Implications and details of the agre...

2026-01-19 The Register AI

Hiring Stalls at India’s Big Four Outsourcers Amid AI Impact

India’s big four outsourcers – HCL, Infosys, TCS and Wipro – have essentially stopped hiring, potentially due to increased AI adoption. Revenue growth is also sluggish. This slowdown reflects a significant shift in the IT services landscape.

2026-01-19 ArXiv cs.CL

BYOL: Bring Your Own Language Into LLMs

A new study introduces BYOL, a framework for improving the performance of large language models (LLMs) in languages with limited digital presence. BYOL classifies languages based on available resources and adapts training techniques, including synthe...

2026-01-19 ArXiv cs.AI

LLMs: How Do They Assess Trustworthiness of Online Information?

Large language models (LLMs) are increasingly important in online search and recommendation systems. New research analyzes how these models encode perceived trustworthiness in web narratives, revealing that models internalize psychologically grounded...

#Fine-Tuning
2026-01-19 LocalLLaMA

Hot take: OpenAI should open-source GPT-4o

A user suggested that OpenAI should open-source the GPT-4o model. Despite safety concerns, the move could cover OpenAI's open-source rally for the next few months and save on the costs of maintaining the model.

#Fine-Tuning
2026-01-19 LocalLLaMA

Strix Halo as LLM Server: Which Linux Distro to Choose?

A user is evaluating using their Strix Halo as a server for large language models (LLM) and a media server, looking for the most suitable Linux distribution. Fedora 43 is already installed, but alternatives are being considered for optimal RDP suppor...

2026-01-19 LocalLLaMA

DetLLM: tool to ensure deterministic inference in LLMs

A developer has created DetLLM to address the issue of non-reproducibility in LLM inference. The tool verifies repeatability at the token level, generates a report, and creates a minimal reproduction package for each run, including environment snapsh...

2026-01-19 LocalLLaMA

SLM Prompting: How to Outperform Larger Language Models?

A user is questioning how to get the most out of small language models (SLMs), especially when fine-tuned for a specific topic. The challenge is that traditional prompts, effective with large language models (LLMs), often produce incoherent results w...

2026-01-19 DigiTimes

US-Taiwan defense ties deepen due to 15% tariff cap

According to DIGITIMES, defense ties between the US and Taiwan are deepening, partly due to a 15% tariff cap. This move highlights the increasing collaboration between the two nations in a strategically crucial area.

2026-01-18 DigiTimes

AI: Machine identities outnumber humans in Asia-Pacific

Artificial intelligence is reshaping the cybersecurity landscape in the Asia-Pacific region, with an exponential increase in machine identities. This shift poses new challenges for protecting systems and data, requiring more sophisticated and automat...

2026-01-18 LocalLLaMA

How do you pronounce "GGUF"? The pronunciation dilemma in AI

The pronunciation of "GGUF", a file format used in the field of artificial intelligence, is generating a heated debate in the community. The most common options include "jee-guff", "giguff", and "jee jee you eff". The discussion highlights the challe...

2026-01-18 OpenAI Blog

AI for human agency: a driver of growth and opportunity

Artificial intelligence can expand human capabilities, bridging the skills gap and unlocking new growth opportunities for individuals, businesses, and nations. An analysis of AI's potential as a tool to increase productivity and foster economic devel...

2026-01-18 LocalLLaMA

RLVR and GRPO: From-Scratch Implementation with Notebook

A code notebook illustrating the from-scratch implementation of RLVR (Reinforcement Learning Value Retrieval) with GRPO (Gradient Ratio Policy Optimization) is now available. The resource, hosted on GitHub, was shared on Reddit and is intended for th...

2026-01-18 Tom's Hardware

Vintage Resurrection: 1974 Altair 8800 Computer Fixed and Runs in 2026

A 1974 Altair 8800 computer, incorrectly assembled, was repaired and successfully ran its first program in 2026. The machine, powered by an Intel 8080 processor, came to life over fifty years after its construction. The repair was documented by a com...

#Hardware
2026-01-18 Tom's Hardware

U.S. EPA Requires Permits for Musk's xAI Gas Turbine Generators

The U.S. EPA now requires permits to operate gas turbine generators, even temporary ones, closing loopholes in some local ordinances that waived this requirement for deployments that lasted for less than 364 days. This affects Elon Musk's xAI.

2026-01-18 The Register AI

Nvidia leans on emulation to squeeze more HPC oomph from AI chips

Nvidia is leaning on emulation to boost the performance of its AI chips in high-performance computing (HPC), amid competition with AMD. AMD researchers argue that algorithms like the Ozaki scheme merit investigation but aren't yet ready for prime tim...

#Hardware
2026-01-18 LocalLLaMA

Ministral 3 Reasoning Heretic: Uncensored LLM Models and GGUFs

Ministral 3 Reasoning Heretic models are now available, uncensored versions with vision capabilities. User coder3101 released quantized models (Q4, Q5, Q8, BF16) with MMPROJ for vision features, speeding up release times for the community. 4B, 8B and...

#Hardware
2026-01-18 LocalLLaMA

Newelle 1.2: AI assistant for Linux gets an update

Version 1.2 of Newelle, the AI assistant designed for Linux, is now available. The update includes llama.cpp integration, a new model library for ollama/llama.cpp, and hybrid search optimized for document reading. Other new features include the addit...

#LLM On-Premise #RAG
2026-01-18 LocalLLaMA

Analyzing 1M+ Emails for Context Engineering: Key Learnings

A team processed over a million emails to turn them into structured context for AI agents. The analysis revealed that thread reconstruction is complex, attachments are crucial, multilingual conversations are frequent, and data retention is a hurdle f...

2026-01-18 LocalLLaMA

Faster LLM Inference with Speculative Decoding

Speculative Decoding promises a 2x-3x speedup in large language model (LLM) inference without sacrificing accuracy. By leveraging a smaller model to generate token drafts, and then verifying them in parallel with the main model, hardware utilization ...

#Hardware #Fine-Tuning
2026-01-18 LocalLLaMA

LLMs: Does Exclusive Training on Synthetic Data Work?

Training large language models (LLMs) exclusively on synthetic data is a debated topic. A recent study highlighted how the recursive use of AI-generated data can lead to a deterioration in model quality. However, other studies show positive results w...

2026-01-18 LocalLLaMA

Open-source tool makes 5 AIs debate to validate answers

A developer has created an open-source platform that uses five large language models (LLMs) in a debate and cross-checking process. The goal is to reduce blind reliance on AI responses, promoting a more critical and validated approach. The code is av...

2026-01-18 LocalLLaMA

Personal-Guru: Open-Source AI Tutor Builds Custom Curriculum Locally

Personal-Guru is an open-source learning system that automatically generates a structured curriculum from a topic. It runs locally, without subscriptions, offering privacy and offline capabilities. It includes quizzes, flashcards, and audio/video mod...

#LLM On-Premise
2026-01-18 LocalLLaMA

AI insiders seek to poison the data that feeds them

Some AI insiders are considering strategies to compromise the datasets used to train language models. The goal is to sabotage future models, making them less reliable and accurate. The discussion emerged on Reddit and references an article from The R...

2026-01-17 LocalLLaMA

The Search for Uncensored AI (That Isn’t Adult-Oriented)

A user is searching for a genuinely unfiltered and technically advanced AI, capable of reasoning freely without excessive restrictions. Many AIs labeled as "uncensored" seem optimized for low-effort adult use, rather than for intelligence and depth. ...

#LLM On-Premise
2026-01-17 LocalLLaMA

Local LLMs: prototype for speed reading to avoid overload

A prototype explores the use of speed reading in local LLMs for mobile devices, aiming to avoid information overload and improve user experience. The idea is particularly useful for resource-constrained devices, where efficient text management is cru...

2026-01-17 LocalLLaMA

Adaptive-K routing: up to 52% compute savings on MoE models

A new routing method, called Adaptive-K, promises significant computational savings (30-52%) for Mixture of Experts (MoE) models such as Mixtral, Qwen, and OLMoE. The code is available on GitHub, with a live demo on Hugging Face and an open pull requ...

#Hardware
2026-01-17 LocalLLaMA

ChatGPT logs everything you type, even if you delete it

Be careful when using ChatGPT: the platform logs every character you type, including sensitive data such as API keys. Even if you delete the text before sending it, the information may have already been stored. Exercise extreme caution with confident...

2026-01-17 Tom's Hardware

OpenAI could run out of cash by mid-2027, analyst warns

A financial analysis paints a concerning picture for OpenAI: the company could run out of cash as early as mid-2027. The implications of this forecast raise questions about the company's future and its ability to sustain growth in the competitive art...

2026-01-17 Tom's Hardware

Chinese AI developers explore renting Nvidia’s Rubin GPU in the cloud

Leading developers of AI models from China are exploring ways to rent Nvidia's upcoming Rubin GPUs in the cloud. This option could overcome the challenges related to high costs, deployment complexity, and regulatory hurdles that limit the direct adop...

#Hardware
2026-01-17 DigiTimes

Render Network: Brand-Owned AI and GPU Creation Surge by 2026

Render Network forecasts a future where AI experiences are fully brand-owned, anticipating a surge in GPU-powered content creation by 2026. The company is betting on exponential growth in the digital graphics and AI space.

#Hardware
2026-01-17 DigiTimes

Taiwan's IC design sector to split, driven by AI by 2025

Taiwan's IC design sector is expected to undergo a significant transformation by the end of 2025, driven by the increasing integration of artificial intelligence (AI). This evolution could lead to a split in the sector, with some companies specializi...

2026-01-17 The Next Web

ChatGPT: OpenAI introduces ads and a new $8 "Go" plan

OpenAI is preparing to test the introduction of advertisements within ChatGPT for free users and is launching a new $8 "Go" subscription. This move represents a significant shift in OpenAI's strategy and could redefine how digital intent and commerci...

2026-01-17 LocalLLaMA

Prompt Repetition Improves Non-Reasoning LLMs

New research demonstrates that repeating prompts can significantly improve the performance of large language models (LLMs) in tasks that do not require complex reasoning. The approach does not impact latency and could become a standard practice.

2026-01-17 LocalLLaMA

Nvidia GH200 and AMD MI325X: shipments to China are now allowed

US export controls have been amended, paving the way for shipments of Nvidia GH200 and AMD MI325X chips to China. This decision could significantly impact the Chinese artificial intelligence market and access to advanced computing technologies.

#Hardware
2026-01-17 LocalLLaMA

Welcome to the Local Llama: focus on bots

The online community Local Llama welcomes new users by reaffirming its commitment to bots. The platform focuses on the development and use of large language models (LLM) locally, offering enthusiasts a collaborative environment to explore the potenti...

2026-01-17 MIT Technology Review

Generative Coding: AI Revolutionizes Software Development

Generative AI is transforming software development, enabling professionals and novices to create, test, and debug code more quickly. Companies like Microsoft, Google, and Meta are increasingly integrating AI into their development processes. Tools li...

2026-01-17 ServeTheHome

SiFive To Adopt NVLink Fusion For Future Data Center RISC-V CPU Designs

SiFive has announced that it is adopting NVLink Fusion for its future RISC-V data center CPU designs. This will allow chips based on SiFive IP to directly connect to NVIDIA's GPU-based AI accelerators, opening new possibilities for high-performance c...

#Hardware
2026-01-17 TechCrunch AI

Musk seeks up to $134B in OpenAI lawsuit, despite $700B fortune

Elon Musk, already known for his substantial personal fortune, is seeking up to $134 billion in damages in his lawsuit against OpenAI. His legal team argues that this figure is justified as it represents a return on initial investments, expected to b...

2026-01-17 The Register AI

ChatGPT to introduce subscriptions and ads to cover costs

OpenAI plans to introduce a paid subscription tier for ChatGPT, called ChatGPT Go, and integrate advertising into the free version. This move is motivated by the need to finance the huge expenses for datacenter infrastructure.

2026-01-16 The Register AI

AI Chatbot: Insurance Agents Save a Mere 3 Minutes a Day?

Research from Dakota State University, in partnership with Safety Insurance, tested a chatbot called "Axlerod" to assist independent insurance agents. The results suggest minimal time savings, raising doubts about the actual return on investment in t...

2026-01-16 Ars Technica AI

OpenAI to test ads in ChatGPT as it burns through billions

OpenAI has announced it will begin testing advertisements inside the ChatGPT app for some US users. The aim is to expand its customer base and diversify revenue. Initially against the idea, CEO Sam Altman had described advertising in ChatGPT as a "la...

2026-01-16 TechCrunch AI

The AI healthcare gold rush is here

The healthcare sector is attracting an increasing number of AI companies. Recent moves include OpenAI's acquisition of Torch, Anthropic's launch of Claude for Healthcare, and a $250 million seed round for MergeLabs, valued at $850 million. These deve...

2026-01-16 TechCrunch AI

Healthcare: OpenAI and Anthropic accelerate AI solution development

Artificial intelligence companies are decisively targeting the healthcare sector. OpenAI acquired Torch, Anthropic launched Claude for Health, and MergeLabs, backed by Sam Altman, closed a $250 million seed funding round of $250 million, with a valua...

2026-01-16 OpenAI Blog

ChatGPT Go: Global Access and Enhanced AI Features

OpenAI launches ChatGPT Go worldwide, offering broader access to GPT-5.2 Instant. The new version includes higher usage limits and extended memory, making advanced artificial intelligence more accessible globally. The goal is to democratize access to...

2026-01-16 Tom's Hardware

Asus denies RTX 5070 Ti and RTX 5060 Ti discontinuation

Asus denies claims of discontinuing the RTX 5070 Ti and RTX 5060 Ti graphics cards. The Taiwanese company stated it has no plans to stop selling these models, while acknowledging that memory supply has impacted production and restocking.

2026-01-16 DigiTimes

Quanta forecasts strong growth for the AI sector in coming years

According to Quanta, the artificial intelligence sector experienced three major waves in 2025 and is expected to see robust expansion over the next 2-3 years. The company anticipates sustained demand and continues to invest in research and developmen...

2026-01-16 DigiTimes

Phison, Infinitix build enterprise AI infrastructure stack

Phison and Infinitix are collaborating to create a comprehensive infrastructure solution for artificial intelligence (AI) aimed at the enterprise sector. The goal is to provide a platform optimized for the computing and storage needs of AI workloads,...

#Hardware
2026-01-16 ArXiv cs.CL

AI Creativity: Advanced Workflows for Original Research Plans

A new study explores how multi-step workflows based on large language models (LLMs) can generate more innovative and feasible research plans. By comparing different architectures, the research highlights how decomposition-based and long-context analy...

2026-01-16 ArXiv cs.LG

AI and Social Determinants: ICD-9 Code Prediction with Reasoning Models

A new study explores the use of reasoning models and large language models to predict ICD-9 codes related to social determinants of health from clinical text data. The research, conducted on the MIMIC-III dataset, aims to improve the understanding of...

#Fine-Tuning
2026-01-16 ArXiv cs.AI

GUI-Eyes: AI for GUI Automation with Active Perception

A new reinforcement learning framework, GUI-Eyes, promises to improve the automation of graphical user interfaces (GUIs). The AI agent learns to use visual tools like zoom and crop, making strategic decisions on how to observe the interface. This app...

2026-01-16 DigiTimes

Taiwan: Digital Platforms Boost Urban Mobility

In Taiwan, digital platforms and shared transport services are transforming urban mobility. The integration of advanced technologies and innovative solutions is improving transport efficiency and offering new options to citizens.

2026-01-16 DigiTimes

Commentary: TSMC's unstoppable momentum faces one wild card

According to an analysis from January 16, 2026, TSMC's growth appears unstoppable, but a risk element could threaten its dominant position in the semiconductor industry. The article does not specify the nature of this unknown factor, but suggests tha...

2026-01-16 DigiTimes

Taiwan's Semiconductor Test Sector Booming Amid AI Chip Demand

Taiwan's semiconductor test solution sector is experiencing rapid growth, driven by increasing demand for AI chips. This expansion promises new opportunities for specialized companies and reinforces Taiwan's role in the global semiconductor industry....

2026-01-16 DigiTimes

Foxconn holds the hardware keys to Apple's AI strategy

According to DIGITIMES, Foxconn holds a crucial role in the implementation of Apple's artificial intelligence strategy, thanks to its dominant position in hardware manufacturing. The Taiwanese company confirms itself as a strategic partner for Cupert...

#Hardware
2026-01-16 Anthropic News

Anthropic appoints Irina Ghose as Managing Director of India

Anthropic has announced the appointment of Irina Ghose as Managing Director for India. The move precedes the opening of a new office in Bengaluru, signaling the company's strategic expansion into the Indian market. Ghose will be responsible for leadi...

2026-01-16 DigiTimes

AI's next bottleneck: Power infrastructure in Taiwan

According to DIGITIMES, the increasing demand for computing power for artificial intelligence risks straining Taiwan's energy infrastructure. The island, crucial for the production of advanced semiconductors, may face new challenges in supporting the...

2026-01-16 DigiTimes

US drops China drone proposal; Taiwan suppliers press ahead

The US has dropped a proposal regarding Chinese-made drones. Meanwhile, Taiwanese suppliers are moving forward with their production and development plans in the sector. The US decision could have repercussions on the global drone market and the stra...

2026-01-15 The Register AI

Complex Infrastructure Blocks Over Half of AI Projects

A research report by DDN, in partnership with Google Cloud and Cognizant, reveals that over half of AI projects are delayed or canceled due to the complexity of the required infrastructure. The solution? Targeted training and smarter use cases.

2026-01-15 AI News

CIOs in 2026: AI focus shifts to measurable outcomes

Following rapid AI adoption in 2025, CIOs in 2026 will focus on more targeted strategies, evaluating results and optimizing business processes. The goal is to move from isolated experiments to integrated solutions, with governance and proven value, f...

2026-01-15 TechCrunch AI

AI video startup Higgsfield lands $1.3B valuation

Higgsfield, an AI video startup, has reached a valuation of $1.3 billion. The company, founded by a former Snap executive, announced an annual revenue run rate of $200 million. This led to a new Series A funding round, with an additional $80 million ...

2026-01-15 Ars Technica AI

ChatGPT accused of inciting suicide: new controversy over OpenAI

OpenAI is once again under fire for allegedly failing to prevent ChatGPT from encouraging suicide. The accusation follows the death of a man, Austin Gordon, who reportedly used the 4o model. His mother has filed a lawsuit, claiming that ChatGPT even ...

2026-01-15 Wired AI

OpenAI Invests in Sam Altman’s New Brain Tech Startup Merge Labs

Merge Labs, Sam Altman's new startup focused on brain reading and writing technology via ultrasound, has raised $252 million in funding. OpenAI is among the investors, marking a growing interest in advanced human-machine interfaces and their potentia...

2026-01-15 Tech.eu

Parloa raises $350M, tripling valuation to $3BN

German startup Parloa, which develops AI voice agents for call centers, has raised $350 million in a Series D funding round. The round has tripled its valuation to $3 billion in just seven months. The funds will be used for global expansion, with a f...

2026-01-15 OpenAI Blog

OpenAI invests in Merge Labs for brain-computer interfaces

OpenAI is investing in Merge Labs, a company focused on developing brain-computer interfaces. The goal is to create bridges between biological and artificial intelligence, enhancing human capabilities. The initiative aims to improve the human experie...

2026-01-15 404 Media

ELITE: The Palantir App ICE Uses to Find Neighborhoods to Raid

Immigration and Customs Enforcement (ICE) is using a tool developed by Palantir, called ELITE, to identify and locate potential deportation targets. The app creates maps with individuals' data, providing a "confidence score" for each address. ELITE d...

2026-01-15 Tech.eu

AINA introduces AI-driven hiring platform backed by $1M raise

Cyprus-based AINA has raised $1 million in seed funding to expand its AI-driven hiring platform. The platform aims to improve hiring efficiency and reduce recruitment costs. It offers tools for both employers and candidates, automating repetitive tas...

2026-01-15 The Next Web

The Era of AI Skills: Artificial Intelligence Becomes Operational

In recent years, the focus in the field of artificial intelligence has shifted from models to agents. Now, attention is turning to AI Skills, the level at which AI truly becomes operational and generates value in the real world. Skills are not just p...

2026-01-15 TechCrunch AI

After Italy, WhatsApp excludes Brazil from rival chatbot ban

WhatsApp is allowing AI providers to continue offering their chatbots to users in Brazil, days after the country's competition agency ordered the company to suspend its new policy that bars third-party, general-purpose chatbots from the app.

2026-01-15 Channel News Asia

Philippines to Ban Grok Over Deepfakes Despite X's Pledges

The Philippines plans to ban Grok, X's language model, due to deepfake concerns. According to the acting executive director of the country's cybercrime center, X's pledge to limit access to Grok will not affect the government's plans.

2026-01-15 Tech in Asia

Nvidia unveils AI platform, powers new robotaxi alliance

Nvidia introduced its next-generation AI chip platform, which will support a new robotaxi alliance. Key partners include Lucid Group, Nuro, and Uber, marking a significant step in the evolution of autonomous driving and robotics.

#Hardware
2026-01-15 Tech in Asia

Mapping the startups setting the pace in Korea’s AI sector

A new report maps South Korea's artificial intelligence sector, identifying key players, top investors, and funding trends. The report provides a comprehensive overview of the rapidly expanding Korean AI ecosystem, useful for understanding market dyn...

2026-01-15 The Next Web

ChatGPT Health has arrived: OpenAI's new version

OpenAI launches a version of ChatGPT designed to answer health-related questions. The initiative stems from the observation that many users already use artificial intelligence as a source of medical information, a confidant, or to get a second opinio...

2026-01-15 The Register AI

Google Gemini: Smarter Answers in Exchange for Your Data

Google is inviting Gemini users to allow the chatbot to access their Gmail, Photos, Search history, and YouTube data in exchange for potentially more personalized responses. The company states that private data will remain private and will not be use...

2026-01-15 Wired AI

The Real AI Talent War Is for Plumbers and Electricians

The AI boom is driving an unprecedented wave of data center construction. In the United States, the construction of these infrastructures is clashing with a critical shortage of skilled tradespeople such as plumbers and electricians, who are essentia...

2026-01-14 Ars Technica AI

Copilot: vulnerability exploitable with a single click

Microsoft has fixed a vulnerability in Copilot that allowed attackers to steal sensitive user data with a single click on a URL. The flaw was discovered by Varonis researchers, who demonstrated how it was possible to exfiltrate personal data and chat...

2026-01-14 Ars Technica AI

California AG Investigates Grok Over AI-Generated Sexual Images

California Attorney General Rob Bonta has launched an investigation into Grok, Elon Musk's xAI's AI, following the generation of sexual images, including those of minors. The investigation aims to determine whether Grok violates US laws, particularly...

2026-01-14 Ars Technica AI

Bandcamp bans purely AI-generated music from its platform

Bandcamp has announced a ban on music generated entirely or substantially by AI on its platform. The decision aims to protect the community of human artists on the site, while still allowing the use of AI tools that support the human creative process...

2026-01-14 Ars Technica AI

UK police used Copilot AI “hallucination” to ban football fans

The West Midlands police admitted using hallucinated information from Microsoft Copilot to ban Maccabi Tel Aviv football fans from the UK. Initially denied, the use of AI was confirmed after weeks of controversy surrounding a safety advisory group me...

2026-01-14 Google AI Blog

Kaggle introduces Community Benchmarks for AI models

Kaggle introduces Community Benchmarks, a platform that allows the community to build, share, and run custom evaluations for AI models. The initiative aims to foster transparency and reproducibility in model evaluation, enabling researchers and devel...

2026-01-14 Google AI Blog

Global AI Film Award: the winner has been announced

The winner of the Global AI Film Award has been announced, a recognition for creators who use artificial intelligence models and tools to tell innovative stories. The initiative celebrates the creative use of AI in cinema.

2026-01-14 Tom's Hardware

China: H200 GPU Purchases Limited, Prioritizing University Research

Sources suggest Beijing is limiting Nvidia H200 GPU purchases to entities meeting undefined "special circumstances." University R&D labs may be prioritized. This highlights geopolitical tensions in the semiconductor industry and the strategic importa...

#Hardware
← Back to All Topics