🗄️ News Archive

Complete history of AI signals, ordered by date.
Total Articles: 1518

💡 Looking for something specific? Use the Search Bar at the top for a detailed search.

Jan 20 2026
Hardware

Micron Acquires $1.8bn DRAM Chip Plant in Taiwan

Micron has announced the acquisition of a DRAM chip manufacturing campus from Powerchip Semiconductor Manufacturing Corporation (PSMC) in Taiwan for $1.8 billion. This acquisition will allow Micron to quickly increase its DRAM manufacturing capacity. PSMC had opened the fab just 19 months prior, after a $9.5 billion investment.

Jan 20 2026
LLM

Unsloth Releases GLM-4.7-Flash in GGUF Format

Unsloth has released the GLM-4.7-Flash language model in GGUF (GPT-Generated Unified Format). This format facilitates the use of the model on various hardware platforms, making it accessible to a wider audience of developers and researchers interested in large language model inference locally.

Jan 20 2026
LLM

GLM-4.7-Flash-GGUF is here!

A new version of GLM-4.7-Flash-GGUF has been released, a large language model (LLM) designed for local inference. This implementation, available on Hugging Face, allows users to run the model directly on their devices, opening new possibilities for offline and customized applications.

Jan 20 2026
Altro

AI for self empowerment: new growth opportunities

Artificial intelligence can expand human capabilities, bridging the skills gap and unlocking new opportunities for productivity and growth for individuals, businesses, and nations. An analysis of how AI can foster self-empowerment and development.

Jan 19 2026
Frameworks

GLM 4.7 Flash: Official Support Merged into llama.cpp

Official support for GLM 4.7 Flash has been merged into llama.cpp. This integration, reported on Reddit, allows developers to leverage the capabilities of GLM 4.7 Flash within the llama.cpp environment, opening up new possibilities for inference and other language model applications.

Jan 19 2026
LLM

GLM 4.7 Flash: A Reliable LLM Agent for Lower-End GPUs?

A user reports excellent performance of GLM 4.7 Flash as an LLM agent, even on systems with lower-end GPUs. The model appears to handle complex tasks such as cloning GitHub repositories and editing files without errors, opening new possibilities for those with limited computing resources. It remains to be seen if the promises will be kept locally.

Jan 19 2026
Hardware

Valve: Power Management Improvements for AMD GCN 1.0 GPUs

A Valve contractor has significantly improved the AMDGPU driver for older GCN 1.0 and GCN 1.1 GPUs. With Linux 6.19, AMDGPU is now the default for these GPUs, offering better performance and RADV Vulkan support. New patches focus on optimizing power management, extending the lifespan of these graphics cards.

Jan 19 2026
LLM

LightOn OCR: New Open Source Model for Optical Character Recognition

LightOn AI has released LightOnOCR-2-1B, an open-source Optical Character Recognition (OCR) model. The model is available on Hugging Face and aims to provide an accessible solution for extracting text from images. Its release has been welcomed by the open-source community, which appreciates its potential utility in various application contexts.

Jan 19 2026
LLM

GLM-4.7-FLASH: Mixed Precision NVFP4 Version Available on Hugging Face

A mixed precision NVFP4 quantized version of GLM-4.7-FLASH has been published on Hugging Face. The author encourages the community to test the model and provide feedback. The model has a size of 20.5 GB and aims to optimize performance while maintaining a good level of accuracy.

Jan 19 2026
LLM

Gemma 3:1b: What are the main uses of small models?

A user wonders about the possible uses of small language models like Gemma 3:1b. These models, while running on less powerful hardware, open up interesting scenarios. It remains to be seen whether they are suitable for basic tasks or simple calculations, or whether they can tackle more complex challenges.

Jan 19 2026
Altro

Linux 7.0: OPEN_TREE_NAMESPACE for Enhanced Container Security and Performance

The upcoming Linux 7.0 kernel cycle is expected to include an OPEN_TREE_NAMESPACE flag for the open_tree() system call. This option offers notable performance gains and enhanced security, particularly for containerized workloads. The feature aims to streamline container management, an increasingly critical aspect of modern IT infrastructure.

Jan 19 2026
Market

Musk seeks $134B from OpenAI, accused of 'making up math'

Elon Musk is suing OpenAI, seeking damages between $79 billion and $134 billion. Musk accuses OpenAI of abandoning its nonprofit mission and "making a fool out of him" as an early investor. The amount is based on an expert's estimate that Musk's early contributions generated 50 to 75 percent of OpenAI's current value.

Jan 19 2026
LLM

GLM-4.7 flash: how to run it with llama.cpp?

A user inquires about the possibility of running the new GLM 4.7 flash model with llama.cpp or similar tools. The question was posted on a forum dedicated to local language models (LocalLLaMA), awaiting responses from the community of developers and enthusiasts.

Jan 19 2026
Market

US AI startups raise record funding in 2025

2024 was a pivotal year for the AI industry in the US and beyond. It remains to be seen whether 2025 will be equally positive. Analysis reveals that numerous AI startups have raised over $100 million in funding, marking an unprecedented wave of investment.

Jan 19 2026
Hardware

Nvidia GB10 vs GH200: early performance benchmarks

Early benchmarks comparing the performance of Nvidia's GB10 GPU with the GH200 have surfaced online. The data, originating from a Reddit source, offers a preview of the potential of Nvidia's new architecture, although they should be taken with caution given their unofficial nature. Interest in the new Nvidia GPUs is very high, given their importance in the field of artificial intelligence.

Jan 19 2026
Frameworks

llama.cpp adopts Anthropic Messages API

The llama.cpp library has integrated Anthropic's Messages API, opening new possibilities for interacting with language models. This integration, announced on Reddit and Hugging Face, allows developers to leverage the capabilities of llama.cpp for advanced generative artificial intelligence applications.

Jan 19 2026
LLM

Z-AI (GLM): Devs Woke Up And Chose Violence

Z-AI (GLM) developers have reportedly adopted an 'aggressive' development strategy. A Reddit post highlights this choice, suggesting direct competition with other teams, particularly those at Qwen. The online discussion focuses on the implications of this approach and its potential impact on the language model ecosystem.

Jan 19 2026
Market

From AI Ambition to AI Production: Escaping the AI Pilot Trap

Many companies want to implement artificial intelligence, but struggle to move from the pilot phase to large-scale production. Pilot projects often fail to take off because the necessary infrastructure to support them is lacking. Interest in AI is very high, but a strategy is needed to translate ambition into concrete results.

Jan 19 2026
Hardware

Eric Demers leaves for Intel after 14 years at Qualcomm

Eric Demers, a key figure in the development of Radeon and Adreno GPUs, leaves Qualcomm after 14 years to join Intel. This move represents a significant reinforcement for Intel's team, led by Lip-Bu Tan, in the dedicated graphics card sector.

Jan 19 2026
Market

Lakestar-backed German drone maker Twentyfour Industries emerges from stealth

German startup Twentyfour Industries, specializing in drone production for defense, has emerged from stealth after raising $11.8 million from investors including Lakestar, OTB Ventures, and 468 Capital. The company focuses on rapid production and deployment of drones in Europe, aiming to bridge the capability gap in the unmanned systems sector and reduce reliance on foreign suppliers. Its drones are already in daily use by European soldiers.

Jan 19 2026
LLM

GLM-4.7-Flash: a 30B model that is impressive in BrowseComp

A Reddit post highlights the performance of the GLM-4.7-Flash 30B parameter model in the context of BrowseComp, suggesting that Qwen may need to catch up. The comparison also includes GPT-OSS-20B. The model is available on Hugging Face.

Jan 19 2026
Market

Is the Metaverse Doomed? VR Overshadowed by Artificial Intelligence

The metaverse appears to be declining, with virtual reality giving way to artificial intelligence. Meta's ambitions in the VR sector are taking a hit. The future of the metaverse is uncertain, with new challenges and competitors on the horizon.

Jan 19 2026
LLM

GLM 4.7 Flash Released: Massive Benchmark Gains?

GLM 4.7 Flash has been released. The open-source community is questioning the potential performance gains compared to Qwen 30b, with a focus on benchmarks. Currently, there is no objective data to support this.

Jan 19 2026
LLM

Ghost Engine: Run Llama-3-8B in 3GB VRAM by Generating Weights

A new inference engine, called Ghost Engine, promises to drastically reduce memory consumption when running large language models (LLMs). Instead of loading static weights, Ghost Engine generates them on the fly, trading memory bandwidth for compute. Early tests on Llama-3-8B show promising results in terms of compression and fidelity.

Jan 19 2026
Altro

Isle of Man launches National AI Office with £1M investment

The Isle of Man Government has launched its National AI Office (NAIO), backed by a £1 million investment. The aim is to coordinate the responsible adoption of artificial intelligence across the island, supporting businesses and the public sector. The Activate AI program had already generated significant savings, demonstrating the value of a coordinated approach. The NAIO will focus on national strategy, AI literacy, practical adoption, guidelines, improvement of public services and workforce reskilling.

Jan 19 2026
LLM

GLM-4.7-Flash: New Open-Source Language Model on Hugging Face

The GLM-4.7-Flash language model is now available on Hugging Face. The news was shared on Reddit, sparking discussion within the LocalLLaMA community. The open-source model promises new opportunities for developing generative artificial intelligence applications and for research in natural language processing.

Jan 19 2026
Hardware

NVIDIA GB10 vs GH200: Dell Pro Max Benchmarks Compared

A performance comparison between the NVIDIA GB10 GPU, integrated into the Dell Pro Max, and the more renowned GH200. The tests highlight the performance differences between the two solutions, providing useful data for evaluating their respective capabilities in professional usage scenarios.

Jan 19 2026
Altro

ICE’s Facial Recognition App Misidentified a Woman. Twice

The Mobile Fortify app, used by Immigration and Customs Enforcement (ICE) to identify individuals and determine their immigration status, provided two incorrect names for the same woman during a check. The incident raises doubts about the accuracy of the app, which ICE claims provides a "definitive" determination of immigration status and should be considered more reliable than a birth certificate. The incident occurred in Oregon during a raid.

Jan 19 2026
Altro

AI Boosts Research Careers, but Flattens Scientific Discovery

An analysis of over 40 million academic papers reveals that scientists using AI tools publish more and reach leadership positions faster. However, AI-driven research tends to focus on narrow areas, limiting originality and diversity in scientific inquiry. This creates a tension between individual advancement and the collective progress of science, raising concerns about a potential feedback loop of conformity and declining innovation.

Jan 19 2026
LLM

On-device browser agent with Qwen: local demo on Chrome

A new demo showcases a local browser agent, powered by Web GPU Liquid LFM and Alibaba's Qwen models, running as a Chrome extension. The agent opens 'All in Podcast' on YouTube. The source code is available on GitHub for those interested in exploring and developing this technology further.

Jan 19 2026
Market

Artificial intelligence: transforming credit unions

Artificial intelligence is rapidly transforming financial services, offering new opportunities but also challenges for credit unions. These institutions, built on trust and community alignment, must integrate AI to meet member expectations and compete with fintech and digital banks. Personalization, customer service, and fraud prevention are key areas where AI can generate tangible value, but scalability remains complex due to data issues, transparency, and integration with existing systems.

Jan 19 2026
LLM

Police chief suspended after AI hallucination: police chief resigns

The chief constable of West Midlands Police has resigned after his police force used fictional output from Microsoft Copilot in deciding to ban Israeli fans from attending a football match. The officer had denied the use of artificial intelligence systems, only to discover the opposite.

Jan 19 2026
LLM

GLM-4.7-Flash soon? Leaks about the new language model

Hints of a possible imminent release of GLM-4.7-Flash are surfacing. An update to the GLM-4.7 collection, containing a hidden item, has caught the attention of experts. Initial analysis suggests that Zai is preparing to launch this new version. A commit on GitHub and an image shared on Reddit fuel speculation, suggesting upcoming news for the GLM family of language models.

Jan 19 2026
Market

TeamFeePay announces £9M funding round and European expansion plans

Belfast-based sports technology company TeamFeePay has completed a £9 million equity funding round to support expansion into new markets and planned recruitment. The round was led by investments from YFM Equity Partners and the Investment Fund for Northern Ireland (IFNI). TeamFeePay plans to create up to 75 new roles over the next two years across the UK and Europe.

Jan 19 2026
Market

China leads in advanced robotics and world models: AI's next frontier

The AI race is shifting towards advanced robotics and world models. China is positioning itself as a leader in this field, with a high number of operational robots expected as early as 2025. This trend could redefine the global balance in the technology sector, with significant implications for industrial automation and AI research.

Jan 19 2026
Hardware

RADV Vulkan Driver Now Implements HPLOC For Faster Ray-Tracing

Valve's RADV Vulkan driver continues to improve ray tracing performance on Linux. The latest implementation, HPLOC, promises a further performance boost for games that leverage this technology. Mesa 26.0 will include this update, bringing tangible benefits to AMD Radeon graphics card users.

Jan 19 2026
Frameworks

Intel LLM-Scaler-Omni Update Brings ComfyUI & SGLang Improvements On Arc Graphics

Intel has released an update to LLM Scaler Omni, focused on image, audio, and video generation via Omni Studio and Omni Serving. This release follows last week's update of Intel LLM-Scaler-vLLM, designed to improve the use of vLLM on Intel Arc graphics cards, offering new opportunities for developers in the field of generative artificial intelligence.

Jan 19 2026
Market

Price, battery life, performance drive PC sales; on-device AI lags

In Q4, commercial resellers primarily shipped AI-capable PCs to enterprise customers. However, the key drivers for purchase were price, battery life, and performance. Integrated artificial intelligence, at least for now, appears to play a less significant role in the choices of business clients.

Jan 19 2026
Frameworks

SPDX SBOM Generation Tool Proposed For The Linux Kernel

Proposed patches to the Linux kernel introduce an SPDX SBOM Generation Tool. The goal is to increase the transparency of software components, improve vulnerability management, ensure license compliance, and secure the software supply chain.

Jan 19 2026
Market

Ananda Impact Ventures secures €73M first close for fifth Core Impact Fund

Ananda Impact Ventures has completed a €73 million first close of its fifth Core Impact Fund, exceeding its €50 million target. The fund focuses on early-stage, technology-driven startups addressing social and environmental challenges in Europe. Backers include the European Investment Fund (EIF) and over 40 family offices.

Jan 19 2026
LLM

Top-K: Optimized Algorithm Up to 20x Faster Than PyTorch

A developer has created an optimized Top-K implementation, crucial for sampling in large language models (LLM). The AVX2-optimized implementation outperforms PyTorch CPU performance by 4-20x, depending on vocabulary size. Integration into llama.cpp resulted in a 63% speedup in prompt processing on a 120B MoE model.

Jan 19 2026
Market

Europe invests €307 million in AI projects

The European Commission has allocated €307.3 million to fund artificial intelligence and related technology projects under the Horizon Europe program. The initiative aims to promote trustworthy AI and European digital autonomy, focusing on data services, robotics, quantum technologies, and photonics. While not a massive figure compared to global private investments, the funding is considered strategic for European technological development.

Jan 19 2026
Hardware

Free GPU Credits to Test LLM Training Platform

A small team is offering free compute credits for its GPU platform, in exchange for usage feedback. Available GPUs include RTX 5090 and Pro 6000, suitable for LLM inference, fine-tuning, or other machine learning workloads.

Jan 19 2026
LLM

Flog: Free iOS Nutrition Tracker App with Local LLM Support

A developer has created Flog, a free iOS app that tracks nutrition through photos, leveraging local LLM models to estimate portions and nutrients. The app integrates with Apple Health and supports LLM models run directly on the device or via LM Studio. The developer does not plan to monetize the application and ensures that user data remains on the device.

Jan 19 2026
Market

Open source's new mission: Rebuild a continent's tech stack

Europe, known for its tightly regulated tech sector, could find in open source a way to rebuild and strengthen its technological infrastructure. The adoption of open solutions could foster innovation and reduce dependence on external suppliers, promoting more autonomous and sustainable technological growth on the continent.

Jan 19 2026
Market

Anzen Industries raises $2.2M for chemical production innovation

UK-based startup Anzen Industries has raised $2.2 million in pre-seed funding. The company focuses on producing high-value chemicals using cell-free enzyme systems, aiming to improve the scalability and resilience of global supply chains. The funding will be used to relocate operations to the United States, establish a manufacturing facility, and expand industrial collaborations.

Jan 19 2026
Hardware

CoolSem Technologies raises pre-seed funding for wafer-level thermal innovation

CoolSem Technologies, based in the Netherlands, has closed a pre-seed funding round led by High-Tech Gründerfonds (HTGF). The company develops advanced wafer-level thermal management solutions, aiming to improve energy efficiency and extend the lifespan of components in semiconductors and photonic devices. The funding will be used to advance its WaLTIS technology and validate it with leading customers in RF, power, and photonics.

Jan 19 2026
Hardware

Tesla accelerates AI chip development even with safety and software challenges

Tesla is accelerating its efforts in AI chip development. This move comes at a crucial time as the company faces significant challenges related to the safety and software of its vehicles. The goal is to improve self-driving capabilities and other advanced features, but questions remain about the timing and effectiveness of these initiatives.

Jan 19 2026
Hardware

TSMC eyes rapid 2nm growth in 2026

Taiwanese giant TSMC anticipates strong expansion of its 2nm production starting in 2026, backed by substantial investments and the expansion of its manufacturing capabilities in both Taiwan and the United States. This strategic move aims to solidify TSMC's leadership in the advanced semiconductor sector.

Jan 19 2026
Market

Taiwan carves robotics niche as humanoids proliferate

Taiwan is positioning itself as a key player in the robotics sector, particularly in the development of humanoids. The island aims to leverage its technological and industrial expertise to compete in this growing market, with a focus on applications ranging from healthcare to industrial automation. Its strategic position could lead to new business opportunities and international partnerships.

Jan 19 2026
Market

Taiwan-US tariff pact sets stage for machinery industry recovery, currency challenge persists

A new tariff agreement between Taiwan and the United States promises to revitalize Taiwan's machinery industry. The agreement is expected to boost exports and the competitiveness of local companies. However, currency fluctuations remain a significant obstacle, potentially eroding the benefits of the trade agreement. Analysts are closely monitoring the combined impact of these factors on Taiwan's economy.

Jan 19 2026
Hardware

A look behind the scenes: building 3 GH200 systems in the workshop

A Reddit user shared images of the process of assembling three GH200 systems inside a workshop. The images show the various stages of construction, offering a close-up look at the hardware and infrastructure needed to support these high-performance systems. The sharing sparked interest in the community, with many users curious about the technical details and applications of such systems.

Jan 19 2026
Hardware

Micron sets 1γ as mainstream node for 2026, HBM and SOCAMM2 ramp

Micron plans to make the 1γ node its mainstream technology by 2026, alongside increasing production of HBM (High Bandwidth Memory) and SOCAMM2 modules. This strategic move underscores Micron's commitment to advanced memory solutions for high-performance applications.

Jan 19 2026
Market

Tariffs reshuffle global supply chains, but US manufacturing revival remains elusive

Tariffs are reshaping global supply chains, but the revival of the US manufacturing sector remains a difficult goal to achieve. Despite efforts and protectionist policies, American industry is struggling to regain ground and compete effectively in the international landscape.

Jan 19 2026
Market

Sinpex raises €10M Series A to redefine KYB automation for Europe’s AML era

Sinpex, an AI-powered platform for KYB/KYC lifecycle management, announced a €10 million Series A financing round. The company aims to streamline business client onboarding and continuous KYB compliance, empowering companies to meet the regulatory demands of the 2027 EU Anti-Money Laundering (AML) Regulation.

Jan 19 2026
LLM

JARVIS: Progress Report on LLM Agent Development

A Reddit user shared an update on the development of JARVIS, an agent based on large language models (LLM). The original post includes a link to a demonstration video of the project. The development of LLM agents is a rapidly growing research area, with the goal of creating systems capable of automating complex tasks by interacting with the external world.

Jan 19 2026
LLM

Local LLM Coding: Is it Still Worth it with a 16GB GPU?

A user with a 16GB Nvidia RTX 5070 Ti GPU questions the effectiveness of local large language model (LLM) development. Experience with Kilo code and Qwen 2.5 coder 7B via Ollama revealed issues with context management, which quickly runs out even with moderately sized project files. The question is: how do other developers with similar setups address this challenge?

Jan 19 2026
Market

Quanta rushing to hire and expand as AI server demand holds strong

Quanta Computer is ramping up hiring and expanding its operations to meet the sustained demand for AI servers. The company aims to strengthen its position in a rapidly expanding market, where the ability to meet customer demands has become crucial.

Jan 19 2026
Hardware

Glass fiber supply crunch hits AI hardware and Apple's 2026 devices

A glass fiber supply crunch is straining the production of AI hardware and may delay the release of new Apple devices slated for 2026. The scarcity of this key material risks having broad repercussions on the tech sector.

Jan 19 2026
Market

Europe moves to reinforce its satellite internet ambitions with new OneWeb order

Europe is intensifying efforts to consolidate its satellite internet infrastructure through a new order for the OneWeb constellation. This strategic move aims to ensure greater autonomy and resilience in the communications sector, reducing dependence on external suppliers and enhancing broadband connectivity in remote and underserved areas of the continent.

Jan 19 2026
LLM

The Race to Build the DeepSeek of Europe Is On

As Europe’s longstanding alliance with the US falters, its push to become a self-sufficient AI superpower has become more urgent. The goal is to create a European alternative to advanced models like DeepSeek, reducing technological dependence on other nations.

Jan 19 2026
Market

Global power grids emerge as strategic choke points in AI and industrial competition race

Global power grids are emerging as crucial strategic points in the competition between artificial intelligence and industrial development. The increasing demand for energy to power data centers and digital infrastructure makes the stability and security of power grids an increasingly critical factor for economic growth and national security.

Jan 19 2026
Market

US-Taiwan investment MOU brings clarity on future auto tariffs

A memorandum of understanding (MOU) between the US and Taiwan outlines the future of automotive tariffs. The agreement aims to promote bilateral investments and establish clearer trade conditions, particularly in the automotive sector. The initiative is expected to have a significant impact on companies operating between the two markets, offering greater predictability and stability for their long-term investment strategies.

Jan 19 2026
Market

Apple-Google AI partnership could reshape voice assistant market

A potential collaboration between Apple and Google in the field of artificial intelligence could reshape the voice assistant market. The partnership, if realized, would have an estimated value of up to $5 billion. Implications and details of the agreement remain unknown at this time, but the potential is enormous.

Jan 19 2026
Hardware

US-Taiwan trade pact clears path for tech supply chain hubs in America

A new trade agreement between the United States and Taiwan could foster the creation of tech supply chain hubs in America. The initiative aims to strengthen supply chain resilience and reduce dependence on foreign suppliers, amid growing global competition in the semiconductor and emerging technology sectors.

Jan 19 2026
Market

Hiring Stalls at India’s Big Four Outsourcers Amid AI Impact

India’s big four outsourcers – HCL, Infosys, TCS and Wipro – have essentially stopped hiring, potentially due to increased AI adoption. Revenue growth is also sluggish. This slowdown reflects a significant shift in the IT services landscape.

Jan 19 2026
LLM

Conversational Agents: Does Conciseness Reduce Expertise?

A new study analyzes the unexpected side effects of using specific stylistic features in prompts for conversational agents based on large language models (LLMs). The research reveals how prompting for conciseness can compromise the perceived expertise of the agent, highlighting the interdependence between different stylistic traits and the need for more sophisticated approaches for effective and safe stylistic control.

Jan 19 2026
LLM

BYOL: Bring Your Own Language Into LLMs

A new study introduces BYOL, a framework for improving the performance of large language models (LLMs) in languages with limited digital presence. BYOL classifies languages based on available resources and adapts training techniques, including synthetic text generation and refinement via machine translation, to optimize results. Early tests on Chichewa, Maori, and Inuktitut show significant improvements over existing multilingual models.

Jan 19 2026
Frameworks

Multi-Source Transfer Learning: New Framework Optimizes Source Weights

A new study introduces UOWQ, a theoretical framework for multi-source transfer learning. UOWQ jointly optimizes source weights and transfer quantities, addressing the issue of negative transfer. The analysis demonstrates that using all available source samples is optimal with properly adjusted weights and provides solutions for determining the optimal weights. Experiments on real-world benchmarks confirm the framework's effectiveness.

Jan 19 2026
LLM

Analytic Bijections for Smooth and Interpretable Normalizing Flows

A new study introduces three families of analytic functions for normalizing flows, offering more efficient and interpretable alternatives to existing approaches. The advantages include increased training stability and the ability to drastically reduce the number of parameters required, opening new perspectives for complex problems in physics and other fields.

Jan 19 2026
LLM

LLMs: How Do They Assess Trustworthiness of Online Information?

Large language models (LLMs) are increasingly important in online search and recommendation systems. New research analyzes how these models encode perceived trustworthiness in web narratives, revealing that models internalize psychologically grounded trust signals without explicit supervision. This study paves the way for more credible and transparent AI systems.

Jan 19 2026
LLM

Japanese AI Agent System on Human Papillomavirus Vaccination: System Design

A new AI agent system has been developed in Japan to address hesitancy regarding human papillomavirus (HPV) vaccination. The system provides verified information through a conversational interface and generates analytical reports for medical institutions, monitoring public discourse on social media. Initial tests show promising results in terms of relevance, correctness, and completeness of the information provided.

Jan 19 2026
Hardware

Optics manufacturers strengthen ties with semiconductor firms

Optics manufacturers are strengthening ties with semiconductor firms in the silicio photonics race. Asia Optical is among the companies targeted for these strategic partnerships. Asia Optical chairman I-Jen Lai is leading the company through this crucial phase of technological convergence.

Jan 19 2026
Frameworks

cuda-nn: Custom MoE inference engine in Rust/CUDA without PyTorch

cuda-nn, a MoE (Mixture of Experts) inference engine developed in Rust, Go, and CUDA, has been introduced. This open-source project stands out for its ability to handle models with 6.9 billion parameters without PyTorch, thanks to manually optimized CUDA kernels. It supports MoE and MQA architectures, offering Python bindings for increased flexibility.

Jan 19 2026
LLM

Hot take: OpenAI should open-source GPT-4o

A user suggested that OpenAI should open-source the GPT-4o model. Despite safety concerns, the move could cover OpenAI's open-source rally for the next few months and save on the costs of maintaining the model.

Jan 19 2026
LLM

Strix Halo as LLM Server: Which Linux Distro to Choose?

A user is evaluating using their Strix Halo as a server for large language models (LLM) and a media server, looking for the most suitable Linux distribution. Fedora 43 is already installed, but alternatives are being considered for optimal RDP support and efficient LLM management.

Jan 19 2026
Frameworks

Chatterbox: Memory Spikes During PDF Conversion?

A user reports excessive memory consumption with Chatterbox-TTS-Server while converting a PDF to an audiobook. The process, based on a fast API wrapper, increases memory usage from 3GB to over 8GB while processing small chunks of the book.

Jan 19 2026
LLM

DetLLM: tool to ensure deterministic inference in LLMs

A developer has created DetLLM to address the issue of non-reproducibility in LLM inference. The tool verifies repeatability at the token level, generates a report, and creates a minimal reproduction package for each run, including environment snapshots and configuration. The code is available on GitHub and open to community feedback.

Jan 19 2026
LLM

SLM Prompting: How to Outperform Larger Language Models?

A user is questioning how to get the most out of small language models (SLMs), especially when fine-tuned for a specific topic. The challenge is that traditional prompts, effective with large language models (LLMs), often produce incoherent results with SLMs, even if the prompt relates to the model's area of expertise. Will it be necessary to fundamentally rethink prompting techniques?

Jan 19 2026
Market

US and Taiwan finalize tariff deal, securing favorable terms for semiconductor exports

The United States and Taiwan have finalized a tariff agreement that will secure favorable terms for semiconductor exports. The deal aims to strengthen economic and technological cooperation between the two nations in the strategic semiconductor sector. Details of the agreement have not been disclosed, but a positive impact on bilateral trade and the global supply chain is expected.

Jan 19 2026
Market

TSMC drives tariff talks as Taiwan eyes 40% chip capacity shift to US

According to Digitimes, TSMC is influencing tariff discussions as Taiwan considers shifting up to 40% of its chip manufacturing capacity to the United States. This strategic move could have significant implications for the global semiconductor industry and trade dynamics between Taiwan and the United States.

Jan 19 2026
Market

US-Taiwan defense ties deepen due to 15% tariff cap

According to DIGITIMES, defense ties between the US and Taiwan are deepening, partly due to a 15% tariff cap. This move highlights the increasing collaboration between the two nations in a strategically crucial area.

Jan 19 2026
Hardware

OpenAI taps Cerebras for US$10 billion AI chip buildout

OpenAI has tapped Cerebras for a US$10 billion AI chip buildout. The collaboration aims to enhance the computing capabilities required for large language models (LLMs).

Jan 19 2026
Hardware

Hardware setup with 3 V620 GPUs for 96GB of VRAM

A user has shared their new hardware setup online, which includes three V620 graphics cards for a total of 96GB of VRAM. This configuration is designed for applications that require high video memory capacity, such as training machine learning models or rendering complex graphics. The share has generated interest in the online community.

Jan 19 2026
LLM

GFN v2.5.0: Verified O(1) Memory Inference and 500x Length Extrapolation

Version 2.5.0 of GFN (Geodesic Flow Networks) has been released, an architecture that reformulates sequence modeling as particle dynamics. GFN offers O(1) inference and stability through symplectic integration. Zero-shot generalization on algorithmic tasks with sequences up to 10,000 tokens has been demonstrated, maintaining a memory footprint of approximately 60MB. Compared to Transformers, GFN reduces memory overhead by 234x at L=1,000.

Jan 18 2026
Market

AI: Machine identities outnumber humans in Asia-Pacific

Artificial intelligence is reshaping the cybersecurity landscape in the Asia-Pacific region, with an exponential increase in machine identities. This shift poses new challenges for protecting systems and data, requiring more sophisticated and automated security strategies to manage the complexity of emerging threats.

Jan 18 2026
Market

Taiwan polarizer firms pivot to medical, semiconductor, and niche markets

Taiwanese polarizer manufacturers are diversifying their business, shifting from traditional markets to more specialized sectors such as medical and semiconductors. This strategy is a response to the oversupply from China, which has eroded profit margins in the standard polarizer sector.

Jan 18 2026
Market

Taiwan PCB industry gearing up for record investment driven by AI cloud computing

Taiwan's printed circuit board (PCB) industry is gearing up for record investments, driven by the increasing demand for cloud computing and artificial intelligence solutions. This influx of capital is expected to further strengthen Taiwan's position as a global leader in high-tech PCB manufacturing.

Jan 18 2026
Hardware

OpenAI and Cerebras reach US$10 billion agreement to reduce Nvidia dependence

OpenAI has reached a US$10 billion agreement with Cerebras. The main goal is to reduce OpenAI's strong dependence on Nvidia chips, thereby diversifying its hardware resources for training large language models (LLMs). This strategic move could have a significant impact on the semiconductor market and innovation in the field of artificial intelligence.

Jan 18 2026
Market

Advantest ATE lead times remain tight

Lead times for Advantest's automated test equipment (ATE) remain tight due to strong demand in the AI and memory markets. This situation reflects the growth of these sectors and the pressure on the semiconductor supply chain. Advantest's ability to meet demand will be crucial to supporting the expansion of these key markets.

Jan 18 2026
LLM

How do you pronounce "GGUF"? The pronunciation dilemma in AI

The pronunciation of "GGUF", a file format used in the field of artificial intelligence, is generating a heated debate in the community. The most common options include "jee-guff", "giguff", and "jee jee you eff". The discussion highlights the challenges of standardization in technical terminology.

Jan 18 2026
General

LLMOnPremise Major Update

The LLMOnPremise M2 update introduces a decision framework for enterprise AI deployment, replacing prescriptive recommendations with constraint analysis. It features strategic decision tools, deep-dive scenarios, an expanded hardware matrix, and a scenario-aware "Ask" mode, catering to solution architects, DevOps engineers, and compliance leads.

Jan 18 2026
LLM

Are LLM Agents Mostly Markdown Todo List Processors?

A user has raised an interesting question regarding the internal architecture of major agents based on large language models (LLMs). It appears that many of these agents break down complex tasks into simple todo lists, executing them sequentially. This implementation, if confirmed, raises questions about the actual intelligence and reasoning capabilities of such systems.

Jan 18 2026
Hardware

ROCm+Linux Support on Strix Halo: January 2026 Stability Update

A user on Reddit reported the future release of a stability update for ROCm and Linux support on Strix Halo. The delivery, expected in January 2026, aims to improve the integration of these technologies. Strix Halo is an AMD hardware platform designed to deliver high graphics performance in mobile environments. This initiative could open new opportunities for Linux developers.

Jan 18 2026
Hardware

AMD Strix Halo: ROCm+Linux Stable Configurations in January 2026

A video and a reference table on Reddit showcase the stable ROCm+Linux configurations for AMD Strix Halo, tested in January 2026. The documentation includes troubleshooting of initial issues. Details are available on GitHub, providing an overview of the working configurations.

Jan 18 2026
Altro

AI for human agency: a driver of growth and opportunity

Artificial intelligence can expand human capabilities, bridging the skills gap and unlocking new growth opportunities for individuals, businesses, and nations. An analysis of AI's potential as a tool to increase productivity and foster economic development.

Jan 18 2026
LLM

RLVR and GRPO: From-Scratch Implementation with Notebook

A code notebook illustrating the from-scratch implementation of RLVR (Reinforcement Learning Value Retrieval) with GRPO (Gradient Ratio Policy Optimization) is now available. The resource, hosted on GitHub, was shared on Reddit and is intended for those who want to deepen their practical implementation of these algorithms.

Jan 18 2026
Hardware

Linux 6.19: USB Issues Fixed for Apple M1/M2 Macs

Coming with Linux 6.19-rc6, are two USB fixes specifically for Apple Macs with M1 and M2 chips. The patches, intended for the mainline kernel, will be back-ported to stable Linux versions. This should improve hardware compatibility for those using Linux on these devices.

Jan 18 2026
Market

OpenAI: A Business Model Scaling with Intelligence

OpenAI's business model scales with the value of intelligence. The company leverages subscriptions, APIs, advertising, commerce, and compute, all driven by the increasing adoption of ChatGPT. This strategy allows OpenAI to grow efficiently, adapting to market evolution and new opportunities offered by AI.

Jan 18 2026
Hardware

Tesla: New AI Chips Every Nine Months, Challenging Nvidia and AMD

Elon Musk aims for a faster development and release cycle for new AI accelerators compared to Nvidia and AMD. The goal is to produce chips in extremely high volumes, but the engineering challenge is significant. Tesla intends to accelerate its roadmap in the field of artificial intelligence.

← Previous Page 4 / 16 Next →