Topic / Trend Rising

AI Model Development & Agentic AI

The rapid evolution of Large Language Models (LLMs) and the emergence of AI agents are transforming how AI interacts with users and automates complex tasks. Focus is on advanced capabilities, optimization, and integration into various platforms.

Detected: 2026-04-23 · Updated: 2026-04-23

Related Coverage

2026-04-23 ArXiv cs.CL

Locating and Preventing Stereotypes in Large Language Models

A recent study investigates the internal mechanisms of LLMs like GPT 2 Small and Llama 3.2 to locate stereotypes. The research explores identifying specific neuronal activations and "attention heads" that contribute to biased outputs. The goal is to ...

#LLM On-Premise #Fine-Tuning #DevOps
2026-04-23 ArXiv cs.LG

A Transparent Framework for Evaluating LLM Impact: Comparability and TCO

A new study introduces a transparent screening framework for estimating the inference and training impacts of Large Language Models. The methodology, based on natural-language descriptions, generates bounded environmental estimates and supports a com...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-23 ArXiv cs.LG

WorkflowGen: An Adaptive Framework for Optimizing LLM Workflows

WorkflowGen is a new framework addressing LLM agent inefficiencies such as high token consumption and instability. Proposed as an adaptive, experience-driven solution, it reduces token consumption by over 40% and improves success rates by 20% on medi...

#LLM On-Premise #Fine-Tuning #DevOps
2026-04-23 ArXiv cs.AI

Algorithm Selection with Zero Domain Knowledge via Text Embeddings

A new study introduces ZeroFolio, an innovative approach to algorithm selection that leverages pretrained text embeddings. This method, free from hand-crafted features, analyzes raw instance files as plain text to identify the most effective algorith...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-23 ArXiv cs.AI

The LLM Tool-Overuse Illusion: Optimizing Efficiency

New research highlights a critical phenomenon in LLMs: tool overuse. Models tend to employ external tools even when internal knowledge would suffice, slowing down operations. The study identifies two key mechanisms: a "knowledge epistemic illusion" a...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 TechCrunch AI

Google Workspace Enhances with AI: New Automated Productivity Functions

Google has introduced new automated functionalities within its Workspace suite, all powered by "Workspace Intelligence," its proprietary artificial intelligence system. This integration aims to streamline daily tasks, offering users advanced tools to...

#Hardware #LLM On-Premise #DevOps
2026-04-22 OpenAI Blog

ChatGPT Images 2.0: New Capabilities for Image Generation and Visual Reasoning

OpenAI has introduced ChatGPT Images 2.0, a state-of-the-art image generation model that brings significant improvements. Key enhancements include more accurate text rendering within images, extended multilingual support, and advanced visual reasonin...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 OpenAI Blog

AI Agents for Workflow Automation: Building, Using, and Scaling

The integration of AI agents into enterprise workflows represents a strategic lever for automating and optimizing operations. These tools, capable of connecting various platforms and streamlining repeatable tasks, offer companies the ability to build...

#Hardware #LLM On-Premise #DevOps
2026-04-22 OpenAI Blog

ChatGPT Introduces Cloud-Based Workspace Agents for Automated Workflows

OpenAI has integrated new Codex-powered agents into ChatGPT, designed to automate complex workflows. These agents operate entirely in the cloud, offering teams the ability to securely scale operations across various tools. Their introduction raises r...

#Hardware #LLM On-Premise #DevOps
2026-04-22 TechCrunch AI

Google Launches AI Agent Platform: Focus on IT and Technical Users

Google has introduced the Gemini Enterprise Agent Platform, a new solution for building LLM-based agents. The platform stands out for its specific orientation towards IT and technical users, suggesting a focus on control, integration, and customizati...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 TechCrunch AI

AI Summaries in Enterprise Gmail: Implications for Businesses

Google is introducing AI Overviews to enterprise Gmail accounts, a feature that will generate instant summaries from multiple emails. This development raises questions about data management strategies and underlying infrastructure, especially for org...

#Hardware #LLM On-Premise #DevOps
2026-04-22 Microsoft Research

AutoAdapt: Automating LLM Adaptation for Critical Scenarios

Microsoft Research introduces AutoAdapt, an Open Source Framework that automates the adaptation of Large Language Models to specialized, high-stakes domains. The system addresses challenges of reproducibility, cost, and time, transforming manual proc...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 Ars Technica AI

Generative AI as a Source of Income: The Case of an Indian Student

An Indian medical student explored an unconventional path to supplement his finances. Using Google Gemini’s Nano Banana Pro, he generated images of a girl through artificial intelligence and then commercialized them online. This initiative, undertake...

#Hardware #LLM On-Premise #DevOps
2026-04-22 The Register AI

Grafana: Free AI Assistant for On-Premise and Open Source Deployments

Grafana has announced the free availability of its AI assistant, specifically targeting Open Source communities and users managing on-premise deployments. The initiative, unveiled at the Barcelona user conference, strengthens the company's commitment...

#Hardware #LLM On-Premise #DevOps
2026-04-22 TechCrunch AI

Generative AI Integrates into Google Maps

Google has announced the integration of generative artificial intelligence into its popular Google Maps application. This move reflects the growing adoption of LLMs in consumer services and raises questions about the technical and strategic implicati...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 The Next Web

Bond: The 'Post-Feed' Social Network Leveraging AI for Real Experiences

Bond, a new social network launched on April 21 by former Index Ventures and Google DeepMind executives, aims to combat "doomscrolling." The application eschews infinite scrolls and traditional algorithmic feeds, instead employing artificial intellig...

#Hardware #LLM On-Premise #DevOps
2026-04-22 The Next Web

NeoCognition Secures $40M Seed for Experience-Driven AI Agents

NeoCognition, a startup, has raised $40 million in seed funding to develop AI agents that specialize through direct experience rather than relying solely on pre-training. The company, a spin-off from Ohio State University, aims to close the reliabili...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 ArXiv cs.CL

LLMs and Cognition: Model Representations Predict Human Reading Times

New research investigates whether the internal representations of Large Language Models (LLMs) capture cognitive signals related to human reading times. The study revealed that early LLM layers outperform scalar predictors in predicting early-pass re...

#Hardware #LLM On-Premise #DevOps
2026-04-22 ArXiv cs.CL

2D Early Exit Optimization: New Horizons for On-Premise LLM Inference

A two-dimensional early exit strategy revolutionizes LLM inference by coordinating layer-wise and sentence-wise exiting. This incremental method generates multiplicative computational savings, surpassing single optimizations. Tested on 3B-8B paramete...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 ArXiv cs.LG

Self-Evolving LLMs: EasyRL Optimizes Fine-tuning with Less Data

A new study introduces EasyRL, an innovative approach for LLM post-training that aims to overcome the limitations of existing methods, such as high annotation costs and model collapse issues. Inspired by cognitive learning theory, EasyRL utilizes a p...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 ArXiv cs.LG

LLMs and Theorem Proving: Compilation Reduces Computational Costs

A new approach leverages compiler outputs to enhance the efficiency of LLMs in formal theorem proving. The method addresses the computational bottleneck typical of current solutions that demand prohibitive resources. Through a learning-to-refine fram...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-22 ArXiv cs.AI

Visualizing LLM Distributions: A New Approach Beyond Single Outputs

Evaluating Large Language Models (LLMs) based on single outputs overlooks the richness of possible response distributions, leading to erroneous generalizations. New research introduces GROVE, an interactive visualization tool that represents multiple...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-21 TechCrunch AI

ChatGPT Images 2.0: OpenAI's Model Surprisingly Good at Text Generation

OpenAI has introduced ChatGPT Images 2.0, a new image-generation model that demonstrates an unexpected ability to produce text. This evolution highlights the rapid advancements in AI capabilities, presenting new challenges and opportunities for on-pr...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-21 TechCrunch AI

NeoCognition Secures $40M Seed Funding for AI Agents That Learn Like Humans

NeoCognition, a startup founded by an OSU researcher, has closed a $40 million seed funding round. The company aims to develop AI agents capable of autonomous learning, emulating human abilities, to become experts in any domain. This approach promise...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-21 Wired AI

OpenAI Updates ChatGPT's Image Generation Model

OpenAI has released ChatGPT Images 2.0, a new version of its image generation model. Preliminary tests show improvements in detail rendering and text generation, though challenges persist with languages other than English. This update highlights the ...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-21 TechCrunch AI

GRAI: AI for Music Remixing, Not Artist Replacement

AI music startup GRAI advocates for an approach to artificial intelligence in the music industry that emphasizes collaboration and human creativity. The company believes AI should empower fans to remix existing tracks rather than generate original co...

#Hardware #LLM On-Premise #DevOps
2026-04-21 ArXiv cs.LG

UniMamba: A Unified Framework for Complex Time Series Forecasting

UniMamba addresses multivariate time series forecasting challenges by integrating the efficiency of state-space models with the pattern recognition capabilities of attention mechanisms. This new framework overcomes the limitations of current methods,...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-21 ArXiv cs.LG

BASIS: Optimizing Activation Memory in LLM Training

A new algorithm, BASIS, promises to overcome the activation memory bottleneck in training deep neural networks, including Large Language Models. By decoupling memory from batch and sequence dimensions, BASIS significantly reduces VRAM requirements wh...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-21 Phoronix

AMD GAIA: Portable AI Agents for Local Deployments

AMD is enhancing GAIA, its cross-platform software solution built around the Lemonade SDK, for running local AI agents on AMD hardware (CPUs, GPUs, NPUs). The latest update introduces portability for custom AI agents, facilitating easy import and exp...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-20 TechCrunch AI

Google Expands Gemini in Chrome to Seven New Markets

Google has announced the expansion of its Gemini model within the Chrome browser to seven countries, including Australia, Japan, and South Korea. This move highlights the growing integration of generative AI into everyday tools, raising crucial quest...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-20 Tech in Asia

Morgan Stanley: Agentic AI to Drive CPU Demand

A Morgan Stanley analysis predicts that agentic artificial intelligence, capable of planning and executing tasks with reduced human intervention, will generate significant demand for CPUs. This trend could add between $32.5 billion and $60 billion to...

#Hardware #LLM On-Premise #DevOps
2026-04-20 The Next Web

Semrush Introduces a Framework for Brand Visibility in the AI Search Era

Semrush has unveiled a new Brand Visibility Framework, introducing the discipline of “Agentic Search Optimisation” (ASO). This tool aims to measure brand presence in AI-generated answers, traditional search, and through autonomous AI agents, based on...

#LLM On-Premise #Fine-Tuning #DevOps
2026-04-20 TechCrunch AI

Recognizing AI-Generated Text: A Revealing Stylistic Clue

The widespread use of a specific syntactic construction in text generated by Large Language Models (LLMs) is becoming an almost certain indicator of its artificial origin. This phenomenon raises crucial questions about content authenticity verificati...

#LLM On-Premise #Fine-Tuning #DevOps
2026-04-20 ArXiv cs.CL

Multilingual LLMs: A Data-Efficient Framework for Code-Switching

A new fine-tuning framework aims to enhance code-switching capabilities in Large Language Models (LLMs), making them more effective in multilingual reasoning. The research introduces a data-efficient approach to identify and teach beneficial code-swi...

#LLM On-Premise #Fine-Tuning #DevOps
2026-04-20 ArXiv cs.LG

Spectral Geometry Unveils Reasoning Mechanisms in LLMs

New research reveals that Large Language Models (LLMs) exhibit "spectral phase transitions" during reasoning, distinguishing it from factual recall. The study, conducted on 11 models from 5 different architectures, identified seven key phenomena, inc...

#LLM On-Premise #Fine-Tuning #DevOps
2026-04-19 TechCrunch AI

The Strategic Window: Foundation LLMs and the Future of AI Startups

Many AI startups thrive by exploiting niches not yet covered by foundation Large Language Models. However, this situation is temporary. The expansion of these general models forces specialized companies to rapidly rethink their differentiation and de...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-18 TechCrunch AI

The App Store Resurgence: Is AI Driving the Mobile Software Boom?

New data from Appfigures forecasts a significant increase in new mobile app launches by 2026. This market rebound, particularly within the App Store, suggests that artificial intelligence tools are playing a crucial role, fueling a wave of innovation...

#Hardware #LLM On-Premise #DevOps
2026-04-17 Anthropic News

Anthropic Labs Introduces Claude Design: A New Tool for On-Premise AI

Anthropic Labs has announced Claude Design, a new tool poised to redefine interaction with artificial intelligence in the design field. For enterprises considering self-hosted deployments, this development raises crucial questions about hardware requ...

#Hardware #LLM On-Premise #DevOps
2026-04-17 TechCrunch AI

Anthropic Unveils Claude Design for Rapid Visual Creation

Anthropic has launched Claude Design, a new tool designed to facilitate the rapid creation of visual content. The product targets individuals such as founders and product managers who lack specific design skills, aiming to simplify the sharing of the...

#LLM On-Premise #DevOps
2026-04-17 The Next Web

DeepL Launches Real-Time Voice-to-Voice Translation in 40+ Languages

DeepL, the Cologne-based company known for its text translation tools, has unveiled a comprehensive suite for real-time voice-to-voice translation, supporting over 40 languages. The solution includes features for meetings and conversations, as well a...

#Hardware #LLM On-Premise #DevOps
2026-04-17 The Next Web

OpenAI Unveils GPT-Rosalind: A Specialized LLM for Life Sciences

OpenAI has launched GPT-Rosalind, its first domain-specific Large Language Model (LLM). Designed for drug discovery and life sciences research, it has been Fine-tuned for biochemistry, genomics, and protein engineering. Access is limited to a trusted...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-17 ArXiv cs.AI

SciFi: A Safe, Autonomous Agentic Framework for Scientific Automation

SciFi, a new agentic framework for autonomous scientific task automation, has been introduced. Designed to be safe, lightweight, and user-friendly, it integrates an isolated execution environment, a three-layer agent loop, and a self-assessing do-unt...

#LLM On-Premise #DevOps
2026-04-16 TechCrunch AI

Luma Launches AI-Powered Production Studio with "Wonder Project"

Luma has inaugurated an AI-powered production studio, unveiling its first project, the "Wonder Project," centered on Moses and starring Ben Kingsley. The initiative highlights the integration of AI into creative processes and its related infrastructu...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-16 TechCrunch AI

OpenAI Enhances Agentic Coding Tool with Expanded Desktop Control

OpenAI has revamped its agentic coding tool, introducing a range of new features and capabilities. This update aims to extend the tool's control and abilities directly to users' desktop environments, offering greater autonomy and potential for softwa...

#LLM On-Premise #DevOps
2026-04-16 OpenAI Blog

OpenAI Introduces GPT-Rosalind: A New LLM for Life Sciences Research

OpenAI has announced GPT-Rosalind, a frontier reasoning model designed to accelerate drug discovery, genomics analysis, and protein reasoning. This Large Language Model (LLM) aims to optimize scientific workflows, offering new capabilities to process...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-16 Ars Technica AI

OpenAI Codex Updates: Background Processing for Desktop Productivity

OpenAI has released a new version of its Codex desktop application, introducing advanced features ranging from development to knowledge work. The most significant new capability is the ability to perform tasks on the PC in the background, without int...

#Hardware #LLM On-Premise #Fine-Tuning
2026-04-16 OpenAI Blog

Codex Evolves: New Features to Accelerate Development on macOS and Windows

The updated Codex app for macOS and Windows introduces advanced features such as direct computer use, in-app browsing, image generation, memory management, and plugin support. These enhancements aim to optimize and accelerate developer workflows, off...

#Hardware #LLM On-Premise #DevOps
2026-04-16 The Next Web

Google Gemini: Image Generation Enhanced with Personal Data

Google has integrated a new image generation feature into Gemini, leveraging users' personal data from services like Gmail and Google Drive. This capability, powered by "Nano Banana," aims to create more relevant visual content. The initial rollout i...

#Hardware #LLM On-Premise #DevOps
2026-04-16 TechCrunch AI

Roblox's AI Assistant Gains Agentic Tools for Game Development

Roblox introduces new agentic functionalities for its AI assistant, aiming to support creators through every stage of game development. These tools promise to optimize planning, building, and testing, offering deeper automation and greater autonomy w...

#Hardware #LLM On-Premise #DevOps
2026-04-16 The Register AI

Visual Studio 18.5: AI Debugging Arrives with a Price, Devs Remain Unhappy

Visual Studio 2026 18.5 introduces a smarter code suggestion system and an AI-powered debugger, which comes with an implicit cost. Despite these innovations, developer frustration persists over issues like color contrast and forced updates. This rele...

#Hardware #LLM On-Premise #DevOps
2026-04-16 Tech.eu

SpAItial: 3D AI Models Are Where ChatGPT Was Five Years Ago

Matthias Niessner, CEO of SpAItial, a European 3D AI foundation model startup, states that this technology is in an early stage, comparable to ChatGPT five years ago. The startup, which raised $13 million in a seed round, aims to develop 3D AI models...

#Hardware #LLM On-Premise #DevOps
2026-04-16 TechCrunch AI

Canva's AI Assistant Now Creates Editable Designs from Text Prompts

The latest version of Canva's AI assistant introduces the ability to generate fully editable designs from simple text descriptions. This evolution allows users to create visual content more intuitively, integrating artificial intelligence directly in...

#Hardware #LLM On-Premise #DevOps
← Back to All Topics