Google releases new factuality benchmark for enterprise AI
The new FACTS benchmark by Google focuses on factuality of AI, measuring the ability of algorithms to generate accurate information in an enterprise setting.
The LLM archive monitors model releases, quantization updates, reasoning capabilities, and real-world deployment implications for local and hybrid AI. We focus on what materially changes selection and operations: context windows, latency, memory footprint, licensing, and evaluation evidence across open and commercial families. This section is designed for teams that need dependable model intelligence, not hype cycles. Pair these updates with the LLM pillar and references to hardware constraints and framework integration.
The new FACTS benchmark by Google focuses on factuality of AI, measuring the ability of algorithms to generate accurate information in an enterprise setting.
Major tech companies join forces with Linux Foundation to establish a standard for AI agent development
The FACTS Benchmark Suite is a system developed to evaluate the factuality of large language models, providing a standardized metric to measure the performance of these models.
Our partnership with the UK government is strengthened to support prosperity and security in the AI era.
Google DeepMind and UK AI Security Institute strengthen collaboration on critical AI safety and security research
The Commonwealth Bank of Australia has launched ChatGPT Enterprise, a machine learning platform designed to improve customer service and fight fraud.
The Agentic AI Foundation, co-founded by OpenAI under the Linux Foundation, aims to promote the integrity and safety of agentic AI. The foundation has received AGENTS.md as a donation to support interoperable standards development.
OpenAI is investing in stronger safeguards and defensive capabilities to enhance cyber resilience as AI models become more powerful in cybersecurity. We explain how we assess risk, limit misuse, and work with the security community to strengthen cyber resilience.
Scout24 has created a GPT-5 powered conversational assistant that reimagines real-estate search, guiding users with clarifying questions, summaries, and tailored listing recommendations.
Process Intelligence technology is revolutionizing how public administrations manage funds and make decisions, ensuring greater transparency and efficiency.
Learn how our new certifications and AI Foundations courses help people build real-world AI skills, boost career opportunities, and prepare for the future of work.
Google has denied rumors about the presence of ads in Gemini, stating that there will be no advertising on the platform.
Deutsche Telekom is collaborating with OpenAI to bring advanced, multilingual AI experiences to millions of people across Europe. ChatGPT Enterprise will also be used by Deutsche Telekom to improve workflows and accelerate innovation.
Chrome's AI agents pose security risks. Google explains how it will protect users with Gemini control models.
The release of GLM-4.6V represents a significant advancement in the field of large language models, offering integration of visual tools and structured multimodal generation.
Virgin Atlantic uses AI to speed up development, improve decision-making, and elevate customer experience. CFO Oliver Byers shares how the airline is using data and advanced technologies to deliver a personalized travel experience
Anthropic has signed a deal with Slack to offer its AI coding agent Claude to software development teams, improving collaborative work and increasing productivity.
A Debate Among Experts on Predicting AI's Impact Until 2030. Divergent Opinions, Promising Futures, and the Need for Greater Accessibility of Technologies.
The rise of AI is changing the way small businesses create their brands. With tools like Design.com, entrepreneurs can explore and personalize their ideas interactively and accessible from day one.
Booking.com has developed a flexible and scalable agent strategy, combining natural language models and personalization techniques to improve request accuracy and reduce reliance on human operations.