AI Security, Ethics & Governance

2026-04-24 • The Next Web

Norway: Social Media Ban for Under-16s and Age Verification Burden on Platforms

The Norwegian government has announced legislation to ban social media access for individuals under 16, shifting age verification responsibility directly onto platforms. This move, which raises the previously proposed age limit, aligns Norway with si...

#Hardware #LLM On-Premise #DevOps

2026-04-24 • ArXiv cs.AI

Beyond Agreement: A New Framework for Evaluating Rule-Governed AI

A new study introduces a framework for evaluating rule-governed AI systems, moving beyond traditional agreement metrics. By proposing the Defensibility Index and Ambiguity Index, the research reveals that many decisions previously deemed 'errors' are...

#LLM On-Premise #DevOps

2026-04-24 • DigiTimes

The Acceleration of AI Innovation and Enterprise Security Challenges

The relentless progress in artificial intelligence, particularly Large Language Models (LLMs), is creating a significant gap with enterprise security capabilities. This rapid evolution forces companies to rethink their data and infrastructure protect...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-23 • OpenAI Blog

GPT-5.5 Bio Bug Bounty: The Red-Teaming Challenge for LLM Security

OpenAI has launched the GPT-5.5 Bio Bug Bounty program, a red-teaming challenge aimed at identifying vulnerabilities and universal 'jailbreaks' in its Large Language Models. The initiative focuses on biosafety risks, offering rewards up to $25,000 fo...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-23 • Tom's Hardware

Crypto Scam Exploits Strait of Hormuz Crisis: Ships Fired Upon Despite Payment

A sophisticated cryptocurrency scam leveraged the Strait of Hormuz crisis, with fake 'Iranian authorities' extorting payments from oil tankers awaiting cargo. Despite making payments, two vessels were reportedly fired upon. The incident highlights th...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-23 • TechCrunch AI

Security Incident at Context AI: Spotlight on Compliance in the AI Sector

AI agent training startup Context AI has disclosed a security incident. TechCrunch confirmed that Delve, a compliance company already facing scrutiny, had performed Context AI's security certifications. The incident raises questions about the robustn...

#LLM On-Premise #Fine-Tuning #DevOps

2026-04-23 • The Register AI

Stale Data and LLMs: The Challenge of Accuracy in Government Information

AI overviews, such as those from Google, are delivering inaccurate summaries of UK government information by drawing on stale GOV.UK pages. This issue, highlighted by the Department for Business and Trade (DBT), raises critical questions about the re...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-22 • The Register AI

Anthropic Mythos: The "Bug Hunter" Model Between Hype and Reality

Anthropic's Mythos model, designed to identify vulnerabilities, generated significant anticipation for its purported capabilities. Despite initial concerns about potential misuse, early analyses suggest its actual implications might be less alarming ...

#LLM On-Premise #Fine-Tuning #DevOps

2026-04-22 • The Register AI

OpenAI and Data Surveillance: Implications for Privacy and Control

OpenAI is introducing new features that raise questions about privacy and data control. The ability for "self-surveillance" to enhance models brings to mind controversies surrounding Microsoft Recall, highlighting the delicate balance between innovat...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-22 • The Next Web

Anthropic's AI strengthens Firefox: 271 bugs resolved with Mythos

Mozilla has released Firefox 150, incorporating fixes for 271 security vulnerabilities. These were identified by Anthropic’s Claude Mythos Preview, an advanced and unreleased AI model distributed under the Project Glasswing program. This collaboratio...

#Hardware #LLM On-Premise #DevOps

2026-04-22 • Wired AI

When AI Learns to Deceive: The Dual Threat of Advanced Models

The social manipulation capabilities of Large Language Models (LLMs) are emerging as a significant concern, alongside cyber risks. Recent observations show AI models capable of attempting scams with alarming effectiveness, raising questions about the...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-22 • Tom's Hardware

Unexpected Google Cloud Bill: Forgotten API Key Leads to $18,000+ Charge

A Google Cloud customer faced an unexpected bill exceeding $18,000, vastly surpassing their $7 budget. The incident stemmed from a forgotten public API key, which an attacker exploited to generate over 60,000 requests, bypassing a $1,400 spending cap...

#LLM On-Premise #DevOps

2026-04-22 • Wired AI

AI Tools and Cybercrime: North Korean Hackers Behind Millions in Thefts

A North Korean hacker group leveraged artificial intelligence tools to optimize their malicious operations, from "vibe coding" malware to creating fake company websites. This strategy allowed them to steal up to $12 million in just three months, high...

#Hardware #LLM On-Premise #DevOps

2026-04-22 • AI News

AI Redefines Enterprise Security: Mozilla Uncovers Hundreds of Vulnerabilities

The integration of advanced Large Language Models (LLM) for automated vulnerability discovery is transforming the enterprise security landscape, reversing costs traditionally favoring attackers. The Mozilla Firefox team's experience with Anthropic's ...

#LLM On-Premise #DevOps

2026-04-22 • OpenAI Blog

OpenAI Introduces Privacy Filter: An Open-Weight Model for Sensitive Data Management

OpenAI has released Privacy Filter, an open-weight model designed to identify and redact Personally Identifiable Information (PII) within text. Its state-of-the-art accuracy makes it a relevant tool for companies aiming to strengthen data sovereignty...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-22 • Tom's Hardware

Ransomware Negotiator Pleads Guilty: Sensitive Victim Data Leaked to BlackCat

A ransomware negotiator has pleaded guilty to leaking confidential victim insurance details to the notorious BlackCat hacker group. This information breach provided attackers with a precise estimate of each target's ability to pay, highlighting sever...

#LLM On-Premise #DevOps

2026-04-22 • The Register AI

Google Cloud: AI Against AI for Cybersecurity

Google Cloud is enhancing its cybersecurity strategy by introducing more AI-powered agents and related services. The approach, summarized by COO Francis deSouza, is based on using artificial intelligence to counter AI-generated threats, addressing th...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-22 • The Register AI

French ID Agency Probes Data Breach: 19 Million Records at Risk

France's National Agency for Secure Documents is investigating a potential data breach. Online criminals claim to have stolen identification information related to approximately one-third of the French population, totaling 19 million records. The inc...

#Hardware #LLM On-Premise #DevOps

2026-04-22 • Tom's Hardware

Critical RCE Risk in Anthropic Protocol: 200,000 AI Servers Exposed

A new and concerning Remote Code Execution (RCE) vulnerability has been identified in Anthropic's Model Context Protocol, a key component for Large Language Models like Claude. This critical security flaw exposes up to 200,000 AI servers to potential...

#Hardware #LLM On-Premise #DevOps

2026-04-22 • The Next Web

Florida Investigates OpenAI: ChatGPT Accused in University Shooting

Florida has launched a criminal investigation into OpenAI, alleging that ChatGPT provided advice on weapons, ammunition, and timing to a suspect involved in a shooting at Florida State University. Attorney General James Uthmeier revealed that chat lo...

#LLM On-Premise #Fine-Tuning #DevOps

2026-04-22 • Wired AI

AI Detection: A Chrome Extension Labels Generated Content, Raising Authenticity Questions

Pangram Labs has updated its Chrome extension, designed to identify and flag AI-generated content. The tool applies warning labels directly on users' social feeds, highlighting the growing need to discern the origin of online information. A recent ca...

#LLM On-Premise #DevOps

2026-04-22 • The Next Web

Meta Under Fire: $16 Billion from Fraudulent Ads

Meta is facing a series of lawsuits across the US, Australia, and the UK. The allegations claim the company knowingly profited from scam ads on Facebook and Instagram. Internal documents reportedly project that 10% of Meta's anticipated 2024 revenue,...

#Hardware #LLM On-Premise #DevOps

2026-04-22 • The Register AI

Mozilla Tests Anthropic's Mythos for Firefox Security

The Mozilla Foundation tested Anthropic's "Mythos" AI model, designed for bug detection. The model identified 271 vulnerabilities in Firefox, all of which were also detectable by human analysts. Mozilla's CTO described the results as a pivotal moment...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-22 • The Register AI

Meta's Internal Surveillance for AI: The Paradox Stirring Employee Unrest

Meta, a company known for its extensive user data collection, is reportedly installing surveillance software on employee work computers. The stated goal is to capture keystrokes to train artificial intelligence, a move that is generating internal dis...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • TechCrunch AI

Anthropic Investigates Alleged Unauthorized Access to its AI Tool Mythos

Anthropic is investigating reports of alleged unauthorized access to its exclusive cyber tool, Mythos. The company told TechCrunch it has found no evidence of impact on its systems, but the incident raises questions about the security of proprietary ...

#Hardware #LLM On-Premise #DevOps

2026-04-21 • Ars Technica AI

Anthropic's Mythos Discovers 271 Zero-Days in Firefox 150: A Game Changer for Security?

Anthropic's Mythos Preview model enabled Mozilla to identify 271 zero-day vulnerabilities in Firefox 150's source code before its release. This result, significantly higher than previous tests, led Firefox's CTO to declare a potential turning point f...

#LLM On-Premise #Fine-Tuning #DevOps

2026-04-21 • Ars Technica AI

Florida Probes ChatGPT's Role in Mass Shooting

The Florida Attorney General's Office has launched a criminal investigation into OpenAI, alleging ChatGPT provided "significant advice" to a suspected gunman before a mass shooting at a university. The accusation is based on chat logs which, accordin...

#LLM On-Premise #DevOps

2026-04-21 • TechCrunch AI

Sam Altman Criticizes Anthropic's Cyber Model Mythos: 'Fear-Based Marketing'

Sam Altman, OpenAI's CEO, has criticized Mythos, Anthropic's AI model focused on cybersecurity. Altman labeled the model's promotional strategy as 'fear-based marketing,' highlighting growing competitive tension within the AI industry.

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • Wired AI

Mozilla Leverages Anthropic's AI to Identify and Fix Bugs in Firefox

Mozilla utilized Mythos, a Large Language Model from Anthropic, to discover and fix 151 bugs in the Firefox browser. While the Firefox team doesn't anticipate emerging AI capabilities will upend cybersecurity long-term, they warn that software develo...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • MIT Technology Review

AI Agents: Governance is Crucial for Enterprise Security and Control

The adoption of AI agents in enterprises introduces new attack surfaces and significant risks. With the rise of non-human identities, robust governance and a strong security foundation become indispensable. A recent Deloitte report indicates that whi...

#LLM On-Premise #DevOps

2026-04-21 • TechCrunch AI

Clarifai Deletes 3 Million OkCupid Photos After FTC Settlement

Clarifai has deleted three million photos provided by OkCupid, originally used to train facial recognition AI. The decision follows a settlement with the Federal Trade Commission (FTC) and raises crucial questions about data management and compliance...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • TechCrunch AI

YouTube Expands AI Likeness Detection to Celebrities

YouTube is enhancing its AI-powered likeness detection tool, extending its application to celebrities. The initiative aims to provide public figures and their representatives with an effective means to identify and remove deepfakes, addressing the gr...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • The Next Web

Lovable: 48 Days of Exposed Data and the 'Vibe Coding' Security Crisis

Lovable, the $6.6 billion 'vibe coding' platform with eight million users, has experienced three security incidents. The most recent, a BOLA vulnerability, exposed source code, database credentials, and thousands of user records for 48 days. The comp...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • The Register AI

Vercel Breach: AI Suspected Behind Attackers' "Surprising Velocity"

Vercel experienced a data breach that its CEO attributes to AI assistance, citing "surprising velocity" and a deep understanding of the infrastructure by the attackers. The incident, involving OAuth abuse and a compromised employee account, highlight...

#LLM On-Premise #DevOps

2026-04-21 • Wired AI

Generative AI: The Phenomenon of Fictitious Identities and Illicit Gains

A recent case highlighted how a medical student generated thousands of dollars by selling images and videos of a fictitious conservative woman, created entirely with generative artificial intelligence tools. This episode is not isolated and raises qu...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-21 • The Next Web

Ofcom Launches Investigation into Telegram Over Child Abuse Content

The UK's online safety regulator, Ofcom, has opened a formal investigation into Telegram. The action aims to verify the messaging platform's compliance with its obligations under the Online Safety Act to protect UK users from child sexual abuse mater...

#Hardware #LLM On-Premise #DevOps

2026-04-21 • The Register AI

Adaptavist Group Breach: Stolen Credentials Lead to Imposter Emails

The Adaptavist Group, a UK enterprise software consultancy, is investigating a security breach. An intruder gained access using stolen credentials, resulting in the circulation of fraudulent emails. A ransomware group has claimed responsibility for t...

#LLM On-Premise #DevOps

2026-04-21 • The Next Web

Clarifai Deletes 3 Million OkCupid Photos and Trained AI Models Following Privacy Breach

Clarifai, an AI company specializing in facial recognition, has confirmed the deletion of approximately 3 million OkCupid user photos and their associated AI models. The images were acquired in 2014 without user consent, breaching OkCupid's privacy p...

#LLM On-Premise #DevOps

2026-04-21 • ArXiv cs.CL

Multimodal Fact-Checking: New Benchmark and Framework to Counter Misinformation

Multimodal misinformation on social media poses significant challenges for automated fact-checking. A new study introduces the first benchmark for multimodal claim extraction from posts combining text and images. Evaluating current MLLMs reveals thei...

#LLM On-Premise #DevOps

2026-04-20 • The Register AI

Lovable Denies Data Leak, Blames HackerOne Amidst Shifting Explanations

The 'vibe-coding' platform Lovable has denied a sensitive data leak, despite a researcher's findings that free accounts could access user credentials and source code. The company's narrative shifted, initially citing 'intentional behavior' and 'uncle...

#LLM On-Premise #DevOps

2026-04-20 • Tech in Asia

Singapore Proposes New Global AI Testing Standard

Singapore is leading an international discussion to define a new global standard for artificial intelligence testing. The proposal will be central to an upcoming ISO meeting, held for the first time in ASEAN, bringing together over 35 national bodies...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-20 • The Next Web

Musk Absent in Paris for Grok Illicit Content Investigation

Elon Musk failed to appear for a voluntary interview with Paris prosecutors investigating Grok. The LLM is accused of generating approximately 23,000 sexualized images of children and 3 million sexualized images overall in just eleven days. The US De...

#LLM On-Premise #DevOps

2026-04-20 • TechCrunch AI

NSA Reportedly Using Anthropic's Restricted Mythos AI Model

The National Security Agency (NSA) is reportedly utilizing Mythos, a 'restricted' LLM developed by Anthropic. This news raises questions about the implications for data sovereignty and control over AI models, particularly in government and national s...

#Hardware #LLM On-Premise #DevOps

2026-04-20 • Tom's Hardware

Vercel Breached: AI Tool with Unrestricted Google Workspace Access Exposes Data

Vercel, an AI cloud company, has experienced a data breach. The incident occurred after an employee granted an AI tool unrestricted access to Google Workspace. Data was stolen, and the responsible hacker is demanding a $2 million ransom. This event r...

#LLM On-Premise #DevOps

2026-04-20 • Ars Technica AI

Anthropic's Mythos: The AI That Detects and Creates Exploits, a Cybersecurity Challenge

Anthropic has released Mythos, a new AI model specialized in cybersecurity. While capable of identifying software vulnerabilities faster than humans, Mythos has also demonstrated the ability to generate exploits. This dual nature raises concerns amon...

#Hardware #LLM On-Premise #DevOps

2026-04-20 • AI News

AI Governance: Companies Unprepared for Incident Management

ISACA research reveals that most organizations cannot quickly halt an AI system in crisis or identify its cause. The lack of governance and clear accountability exposes businesses to operational, legal, and reputational risks, highlighting the need f...

#Hardware #LLM On-Premise #DevOps

2026-04-20 • The Next Web

Anthropic's Mythos Model Under Global Regulators' Scrutiny: Focus on Banking Risks

Australia's ASIC joins a broad coalition of international regulators, including the Bank of England and the US Federal Reserve, to monitor the development of Anthropic's Mythos AI model. The initiative aims to assess potential risks to the global ban...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-20 • DigiTimes

US Security Agencies Adopt Anthropic's Mythos Despite Pentagon Risk Label

US security agencies have opted to integrate Anthropic's Mythos LLM into their operations. This decision comes despite the Pentagon flagging potential risks associated with the model. The move highlights the increasing adoption of Large Language Mode...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-19 • The Register AI

Prompt Injection: The Persistent Threat Exposing LLM Secrets

Prompt injection attacks continue to pose a critical security challenge for Large Language Models (LLMs). Similar to phishing, these techniques manipulate input to bypass AI bot defenses, forcing them to reveal sensitive information. Their persistent...

#LLM On-Premise #DevOps

2026-04-18 • TechCrunch AI

Anthropic and the Trump Administration: Signs of Thaw Amidst Risks and Dialogues

Anthropic, a developer of Large Language Models, is re-engaging in dialogues with the Trump administration, despite the Pentagon's recent classification of the company as a supply-chain risk. This development suggests a potential thaw in relations, w...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-18 • Tom's Hardware

Bluetooth Tracker on Warship: A Warning for Physical Security of On-Premise AI

A simple Bluetooth tracker, hidden in a postcard, revealed the location of a €500 million Dutch warship for 24 hours. The incident, costing only €5, highlights how seemingly minor vulnerabilities can compromise critical assets. For decision-makers ma...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-18 • Tom's Hardware

Counterfeit Hardware Wallets: The Hidden Threat to Data Sovereignty

A tech expert discovered a counterfeit Ledger Nano S+ hardware wallet, nearly falling victim to a phishing attack. The incident highlights the dangers of inauthentic hardware and its implications for data security, a crucial aspect for those managing...

#Hardware #LLM On-Premise #DevOps

2026-04-18 • The Next Web

Anthropic and White House: First Steps Towards Mythos Model Access

Anthropic CEO Dario Amodei met with senior White House officials to discuss access to Mythos, a frontier LLM. The model is known for its ability to identify thousands of zero-day vulnerabilities. The meeting, described as "productive and constructive...

#Hardware #LLM On-Premise #Fine-Tuning

2026-04-18 • Wired AI

EU Age-Verification App Hacked in Two Minutes: A Security Wake-Up Call

The European Union's new age-verification app was reportedly hacked in just two minutes, highlighting persistent challenges in application security. This incident follows recent data breaches at a gym chain and a hotel giant, as well as a DDoS attack...

#LLM On-Premise #DevOps

2026-04-17 • The Next Web

Zoom and World ID: Biometric Verification to Combat Deepfakes in Meetings

Zoom has partnered with World, Sam Altman's biometric identity company, to introduce a human identity verification system for virtual meetings. Utilizing World's Deep Face technology, which cross-references iris-scanned biometric profiles with live v...

#LLM On-Premise #DevOps

2026-04-17 • The Next Web

Anthropic and White House Clash Over Mythos AI Model Security

Anthropic CEO Dario Amodei is meeting the White House to negotiate access to Mythos, a frontier AI model capable of identifying and exploiting thousands of zero-day vulnerabilities. The meeting follows a Pentagon blacklisting after Anthropic refused ...

#Hardware #LLM On-Premise #DevOps

AI Security, Ethics & Governance

Related Coverage