AI Ethics and Safety Concerns

2026-02-13 • ArXiv cs.CL

Response-Based Knowledge Distillation: Multilingual LLM Safety Compromised?

A new study explores knowledge distillation to improve the safety of large language models (LLMs) in multilingual contexts. Results show that fine-tuning on "safe" data can paradoxically increase model vulnerability to jailbreak attacks, highlighting...

#LLM On-Premise #Fine-Tuning #DevOps

2026-02-13 • DigiTimes

Anthropic's "tutor" for Claude shifts the AI race from scale to ethics

Anthropic, with its Claude model, appears to be shifting the focus in the AI race. The company is now focusing on aspects such as ethics and responsibility, in addition to pure computing power and model scalability.

#LLM On-Premise #DevOps

2026-02-12 • 404 Media

AI Abuse: Nude AI Images Created, OnlyFans Opened in Her Name

A woman was victimized by AI-generated images. Strangers created nude images from her profile and opened an OnlyFans account in her name. The incident occurred during a surge in the generation of sexual images via AI, raising questions about the misu...

2026-02-12 • AI News

State-sponsored hackers exploit AI for advanced cyberattacks

State-sponsored hackers are exploiting AI models like Gemini to refine phishing attacks and develop malware. Groups from Iran, North Korea, China, and Russia are leveraging AI for reconnaissance, social engineering, and malware development, increasin...

2026-02-11 • TechCrunch AI

OpenAI reorganizes mission alignment team focused on AI safety

OpenAI has disbanded its mission alignment team, which focused on developing 'safe' and 'trustworthy' artificial intelligence. The team's leader will become OpenAI's Chief Futurist, with other members reassigned within the company.

#DevOps

2026-02-11 • Ars Technica AI

OpenAI researcher quits over fears that ChatGPT ads could manipulate users

Zoë Hitzig, an economist and researcher, resigned from OpenAI due to disagreements over ChatGPT's advertising strategy. She fears that the use of personal data shared by users could lead to manipulation, repeating past mistakes. Hitzig criticizes Ope...

#LLM On-Premise #DevOps

2026-02-11 • MIT Technology Review

Is a secure AI assistant possible?

AI assistants equipped with autonomous action capabilities raise serious concerns about data security. The article examines the risks associated with tools like OpenClaw, which offer extensive customization options but expose users to potential promp...

2026-02-11 • Tom's Hardware

AI-assisted sinus surgery: malfunctions rocket from eight to 100 incidents

An AI-enhanced sinus surgery system experienced a significant increase in malfunctions, rising from eight to one hundred incidents. The investigation raises concerns about the safety and reliability of integrating AI into delicate medical procedures.

2026-02-11 • 404 Media

Ring Under Scrutiny: Surveillance and Privacy Concerns

A podcast analyzes Ring's new features and raises concerns about mass surveillance. It also discusses how Apple's Lockdown Mode prevented the FBI from accessing a Washington Post reporter's iPhone, highlighting the importance of device security.

#LLM On-Premise #DevOps

2026-02-11 • TechCrunch AI

OpenAI policy exec who opposed chatbot’s “adult mode” reportedly fired on discrimination claim

An OpenAI executive, who opposed the introduction of an "adult mode" in the chatbot, has reportedly been fired following allegations of discrimination. The executive has denied the allegations.

2026-02-10 • 404 Media

RFK Jr's Nutrition Chatbot Recommends Best Foods to Insert Into Your Rectum

An AI chatbot from the U.S. Department of Health and Human Services, promoted by Robert F. Kennedy Jr., has generated questionable responses, suggesting foods suitable for rectal insertion and identifying the liver as the most nutritious human body p...

#LLM On-Premise #DevOps

2026-02-10 • The Register AI

AI agents spill secrets just by previewing malicious links

Researchers warn: a zero-click prompt injection vulnerability can leak data when AI agents meet messaging apps. An attacker can trick an AI agent into generating a data-leaking URL, which link previews may fetch automatically, exposing sensitive info...

#LLM On-Premise #DevOps

2026-02-09 • The Register AI

Single Prompt Bypasses LLM Safety Guardrails

Microsoft Azure researchers discovered that a single, unlabeled training prompt can disable the safety mechanisms built into several large language models (LLMs). The finding raises concerns about the robustness of current safeguards.

#LLM On-Premise #Fine-Tuning #DevOps

2026-02-09 • The Register AI

AI Chatbots: Medical Advice as Unreliable as a Search Engine?

Healthcare researchers have found that AI chatbots could put patients at risk by giving shoddy medical advice. The quality of responses is compromised by users' failure to provide accurate details.

2026-02-07 • LocalLLaMA

Prompt injection: critical vulnerability for self-hosted LLMs

A user reports a severe prompt injection vulnerability in a self-hosted LLM system. During testing, a malicious prompt exposed the entire system prompt, highlighting the lack of adequate defenses against this type of attack. Traditional Web Applicati...

#LLM On-Premise #DevOps

2026-02-07 • The Next Web

Anthropic challenges OpenAI with Super Bowl ads: AI advertising

Anthropic invested millions of dollars in Super Bowl commercials to highlight its strategy, which rejects the insertion of advertising in chatbots, in contrast to other companies in the sector. The campaign aims to highlight a different approach to t...

2026-02-06 • The Register AI

AI video company arouses fury by boasting about replacing creative jobs

Higgsfield.ai, a startup offering AI video creation tools, has generated outrage by claiming it contributed to artists' unemployment. The marketing stunt sparked a heated debate about the impact of AI on the creative job market.

#LLM On-Premise #DevOps

2026-02-06 • 404 Media

ICE Surveillance: Investigation into the Use of Technologies and Biometric Data

The Department of Homeland Security’s (DHS) Inspector General has launched an investigation into Immigration and Customs Enforcement (ICE) regarding potential privacy abuses related to surveillance and biometric data programs. The investigation aims ...

AI Ethics and Safety Concerns

Related Coverage