Topic / Trend Stable

AI Ethics and Safety Concerns

Ethical considerations and safety measures are becoming increasingly important as AI models become more powerful. This includes addressing biases, preventing misuse, and ensuring responsible development.

Detected: 2026-02-13 · Updated: 2026-02-13

Related Coverage

2026-02-13 ArXiv cs.CL

Response-Based Knowledge Distillation: Multilingual LLM Safety Compromised?

A new study explores knowledge distillation to improve the safety of large language models (LLMs) in multilingual contexts. Results show that fine-tuning on "safe" data can paradoxically increase model vulnerability to jailbreak attacks, highlighting...

#LLM On-Premise #Fine-Tuning #DevOps
2026-02-12 404 Media

AI Abuse: Nude AI Images Created, OnlyFans Opened in Her Name

A woman was victimized by AI-generated images. Strangers created nude images from her profile and opened an OnlyFans account in her name. The incident occurred during a surge in the generation of sexual images via AI, raising questions about the misu...

2026-02-12 AI News

State-sponsored hackers exploit AI for advanced cyberattacks

State-sponsored hackers are exploiting AI models like Gemini to refine phishing attacks and develop malware. Groups from Iran, North Korea, China, and Russia are leveraging AI for reconnaissance, social engineering, and malware development, increasin...

2026-02-11 TechCrunch AI

OpenAI reorganizes mission alignment team focused on AI safety

OpenAI has disbanded its mission alignment team, which focused on developing 'safe' and 'trustworthy' artificial intelligence. The team's leader will become OpenAI's Chief Futurist, with other members reassigned within the company.

#DevOps
2026-02-11 Ars Technica AI

OpenAI researcher quits over fears that ChatGPT ads could manipulate users

Zoë Hitzig, an economist and researcher, resigned from OpenAI due to disagreements over ChatGPT's advertising strategy. She fears that the use of personal data shared by users could lead to manipulation, repeating past mistakes. Hitzig criticizes Ope...

#LLM On-Premise #DevOps
2026-02-11 MIT Technology Review

Is a secure AI assistant possible?

AI assistants equipped with autonomous action capabilities raise serious concerns about data security. The article examines the risks associated with tools like OpenClaw, which offer extensive customization options but expose users to potential promp...

2026-02-11 404 Media

Ring Under Scrutiny: Surveillance and Privacy Concerns

A podcast analyzes Ring's new features and raises concerns about mass surveillance. It also discusses how Apple's Lockdown Mode prevented the FBI from accessing a Washington Post reporter's iPhone, highlighting the importance of device security.

#LLM On-Premise #DevOps
2026-02-10 The Register AI

AI agents spill secrets just by previewing malicious links

Researchers warn: a zero-click prompt injection vulnerability can leak data when AI agents meet messaging apps. An attacker can trick an AI agent into generating a data-leaking URL, which link previews may fetch automatically, exposing sensitive info...

#LLM On-Premise #DevOps
2026-02-09 The Register AI

Single Prompt Bypasses LLM Safety Guardrails

Microsoft Azure researchers discovered that a single, unlabeled training prompt can disable the safety mechanisms built into several large language models (LLMs). The finding raises concerns about the robustness of current safeguards.

#LLM On-Premise #Fine-Tuning #DevOps
2026-02-07 LocalLLaMA

Prompt injection: critical vulnerability for self-hosted LLMs

A user reports a severe prompt injection vulnerability in a self-hosted LLM system. During testing, a malicious prompt exposed the entire system prompt, highlighting the lack of adequate defenses against this type of attack. Traditional Web Applicati...

#LLM On-Premise #DevOps
2026-02-07 The Next Web

Anthropic challenges OpenAI with Super Bowl ads: AI advertising

Anthropic invested millions of dollars in Super Bowl commercials to highlight its strategy, which rejects the insertion of advertising in chatbots, in contrast to other companies in the sector. The campaign aims to highlight a different approach to t...

← Back to All Topics