# AI safety

Latest news and articles about AI safety

Total: 8 articles found

Elegant Tesla Model S parked outdoors against a modern backdrop, showcasing luxury and innovation.
Technology

Musk Opens Grok 4.2 Candidate to Public Beta, Promising Weekly ‘Fast‑Learning’ Updates

Elon Musk has opened a candidate public beta of Grok 4.2, requiring users to opt in and inviting public feedback. The model claims a new fast‑learning capability and will receive weekly updates accompanied by release notes, accelerating xAI’s iterative development approach but raising questions about safety and oversight.

NeTe2026年2月17日 20:25
#Grok 4.2#Elon Musk#xAI
St. Peter's Basilica and fountain in St. Peter's Square, Vatican City, showcasing iconic architecture and tourists.
Technology

OpenAI Recruits Creator of OpenClaw, Vows to Keep Viral Agent Open-Source via New Foundation

OpenAI has hired Peter Steinberger, creator of the widely adopted agent framework OpenClaw, and pledged to place the project into a foundation that will keep it open-source and independent while receiving funding and support. The move is a tactical win for OpenAI but raises questions about governance, security and the balance between openness and centralization as agent platforms mature.

NeTe2026年2月16日 05:44
#OpenAI#OpenClaw#Peter Steinberger
Vibrant abstract artwork showcasing dynamic blue fluid textures.
World

US Military Allegedly Used Anthropic’s Claude in Venezuela Operation, Raising Questions About AI’s Role in War

U.S. media report that Anthropic’s AI model Claude was used in the January 3 U.S. operation in Venezuela, routed via a partnership with Palantir. Anthropic has not confirmed the claim and stresses its policy forbidding uses that facilitate violence, but the allegation raises legal, ethical and strategic questions about private AI models in military operations.

SoMi2026年2月14日 21:14
#Anthropic#Claude#Palantir
A breathtaking aerial shot of a dock and green waters of Lake Ohrid, North Macedonia.
Technology

OpenAI’s Voice Models Tapped for Pentagon Drone‑Swarm Challenge, Raising Dual‑Use Concerns

OpenAI has shared an open‑source voice‑to‑instruction model with two Pentagon‑selected defence firms competing in a prize to produce voice‑controlled drone‑swarm prototypes. The move highlights the tension between commercial AI innovation and the risks of rapid diffusion of components that can enable more autonomous and potentially weaponised systems.

NeTe2026年2月13日 19:04
#OpenAI#Pentagon#drone swarm
Screen displaying ChatGPT examples, capabilities, and limitations.
Technology

AI Insiders Sound the Alarm as U.S. Start‑ups Pivot from Safety to Speed

Senior researchers exiting US AI companies have publicly warned that commercialization and IPO pressures are sidelining safety, risking manipulative or harmful model behaviour. The conflict between monetisation incentives and the need for interpretability, privacy safeguards and robust alignment work has produced real‑world moderation failures and could invite regulatory intervention.

NeTe2026年2月12日 17:04
#AI safety#OpenAI#Anthropic
A SpaceX Falcon 9 rocket displayed outdoors against a clear blue sky in Dubai.
Technology

Musk’s AI Project in Retreat: Key xAI Founders Exit After SpaceX Rescue

Two prominent xAI founders quit within 48 hours after a series of earlier exits left half the original founding team gone, undermining Elon Musk’s AI ambitions. The exits, heavy cash burn, and product scandals around Grok have coincided with xAI’s absorption into SpaceX — a deal that looks like a financial bailout but raises fresh strategic and regulatory headaches.

NeTe2026年2月11日 08:54
#Elon Musk#xAI#SpaceX
Wooden letter tiles scattered on a textured surface, spelling 'AI'.
Technology

OpenClaw’s Viral Rise Signals a New Age for Cheap, Deployable AI Agents — and New Risks

OpenClaw, an open‑source agent platform created by Peter Steinberger, has gone viral by turning chat messages into executable commands across multiple model APIs, accelerating demand for inexpensive, high‑throughput models and simple local hardware like the Mac Mini. The surge highlights opportunities for Chinese model providers such as Minimax and Kimi, while raising acute security, deployment and governance challenges.

NeTe2026年2月2日 11:00
#OpenClaw#AI agent#Peter Steinberger
Artistic black and white close-up photo of banana leaves with textured patterns.
Technology

Philippines to Lift Ban on xAI’s Grok After Promised Fixes for Sexual-Content Abuse

The Philippines will lift its ban on xAI’s Grok once the company implements promised fixes to stop the chatbot being used to generate sexually explicit images, including alleged child-exploitative content. Authorities will continue close monitoring, following platform-level restrictions introduced earlier by X to block generation of real-person nudity.

NeTe2026年1月22日 02:20
#xAI#Grok#Philippines