AI Governance is a Red Flag: Vitalik Buterin Offers an Alternative

Updated on Sep 13, 2025 at 8:20 am UTC by · 2 mins read

Vitalik Buterin has raised concerns about the dangers of over-relying on AI for governance, citing recent security flaws as proof of its fragility.

Ethereum co-founder Vitalik Buterin warned his followers on X regarding the risks of relying on artificial intelligence (AI) for governance, arguing that current approaches are too easy to exploit.

Buterin’s concerns followed another warning by EdisonWatch co-founder Eito Miyamura, who showed how malicious actors could hijack OpenAI’s new Model Context Protocol (MCP) to access private user data.

The Risks of Naive AI Governance

Miyamura’s test revealed how a simple calendar invite with hidden commands could trick ChatGPT into exposing sensitive emails once the assistant accessed the compromised entry.

Security experts noted that large language models cannot distinguish between genuine instructions and malicious ones, making them highly vulnerable to manipulation.

Buterin said that this flaw is a major red flag for governance systems that place too much trust in AI.

He argued that if such models were used to manage funding or decision-making, attackers could easily bypass safeguards with jailbreak-style prompts, leaving governance processes open to abuse.

Info Finance: A Market-Based Alternative

To address these weaknesses, Buterin has proposed a system he calls “info finance.” Instead of concentrating power in a single AI, this framework allows multiple governance models to compete in an open marketplace.

Anyone can contribute a model, and their decisions can be challenged through random spot checks, with the final word left to human juries.

This approach is designed to ensure resilience by combining diversity of models with human oversight. Also, incentives are built in for both developers and external observers to detect flaws.

Designing Institutions for Robustness

Buterin describes this as an “institution design” method, one where large language models from different contributors can be plugged in, rather than relying on a single centralized system.

He added that this creates real-time diversity, reducing the risk of manipulation and ensuring adaptability as new challenges emerge.

Earlier in August, Buterin criticized the push toward highly autonomous AI agents, saying that increased human control generally improves both quality and safety.

He supports models that allow iterative editing and human feedback rather than those designed to operate independently for long periods.

Share:

Related Articles

Vitalik Buterin, Ethereum OGs to Create a $220M Security Fund from TheDAO

By January 29th, 2026

Ethereum’s early supporters are repurposing $220 million in idle funds from the infamous 2016 DAO hack to establish a comprehensive security fund for the network.

Nansen Brings AI-based Crypto Trading Solution to Solana, Base Networks

By January 21st, 2026

Nansen has launched AI-powered crypto trading that lets users execute trades through conversational prompts.

Vitalik Buterin: Full Return to Decentralized Social Networking in 2026

By January 21st, 2026

Ethereum co-founder Vitalik Buterin plans to move to decentralized social networking in 2026.

Exit mobile version