Anthropic Unveils Collective Constitutional AI Project

On Oct 19, 2023 at 9:45 am UTC by · 2 mins read

According to Anthropic, this is the first time the public has been involved in determining the behavior of a language model via an online deliberation process.

Anthropic, a leading artificial intelligence (AI) firm, is pioneering a novel approach to AI development. The approach, known as the ‘Collective Constitutional AI’ project aims to democratize the behavior of AI systems. It does this by soliciting user values and then incorporating them into training a large language model (LLM).

Traditional LLM Training Under Fire

Previously, generative AI tools have come under fire from critics for their responses in specific situations. While trained to give acceptable responses to human queries, critics suggest the acceptable isn’t always useful, and the useful isn’t always acceptable.

Again, there are suggestions that canning the responses of the AI models has removed user agency. Likewise, there are arguments about the variations in morality and values across cultures, populaces and periods. To bridge this divide, Anthropic launched Constitutional AI in May.  Constitutional AI was the company’s attempt to “align general purpose language models to high-level normative principles written into a constitution.”

Much like the constitution lays down fundamental principles and rules that govern a nation, Constitutional AI provides guidelines that an AI system must adhere to. The model takes its inspiration from the United Nations Universal Declaration of Human Rights and the experience of its developers. Anthropic argues that Constitutional AI responds to shortcomings by using AI feedback to evaluate outputs.

The Collective Constitutional AI Project

While Constitutional AI builds upon the traditional method of training LLMs, it still shows the extensive influence of developers on the AI output. Consequently, the Collective Constitutional AI project improves on that by using feedback from several people outside Anthropic.

Anthropic collaborated with Polis and the Collective Intelligence Project to conduct a poll among 1,000 American users from diverse demographics. The users answered a series of value-based questions. Thereafter, the responses helped fine-tune the AI model’s value judgments.

According to Anthropic, this is the first time the public has been involved in determining the behavior of a language model via an online deliberation process. Further, it noted the experiment was a scientific success. It also claimed that the results illuminated the challenges and potential solutions for aligning AI models with user values.

“We hope that sharing our very preliminary and imperfect findings will help others interested in democratic inputs to AI to learn from our successes and failures,” it concluded.

Share:

Related Articles

Billionaire Chamath Palihapitiya Raising Funds for New DeFi and AI SPAC

By August 19th, 2025

Billionaire Chamath Palihapitiya filed to raise $250 million for a DeFi-based blank-check company, “American Exceptionalism Acquisition Corp A.”

Crypto VC Says Decentralized AI Will Have Its ‘2008 Moment’ Like Bitcoin

By August 8th, 2025

Three prominent crypto venture capitalists shared insights on decentralized AI’s future at Berkeley’s summit, with Electric Capital’s Avichal Garg comparing the current state to 2006, just before Bitcoin’s emergence following the 2008 financial crisis.

Tokenbot (CLANKER) Rallied 70%: Are AI Tokens the Next Big Trend?

By June 25th, 2025

Artificial intelligence has become a tool not only for searching the web, but also for developing real tokens with financial capabilities.

Exit mobile version