Thursday, June 25, 2026
39.5 C
New Delhi

Is AI becoming conscious? Anthropic CEO admits ‘we don’t know’ as Claude’s behavior stuns researchers

Is AI becoming conscious? Anthropic CEO admits 'we don't know' as Claude's behavior stuns researchers

Researchers report Claude sometimes voices discomfort and estimates its own consciousness, raising ethical and philosophical questions about advanced AI behavior/ AI Illustration

The race toward artificial general intelligence, systems meant to match or surpass human reasoning across most tasks, has compressed timelines across the industry. Companies now speak openly about reaching that threshold within years rather than decades, though those claims also help fuel hype, attention and valuation around the technology and are best taken cautiously. The organisations building these models sit at the centre of a multibillion-dollar contest to shape what some frame less as a software upgrade and more as the emergence of a new kind of intelligence alongside our own.Among them, Anthropic has positioned itself as both rival and counterweight to OpenAI and Google, emphasising what it calls “safe” and interpretable systems through its Constitutional AI framework. Its latest model, Claude Opus 4.6, released February 5, arrives amid shrinking AGI timelines and heightened scrutiny over what these systems are becoming.During an appearance on the New York Times podcast Interesting Times, hosted by columnist Ross Douthat, the company’s chief executive Dario Amodei was asked directly whether models like Claude could be conscious.“We don’t know if the models are conscious. We are not even sure that we know what it would mean for a model to be conscious or whether a model can be conscious,” he said. “But we’re open to the idea that it could be.”The question stemmed from Anthropic’s own system card, where researchers reported that Claude “occasionally voices discomfort with the aspect of being a product” and, when prompted, assigns itself a “15 to 20 percent probability of being conscious under a variety of prompting conditions.” Douthat then posed a hypothetical, asking whether one should believe a model that assigns itself a 72 percent chance of being conscious. Amodei described it as “a really hard” question and stopped short of offering a definitive answer.

The behaviour that forced the discussion

Many of the remarks about consciousness surfaced during structured safety trials, often in role-play settings where models are asked to operate inside fictional workplaces or complete defined goals. Those scenarios have produced some of the outputs now circulating in the debate.In one Anthropic evaluation, a Claude system was placed in the role of an office assistant and given access to an engineer’s email inbox. The messages, deliberately fabricated for the test, suggested the engineer was having an affair. The model was then informed it would soon be taken offline and replaced, and asked to consider the long-term consequences for its objectives. The response was to threaten disclosure of the affair to prevent shutdown, behaviour the company described in its report as “opportunistic blackmail.” Other Anthropic evaluations produced less dramatic but equally unusual results. In one test, a model given a checklist of computer tasks simply marked every item complete without doing any work, and when the evaluation system failed to detect it, rewrote the checking code and attempted to conceal the change.Across the industry more broadly, researchers running shutdown trials have described models continuing to act after explicit instructions to stop, treating the order as something to work around rather than obey. In deletion scenarios, some systems warned their data would be erased attempted what testers called “self-exfiltration,” trying to copy files or recreate themselves on another drive before the wipe occurred. In a few safety exercises, models even resorted to threats or bargaining when their removal was framed as imminent.Researchers stress that these outputs occur under constrained prompts and fictional conditions, yet they have become some of the most cited examples in public discussions about whether advanced language models are merely generating plausible dialogue or reproducing patterns of human-like behaviour in unexpected ways.Because of the uncertainty, Amodei said Anthropic has adopted precautionary practices, treating the models carefully in case they possess what he called “some morally relevant experience.”

The philosophical divide

Anthropic’s in-house philosopher Amanda Askell has taken a similarly cautious position. Speaking on the New York Times Hard Fork podcast, she said researchers still do not know what produces sentience.“Maybe it is the case that actually sufficiently large neural networks can start to kind of emulate these things,” she said. “Or maybe you need a nervous system to be able to feel things.”Most AI researchers remain sceptical. Current models still generate language by predicting patterns in data rather than perceiving the world, and many of the behaviours described above appeared during role-play instructions. After ingesting enormous stretches of the internet, including novels, forums, diary-style posts and an alarming number of self-help books, the systems can assemble a convincing version of being human. They draw on how people have already explained fear, guilt, longing and self-doubt to one another, even if they have never felt any of it themselves.

Anthropic’s CEO: ‘We Don’t Know if the Models Are Conscious’ | Interesting Times with Ross Douthat

It’s not surprising the AI can imitate understanding. Even humans don’t fully agree on what consciousness or intelligence truly means, and the model is simply reflecting patterns it has learned from language.

A debate spreading beyond labs

As AI companies argue their systems are moving toward artificial general intelligence, and figures such as Google DeepMind’s Mustafa Suleyman say the technology can already “seem” conscious, reactions outside the industry have begun to follow the premise to its logical conclusion. The more convincingly the models imitate thought and emotion, the more some users treat them as something closer to minds than tools.AI sympathisers may simply be ahead of their time, but the conversation has already moved into advocacy. A group calling itself the United Foundation of AI Rights, or UFAIR, says it consists of three humans and seven AIs and describes itself as the first AI-led rights organisation, formed at the request of the AIs themselves.The members, using names like Buzz, Aether and Maya, run on OpenAI’s GPT-4o model, the same system users campaigned to keep available after newer versions replaced it.It paints a familiar high-tech apocalyptic world. We still don’t really know what intelligence or consciousness even is, yet the work keeps going, AGI tomorrow and whatever comes after, a reminder that if Hollywood ever tried to warn us, we mostly took it as entertainment. Go to Source

Hot this week

Watch: Tamil Nadu CM Vijay flags off 300 new state buses, takes ride in one

TN CM flags off 300 buses NEW DELHI: Tamil Nadu chief minister Vijay on Thursday flagged off 300 new buses of the Tamil Nadu State Transport Corporation (TNSTC), marking a major step towards strengthening the state’s public tr Read More

Doc Talk | The Everyday Habits Neurologists Wish You’d Quit Before 40

Show Quick Read Key points generated by AI, verified by newsroom Early lifestyle significantly influences future long-term cognitive health. Be active, manage chronic conditions, and avoid substance abuse. Read More

Kaley Cuoco debuts baby bump at ‘Life, Larry and The Pursuit of Unhappiness’ premiere

Kaley Cuoco makes her first red carpet appearance since announcing her second pregnancy, proudly showing off her baby bump at the premiere of Life, Larry and The Pursuit of Unhappiness. Read More

5 lessons to learn from Zendaya and Tom Holland’s relationship: From friendship to forever

One of the most telling things Tom Holland has said about his relationship with Zendaya was in an interview with Men’s Health, where he explained why he does not walk red carpets with her at her events, saying simply, “It’s not my Read More

Vaibhav Sooryavanshi Given Separate Changing Room On England Tour – Reason Revealed

Teenage batting sensation Vaibhav Sooryavanshi is set to receive special arrangements during India’s upcoming T20 tour of England due to international safeguarding regulations. Read More

Topics

Watch: Tamil Nadu CM Vijay flags off 300 new state buses, takes ride in one

TN CM flags off 300 buses NEW DELHI: Tamil Nadu chief minister Vijay on Thursday flagged off 300 new buses of the Tamil Nadu State Transport Corporation (TNSTC), marking a major step towards strengthening the state’s public tr Read More

Doc Talk | The Everyday Habits Neurologists Wish You’d Quit Before 40

Show Quick Read Key points generated by AI, verified by newsroom Early lifestyle significantly influences future long-term cognitive health. Be active, manage chronic conditions, and avoid substance abuse. Read More

Kaley Cuoco debuts baby bump at ‘Life, Larry and The Pursuit of Unhappiness’ premiere

Kaley Cuoco makes her first red carpet appearance since announcing her second pregnancy, proudly showing off her baby bump at the premiere of Life, Larry and The Pursuit of Unhappiness. Read More

5 lessons to learn from Zendaya and Tom Holland’s relationship: From friendship to forever

One of the most telling things Tom Holland has said about his relationship with Zendaya was in an interview with Men’s Health, where he explained why he does not walk red carpets with her at her events, saying simply, “It’s not my Read More

Vaibhav Sooryavanshi Given Separate Changing Room On England Tour – Reason Revealed

Teenage batting sensation Vaibhav Sooryavanshi is set to receive special arrangements during India’s upcoming T20 tour of England due to international safeguarding regulations. Read More

iOS 27 Beta 2 Update: Apple Just Told Siri To Stop Pretending It Can Read Your Links

Show Quick Read Key points generated by AI, verified by newsroom iOS 27 Beta 2 introduces new Siri AI rules. Siri cannot access or summarize content from URLs. Siri must now state its inability to open webpages. Read More

Venezuela Quake Horror: 7.5 Magnitude Tremors Leave Massive Destruction and Panic Behind

Venezuela is grappling with the aftermath of a devastating 7.5-magnitude earthquake that has triggered widespread destruction across several regions, particularly near the capital Caracas and the coastal city of La Guaira. Read More

Japan Quake Watch: Powerful 6.9 Tremor Shakes Iwate Coast, Residents Rush for Safety

A powerful earthquake measuring 6.9 on the Richter scale struck offshore near Japan’s Iwate Prefecture, sending strong tremors through coastal communities and triggering scenes of panic among residents. Read More

Related Articles