psychology

19 articles about psychology in AI news

Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex

Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.

99% relevant

Anthropic Paper: 'Emotion Concepts and their Function in LLMs' Published

Anthropic has released a new research paper titled 'Emotion Concepts and their Function in LLMs.' The work investigates the role and representation of emotional concepts within large language model architectures.

95% relevant

Chamath Palihapitiya: OpenAI, Anthropic IPOs to Pressure Legacy Tech Stocks

VC Chamath Palihapitiya claims the scale of OpenAI and Anthropic is unprecedented and their public listings will force a market re-evaluation of traditional tech company valuations.

85% relevant

Agent Judges with Big Five Personas Match Human Evaluators, Show Logarithmic Score Saturation in New arXiv Study

A new arXiv study shows LLM agents conditioned with Big Five personalities produce evaluations indistinguishable from humans. Crucially, quality scores saturate logarithmically with panel size, while discovering unique issues follows a slower power law.

72% relevant

E-STEER: New Framework Embeds Emotion in LLM Hidden States, Shows Non-Monotonic Impact on Reasoning and Safety

A new arXiv paper introduces E-STEER, an interpretable framework for embedding emotion as a controllable variable in LLM hidden states. Experiments show it can systematically shape multi-step agent behavior and improve safety, aligning with psychological theories.

75% relevant

American Express Bets on Agentic AI Commerce with ACE Developer Kit and ChatGPT Perks

AmEx CEO Stephen Squeri's shareholder letter outlines a proactive strategy for the agentic AI commerce era, launching an ACE developer kit for payment integration and offering business cardholders a ChatGPT subscription credit. The company sees its premium membership model as resilient against disruptive AI commerce theories.

100% relevant

Humans-as-Luxury: Redefining Value in an Automated Hospitality Future

An article on Hospitality Net argues that in a future of automated service, genuine human interaction will become a premium, scarce commodity. This 'Humans-as-Luxury' concept redefines value, shifting from efficiency to emotional connection and bespoke experience.

89% relevant

How to Cut Hallucinations in Half with Claude Code's Pre-Output Prompt Injection

A Reddit user discovered a technique that forces Claude to self-audit before responding, dramatically reducing hallucinations by surfacing rules at generation time.

100% relevant

Motif CLI: Track Your Claude Code Efficiency with Real-Time AIPM Dashboard

Install Motif CLI to analyze your Claude Code chat history, track AI tokens per minute, and generate personal coding assessments—all locally.

86% relevant

New Research Diagnoses LLMs' Struggle with Multiple Knowledge Updates in Context

A new arXiv paper reveals a persistent bias in LLMs when facts are updated multiple times within a long context. Models increasingly favor the earliest version, failing to track the latest state—a critical flaw for dynamic knowledge tasks.

78% relevant

How to Build Your Own Claude Code Agent: The Core Loop Explained

Learn the fundamental while-tool-feedback loop that powers Claude Code and how to apply its principles to write better prompts.

100% relevant

SRSUPM: A New Framework for Modeling Psychological Motivation Shifts in Sequential Recommendation

Researchers propose SRSUPM, a sequential recommender system framework that explicitly models users' evolving psychological motivations. It outperforms existing methods on three benchmarks by better capturing motivation shifts and collaborative patterns.

98% relevant

Hermès Faces Questions as Birkin and Kelly Resale Market Softens

The Business of Fashion reports a softening resale market for Hermès's iconic Birkin and Kelly bags, posing strategic questions for the luxury powerhouse. This signals a potential shift in the ultra-luxury asset class.

88% relevant

Hinton's Linguistic Shift: Why 'Confabulations' Could Transform How We Understand AI Errors

AI pioneer Geoffrey Hinton proposes replacing the term 'hallucinations' with 'confabulations' to describe AI errors. This linguistic reframing suggests AI systems aren't malfunctioning but rather constructing plausible narratives from their training data, offering new perspectives on AI cognition.

85% relevant

When AI Agents Disagree: New Research Tests Whether LLMs Can Reach Consensus

New research explores whether LLM-based AI agents can effectively communicate and reach agreement in multi-agent systems. The study reveals surprising patterns in how AI agents negotiate, disagree, and sometimes fail to find common ground.

85% relevant

When AI Agents Need to Read Minds: The Complex Reality of Theory of Mind in Multi-LLM Systems

New research reveals that adding Theory of Mind capabilities to multi-agent AI systems doesn't guarantee better coordination. The effectiveness depends on underlying LLM capabilities, creating complex interdependencies in collaborative decision-making.

85% relevant

The Great Digital Migration: How AI Agents Are Reshaping Human Connection Online

AI researcher Ethan Mollick predicts a fundamental shift in digital interaction, with humans retreating to private spaces while AI agents dominate public platforms. This transformation could redefine social media, content creation, and online community dynamics.

85% relevant

The Polished AI Paradox: Anthropic Study Reveals How Fluent Output Undermines Critical Thinking

Anthropic's analysis of 10,000 Claude conversations reveals a troubling pattern: the more polished AI-generated content appears, the less likely users are to verify its accuracy. The company's new AI Fluency Index shows that while iteration improves outcomes, it also creates dangerous complacency.

70% relevant

Beyond the Benchmark: New Model Separates AI Hype from True Capability

A new 'structured capabilities model' addresses a critical flaw in AI evaluation: benchmarks often confuse model size with genuine skill. By combining scaling laws with latent factor analysis, it offers the first method to extract interpretable, generalizable capabilities from LLM test results.

72% relevant