Contemplative Wisdom for Superalignment

Ruben Laukkonen; Fionn Inglis; Shamil Chandaria; Lars Sandved-Smith; Edmundo Lopez-Sola; Jakob Hohwy; Jonathan Gold; Adam Elwood

doi:10.48550/arxiv.2504.15125

Back

Preprint

Open access

Contemplative Wisdom for Superalignment

Ruben Laukkonen, Fionn Inglis, Shamil Chandaria, Lars Sandved-Smith, Edmundo Lopez-Sola, Jakob Hohwy, Jonathan Gold and Adam Elwood

arXiv (Cornell University)

Cornell University

21/04/2025

DOI: https://doi.org/10.48550/arxiv.2504.15125

Files and links (2)

pdf

Contemplative Artificial Intelligence3.08 MBDownload View

Preprint (Author's original)CC BY V4.0, Open Access

url

Contemplative Artificial IntelligenceView

Preprint (Author's original)CC BY V4.0, Open

Metrics

1 Record Views

Abstract

Artificial intelligence

Neuroscience

Meditation

Buddhism

Alignment

Large language models

Neural networks

Machine learning

Mindfulness

As artificial intelligence (AI) improves, traditional alignment strategies may falter in the face of unpredictable self-improvement, hidden subgoals, and the sheer complexity of intelligent systems. Inspired by contemplative wisdom traditions, we show how four axiomatic principles can instil a resilient ‘Wise World Model’ in AI systems. First, mindfulness enables self monitoring and recalibration of emergent subgoals. Second, emptiness forestalls dogmatic goal fixation and relaxes rigid priors. Third, non-duality dissolves adversarial self–other boundaries. Fourth, boundless care motivates the universal reduction of suffering. We find that prompting AI to reflect on these principles improves performance on the AILuminate Benchmark (d=.96) and boosts cooperation and joint-reward on the Prisoner’s Dilemma task (d=7+). We offer detailed implementation strategies at the level of architectures, constitutions, and reinforcement on chain-of-thought. For future systems, active inference may offer the self-organizing and dynamic coupling capabilities needed to enact Contemplative AI in embodied agents.

Details

Title: Contemplative Wisdom for Superalignment
Creators: Ruben Laukkonen - Southern Cross University
Fionn Inglis - University of Amsterdam
Shamil Chandaria - Oxford University
Lars Sandved-Smith - Monash University
Edmundo Lopez-Sola - Neuroelectrics Barcelona SLU (Spain)
Jakob Hohwy - Monash University
Jonathan Gold - Princeton University
Adam Elwood
Publication Details: arXiv (Cornell University)
Publisher: Cornell University
Identifiers: 991013276594202368
Academic Unit: Human Sciences; Faculty of Health
Language: English
Resource Type: Preprint

Contemplative Wisdom for Superalignment

Files and links (2)

Metrics

Abstract

Details

Southern Cross University Social media