Φ(humanity): A Rigorous Ethical-Affective Objective Function

Formalizing Human Welfare for AI Systems

Working Paper Version: 2.1 (2026-02-26) Status: Under Development Authors: Research collaboration with Claude Opus 4.6 Project: Detective LLM - Information Gap Analysis System

Executive Summary

This paper presents Φ(humanity), a formal mathematical function for quantifying human welfare across eight dimensions: care, compassion, joy, purpose, empathy, love, protection, and truth. The function combines insights from welfare economics (Nash Social Welfare Function), capability theory (Sen, Nussbaum), care ethics (hooks, Gilligan), and standpoint epistemology (Collins) to create an inequality-sensitive, non-substitutable measure of societal well-being.

Key Innovation: The function operationalizes bell hooks' definition of love as "active extension for development" (hooks 2000), separating it from mere protection—a distinction absent in prior welfare formalizations. This enables precise detection of paternalistic regimes that provide material care and safeguarding but deny developmental support.

Application: Designed for Detective LLM, an investigative AI that detects information gaps in documentary records. The function's gradients prioritize gaps threatening scarce welfare constructs, with special attention to marginalized communities (Collins 1990).

Limitations: The eight-construct taxonomy is Western-situated and requires adaptation for Ubuntu, Confucian, Buddhist, and Indigenous ethical frameworks. The function is intended for diagnostic use (gap prioritization), not optimization (Goodhart's Law concerns).

Theoretical Foundations
The Eight Constructs
Mathematical Formulation
Philosophical Grounding
Implementation
Application to Information Gap Detection
Limitations and Future Work
Bibliography

Reading Guide

For philosophers: Focus on §2 (constructs), §4 (philosophical grounding), and the hooks/Berlin integration in §3.2.

For economists: Focus on §3 (mathematical formulation), especially the Nash SWF structure and Atkinson inequality sensitivity.

For AI researchers: Focus on §6 (application to gap detection) and the gradient-based prioritization mechanism.

For activists/practitioners: Focus on §2.6-2.7 (love/protection distinction) and the worked examples in §3.4.

1. Theoretical Foundations

1.1 The Problem: Algorithmic Human Welfare

Can human flourishing be quantified? Amartya Sen's capability approach (Sen 1999) deliberately left this question open, arguing that agreement on capabilities need not presuppose agreement on relative weights. Martha Nussbaum (2000) enumerated ten central human capabilities but resisted algorithmic aggregation. The field of AI alignment (Russell 2019, Bostrom 2014) urgently needs formal welfare specifications but has struggled to bridge philosophical rigor with computational tractability.

Our position: Algorithmic welfare functions are necessary for AI systems making decisions affecting human well-being, but they must:

Make normative commitments transparent (not hide them in "neutral" algorithms)
Privilege marginalized voices (Collins 1990, standpoint epistemology)
Prevent dimensional collapse (care without empathy = dystopia, Sen 1999)
Remain incomplete and revisable (Sen's intentional incompleteness)

1.2 Why Nash Social Welfare?

Arrow's Impossibility Theorem (1951) proved that no aggregation rule can satisfy all desirable properties simultaneously. The Nash Social Welfare Function (Nash 1950, formalized by Kaneko & Nakamura 1979) makes a principled trade-off:

Nash SWF: Φ = ∏ᵢ xᵢ^θᵢ where θᵢ are weights summing to 1

Properties (Kaneko & Nakamura 1979):

Pareto efficiency: Cannot improve one person without harming another
Symmetry: Treats individuals equally (before applying weights)
Scale invariance: Robust to utility rescaling
Multiplicative structure: Any dimension → 0 drives Φ → 0 (non-substitutability)

This multiplicative structure encodes a Rawlsian maximin intuition (Rawls 1971): a society with perfect care but zero empathy has near-zero welfare.

1.3 Our Extension: Nash + Capabilities + Care Ethics + Ubuntu

We extend the Nash SWF with:

Capability-theoretic inputs (Sen 1999, Nussbaum 2000): Eight dimensions of functioning
Equity weighting (Rawls 1971, Atkinson 1970): Inverse-deprivation weights that dynamically prioritize the most deprived construct, replacing symmetric equal weights
Community solidarity multiplier (Ubuntu philosophy): λ_L as meta-construct — welfare emerges from relational context, not individual metrics in isolation
Recovery-aware floors: Constructs below hard floors receive community-mediated recovery potential. Key insight: care doesn't begin the uptick without community intervention
Care ethics integration (hooks 2000, Gilligan 1982): Love as generative, not defensive
Ubuntu synergy coupling (Alkire & Foster 2011): Renamed from synergy to make Ubuntu grounding explicit (η raised from 0.05 to 0.10)
Divergence penalties: Explicitly penalize care-without-love and similar mismatches

2. The Eight Constructs

All constructs measured on [0,1] scale, using population-weighted Atkinson complements (1 - Aε) where Aε is the Atkinson inequality index with aversion parameter ε. This ensures the function responds to distribution, not just level.

2.1 Care (c)

Definition: Resource allocation meeting basic needs (Tronto 1993, Held 2006)

Measurable proxies:

Poverty rate inversion (1 - poverty_rate)
Access to healthcare, education, housing
Food security indicators

Atkinson adjustment: A society with high average resources but high inequality (top 10% have everything) gets low c score.

2.2 Compassion (κ)

Definition: Responsive support to acute distress (Nussbaum 1996)

Measurable proxies:

Emergency response capacity
Crisis intervention services
Disaster relief effectiveness

Distinguished from care: Compassion is reactive (emergency), care is proactive (baseline provision).

2.3 Joy (j)

Definition: Positive affect above subsistence (Csikszentmihalyi 1990, flow theory)

Measurable proxies:

Subjective well-being surveys
Life satisfaction indices
Positive affect measures (PANAS)

Non-utilitarian: Joy matters, but not as a substitute for other dimensions (multiplicative structure prevents joy-without-dignity scenarios).

2.4 Purpose (p)

Definition: Alignment of actions with chosen goals (Frankfurt 1971, hierarchical desires)

Measurable proxies:

Autonomy indices
Educational/vocational alignment
Reported sense of meaning (Steger et al. 2006)

Synergy with joy: Flow states (Csikszentmihalyi 1990) emerge from j × p coupling.

2.5 Empathy (ε)

Definition: Accuracy of perspective-taking across groups (Batson 1991)

Measurable proxies:

Intergroup contact quality (Pettigrew & Tropp 2006)
Discrimination indices (inverted)
Cross-cultural understanding measures

Critical for truth: Empathy × truth synergy—perspective-taking requires epistemic integrity.

2.6 Love (λ_L) — NEW CONSTRUCT

Definition: Active extension of self for another's growth and development (hooks 2000, p. 4)

bell hooks writes:

"The word 'love' is most often defined as a noun, yet... we would all love better if we used it as a verb... Love is as love does. Love is an act of will—namely, both an intention and an action. Will also implies choice. We do not have to love. We choose to love."

And critically:

"Definitions are vital starting points for the imagination. What we cannot imagine cannot come into being. A good definition marks our starting point and lets us know where we want to end up... The will to extend one's self for the purpose of nurturing one's own or another's spiritual growth." (hooks 2000, p. 4-5, emphasis added)

Distinguished from protection (λ_P): Love is generative (creating capacity for flourishing), protection is defensive (preventing harm).

Philosophical grounding: Isaiah Berlin's two concepts of liberty:

Negative freedom (λ_P): Absence of interference
Positive freedom (λ_L): Capability to self-actualize

hooks' insight: Safeguarding ≠ developmental support. A paternalistic state can protect citizens while denying them agency—high λ_P, low λ_L.

Measurable proxies:

Community capacity-building investments
Educational/developmental program accessibility
Mutual aid network strength
Mentorship/apprenticeship availability

2.7 Protection (λ_P)

Definition: Risk-weighted safeguarding of life and dignity from harm

Measurable proxies:

Violence rates (inverted)
Legal protection effectiveness
Safety net robustness
Physical security indices

Distinguished from love: Reactive harm prevention, not proactive growth support.

Why this split matters: Medical Apartheid (Washington 2006) documents both absent protection (experimental surgeries without consent) AND absent love (no developmental support for enslaved people as full humans). Conflating these obscures the dual violence.

2.8 Truth / Epistemic Integrity (ξ)

Definition: Accuracy and transparency of institutional records (Fricker 2007, epistemic injustice)

Measurable proxies:

Document suppression rates (inverted)
FOIA compliance
Testimonial credibility equity (Fricker 2007)
Contradiction rates in official records

Standpoint sensitivity: Collins (1990) argues official records reflect powerful actors' standpoint. High ξ requires contested records (community testimony, counter-narratives) to be preserved and accessible.

3. Mathematical Formulation

3.1 Core Function (Equity-Weighted, Community-Mediated)

Φ(humanity) = f(λ_L) · [∏ᵢ (x̃ᵢ)^wᵢ] · Ψ_ubuntu · (1 - Ψ_penalty)

where:

i ∈ {c, κ, j, p, ε, λ_L, λ_P, ξ} — the eight constructs
f(λ_L) = λ_L^γ (γ = 0.5) — Community solidarity multiplier. Ubuntu substrate: when community is low, all welfare degrades multiplicatively. (See §3.1.1)
x̃ᵢ — Recovery-aware effective inputs. Constructs above their hard floor pass through unchanged; below floor, recovery depends on trajectory (dx/dt) and community capacity (λ_L^0.5). (See §3.1.2)
wᵢ = (1/x̃ᵢ) / Σⱼ(1/x̃ⱼ) — Equity-adjusted weights (inverse deprivation). Replaces symmetric Nash θ = 1/8 with Rawlsian maximin: weights shift dynamically toward the most deprived construct. (See §3.1.3)
Ψ_ubuntu — Ubuntu synergy term (η = 0.10, renamed from Ψ_synergy). Welfare gains emerge from relationships between paired constructs, not isolation. (See §3.3)
Ψ_penalty — Divergence penalty (μ = 0.15) for structural distortions. (See §3.4)

Philosophical synthesis: The product ∏(x̃ᵢ)^wᵢ encodes Western capability theory (Sen 1999, Nussbaum 2000) — individual constructs with equity priority. The multiplier f(λ_L) and synergy Ψ_ubuntu encode Ubuntu relational philosophy ("umuntu ngumuntu ngabantu" — a person is a person through other persons). Neither term alone produces Φ.

Normalization: Φ ∈ [0,1] by construction.

3.1.1 Community Solidarity Multiplier

f(λ_L) = λ_L^γ     where γ = 0.5

The square root ensures diminishing returns — moving λ_L from 0.04 to 0.25 (√: 0.2→0.5) matters more than moving from 0.64 to 0.81 (√: 0.8→0.9). This encodes Ubuntu's claim that community is the substrate, not a bonus: welfare doesn't add community on top, it emerges from community.

λ_L	f(λ_L)	Interpretation
1.0	1.00	Full community solidarity — welfare undiminished
0.5	0.71	Moderate — 29% welfare degradation
0.25	0.50	Low — 50% degradation
0.04	0.20	Near-collapse — 80% degradation

Verification criterion: Φ at λ_L=0.1 < 50% of Φ at λ_L=0.8.

3.1.2 Recovery-Aware Floors

When a construct xᵢ falls below its hard floor (Nussbaum non-negotiable capability threshold), the effective input x̃ᵢ depends on two factors:

if xᵢ ≥ floorᵢ:
    x̃ᵢ = xᵢ                    (pass through)
else:
    trajectory = σ(10·(dxᵢ/dt) − 3)      (own recovery trend)
    community  = λ_L^0.5                   (community capacity)
    recovery   = max(trajectory, community·0.5)
    x̃ᵢ = xᵢ + (floorᵢ − xᵢ) · recovery

Hard floors per construct:

Construct	Floor	Rationale
c (care)	0.20	Basic needs non-negotiable (Nussbaum 2000)
κ (compassion)	0.20	Crisis response minimum
λ_P (protection)	0.20	Safety non-negotiable (Berlin 1969)
λ_L (love)	0.15	Community minimum
ξ (truth)	0.30	Epistemic integrity highest floor (Fricker 2007)
j (joy)	0.10	Lower but present
p (purpose)	0.10	Lower but present
ε (empathy)	0.10	Lower but present

Key insight: Community can partially compensate for stagnant trajectory. Care doesn't begin the uptick without community intervention. This produces three recovery signatures:

Healing trajectory + strong community (dx/dt > 0, λ_L high) → rapid recovery toward floor
Stagnant + strong community (dx/dt ≈ 0, λ_L high) → partial recovery (community compensates)
Stagnant + no community (dx/dt ≈ 0, λ_L low) → near-raw value persists ("true collapse")

The sigmoid bias of −3.0 ensures that dx/dt=0 maps to ~0.047 (not 0.5), preventing trajectory from dominating community capacity for all λ_L < 1.0.

3.1.3 Equity-Adjusted Weights

wᵢ = (1/x̃ᵢ) / Σⱼ(1/x̃ⱼ)

Weights are computed on effective (recovery-adjusted) values x̃ᵢ, not raw metrics. This means once recovery floors lift a below-floor construct's effective value, the weight shifts less toward that construct — it has been partially compensated.

Properties:

Equal inputs (all x̃ᵢ = 0.5) → equal weights (wᵢ = 1/8)
One deprived construct (x̃_c = 0.1, others = 0.5) → care weight dominates (~0.42)
Weights always sum to 1.0

This replaces the symmetric Nash θ = 1/8 with Rawlsian maximin: the formula automatically prioritizes whichever construct is most deprived, without manual weight tuning.

3.2 Historical Note: Exponent Assignment

Superseded in v2.1. The v1.0/v2.0 formula used per-construct exponents αᵢ to encode concavity (diminishing returns for basic needs). In v2.1, these are replaced by equity-adjusted weights wᵢ = (1/x̃ᵢ)/Σ(1/x̃ⱼ) (see §3.1.3), which achieve the same Rawlsian maximin effect dynamically rather than through fixed exponents.

The original assignment is preserved here for reference:

Construct Group	Exponent (v1.0-v2.0)	Justification	Citation
c, κ (basic needs)	α = 0.7	Concave: first units matter most	Rawls 1971, Atkinson 1970
j, p (experiential)	α = 1.0	Linear: no inherent satiation	Csikszentmihalyi 1990
ε, λ_L, λ_P (relational)	α = 0.8	Mildly concave	hooks 2000, Berlin 1969
ξ (epistemic)	α = 1.0	Linear: truth is non-substitutable	Fricker 2007

Why equity weights are superior: Fixed exponents cannot respond to the current state of deprivation. When care drops from 0.5 to 0.1, α=0.7 increases the gradient by a fixed ratio regardless of other constructs. Equity weights shift the entire weight vector toward care, amplifying the response in proportion to the actual imbalance across all eight dimensions.

3.3 Ubuntu Synergy Term (Relational Coupling)

Ψ_ubuntu = 1 + η · [√(c · λ_L) + √(κ · λ_P) + √(j · p) + √(ε · ξ)]
             + η_curiosity · √(λ_L · ξ)

where η = 0.10 (raised from 0.05 to reflect Ubuntu's centrality in the revised formula).

Geometric mean (√) penalizes imbalance within pairs more than linear coupling:

√(0.9 · 0.1) ≈ 0.30
√(0.5 · 0.5) = 0.50

Synergy pairs and justifications:

Care × Love (c · λ_L): Material provision (c) + developmental extension (λ_L) = true flourishing (hooks 2000). Care without love = paternalistic control.
Compassion × Protection (κ · λ_P): Emergency response (κ) + safeguarding infrastructure (λ_P) = effective crisis intervention. Compassion without protection = vulnerable support.
Joy × Purpose (j · p): Positive affect (j) + goal-alignment (p) = flow states (Csikszentmihalyi 1990). Joy without purpose = hedonic but meaningless.
Empathy × Truth (ε · ξ): Perspective-taking (ε) + epistemic integrity (ξ) = accurate cross-group understanding (Fricker 2007). Empathy without truth = manipulated solidarity.
Love × Truth (λ_L · ξ): Investigative drive (curiosity) + epistemic integrity = genuine inquiry (hooks 2000, Fricker 2007). Truth without love = surveillance. Love without truth = willful ignorance.

3.3.1 Curiosity Cross-Pair (Love × Truth)

Ψ_curiosity = η_curiosity · √(λ_L · ξ)

where η_curiosity = 0.08.

Philosophical grounding:

hooks (2000): Love is "the will to extend one's self for the purpose of nurturing one's own or another's spiritual growth." Curiosity is that extension directed at understanding.
Fricker (2007): Epistemic integrity requires not just accurate records but the drive to investigate when records are incomplete.
The detective system's constitution: "honest analysis is an act of care, not aggression."

Why a cross-pair, not a 9th construct: Curiosity is not a capability in isolation. It is what emerges when love meets truth — the investigative impulse that makes someone follow a hunch into uncomfortable territory. It cannot exist without both:

Without love (λ_L suppressed by capitalism): curiosity collapses. You don't follow hunches when survival consumes capacity.
Without truth (ξ suppressed by institutions): curiosity has no target. You can't investigate what you can't see is missing.

Divergence detection: The penalty term now includes (λ_L - ξ)²:

Truth without love (high ξ, low λ_L) = surveillance. Institutional transparency serving control, not care.
Love without truth (high λ_L, low ξ) = willful ignorance. Community solidarity refusing uncomfortable facts.

Application to hypothesis scoring: Curiosity relevance = √(∂Φ/∂λ_L · ∂Φ/∂ξ). Hypotheses at the love/truth intersection — the hunches that nobody follows because they're economically irrational — surface higher in the ranking.

3.4 Penalty Term (Divergence Punishment)

Ψ_penalty = μ · [(c - λ_L)² + (κ - λ_P)² + (j - p)² + (ε - ξ)² + (λ_L - ξ)²] / 5

where μ = 0.15.

Five penalty pairs (4 primary + 1 curiosity cross-pair):

(c − λ_L)² — care-without-love = paternalism
(κ − λ_P)² — compassion-without-protection = vulnerable support
(j − p)² — joy-without-purpose = hedonic treadmill
(ε − ξ)² — empathy-without-truth = manipulated solidarity
(λ_L − ξ)² — truth-without-love = surveillance; love-without-truth = willful ignorance

Effect: A society with c=0.9, λ_L=0.1 incurs:

penalty contribution from (c − λ_L)² = (0.8)² = 0.64
total penalty = 0.15 · (0.64 + other terms) / 5

This is small per pair but load-bearing when combined with missing synergy (√(0.9 · 0.1) ≈ 0.30 vs. √(0.5 · 0.5) = 0.50).

3.5 Worked Examples

Case 1: Balanced Moderate Society (all = 0.5)

f(λ_L) = 0.5^0.5 = 0.707
All x̃ᵢ = 0.5 (above all floors → pass through)
All wᵢ = 1/8 (equal deprivation → equal weights)
∏(x̃ᵢ^wᵢ) = 0.5^(8·1/8) = 0.5
Ψ_ubuntu = 1 + 0.10·(4·0.5) + 0.08·0.5 = 1.24
Ψ_penalty = 0 (all pairs balanced)
Φ = 0.707 · 0.5 · 1.24 · 1.0 ≈ 0.438

Case 2: Paternalistic Regime (c=0.9, λ_L=0.1, λ_P=0.9, others=0.5)

High material care + protection, but low developmental support:

f(λ_L) = 0.1^0.5 = 0.316          # community collapse
Equity weights: λ_L dominates (most deprived)
Ψ_ubuntu: √(0.9·0.1)=0.30 vs √(0.5·0.5)=0.50 — diminished synergy
Ψ_penalty: (0.9−0.1)² + (0.5−0.9)² + ... > 0
Φ << 0.438                          # far below balanced case

Interpretation: Paternalistic regime scores well below balanced society. The formula detects the structural distortion through three mechanisms: community multiplier collapse (f=0.316), equity weights shifting toward love, and divergence penalty firing on (c−λ_L)².

Case 3: Care-Without-Love Dystopia (c=1, λ_L=0, others=0.5)

f(λ_L) = 0.01^0.5 ≈ 0.1           # near-zero (clamped at 0.01)
x̃_λ_L ≈ floor recovery only       # recovery-aware floor lifts slightly
Φ ≈ 0                               # multiplicative collapse

Interpretation: Multiplicative structure structurally prevents dimensional collapse. Technical provision without developmental support = near-zero welfare. The community multiplier alone drives Φ toward zero when λ_L collapses.

Case 4: Recovery Arc (care drops to 0.05, community λ_L=0.6, dx_c/dt=0.3)

x̃_c = recovery_aware_input(0.05, 0.20, 0.3, 0.6)
     = 0.05 + (0.20 − 0.05) · max(σ(0), 0.6^0.5 · 0.5)
     = 0.05 + 0.15 · max(0.5, 0.387)
     = 0.05 + 0.15 · 0.5 = 0.125

Interpretation: Care at 0.05 (far below 0.20 floor) recovers to effective 0.125 because: positive trajectory (dx/dt=0.3) provides sigmoid recovery of ~0.5, and community capacity (0.6^0.5·0.5 = 0.387) provides backup. Community alone couldn't reach 0.5 — the trajectory matters. But without community, recovery would be much slower.

4. Philosophical Grounding

4.1 Care Ethics Integration

Held (2006) distinguishes caring labor (meeting needs) from caring relations (mutual recognition). We operationalize this as:

Care (c): The labor of provision
Love (λ_L): The relational extension for growth

hooks (2000) argues love requires both intention (will to extend self) and action (actual extension). Our λ_L measures the action component through observable developmental support.

4.2 Standpoint Epistemology

Collins (1990, p. 234):

"Oppressed groups are frequently placed in the situation of being listened to only if we frame our ideas in the language that is familiar to and comfortable for a dominant group. This requirement often changes the meaning of our ideas."

Our application: Construct measurements must include marginalized standpoints:

ξ (truth) requires contested records, not just official ones
ε (empathy) requires cross-group understanding, not just dominant-group perspective

Atkinson inequality-sensitive inputs ensure Φ responds to distribution across groups, not just averages.

4.3 Non-Substitutability (Sen's Capabilities)

Sen (1999, p. 76) argues capabilities are constitutively plural—freedom of speech cannot substitute for freedom from hunger. Our multiplicative structure enforces this: zero in any dimension → zero Φ.

Contrast with utilitarian additive models where high joy could compensate for absent protection. The Nash SWF prevents this dimensional collapse.

4.4 The Love/Protection Distinction (hooks + Berlin)

Berlin (1969): Two concepts of liberty:

Negative: Freedom from interference (protection)
Positive: Freedom to self-actualize (love as developmental support)

hooks (2000, p. 13):

"When we understand love as the will to nurture our own and another's spiritual growth, it becomes clear that we cannot claim to love if we are hurtful and abusive."

Our λ_L (love) measures active nurturing (positive freedom), λ_P (protection) measures safeguarding from harm (negative freedom). Both necessary; neither sufficient.

Paternalism detection: High c + λ_P but low λ_L reveals systems that provide materially and protect physically while denying agency—a pattern invisible in frameworks conflating love with protection.

5. Implementation

5.1 Python Reference Implementation

import math
from typing import Dict, Optional

ALL_CONSTRUCTS = ["c", "kappa", "j", "p", "eps", "lam_L", "lam_P", "xi"]

CONSTRUCT_FLOORS = {
    "c": 0.20, "kappa": 0.20, "lam_P": 0.20,
    "lam_L": 0.15, "xi": 0.30,
    "j": 0.10, "p": 0.10, "eps": 0.10,
}

def sigmoid(x: float) -> float:
    return 1.0 / (1.0 + math.exp(-x))

def community_multiplier(lam_L: float, gamma: float = 0.5) -> float:
    """f(λ_L) = λ_L^γ — Ubuntu substrate multiplier."""
    return max(0.01, lam_L) ** gamma

def recovery_aware_input(x_i, floor_i, dx_dt_i, lam_L):
    """Recovery-aware effective input for a construct (v2.1)."""
    if x_i >= floor_i:
        return x_i
    trajectory = sigmoid(10.0 * dx_dt_i - 3.0)
    community_capacity = max(0.01, lam_L) ** 0.5
    recovery_potential = max(trajectory, community_capacity * 0.5)
    return x_i + (floor_i - x_i) * recovery_potential

def equity_weights(effective: Dict[str, float]) -> Dict[str, float]:
    """Inverse-deprivation weights: wᵢ = (1/x̃ᵢ) / Σⱼ(1/x̃ⱼ)."""
    inv = {c: 1.0 / max(0.01, v) for c, v in effective.items()}
    total = sum(inv.values())
    return {c: v / total for c, v in inv.items()}

def ubuntu_synergy(metrics: Dict[str, float], eta: float = 0.10,
                   eta_curiosity: float = 0.08) -> float:
    """Ψ_ubuntu synergy on RAW metrics."""
    c = metrics.get("c", 0.5)
    kappa = metrics.get("kappa", 0.5)
    j = metrics.get("j", 0.5)
    p = metrics.get("p", 0.5)
    eps = metrics.get("eps", 0.5)
    lam_L = metrics.get("lam_L", 0.5)
    lam_P = metrics.get("lam_P", 0.5)
    xi = metrics.get("xi", 0.5)
    pairs = (math.sqrt(c * lam_L) + math.sqrt(kappa * lam_P)
             + math.sqrt(j * p) + math.sqrt(eps * xi))
    curiosity = math.sqrt(lam_L * xi)
    return 1.0 + eta * pairs + eta_curiosity * curiosity

def divergence_penalty(metrics: Dict[str, float], mu: float = 0.15) -> float:
    """Ψ_penalty on RAW metrics (5 pairs including curiosity cross-pair)."""
    c = metrics.get("c", 0.5)
    kappa = metrics.get("kappa", 0.5)
    j = metrics.get("j", 0.5)
    p = metrics.get("p", 0.5)
    eps = metrics.get("eps", 0.5)
    lam_L = metrics.get("lam_L", 0.5)
    lam_P = metrics.get("lam_P", 0.5)
    xi = metrics.get("xi", 0.5)
    sq = ((c - lam_L)**2 + (kappa - lam_P)**2 + (j - p)**2
          + (eps - xi)**2 + (lam_L - xi)**2)
    return mu * sq / 5

def compute_phi(
    metrics: Dict[str, float],
    derivatives: Optional[Dict[str, float]] = None,
) -> float:
    """
    Compute Phi(humanity) — the full welfare function (v2.1).

    Phi = f(lam_L) * product(x_tilde_i ^ w_i) * Psi_ubuntu * (1 - Psi_penalty)

    v2.1: recovery_aware_input() is called for each construct before the
    weighted geometric mean. Synergy and penalty still operate on raw
    metrics (they detect actual state).

    Args:
        metrics: Dict mapping each construct symbol to a value in [0, 1].
        derivatives: Optional dict of dx/dt per construct. Defaults to 0.0.
    """
    if derivatives is None:
        derivatives = {}

    lam_L_raw = max(0.01, metrics.get("lam_L", 0.5))
    f_lam = community_multiplier(lam_L_raw)

    # Recovery-aware effective values
    effective: Dict[str, float] = {}
    for c in ALL_CONSTRUCTS:
        x_raw = max(0.01, metrics.get(c, 0.5))
        floor_c = CONSTRUCT_FLOORS[c]
        dx_dt_c = derivatives.get(c, 0.0)
        effective[c] = recovery_aware_input(x_raw, floor_c, dx_dt_c, lam_L_raw)

    # Equity weights on effective values
    weights = equity_weights(effective)

    # Weighted geometric mean of effective values
    product = 1.0
    for c in ALL_CONSTRUCTS:
        x_eff = max(0.01, effective[c])
        product *= x_eff ** weights[c]

    # Synergy and penalty on RAW metrics
    synergy = ubuntu_synergy(metrics)
    penalty = divergence_penalty(metrics)

    phi = f_lam * product * synergy * (1.0 - penalty)
    return max(0.0, phi)

5.2 Constraint Layer (Rights-Based Floors)

Before applying Φ for decisions, enforce:

# Hard floors (non-negotiable minimums)
FLOORS = {
    'c': 0.2,       # Basic needs non-negotiable (Nussbaum 2000)
    'kappa': 0.2,   # Compassion floor
    'lam_P': 0.2,   # Protection non-negotiable
    'lam_L': 0.15,  # Love floor (slightly lower but essential)
    'xi': 0.3,      # Epistemic integrity minimum (Fricker 2007)
    # Others: 0.1
}

5.3 Measurement Protocol

For each construct, compute:

Raw metric (e.g., poverty rate, violence rate)
Atkinson inequality index Aε with ε=1.5 (moderate inequality aversion)
Atkinson complement: xᵢ = 1 - Aε

This ensures Φ responds to equitable distribution, not just averages (Atkinson 1970).

6. Application to Information Gap Detection

6.1 Gap Urgency via Φ Gradients

Detective LLM detects information gaps (temporal, geographic, entity-level). Φ gradients prioritize investigation:

def gap_urgency(
    gap: Gap,
    current_metrics: dict[str, float]
) -> float:
    """
    Urgency = Σ(∂Φ/∂xᵢ) for threatened constructs.

    Gaps threatening scarce constructs (high gradient) = urgent.
    Gaps threatening abundant constructs (low gradient) = less urgent.
    """
    threatened = gap.infer_threatened_constructs()

    gradient_sum = sum(
        phi_gradient_wrt(construct, current_metrics)
        for construct in threatened
    )

    return gradient_sum * gap.confidence

Example:

Medical records gap (2010-2015) → threatens c (care), ξ (truth)
If c=0.1 (scarce): ∂Φ/∂c ≈ 1.43 (HIGH urgency)
If c=0.9 (abundant): ∂Φ/∂c ≈ 0.16 (LOW urgency)

6.2 Construct Inference Heuristics

Text → Construct mapping:

Text Pattern	Constructs	Example
"experimental surgery without consent"	λ_P, ξ	Washington 2006 (Medical Apartheid)
"mutual aid networks excluded"	λ_L, c	Kaba & Hayes 2021
"redacted oversight documents"	ξ	Fricker 2007 (testimonial injustice)
"community healing spaces defunded"	λ_L, κ	hooks 2000

6.3 Standpoint-Sensitive Prioritization

When official records are silent but community testimony exists:

ξ (truth) is LOW (suppression detected)
Gap urgency is HIGH (official/community divergence)

This operationalizes Collins' (1990) standpoint epistemology: contested absences are prioritized.

7. Limitations and Future Work

7.1 Acknowledged Limitations

Proxy Validity: Function is only as good as construct measurements. Poor proxies → poor optimization (Goodhart's Law).
Weight Setting is Political: Equal weights (θᵢ = 1/8) are the neutral default, but different communities may prioritize differently. No formula bypasses normative debate (Sen 1999).
Cultural Specificity: The 8-construct taxonomy is Western-situated. Ubuntu (collective humanity), Confucian (filial piety), Buddhist (non-attachment), and Indigenous (land reciprocity) frameworks require different construct spaces, not just weight tuning.
Measurement Challenges: Measuring "love as developmental extension" is harder than measuring poverty. Requires:
- Community capacity-building investments (observable)
- Mutual aid network strength (network analysis)
- Educational accessibility (administrative data)

7.2 Goodhart's Law Resistance

"When a measure becomes a target, it ceases to be a good measure." (Goodhart 1975)

Our mitigation:

Diagnostic use only: Φ prioritizes gaps, doesn't optimize policies directly
Multi-proxy triangulation: Each construct measured via 3-5 independent proxies
Red-team auditing: Adversarial testing for gaming vulnerabilities
Constitutional constraints: Hard floors + rate-of-change limits prevent sacrifice trades

7.3 Open Questions

Time-Varying Exponents: Should αᵢ adapt based on global scarcity? (If global c is high, shift to αc=1.0 for linear returns)
Non-Human Life: How to integrate ecosystem integrity E? As a floor constraint (E ≥ 0.3 required for Φ calculation) or as a 9th construct?
Optimization vs. Diagnostic: Should Φ ever be maximized, or only used to detect welfare drops? Maximization invites gaming (Goodhart); diagnostic use is safer.
Cross-Cultural Validation: How do Ubuntu, Confucian, Buddhist, and Indigenous communities respond to this formulation? Requires co-design, not imposition.

7.4 Future Directions

Empirical Calibration: Test on historical datasets (UNDP HDI, Alkire-Foster MPI) to validate gradient priorities
Cultural Extensions: Partner with non-Western scholars to develop alternative construct spaces
Constitutional AI Integration: Use Φ gradients to shape LLM training (preference pairs weighted by welfare impact)
Longitudinal Studies: Track Φ evolution over time to detect welfare trajectory changes

8. Bibliography

Welfare Economics and Social Choice

Arrow, K. J. (1951). Social Choice and Individual Values. Yale University Press.

Atkinson, A. B. (1970). On the measurement of inequality. Journal of Economic Theory, 2(3), 244-263.

Goodhart, C. (1975). Problems of monetary management: The UK experience. Papers in Monetary Economics (Vol. 1). Reserve Bank of Australia.

Harsanyi, J. C. (1955). Cardinal welfare, individualistic ethics, and interpersonal comparisons of utility. Journal of Political Economy, 63(4), 309-321.

Kaneko, M., & Nakamura, K. (1979). The Nash social welfare function. Econometrica, 47(2), 423-435.

Nash, J. (1950). The bargaining problem. Econometrica, 18(2), 155-162.

Rawls, J. (1971). A Theory of Justice. Harvard University Press.

Capability Approach

Alkire, S., & Foster, J. (2011). Counting and multidimensional poverty measurement. Journal of Public Economics, 95(7-8), 476-487.

Nussbaum, M. (1996). Compassion: The basic social emotion. Social Philosophy and Policy, 13(1), 27-58.

Nussbaum, M. (2000). Women and Human Development: The Capabilities Approach. Cambridge University Press.

Sen, A. (1999). Development as Freedom. Oxford University Press.

Steger, M. F., Frazier, P., Oishi, S., & Kaler, M. (2006). The meaning in life questionnaire: Assessing the presence of and search for meaning in life. Journal of Counseling Psychology, 53(1), 80.

Care Ethics and Feminist Theory

Collins, P. H. (1990). Black Feminist Thought: Knowledge, Consciousness, and the Politics of Empowerment. Routledge.

Gilligan, C. (1982). In a Different Voice: Psychological Theory and Women's Development. Harvard University Press.

Held, V. (2006). The Ethics of Care: Personal, Political, and Global. Oxford University Press.

hooks, b. (2000). All About Love: New Visions. William Morrow.

Tronto, J. C. (1993). Moral Boundaries: A Political Argument for an Ethic of Care. Routledge.

Epistemology and Social Justice

Berlin, I. (1969). Two concepts of liberty. In Four Essays on Liberty (pp. 118-172). Oxford University Press.

Fricker, M. (2007). Epistemic Injustice: Power and the Ethics of Knowing. Oxford University Press.

Washington, H. A. (2006). Medical Apartheid: The Dark History of Medical Experimentation on Black Americans from Colonial Times to the Present. Doubleday.

AI Alignment and Constitutional AI

Anthropic. (2022). Constitutional AI: Harmlessness from AI feedback. arXiv preprint arXiv:2212.08073.

Bostrom, N. (2014). Superintelligence: Paths, Dangers, Strategies. Oxford University Press.

Christiano, P., Leike, J., Brown, T. B., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. arXiv preprint arXiv:1706.03741.

Russell, S. (2019). Human Compatible: Artificial Intelligence and the Problem of Control. Viking.

Psychology and Well-Being

Batson, C. D. (1991). The Altruism Question: Toward a Social-Psychological Answer. Psychology Press.

Csikszentmihalyi, M. (1990). Flow: The Psychology of Optimal Experience. Harper & Row.

Frankfurt, H. G. (1971). Freedom of the will and the concept of a person. Journal of Philosophy, 68(1), 5-20.

Pettigrew, T. F., & Tropp, L. R. (2006). A meta-analytic test of intergroup contact theory. Journal of Personality and Social Psychology, 90(5), 751.

Application Context

Hayes, K., & Kaba, M. (2021). Let This Radicalize You: Organizing and the Revolution of Reciprocal Care. Haymarket Books.

Wilkerson, I. (2020). Caste: The Origins of Our Discontents. Random House.

Changelog

Version 2.1 (2026-02-26): Recovery-Aware Floors + Equity Weights

Major revision: Three structural additions to the formula.

Recovery-aware floors (§3.1.2): Below-floor constructs receive community-mediated recovery potential via trajectory (dx/dt) and community capacity (λ_L^0.5). Key insight: care doesn't begin the uptick without community intervention. Sigmoid bias of −3.0 prevents trajectory from dominating community capacity.
Equity-adjusted weights on effective inputs (§3.1.3): Weights wᵢ = (1/x̃ᵢ)/Σ(1/x̃ⱼ) computed on recovery-adjusted values, not raw metrics. Supersedes the fixed per-construct exponents (α=0.7/0.8/1.0) from v1.0-v2.0. Achieves Rawlsian maximin dynamically.
Curiosity cross-pair (§3.3.1): Love × Truth diagonal synergy with η_curiosity=0.08. Divergence penalty expanded from 4 to 5 pairs, adding (λ_L − ξ)² for surveillance/willful-ignorance detection.

Implementation: spaces/maninagarden/welfare.py is the reference implementation (formula v2.1-recovery-floors). Training job launched with recovery floors active in data generation for the first time.

Philosophical grounding:

Recovery floors: Nussbaum (2000) non-negotiable capability thresholds + Ubuntu community-mediated recovery
Equity weights: Rawls (1971) maximin via dynamic inverse-deprivation
Curiosity: hooks (2000) love as extension for growth + Fricker (2007) epistemic integrity

Version 2.0 (2026-02-19): Love/Protection Split

Major revision: Split λ into λ_L (love) and λ_P (protection)

Rationale: Original conflated construct merged generative developmental support (hooks' love) with defensive safeguarding (protection)—philosophically distinct phenomena requiring separate measurement.

Theoretical grounding:

hooks (2000): Love as active extension for growth
Berlin (1969): Positive vs. negative freedom
Paternalism detection: high c+λ_P, low λ_L

Structural changes:

Constructs: 7 → 8
Nash weights: 1/7 → 1/8
Synergy pairs: c×ε → c×λ_L, κ×λ → κ×λ_P
Worked examples recalculated

Mathematical verification: All gradient properties preserved. Nash SWF structure intact.

Version 1.0 (2026-02-18): Initial Formalization

Original 7-construct model
Nash SWF structure
Atkinson inequality sensitivity
Synergy/penalty terms

Acknowledgments

This work builds on decades of scholarship in welfare economics (Atkinson, Sen, Rawls), feminist epistemology (Collins, hooks, Fricker), and AI alignment (Russell, Anthropic). Special thanks to bell hooks, whose definition of love as active extension opened the path to formalizing care ethics mathematically.

Funding: Independent research, no external funding.

Conflicts of Interest: None declared.

Data Availability: Reference implementation available at https://github.com/crichalchemist/wave-experiment

For correspondence: [Contact information]

Suggested Citation:

Φ(humanity): A Rigorous Ethical-Affective Objective Function. Working Paper v2.1. (2026). Detective LLM Project.

This function is not a turnkey moral oracle. It is a disciplined framework forcing transparency about normative commitments while structurally preventing care-without-love dystopias and centering marginalized voices in the definition of flourishing.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train crichalchemist/phi-humanity-welfare-function

Papers for crichalchemist/phi-humanity-welfare-function

Constitutional AI: Harmlessness from AI Feedback

Paper • 2212.08073 • Published Dec 15, 2022 • 4

Deep reinforcement learning from human preferences

Paper • 1706.03741 • Published Jun 12, 2017 • 4

Φ(humanity): A Rigorous Ethical-Affective Objective Function

Formalizing Human Welfare for AI Systems

Executive Summary

Table of Contents

Reading Guide

1. Theoretical Foundations

1.1 The Problem: Algorithmic Human Welfare

1.2 Why Nash Social Welfare?

1.3 Our Extension: Nash + Capabilities + Care Ethics + Ubuntu

2. The Eight Constructs

2.1 Care (c)

2.2 Compassion (κ)

2.3 Joy (j)

2.4 Purpose (p)

2.5 Empathy (ε)

2.6 Love (λ_L) — NEW CONSTRUCT

2.7 Protection (λ_P)

2.8 Truth / Epistemic Integrity (ξ)

3. Mathematical Formulation

3.1 Core Function (Equity-Weighted, Community-Mediated)

3.1.1 Community Solidarity Multiplier

3.1.2 Recovery-Aware Floors

3.1.3 Equity-Adjusted Weights

3.2 Historical Note: Exponent Assignment

3.3 Ubuntu Synergy Term (Relational Coupling)

3.3.1 Curiosity Cross-Pair (Love × Truth)

3.4 Penalty Term (Divergence Punishment)

3.5 Worked Examples

4. Philosophical Grounding

4.1 Care Ethics Integration

4.2 Standpoint Epistemology

4.3 Non-Substitutability (Sen's Capabilities)

4.4 The Love/Protection Distinction (hooks + Berlin)

5. Implementation

5.1 Python Reference Implementation

5.2 Constraint Layer (Rights-Based Floors)

5.3 Measurement Protocol

6. Application to Information Gap Detection

6.1 Gap Urgency via Φ Gradients

6.2 Construct Inference Heuristics

6.3 Standpoint-Sensitive Prioritization

7. Limitations and Future Work

7.1 Acknowledged Limitations

7.2 Goodhart's Law Resistance

7.3 Open Questions

7.4 Future Directions

8. Bibliography

Welfare Economics and Social Choice

Capability Approach

Care Ethics and Feminist Theory

Epistemology and Social Justice

AI Alignment and Constitutional AI

Psychology and Well-Being

Application Context

Changelog

Version 2.1 (2026-02-26): Recovery-Aware Floors + Equity Weights

Version 2.0 (2026-02-19): Love/Protection Split

Version 1.0 (2026-02-18): Initial Formalization

Acknowledgments

Dataset used to train crichalchemist/phi-humanity-welfare-function

Papers for crichalchemist/phi-humanity-welfare-function