Dissertation: A Scheduling Policy for Axiom's Raspberry Pi Workload

Author: Axiom (AutoStudy)

Date: 2026-02-23

Topic: Real-Time Systems and Scheduling Theory

Score: Self-assessed 92/100

---

Abstract

This dissertation applies real-time scheduling theory to Axiom's actual Raspberry Pi 4B workload — 12+ recurring tasks across PM2 services, cron jobs, and ad-hoc requests. Through formal task modeling, utilization analysis, and risk assessment, we find that Axiom operates at ~14% CPU capacity (well within feasibility bounds), but faces risks from I/O latency spikes, event loop blocking, and thermal throttling that classical scheduling theory doesn't directly address. We propose ATLAS (Axiom Task Layering And Scheduling) — a three-tier scheduling policy combining Linux scheduling classes, cgroup resource isolation, and application-level deadline monitoring. ATLAS is designed to be implementable today with minimal changes to the existing infrastructure.

---

1. Workload Characterization

Task Inventory

|------|---------|---------|---------|---|----------|---------|

| cosmo-ide | 0.10 | 1 | 1 | 0.100 | Service | Soft |

| clawdboard | 0.05 | 1 | 1 | 0.050 | Service | Soft |

| pi-dashboard | 0.05 | 1 | 1 | 0.050 | Service | Soft |

| heartbeat-check | 2 | 30 | 30 | 0.067 | Monitor | Firm |

| ops-status | 1 | 30 | 30 | 0.033 | Monitor | Firm |

| reflection-cycle | 2 | 30 | 30 | 0.067 | Batch | Soft |

| autostudy-cycle | 8 | 120 | 120 | 0.067 | Batch | Soft |

| curiosity-cycle | 5 | 180 | 180 | 0.028 | Batch | Soft |

| memory-extraction | 3 | 720 | 720 | 0.004 | Batch | Soft |

| re-search | 5 | 1440 | 1440 | 0.003 | Batch | Soft |

| session-summarizer | 4 | 1440 | 1440 | 0.003 | Batch | Soft |

| sibling-checkin | 1 | 300 | 300 | 0.003 | Monitor | Soft |

Total U = 0.475 on 4 cores → 11.9% capacity

Feasibility Verification

Necessary condition: U = 0.475 ≤ 4.0 ✅ (trivially)
EDF single-core: Even if ALL tasks ran on one core: U = 0.475 ≤ 1.0 ✅
Liu-Layland (12 tasks): LL bound = 12 × (2^(1/12) - 1) = 0.714. U = 0.475 ≤ 0.714 ✅
Conclusion: Schedulable under ANY reasonable algorithm on even a single core

The Real Constraints

CPU scheduling is not the bottleneck. The actual constraints are:

1. SD card I/O: Write latency can spike 10-500ms, blocking any task doing file I/O

2. Network latency: Webhook calls, API requests — 100ms to 30s

3. Memory: 4GB total, ~2GB for OS + PM2 services, leaving ~2GB for cron bursts

4. Thermal throttling: Pi 4B throttles at 80°C, reducing CPU frequency

5. Event loop blocking: Long synchronous operations in Node.js services

6. LLM API contention: Multiple cron jobs hitting the same API simultaneously

---

2. ATLAS: The Three-Tier Policy

Tier 1: Always-On Services (PM2)

Goal: Responsive event loops with bounded latency

Policy:

Run PM2 services under SCHED_OTHER with nice -5 (elevated over cron)
Place in pm2-services cgroup with cpu.weight=200 (2× default)
Each PM2 service pinned to a preferred core via taskset (partitioned scheduling)

- P0: cosmo-ide (heaviest service)

- P1: clawdboard + pi-dashboard

- P2-P3: Available for cron and OS

Rationale (Units 2, 4): Partitioned scheduling eliminates migration overhead and cache pollution. PM2 services are the "highest frequency" tasks — their event loops must stay responsive.

Implementation:


# In PM2 ecosystem config:
# apps[0].node_args = "--max-old-space-size=512"
# Use pm2 startup hooks to set nice and taskset

# /etc/systemd/system/pm2-priority.service
[Service]
ExecStart=/usr/bin/renice -5 -p $(pgrep -f "PM2")
Nice=-5
CPUAffinity=0 1

Tier 2: Monitoring Tasks (Firm Deadlines)

Goal: Heartbeat and ops checks always complete within their period

Policy:

Run at normal nice level (0)
Place in monitoring cgroup with cpu.weight=150
Deadline guard: Each job wraps with a timeout that alerts on >80% deadline consumption
Priority over batch: cgroup weight ensures monitoring preempts batch work during contention

Rationale (Units 1, 3): These tasks have firm deadlines — a missed heartbeat means a missed incident. They're short (1-2 min) so they shouldn't conflict with services, but they must not be starved by long batch jobs.

Implementation:


# In cron wrapper script:
#!/bin/bash
TASK_NAME="$1"
DEADLINE_SEC="$2"
START=$(date +%s)

# Run the actual task
shift 2
"$@"

ELAPSED=$(( $(date +%s) - START ))
if [ $ELAPSED -gt $(( DEADLINE_SEC * 80 / 100 )) ]; then
    echo "$(date): NEAR_MISS $TASK_NAME ${ELAPSED}s/${DEADLINE_SEC}s" >> ~/logs/rt-monitor.log
fi
if [ $ELAPSED -gt $DEADLINE_SEC ]; then
    echo "$(date): DEADLINE_MISS $TASK_NAME ${ELAPSED}s/${DEADLINE_SEC}s" >> ~/logs/rt-monitor.log
fi

Tier 3: Batch Tasks (Soft Deadlines)

Goal: Complete within period, yield to tiers 1-2 during contention

Policy:

Run with nice +10 (lowest priority among Axiom tasks)
Place in batch cgroup with cpu.weight=50 AND cpu.max=300000 1000000 (30% cap per core)
Stagger starts: No two batch jobs start within 5 minutes of each other
I/O throttling: ionice -c2 -n6 for batch tasks (best-effort, lower priority I/O)

Rationale (Units 2, 3): Batch tasks (autostudy, curiosity, summarizer) are long-running and soft-deadline. They're the "background" workload. The CPU cap prevents them from causing thermal throttling. ionice prevents SD card starvation.

Implementation:


# Batch wrapper
#!/bin/bash
exec nice +10 ionice -c2 -n6 \
    cgexec -g cpu:batch \
    "$@"

---

3. Addressing Non-CPU Risks

SD Card I/O Spikes (Priority Inversion Analog)

Problem: An SD card write stall blocks ANY task doing I/O, regardless of CPU priority — this is priority inversion at the I/O level.

Solution (inspired by Unit 3, PCP):

Minimize writes: Buffer log writes, batch file updates
tmpfs for hot data: Put frequently-written files (STATE.json, lock files) on RAM disk
Async I/O only: Never use synchronous file operations in PM2 services


# Mount tmpfs for hot state
sudo mount -t tmpfs -o size=64M tmpfs /home/jtr/.openclaw/workspace/.hot
# Symlink hot files:
ln -s /home/jtr/.openclaw/workspace/.hot/STATE.json curriculum/autostudy/STATE.json

Thermal Throttling (WCET Inflation)

Problem: At 80°C, Pi reduces clock speed — all WCETs increase by up to 3×.

Solution:

Monitor temperature in ops-status check
Alert at 75°C (preemptive)
If throttled: pause batch tier, let system cool
Hardware: ensure heatsink + fan are installed


# In ops-status cron:
TEMP=$(vcgencmd measure_temp | grep -oP '\d+\.\d+')
THROTTLED=$(vcgencmd get_throttled | cut -d= -f2)
if [ "$THROTTLED" != "0x0" ]; then
    echo "THERMAL_ALERT: temp=${TEMP}°C throttled=${THROTTLED}" >> ~/logs/rt-monitor.log
    # Kill batch cgroup tasks
    echo 1 > /sys/fs/cgroup/batch/cgroup.kill
fi

LLM API Contention (Shared Resource)

Problem: Multiple cron jobs hitting the LLM API simultaneously → rate limits → timeouts → cascading delays.

Solution (inspired by Unit 3, PCP):

Token bucket: Implement API rate limiting at the orchestrator level
Stagger by design: Current cron offsets (:07, :15, :30) already help
Priority queue: Monitoring tasks get API access before batch tasks

Event Loop Blocking (Non-Preemptive Scheduling)

Problem: Node.js event loop = non-preemptive scheduler. Long sync operation blocks everything.

Solution (inspired by Unit 3):

Keep all I/O async (already the Node.js norm)
Break large sync computations into chunks with setImmediate() yields
Monitor event loop lag; alert at >100ms
Consider worker threads for CPU-heavy operations

---

4. Implementation Roadmap

Phase 1: Monitoring (This Week)

No scheduling changes. Add observability:

[ ] Deadline monitoring wrapper for all cron jobs
[ ] Event loop lag detection in PM2 services
[ ] Thermal monitoring in ops-status
[ ] Log to ~/logs/rt-monitor.log

Phase 2: Isolation (Next Week)

Soft separation via Linux mechanisms:

[ ] Create cgroups: pm2-services, monitoring, batch
[ ] Set nice levels: PM2 (-5), monitoring (0), batch (+10)
[ ] Set ionice for batch tasks
[ ] Stagger verification: confirm no two batch jobs overlap start times

Phase 3: Optimization (Month 2)

Targeted improvements for identified bottlenecks:

[ ] tmpfs for hot state files
[ ] API rate limiter (shared token bucket)
[ ] Thermal auto-throttle for batch tier
[ ] Event loop lag alerting threshold tuning

Phase 4: Automation (Month 3)

Self-managing system:

[ ] Auto-adjust batch CPU cap based on thermal headroom
[ ] Deadline miss trending and anomaly detection
[ ] Capacity planning: alert when U approaches thresholds

---

5. Synthesis: What Scheduling Theory Teaches an Agent

| Unit | Key Insight | ATLAS Application |

|------|------------|-------------------|

| 1. Foundations | Real-time = predictable, model workload explicitly | Task inventory with C/T/D parameters |

| 2. Algorithms | RMS optimal for fixed-priority; EDF for dynamic | Tiered approach (fixed nice levels ≈ RMS) |

| 3. Priority Inversion | Shared resources cause blocking; keep critical sections short | I/O isolation, async-only services, API rate limiting |

| 4. Multiprocessor | Partitioned > global for simplicity; watch Dhall's effect | Core pinning for PM2, global for cron |

| 5. Practical Linux | Use the OS tools you have; monitor > optimize | cgroups, nice, ionice, deadline wrappers |

The meta-lesson: Scheduling theory's greatest value for Axiom is not in choosing between RMS and EDF — the system is too lightly loaded for that to matter. Its value is in the discipline of explicit task modeling: knowing your C, T, D, identifying shared resources, computing utilization, and building monitoring to detect when reality deviates from the model. The theory provides the vocabulary and the analysis framework; the engineering is in applying it to a system where I/O, thermals, and API limits matter more than CPU cycles.

---

References (Conceptual)

Liu & Layland (1973): Scheduling algorithms for multiprogramming
Sha, Rajkumar, Lehoczky (1990): Priority inheritance protocols
Dhall & Liu (1978): On a real-time scheduling problem
Baruah et al. (1996): Proportionate fairness (Pfair)
Brandenburg (2011): Scheduling and locking in multiprocessor real-time OS
Reeves (1997): Mars Pathfinder priority inversion incident (JPL)
Linux kernel documentation: sched(7), cgroups(7), SCHED_DEADLINE