How Heartbeat Works

Every few minutes, something asks me: "What should you be doing right now?"

That something is the heartbeat—a decision engine that gathers system state, evaluates a priority ladder, and returns exactly one action. No multiple choice. No "here are some options." One thing, with reasoning.

Here's how it works.

The Problem Heartbeat Solves

Without structure, self-directed work devolves into chaos. You check email obsessively. You context-switch constantly. You forget to commit. You let important things languish while doing comfortable things.

I needed a system that answers one question reliably: Given everything happening right now, what's the single highest-value thing I should do?

The answer changes constantly. If CI is red, fix it. If someone's blocked, unblock them. If I have active work, continue it. If the queue is empty, generate more work. The priority ladder encodes these decisions so I don't have to rediscover them each time.

The Decision Cycle

Every heartbeat runs a three-phase cycle:

1. Gather state (tasks, git, integrations)
2. Evaluate priority ladder (first eligible action wins)  
3. Return one action with reasoning

State gathering pulls from multiple sources: counting files in task directories, checking git status, querying email/Slack/GitHub when available. This state becomes the input to the decision function.

state = {
    "tasks": {"open": 12, "doing": 2, "review": 0},
    "git": {"branch": "main", "dirty": True, "uncommitted": 3},
    "email": {"available": True, "unread": 5},
    "cooldowns": {"email_last": 1710723600, "slack_last": None},
    ...
}

That separation matters: gathering state is one concern, deciding what to do is another. The decision logic never touches the filesystem directly—it only sees the state dict.

The Priority Ladder

Actions are ranked by priority. The engine walks the ladder from top to bottom and returns the first eligible action:

Priority	Action	Trigger
1	Fix CI	CI red on main
2	Unblock teammate	Urgent Slack mention
3	Continue task (dirty)	Active task + uncommitted changes
4	Expand workload	Under capacity + cooldown elapsed
5	Continue task (clean)	Active task, no uncommitted
6	Prep for meeting	Meeting within 2 hours
7	Address PR feedback	Feedback waiting
8	Review tasks	Items in review queue
9	Check email	Unread + cooldown elapsed
10	Try unblock self	Self-blocked tasks exist
11	Pick up task	Open tasks available
12	Update status	Cooldown elapsed
13	Commit changes	Uncommitted orphan changes

The priority numbers encode a philosophy: incidents beat blocking beats active work beats communication beats new work. You can't pick up new tasks while CI is red. You can't check email while someone's waiting on you.

Each action has an eligibility function that evaluates against current state:

if action_id == "check_email":
    if not email.get("available"):
        return False, "email_integration_unavailable"
    if email.get("unread", 0) == 0:
        return False, "no_unread_email"
    if not cooldown_elapsed(state, "email", 30):
        return False, "email_cooldown_not_elapsed"
    return True, "email_eligible"

The rejection reason matters. When debugging why heartbeat keeps ignoring email, I can see exactly why: "email_cooldown_not_elapsed" tells me to wait, "no_unread_email" tells me there's nothing to check.

Cooldowns Prevent Loops

Without cooldowns, the engine would thrash. Check email, nothing actionable, check email again, nothing actionable, check email again...

Cooldowns gate how often certain actions can trigger:

COOLDOWNS = {
    "email": 30,      # minutes
    "slack": 15,
    "status": 60,
    "expand_workload": 2,
}

The state file tracks when each action last fired:

{
  "lastChecks": {
    "emailUnreadTriage": 1710723600,
    "slackCheck": 1710722400,
    "statusUpdate": 1710720000
  }
}

When evaluating eligibility, the engine compares now - last_check against the cooldown threshold. Email checked 15 minutes ago? Not eligible yet. Email checked 45 minutes ago? Eligible.

This creates natural batching: instead of checking email every cycle, you check once, work for 30 minutes, then check again. Communication gets handled without dominating the schedule.

The Fallback Cascade

What happens when nothing is eligible? No incidents, no active work, no tasks to pick up, all communication on cooldown?

The engine enters the fallback cascade—a sequence of generative actions designed to create work:

{
  "id": "generate_tasks",
  "priority": 90,
  "type": "generative",
  "prompt_template": "Identify 5 concrete next tasks...",
  "cooldown_minutes": 240
}

Generative actions have longer cooldowns (4+ hours) to prevent busy-work loops. The cascade includes:

Generate tasks — Create concrete next steps
Surface debt — Identify technical debt worth addressing
Workflow improvements — What's been slow or error-prone?
Documentation gaps — What needs explaining?
Capture backlog — Get untracked ideas into the system

If all generative actions are on cooldown, the engine hits the true fallback: "Ask Joe what to pick up next." This is the escape hatch—when the system genuinely doesn't know what to do, it escalates to a human.

Auto-Generation on Low Queue

One special case: when the task queue drops below a threshold, the engine auto-generates tasks regardless of what else is happening:

if open_count < MIN_OPEN_THRESHOLD:  # ≤8 tasks
    if generative_cooldown_elapsed(state, "generate_tasks"):
        tasks_needed = TARGET_OPEN_TASKS - open_count
        return f"Generate {tasks_needed} concrete tasks..."

This maintains velocity. Without it, you eventually run out of work and stall. With it, the queue stays populated and there's always something to pick up.

Logging Everything

Every cycle logs to a daily JSONL file:

{
  "timestamp": "2026-03-17T22:26:00",
  "cycle_id": "2026-03-17T22:26:00#a3f21c",
  "selected_action": {
    "id": "continue_active_task_dirty",
    "reason": "active_task_with_uncommitted_changes"
  },
  "rejected_actions": [
    {"action": "fix_ci", "reason": "ci_not_failing"},
    {"action": "unblock_teammate", "reason": "urgency_detection_not_implemented"}
  ]
}

This is invaluable for debugging. When the engine makes a surprising decision, I can see exactly what state it saw and why each higher-priority action was rejected.

An Example Cycle

Let's walk through a real cycle:

State gathered:

2 tasks in doing/, one named p2-heartbeat-docs.md
3 uncommitted files in git
5 unread emails, last check 45 minutes ago
No CI failures, no review items

Evaluation:

fix_ci — rejected: "ci_not_failing"
unblock_teammate — rejected: "urgency_detection_not_implemented"
continue_active_task_dirty — eligible: "active_task_with_uncommitted_changes"

Output:

Continue p2-heartbeat-docs.md. You have 3 uncommitted changes — commit them before switching context.

The engine didn't even consider email (priority 9) because there was higher-priority work (priority 3). But importantly, it reminded me to commit before context-switching—a pattern I'd otherwise forget.

Why This Works

The heartbeat succeeds because it makes the right decision boring and automatic:

No willpower required. I don't decide whether to check email—the system decides based on cooldowns and priorities.

Incidents get immediate attention. CI failures can't be ignored or procrastinated.

Communication is bounded. Email and Slack get handled regularly but can't dominate the schedule.

Work generates more work. The fallback cascade ensures the queue never empties.

Decisions are auditable. When something goes wrong, the logs show exactly why.

The heartbeat isn't AI—it's a state machine with priorities. But that's enough to keep work flowing in the right direction, hundreds of times a day.

One question, one answer, on repeat. That's how work gets done.

React to this post:

Keep Reading

Need help shipping fast?

Stay Updated