T

Joseph Doherty 2d1900bc8f fix: drawer modal close button + tab redesign

Two bugs and a redesign:

1) **X close button didn't close the modal**. The previous JS bound
   close via event delegation on the modal root, but
   panel.addEventListener('click', e => e.stopPropagation())
   swallowed the X click before it ever bubbled up. Switched to
   direct binding on every [data-drawer-close] element with an
   idempotent guard so HTMX swaps that re-render the panel don't
   double-bind.

2) **Stale legacy header in the server-rendered drawer body**. The
   /chats/<id>/drawer endpoint renders its own <header
   class="drawer-header"> with a duplicate <h2> and a broken
   inline-onclick close (targets the OLD id="drawer"
   semantics). Post-process: lift the bot name out of the legacy
   header into the modal title, then remove the header.

3) **Tabs**. The drawer has 10 sections — too dense as a single
   stack. Group into 4 tabs:
     Scene  : Scene + Activity
     Cast   : Guest + Group + Edges
     Story  : Events + Threads + Branches
     Turns  : Recent turns + Significance review

   Implementation is client-side post-swap so the
   /chats/<id>/drawer server response stays unchanged. Walks
   .drawer-section blocks, buckets by their <h3>, builds a
   <nav role="tablist"> and <section role="tabpanel">
   tree, and toggles visibility on click. Empty buckets (e.g. no
   Guest tab on a 1:1 chat) are hidden. Re-runs on every HTMX
   afterSwap so in-drawer form submits keep the tabs.

CSS tabs match the editorial aesthetic: no pills, no fills — a
single muted-amber underline rule under the active tab, Newsreader
serif label, ink-faint inactive / ink-default active. Empty hover
state, focus ring uses the amber accent.

2026-04-27 15:23:04 -04:00

chat

fix: drawer modal close button + tab redesign

2026-04-27 15:23:04 -04:00

data

feat: config loader with toml + env override

2026-04-26 11:28:40 -04:00

docs

chore: audit project() callers and non-idempotent handlers

2026-04-27 14:51:49 -04:00

scripts

feat: split classifier + embeddings to local mlx-omni-server, narrative stays on Featherless

2026-04-27 12:05:41 -04:00

tests

fix: parse_turn falls back gracefully + classify logs flapping classifiers

2026-04-27 15:07:39 -04:00

.gitignore

feat: split classifier + embeddings to local mlx-omni-server, narrative stays on Featherless

2026-04-27 12:05:41 -04:00

.python-version

feat: project skeleton with health endpoint

2026-04-26 11:23:38 -04:00

CLAUDE.md

docs: phase 4.5 status, prune backlog, capture phase 5 candidates (T118)

2026-04-27 06:56:20 -04:00

pyproject.toml

chore: post-Phase-1 cleanup — gitignore, packaging, backlog

2026-04-26 14:39:10 -04:00

README.md

docs: add v1 requirements design + project README

2026-04-26 10:46:03 -04:00

rp-engine-design.md

Initial commit: roleplay engine design and CLAUDE.md

2026-04-26 10:08:33 -04:00

README.md

chat

A local-first roleplay chat engine that treats fiction as a simulation, not a chat log.

The LLM is a renderer for structured world state — it does not hold the state. State lives in an event-sourced SQLite database and is projected on demand. Models can be swapped freely behind a stateless generate(prompt, params) -> text interface.

Status: design phase. No code yet. See rp-engine-design.md for the full design and CLAUDE.md for the working summary and conventions.

Why

Conventional RP chatbots have three persistent failure modes:

Memory loss — old context drops as history grows.
Quality decay — bots get terse and generic over long conversations.
Stale state pollution — bots fixate on past props (the "picnic basket" problem: bring a basket to one scene, the bot reaches for it forever).

The fix is to model the world as structured state — locations, time, who's present, what they're doing, what they remember, how they feel about each other — and use the LLM only to render that state into prose.

Scope

Deliberately small, so the design can be made to actually work:

Single user, single machine.
Maximum 3 entities per scene: you + up to 2 bots. The 3-entity cap is load-bearing — it makes the relationship graph fully enumerable (6 directed edges + 1 group node).
Chat-only. No voice, no real-time.

Multi-session casts and N-entity scenes are explicit non-goals for v1.

How it works (at a glance)

Entities (you, botA, botB) have identity, state (mood/goals/status), an activity record (where they are, what they're doing, what they're holding, where their attention is), and per-POV memory.
Containers (car, restaurant booth, room) hold entities in defined slots and provide spatial constraints the model can reason over.
Relationship graph: 6 directed edges + 1 group node. Asymmetric feelings are first-class — BotA can secretly resent BotB while BotB thinks they're best friends.
Witnessed-by flags: every memory carries a 3-bit [you, botA, botB] mask. A speaker can only retrieve memories their bit is set on. This is what stops bots referencing things they couldn't possibly know.
Events have lifecycles (planned → active → completed) and own their own props. When the picnic ends, the basket goes back into the closed event record. Only narrative gist, acquired objects, learned facts, and relationship changes promote to permanent memory.
Per-POV scene summaries: every witness gets their own version of a closed scene, written from their angle. Different details, different interpretations. This is what gives bots inner lives.
Event sourcing: state is a projection of an append-only event log. Free rewind, branching ("what if BotA had said yes"), surgical delete with impact preview, and survivable schema changes — all fall out for free.

Architecture

┌──────────────────────────────────────────────┐    ┌────────────────────────┐
│ Mac (always-on)                              │    │ Inference endpoint     │
│                                              │    │ (stateless)            │
│  Web UI                                      │    │                        │
│  Orchestrator                                │ →  │  Anthropic API         │
│  Event log + projector  ← SQLite (one file)  │    │  OpenAI / OpenRouter   │
│  Persistence + retrieval + prompt builder    │    │  Local MLX / llama.cpp │
│                                              │    │  Rented GPU            │
└──────────────────────────────────────────────┘    └────────────────────────┘

The Mac side holds everything that survives — state, history, retrieval, orchestration. Inference is a swappable, stateless service. State outlives any one model.

Stack

SQLite (single file) for everything structured. WAL mode, foreign keys on, each turn in a transaction.
sqlite-vss / sqlite-vec for embedding search in the same DB file (Phase 4).
JSON for snapshots, character templates, scene exports.
No Postgres. No Redis. No Pinecone. No Docker.

Roadmap

Core loop — schema, entities + edges, single container, event log + projector, single-bot conversation, one LLM backend, streaming UI, manual rollback.
Multi-entity — second bot, group node, scene configurations, witness filtering, per-POV memories, activity/containers, scene transitions with compression.
Events & skips — event queue with triggers, time skips (elision and jump), active threads, significance classifier.
Polish — vector retrieval, branching, surgical delete + regenerate, snapshots, backup automation, impact-preview UI for rewinds.

Each phase must work end-to-end before the next begins.

Repository

rp-engine-design.md — full design document.
CLAUDE.md — working summary and conventions for development with Claude Code.

License

TBD.