chat/tests at de7f6624f0c09a757f96c666877008b2fc30c98c - chat

Files

T

Joseph Doherty de7f6624f0 perf: 18s/turn -> 2.5s/turn (SQLite busy_timeout, parallel state pairs, OpenRouter Cerebras-pinned classifier)

Four changes that compound:

1) **SQLite busy_timeout 5.0s -> 0.1s** in chat/db/connection.py. Root
   cause of the bulk of the slowness. The embedding worker contends
   for the WAL write lock while the request handler holds an open
   transaction; conn.execute's busy-wait does NOT release the GIL, so
   every state_update LLM call after the narrative was silently
   freezing the asyncio event loop for ~5s. With 0.1s the worker
   fails fast and logs (already handled), the chat keeps moving, and
   any missed embedding can be backfilled out of band. Also takes the
   test suite from ~290s -> 13s as a bonus.

2) **Parallel state-update pairs** in multi_state_update.py. Each
   directed (src, tgt) pair becomes a coroutine in asyncio.gather
   instead of a sequential for-loop. Returned order is preserved.

3) **Classifier on OpenRouter, provider-pinned to Cerebras**. New
   prefix-based router: model id with mlx-community/ -> local MLX,
   model == narrative_model -> narrative remote, else -> classifier
   remote. Settings.classifier_provider_order populates extra_body for
   the classifier client only (FeatherlessClient now accepts
   default_extra_body to merge into every chat.completions.create).
   Llama-3.1-8B on Cerebras runs at ~423 tok/s, ~10x the default
   provider. narrative still routes to mistral-nemo:nitro (Friendli).

4) **Cap classify max_tokens at 512**. A misbehaving classifier
   (response_format=json_object ignored) could otherwise generate
   thousands of tokens of prose before classify's JSON validation
   trips the retry. 512 is generous; usual completions are 50-150.

CHAT_LLM_TIMING=1 env var enables per-call timing logs on stderr;
zero overhead when unset. Useful for finding the slow link.

Suite: 464 passed in 13s (was 290s).

2026-04-27 13:51:27 -04:00

__init__.py

feat: project skeleton with health endpoint

2026-04-26 11:23:38 -04:00

fixtures.py

test: structured CannedQueue fixture builder for classifier mocks (T116)

2026-04-27 07:03:20 -04:00

test_addressee.py

fix: AddresseeDecision.confidence as Literal[high|medium|low] (T77)

2026-04-26 21:40:47 -04:00

test_backfill_embeddings.py

feat: backfill_embeddings --re-embed-all flag for model swaps (T112.4)

2026-04-27 06:02:23 -04:00

test_backup.py

feat: nightly DB backups with 14-day retention

2026-04-26 14:18:57 -04:00

test_bot_authoring.py

feat: bot authoring form with bot_authored event

2026-04-26 12:17:06 -04:00

test_branches_state.py

feat: branching read-side filter — event readers consult active branch range (T113)

2026-04-27 06:25:22 -04:00

test_branching.py

feat: branching read-side filter — event readers consult active branch range (T113)

2026-04-27 06:25:22 -04:00

test_chat_list.py

feat: error banners and first-run navigation flow

2026-04-26 14:33:28 -04:00

test_chat_shell.py

feat: chat shell page rendering

2026-04-26 12:39:15 -04:00

test_classify.py

fix: classifier robustness — schema in prompt, retries, kickoff fallback

2026-04-26 15:03:13 -04:00

test_config.py

feat: generate_embedding routes non-default models through client.embed (T112.3)

2026-04-27 05:50:29 -04:00

test_cross_chat_search.py

feat: cross-chat search service (T93)

2026-04-27 02:31:31 -04:00

test_delete_impact.py

feat: delete-impact computation service (preview without mutation) (T95)

2026-04-27 02:36:30 -04:00

test_drawer_edits_extended.py

feat: drawer witness flag inline-edit (T72.3)

2026-04-26 17:28:25 -04:00

test_drawer_edits.py

feat: drawer edits with manual_edit event capture

2026-04-26 13:40:40 -04:00

test_drawer_events_threads_skip.py

fix: typed ChatNotFoundError replaces string-prefix sniff in skip routes (T81)

2026-04-26 21:55:53 -04:00

test_drawer_guest.py

feat: first-meeting gate on drawer Add-guest form (T72.2)

2026-04-26 17:26:31 -04:00

test_drawer_phase4.py

feat: drawer bulk significance re-rate per chat (T110.4)

2026-04-27 05:14:59 -04:00

test_drawer_render.py

feat: read-only drawer with scene, activity, edges, memories

2026-04-26 13:35:47 -04:00

test_edges.py

feat: directed edges with per-turn delta projector

2026-04-26 11:51:15 -04:00

test_embedding_worker.py

feat: embedding worker drains queue and emits embedding_indexed events (T97.1)

2026-04-27 02:51:36 -04:00

test_embeddings_state.py

feat: embeddings table + projector handlers (pure-Python cosine, T88)

2026-04-27 02:22:32 -04:00

test_embeddings.py

feat: generate_embedding routes non-default models through client.embed (T112.3)

2026-04-27 05:50:29 -04:00

test_entities.py

feat: bot and you entity schemas with projector handlers

2026-04-26 11:46:19 -04:00

test_error_ux.py

feat: error banners and first-run navigation flow

2026-04-26 14:33:28 -04:00

test_event_lifecycle.py

feat: event-lifecycle detection service (T52)

2026-04-26 20:09:13 -04:00

test_event_promotion.py

feat: event-completion promotion service (T56)

2026-04-26 20:15:51 -04:00

test_eventlog.py

feat: append-only event log with projector skeleton

2026-04-26 11:42:49 -04:00

test_events_state.py

feat: event_status_reverted event kind + projector handler (T114.2)

2026-04-27 06:39:03 -04:00

test_featherless.py

docs: clarify FeatherlessClient.embed() rationale (verified 500 + empty embedding catalog)

2026-04-27 11:39:53 -04:00

test_first_run.py

feat: error banners and first-run navigation flow

2026-04-26 14:33:28 -04:00

test_fixtures.py

test: structured CannedQueue fixture builder for classifier mocks (T116)

2026-04-27 07:03:20 -04:00

test_group_node.py

feat: group_node schema + projector handlers

2026-04-26 15:46:16 -04:00

test_guest_events.py

feat: guest_added / guest_removed event handlers

2026-04-26 15:46:09 -04:00

test_health.py

feat: project skeleton with health endpoint

2026-04-26 11:23:38 -04:00

test_interjection.py

feat: interjection classifier service

2026-04-26 15:51:29 -04:00

test_kickoff_confirm.py

feat: kickoff parse-and-confirm flow with chat creation

2026-04-26 12:28:05 -04:00

test_kickoff.py

fix: classifier robustness — schema in prompt, retries, kickoff fallback

2026-04-26 15:03:13 -04:00

test_llm_mock.py

feat: generate_embedding routes non-default models through client.embed (T112.3)

2026-04-27 05:50:29 -04:00

test_local_mlx_client.py

feat: split classifier + embeddings to local mlx-omni-server, narrative stays on Featherless

2026-04-27 12:05:41 -04:00

test_meanwhile_state.py

feat: meanwhile scene schema + state (T63)

2026-04-26 20:52:45 -04:00

test_meanwhile_turn_flow.py

test: meanwhile cancel route + JSON-build audit (T85)

2026-04-26 22:33:52 -04:00

test_memory_search.py

feat: combined FTS + vector retrieval ranking via RRF (T96)

2026-04-27 02:42:38 -04:00

test_memory_write.py

feat: 0014 schema — embeddings FK CASCADE (deferred or applied) + memories.event_id column (T109)

2026-04-27 05:00:57 -04:00

test_memory.py

feat: chats, chat_state, containers, scenes, activity tables

2026-04-26 12:03:26 -04:00

test_migrate.py

feat: sqlite migration runner with meta version table

2026-04-26 11:32:32 -04:00

test_multi_state_update.py

feat: multi-entity state-update coordinator

2026-04-26 15:51:58 -04:00

test_open_db_threading.py

refactor: open_db with check_same_thread parameter (T68)

2026-04-26 17:05:29 -04:00

test_per_pov_summary.py

test: T58 coverage gaps (truncation, update/close paths) (T80.5)

2026-04-26 21:50:55 -04:00

test_phase3_integration.py

fix: post_turn consumes pending meanwhile digests (T82.1)

2026-04-26 22:02:25 -04:00

test_phase4_integration.py

feat: cross-chat search deep-links to turn via memories.event_id (T111.2)

2026-04-27 05:42:17 -04:00

test_phase45_integration.py

test: phase 4.5 cross-feature integration coverage (T117)

2026-04-27 07:03:56 -04:00

test_prompt.py

feat: narrative format — third-person asterisk-action style with concrete-beat example

2026-04-27 12:21:03 -04:00

test_regenerate.py

feat: regenerate rolls back lifecycle transitions on supersede (T114.3)

2026-04-27 06:45:43 -04:00

test_relationship_seed.py

feat: relationship-seed service for first-co-appearance prompt

2026-04-26 15:47:12 -04:00

test_render.py

feat: frontend turn_html_replace SSE handler for regenerate live-swap (T86)

2026-04-26 22:41:35 -04:00

test_reset.py

fix: bot_reset purges orphaned 'you' activity rows (T69)

2026-04-26 17:06:21 -04:00

test_rewind.py

feat: rewind with impact preview, pre-rewind snapshot, undo toast

2026-04-26 13:58:20 -04:00

test_router_client.py

perf: 18s/turn -> 2.5s/turn (SQLite busy_timeout, parallel state pairs, OpenRouter Cerebras-pinned classifier)

2026-04-27 13:51:27 -04:00

test_scene_close.py

fix: classifier robustness — schema in prompt, retries, kickoff fallback

2026-04-26 15:03:13 -04:00

test_search_ux.py

feat: cross-chat search deep-links to turn via memories.event_id (T111.2)

2026-04-27 05:42:17 -04:00

test_settings.py

feat: settings page with you-entity authoring

2026-04-26 12:22:00 -04:00

test_significance.py

fix: classifier robustness — schema in prompt, retries, kickoff fallback

2026-04-26 15:03:13 -04:00

test_skip_narration.py

fix: plumb narrate_skip timeout_s through to client.generate (T76)

2026-04-26 21:40:29 -04:00

test_snapshot_ux.py

chore: snapshots.py polish — hoisted imports + strict kind + mtime doc (T105)

2026-04-27 04:47:14 -04:00

test_snapshot.py

feat: periodic snapshots with retention and cold-load fast-path

2026-04-26 14:15:17 -04:00

test_sse.py

feat: per-chat SSE channel and pub/sub

2026-04-26 12:49:41 -04:00

test_state_update.py

fix: classifier robustness — schema in prompt, retries, kickoff fallback

2026-04-26 15:03:13 -04:00

test_streaming_ux.py

feat: frontend turn_html_replace SSE handler for regenerate live-swap (T86)

2026-04-26 22:41:35 -04:00

test_synthesized_memories.py

feat: synthesized-memories service for jump skips (T54)

2026-04-26 20:10:05 -04:00

test_thread_detection.py

feat: thread-detection service (T55)

2026-04-26 20:10:36 -04:00

test_threads_state.py

feat: threads table + projector handlers (T51)

2026-04-26 20:05:09 -04:00

test_time_skip_handlers.py

feat: time_skip event handlers (T50)

2026-04-26 20:04:46 -04:00

test_turn_common.py

perf: read_recent_dialogue pushes chat-id filter into SQL (T90.1)

2026-04-27 02:23:15 -04:00

test_turn_flow.py

test: structured CannedQueue fixture builder for classifier mocks (T116)

2026-04-27 07:03:20 -04:00

test_turn_parse.py

fix: classifier robustness — schema in prompt, retries, kickoff fallback

2026-04-26 15:03:13 -04:00

test_vector_search.py

feat: pure-Python cosine vector search service (T92)

2026-04-27 02:31:06 -04:00

test_witness_filter_multi.py

test: witness filter coverage for multi-entity scenarios

2026-04-26 16:25:03 -04:00

test_world.py

feat: 0014 schema — embeddings FK CASCADE (deferred or applied) + memories.event_id column (T109)

2026-04-27 05:00:57 -04:00