dohertj2/chat - chat - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Joseph Doherty	49be3cf4b9	fix: parse_turn falls back gracefully + classify logs flapping classifiers The turn endpoint was 500ing in multi-bot scenes whenever the classifier provider hiccuped on parse_turn — particularly visible after a guest was added and bots started exchanging turns. The traceback was 'classify failed for schema ParsedTurn with no default' because parse_turn was the only classify caller without a default. Two changes: - chat/services/turn_parse.py: parse_turn now passes a default that wraps the whole prose as one 'dialogue' segment. The narrative still fires on the prose; we lose finer-grained segment kinds (action vs dialogue vs ooc) on this turn, but the request returns cleanly. Updated the existing test that pinned the old RuntimeError contract. - chat/llm/classify.py: when retries are exhausted, log a WARNING with the schema name, last error type, and a snippet of the last raw text the model returned. Surfaces flapping classifiers in the uvicorn log for diagnosis without taking down the request. Suite: 471 passed in 11.7s.	2026-04-27 15:07:39 -04:00
Joseph Doherty	3a81e540a1	chore: audit project() callers and non-idempotent handlers Same defect class as `0f8bf94`: routes that ``append_event`` then ``project(conn)`` 500 once any prior event makes the full-log replay hit a raw-INSERT handler (chat_created, container_created, scene_opened, memory_written, meanwhile_scene_started, etc.). Fixes the two remaining live-path callers: - chat/web/bots.py (bot_create) — bot_authored - chat/web/settings.py (settings_post) — you_authored Both swap ``append_event`` + ``project`` → ``append_and_apply`` so only the new event is applied through its registered handler. Unused imports of ``append_event`` and ``project`` removed from each file. The rewind path (chat/services/rewind.py) intentionally calls ``project()`` after wiping every projected table — that's the canonical "rebuild from log against an empty DB" entry point and is left unchanged. Inventory of every projector handler that uses raw INSERT (chat_created, container_created, scene_opened, memory_written, meanwhile_scene_started, meanwhile_digest_created, edge_update) is documented with the trade-offs of why we don't blindly switch them to INSERT OR REPLACE — for autoincrement-id rows there is no key to match on, and for chat_created a lossy overwrite would silently clobber chat_state mutations from later events. The handler layer stays correctly non-idempotent under event-sourcing semantics; the rule is enforced at the call site. Adds a regression test (tests/test_chat_created_non_idempotent.py) that pins the contract: appending two chat_created events for the same id and then ``project()``ing a second time MUST raise ``IntegrityError`` on chats.id. Any future "make it idempotent" change must update the test, forcing a deliberate review. Suite: 471 passed in 11.82s (was 470 + this regression test). Report: docs/audits/2026-04-27-project-callers.md	2026-04-27 14:51:49 -04:00
Joseph Doherty	3b83786b8b	feat: cap narrative output at 2-3 beats via trim_to_max_beats post-processor Verbose roleplay-tuned narrators (Cydonia, Magnum, etc.) reliably ignore prompt-level beat-count instructions and ramble for 6-12 asterisk-action beats per turn — even with HARD CAP language and worked examples in the closing instruction. The fix is a deterministic post-stream trimmer: - New trim_to_max_beats(text, max_beats) in chat/services/prompt.py. Counts * characters in the streamed output (each beat = 2 asterisks: open + close), trims at the start of the (max_beats+1)th asterisk action, strips trailing whitespace. Idempotent and safe on under-cap input. - Wired into post_turn for both the primary stream (3-beat cap) and the optional interjection stream (2-beat cap — interjections are by definition shorter chime-ins). - Tightened the closing instruction: explicit "HARD CAP: 2-3 beats" with "After the third beat, STOP". Helps the well-behaved models self-cap; the post-processor catches the rest. - max_tokens: 250 -> 160 (lets the 3rd beat finish naturally before hitting the physical cap; trim_to_max_beats handles 4+ beat overflow). temperature: 0.85 -> 0.7 (Cydonia is more compliant with format instructions at slightly cooler sampling). - Test budgets bumped (closing grew ~15 tokens with the new wording). 6 new tests for trim_to_max_beats covering passthrough, exact-cap, 4-beat trim, 6-beat runaway, lower caps, zero cap. Verified live: 4-turn bench against chat_maya, every response is 2-3 beats consistently. Suite: 470 passed in 11.7s.	2026-04-27 14:19:21 -04:00
Joseph Doherty	de7f6624f0	perf: 18s/turn -> 2.5s/turn (SQLite busy_timeout, parallel state pairs, OpenRouter Cerebras-pinned classifier) Four changes that compound: 1) SQLite busy_timeout 5.0s -> 0.1s in chat/db/connection.py. Root cause of the bulk of the slowness. The embedding worker contends for the WAL write lock while the request handler holds an open transaction; conn.execute's busy-wait does NOT release the GIL, so every state_update LLM call after the narrative was silently freezing the asyncio event loop for ~5s. With 0.1s the worker fails fast and logs (already handled), the chat keeps moving, and any missed embedding can be backfilled out of band. Also takes the test suite from ~290s -> 13s as a bonus. 2) Parallel state-update pairs in multi_state_update.py. Each directed (src, tgt) pair becomes a coroutine in asyncio.gather instead of a sequential for-loop. Returned order is preserved. 3) Classifier on OpenRouter, provider-pinned to Cerebras. New prefix-based router: model id with mlx-community/ -> local MLX, model == narrative_model -> narrative remote, else -> classifier remote. Settings.classifier_provider_order populates extra_body for the classifier client only (FeatherlessClient now accepts default_extra_body to merge into every chat.completions.create). Llama-3.1-8B on Cerebras runs at ~423 tok/s, ~10x the default provider. narrative still routes to mistral-nemo:nitro (Friendli). 4) Cap classify max_tokens at 512. A misbehaving classifier (response_format=json_object ignored) could otherwise generate thousands of tokens of prose before classify's JSON validation trips the retry. 512 is generous; usual completions are 50-150. CHAT_LLM_TIMING=1 env var enables per-call timing logs on stderr; zero overhead when unset. Useful for finding the slow link. Suite: 464 passed in 13s (was 290s).	2026-04-27 13:51:27 -04:00
Joseph Doherty	d656ee8805	feat: narrative format — third-person asterisk-action style with concrete-beat example Rewrites the closing instruction in assemble_narrative_prompt to enforce the asterisk-action / interleaved-beat format: actions wrapped in asterisks in third person, dialogue as plain text between beats (no quote marks), 2-4 short concrete beats per response, no inner monologue or stage-direction adverbs. Includes a one-line worked example so the model has a concrete target. Was producing first-person prose blocks like 'I stare at you... "Well, that's direct," I murmur'. Target style is short interleaved beats: 'She turns with soapy hands to cup your face That's how I know it's real... She kisses you softly You love me when I'm messy...' Drops narrative_max_tokens 400 -> 250 so the model can't drift into multi-paragraph monologue. Bumps three test budgets to fit the larger closing (closing grew ~80 -> ~200 tokens; tests still exercise the same trim-order behavior, just with proportionally larger budgets).	2026-04-27 12:21:03 -04:00
Joseph Doherty	fe9c497038	feat: split classifier + embeddings to local mlx-omni-server, narrative stays on Featherless Adds RoutedLLMClient that dispatches by model name: requests matching Settings.narrative_model go to Featherless, everything else (classifier calls, embed) goes to a local MLX server. The local server is mlx-omni-server (separate venv at .mlx-venv) and exposes the standard OpenAI surface at http://127.0.0.1:10240/v1. LocalMLXClient mirrors FeatherlessClient (AsyncOpenAI under the hood) but with a working embed() — Featherless's /v1/embeddings always returns 500 with completions_error, so the router unconditionally sends embed traffic to the local backend. Production deployment overrides via data/config.toml: - classifier_model = mlx-community/Hermes-3-Llama-3.1-8B-8bit (~8 GB) - embedding_model = mlx-community/bge-small-en-v1.5-bf16 (~150 MB, 384 dim — matches existing schema, no migration) Defaults stay remote / pseudo so fresh installs and tests need no external infra. Smoke-tested live: classifier returns expected output, BGE produces correctly-clustering 384-dim vectors (cat-on-mat closer to cat-on-rug than to quantum-mechanics). scripts/start_mlx_server.sh starts the daemon (foreground or --daemon). .mlx-venv/ added to .gitignore. Suite: 464 passed (was 457 → +7 new across LocalMLXClient + Router).	2026-04-27 12:05:41 -04:00
Joseph Doherty	b3d78c1603	docs: clarify FeatherlessClient.embed() rationale (verified 500 + empty embedding catalog) Updates the docstring + test docstring for the NotImplementedError stub shipped in T112 (Phase 4.5). Original wording said Featherless 'does not expose /v1/embeddings'; verified the endpoint actually responds but always returns HTTP 500 with type='completions_error' for every model tried (text-embedding-3-small, BAAI/bge-small-en-v1.5, sentence-transformers/all-MiniLM-L6-v2, etc.) and /v1/models has no embedding-class entries. Stub behavior unchanged.	2026-04-27 11:39:53 -04:00
Joseph Doherty	5c4356e4e2	merge: T117 phase 4.5 cross-feature integration tests	2026-04-27 07:04:01 -04:00
Joseph Doherty	f71613786b	test: phase 4.5 cross-feature integration coverage (T117)	2026-04-27 07:03:56 -04:00
Joseph Doherty	4afaf01de7	test: structured CannedQueue fixture builder for classifier mocks (T116) Phase 4.5 carry-over from Phase 3. Tests across test_turn_flow.py, test_meanwhile_turn_flow.py, and the phase3/4 integration suites built positional canned-response arrays for MockLLMClient — adding a new classifier call to a code path required updating the array index in many places. This change ships tests/fixtures.py with a fluent CannedQueue builder that lets tests declare classifier expectations by name and call order instead of by index. Each method appends one item to an internal queue and returns self for chaining; build() emits the flat list[str] queue that MockLLMClient(canned=...) already consumes. The mock's contract is unchanged. Builder methods cover: parse_turn, detect_addressee, state_update (with zero_state alias), detect_interjection, detect_interjection_targeted, detect_scene_close, detect_event_transitions, summarize_scene_pov, detect_threads, meanwhile_digest, score_significance, and a narrative() helper for streaming bot beats. raw() is a passthrough escape hatch. Migration scope: ship the builder + 2 sanity tests + migrate 3 representative tests in test_turn_flow.py as proof of concept (test_single_bot_turn_no_guest_regression, test_turn_with_event_transition_appends_started_event, test_turn_with_no_active_events_skips_classifier). The remaining positional-array tests stay as-is; the builder docstring documents the migration template for Phase 5 work.	2026-04-27 07:03:20 -04:00
Joseph Doherty	80ce891bd8	feat: regenerate rolls back lifecycle transitions on supersede (T114.3) Closes the T83.4 gap: when ``regenerate_assistant_turn`` supersedes an assistant_turn that already produced lifecycle transitions, it now emits an ``event_status_reverted`` (T114.2) for each transition tagged with ``triggered_by_assistant_turn_id == original_assistant_event_id`` (T114.1 back-reference) before the regenerated narrative is reclassified. Mapping from forward kind to ``prior_status`` lives in ``_PRIOR_STATUS_MAP``: - event_started → planned - event_completed → active - event_cancelled → active (best-effort default; cancellation can fire from either planned or active, but detect_event_transitions only surfaces currently-active rows so 'active' is the realistic prior) Backward compatibility: lifecycle rows authored before T114.1 lack the back-reference field. Those are skipped (DEBUG log per row) and collected into a legacy WARNING that preserves the T83.4 observability contract — operators still see un-rolled-back transitions, just from older logs. The classify-and-emit pass below the rollback now operates against an events projection that has already been reverted, so re-firing ``event_started``/``event_completed``/``event_cancelled`` for the regenerated narrative is safe — no double-emit of promotion artifacts. Spec tests: - ``test_regenerate_rolls_back_event_started_from_superseded_turn`` - ``test_regenerate_rolls_back_event_completed_to_active`` (also exercises the multi-rollback loop: a turn that fired both a start and a completion gets two event_status_reverted rows in id order, with active as the final projection — matching the per-row replay semantics of the projector) - ``test_regenerate_skips_events_without_back_reference`` (pins the legacy compatibility path with both DEBUG and WARNING expectations)	2026-04-27 06:45:43 -04:00
Joseph Doherty	6d4ad86e33	feat: event_status_reverted event kind + projector handler (T114.2) Adds the inverse projection used by T114.3's regenerate rollback. The new ``event_status_reverted`` event kind carries ``{event_id, prior_status}`` and the handler unconditionally sets ``events.status = prior_status`` for the row. Unlike the forward transitions (event_started / event_completed / event_cancelled), this handler does NOT guard against terminal statuses — its entire purpose is to reverse a transition, including walking back from a terminal status to a non-terminal one. Without that, rolling back an event_completed (status='completed' is terminal for the forward handlers) would silently no-op and leave the row in the post-superseded state. The handler registers via the existing ``@on(kind)`` decorator pattern in chat/eventlog/projector.py, so future replays of an event_log that contains event_status_reverted rows pick it up automatically. Test exercises completed→active, active→planned, and cancelled→active round-trips.	2026-04-27 06:39:03 -04:00
Joseph Doherty	7370f68bdf	feat: lifecycle events carry triggered_by_assistant_turn_id back-reference (T114.1) Phase 3.5 T83.4 surfaced un-rolled-back lifecycle transitions on regenerate; T114 wires up the actual rollback. Step 1 is the back- reference: every event_started / event_completed / event_cancelled emitted by post_turn (chat/web/turns.py) and regenerate (chat/services/regenerate.py) now carries ``triggered_by_assistant_turn_id`` in its payload, set to the id of the assistant_turn event that produced the transition. Schema decision (Option A from the plan): no migration. The field is a payload convention only — older event_log rows lack it and rollback will skip them with a debug log when T114.3 lands. Forward-only. The post_turn lifecycle block already runs AFTER the assistant_turn event is appended (step 8a vs step 7), so ``primary_assistant_event_id`` is in scope. Same for regenerate: the lifecycle classification (step 9a) runs after step 6's append. No emission-order reorder was needed in either flow. Updates ``test_turn_with_event_transition_appends_started_event`` to assert the new field is present in the emitted event_started payload and points at the assistant_turn id.	2026-04-27 06:38:48 -04:00
Joseph Doherty	456f50d334	feat: branching read-side filter — event readers consult active branch range (T113) Wire the active branch's [origin_event_id, head_event_id] window into every user-facing event/memory reader so switching branches actually changes what dialogue and memories the user sees. Phase 4 T89/T94 shipped branches as metadata-only — this closes the loop. Helper: - chat/state/branches.py: add `active_branch_event_ids(conn)` returning the active branch's id range, with two defensive fall-throughs to `(0, BIG_INT)`: (a) no active branch row at all, and (b) the bootstrap "main" sentinel (name="main", origin=0, head=0). Production never bumps main's head_event_id today, so this preserves existing reader behaviour for every test that doesn't explicitly switch. Readers updated (all user-facing dialogue / retrieval surfaces): - chat/services/turn_common.py::read_recent_dialogue — chat-history prompt context + the chat-view template path (via web/turns.py + web/chat.py). - chat/services/scene_summarize.py::_read_recent_dialogue — scene-close per-POV summary input. - chat/state/memory.py::search_memories — FTS leg filters via m.event_id (T109's column); legacy NULL event_id rows are included unconditionally so the filter doesn't break pre-0014 retrieval. The fused (FTS + RRF + vector) path also drops vector hits whose event_id falls outside the branch window. - chat/web/meanwhile.py::_read_recent_meanwhile_dialogue — meanwhile prompt context. Projector queries (chat/state/world.py et al.) and admin/management surfaces (drawer hide-panel, cross-chat search, regenerate's row lookups by id) are intentionally NOT branch-filtered: projection must see the full log to build state correctly, and the admin surfaces operate across branches by design. Tests (10 new, 446 total): - tests/test_branches_state.py: 3 tests for `active_branch_event_ids` itself (bootstrap-main, no-active-branch, non-main literal range). - tests/test_branching.py: 7 cross-feature tests covering the spec's five required scenarios plus scene_summarize and meanwhile readers.	2026-04-27 06:25:22 -04:00
Joseph Doherty	9b7a6d459f	feat: backfill_embeddings --re-embed-all flag for model swaps (T112.4) Adds two new flags to the backfill script: * --re-embed-all walks every memory (not just those without an existing embeddings row) and re-emits embedding_indexed events. The projector is INSERT OR REPLACE, so re-emitting an event for an existing memory replaces the prior vector. Use this when swapping embedding models — the default mode still keeps the Phase 4 gap-fill behavior. * --model M overrides Settings.embedding_model for this run. The script also gains a small _build_client helper that returns None for the pseudo path (no client needed) and a FeatherlessClient otherwise; tests monkeypatch this to inject a Mock with canned embeddings. Adds tests/test_backfill_embeddings.py with three integration tests: re-embed-all walks every memory, default mode skips existing rows, and --model overrides the configured model end-to-end.	2026-04-27 06:02:23 -04:00
Joseph Doherty	e0a28abbcd	feat: generate_embedding routes non-default models through client.embed (T112.3) When model != DEFAULT_EMBEDDING_MODEL, generate_embedding now calls client.embed(text, model=model) and wraps the returned vector in an EmbeddingResult tagged with the requested model. On any exception (NotImplementedError from providers without an embeddings endpoint, transient network errors, etc.), the existing T107 warning fires and the function falls back to the zero-vector sentinel — callers detect model == 'fallback' and skip indexing. Adds: - MockLLMClient accepts a canned_embeddings queue mirroring the existing canned pattern. embed() pops from the front; empty queue raises IndexError so misconfigured tests fail loudly. - Settings.embedding_model defaults to "pseudo-sha256-384" so existing zero-config installs keep Phase 4 behavior. The app lifespan now passes this through to EmbeddingWorker.model. The public signature of generate_embedding is unchanged: (client, *, text, model=DEFAULT_EMBEDDING_MODEL, dim=..., timeout_s=...).	2026-04-27 05:50:29 -04:00
Joseph Doherty	ac6e74ab4c	feat: FeatherlessClient.embed() against /v1/embeddings (T112.2) Implements embed() on FeatherlessClient. Featherless's OpenAI- compatible surface does NOT expose /v1/embeddings at the time of writing, so this implementation raises NotImplementedError rather than issuing a request that would 404. The chat.services.embeddings.generate_embedding wrapper (T112.3) catches the exception and degrades to the zero-vector fallback path (plus the existing T107 warning) — misconfigured callers fail loudly in logs while the request path keeps working. If/when Featherless ships embeddings, swap the body for self._client.embeddings.create(model=..., input=...) guarded by the existing 2-conn semaphore (mirrors generate/stream). The Protocol seam in T112.1 is already wired so no other code needs to change. Adds tests/test_featherless.py pinning the NotImplementedError contract.	2026-04-27 05:48:34 -04:00
Joseph Doherty	9987da2c07	feat: cross-chat search deep-links to turn via memories.event_id (T111.2) Add ``m.event_id`` (T109's nullable column from migration 0014) to ``search_all_memories``'s SELECT, propagate it through the route's template context, and have ``search.html`` build result links as ``/chats/{chat_id}#turn-{event_id}`` — matching the ``id="turn-{event_id}"`` anchor that Phase 3.5 T86 stamps on each turn DOM node so the chat page scrolls to the originating turn on load. Memory rows projected before the 0014 migration ran read NULL ``event_id``; the template falls back to a chat-level link in that case so we never emit ``#turn-None``. Pre-existing tests that asserted on the bare ``href="/chats/{chat_id}"`` contract are updated to assert on the ``href="/chats/{chat_id}#turn-`` prefix to reflect the new deep-link.	2026-04-27 05:42:17 -04:00
Joseph Doherty	fa87ab8c55	feat: cross-chat search FTS snippet highlighting (T111.1) Replace the ``pov_summary`` column in ``search_all_memories``'s SELECT with ``snippet(memories_fts, 0, '<mark>', '</mark>', '…', 32)`` so each match in a result row is wrapped in ``<mark>`` for the search-results UI. The original ``pov_summary`` is still returned alongside as a non-highlighted fallback. Template renders ``r.snippet\|safe`` — the only HTML in the snippet output is the configured ``<mark>`` markers, so it is safe to bypass Jinja's auto-escape.	2026-04-27 05:30:32 -04:00
Joseph Doherty	2ab8fcbdf0	feat: drawer bulk significance re-rate per chat (T110.4) The drawer's Significance review panel previously only supported per-memory edits. Adds a bulk control: pick ``level_from`` and ``level_to``, and every memory in the chat at ``level_from`` is moved to ``level_to``. Implementation emits one ``manual_edit`` event per matching memory (not a single bulk event) so the §6.4 per-row audit trail stays intact — each affected memory carries its own ``prior_value -> new_value`` snapshot, so an inverse edit can restore an individual row without needing to inspect a bulk payload's member list. Reuses the existing ``memory_significance`` ``manual_edit`` projector branch (T25), so no state-layer changes are required. The route rejects no-op submissions (``level_from == level_to``) with 400 to avoid padding the event log with empty edits, and clamps both levels to 0..3 (matching ``edit_memory_significance``). UI: a small ``<details>`` block in the Significance review section with two number inputs and a submit button. Test: tests/test_drawer_phase4.py::test_bulk_significance_re_rate_emits_manual_edit_per_memory.	2026-04-27 05:14:59 -04:00
Joseph Doherty	5d5c888acf	refactor: drawer delete-impact modal extracted to Jinja partial (T110.3) The modal HTML was assembled via raw f-string concatenation in ``delete_preview``. Move it to a dedicated Jinja2 partial (``chat/templates/_delete_impact_modal.html``) and render via ``TEMPLATES.TemplateResponse``. Jinja2 autoescape now handles HTML safety automatically — the explicit ``html.escape()`` calls added in T110.2 (and the ``import html``) become redundant and are removed in this commit. Net behavioural change: attribute quoting style flips from single to double quotes (Jinja default) — the existing T98.4 substring-based assertions are unaffected, and the new T110.3 test pins the double-quoted shape so future regressions surface. Test: tests/test_drawer_phase4.py::test_delete_impact_modal_uses_jinja_partial.	2026-04-27 05:13:36 -04:00
Joseph Doherty	a45a33534f	fix: drawer delete-impact modal HTML escapes user-controllable fields (T110.2) The delete-impact modal is built via raw f-string concatenation from the ImpactReport — item.kind / item.description / report.notes ultimately embed user-controllable content (turn prose, scene timestamps). A turn with prose like "<script>alert(1)</script>" would reach the rendered HTML verbatim. Currently safe (the fields embedded today are bounded strings) but defense-in-depth — wrap with html.escape() so future description changes can't smuggle markup through. Test: tests/test_drawer_phase4.py::test_delete_impact_modal_escapes_user_controllable_strings.	2026-04-27 05:12:28 -04:00
Joseph Doherty	f3827706df	fix: drawer delete_turn guards event_id <= 0 (T110.1) A stale tab or hand-crafted request posting event_id=0 to the surgical delete route would compute after_event_id=-1 and silently truncate the entire log. Now rejected with 400. SQLite assigns event_log ids starting at 1, so any legitimate id is always >= 1 — non-positive values can only indicate a client bug. Test: tests/test_drawer_phase4.py::test_delete_turn_with_event_id_zero_returns_400.	2026-04-27 05:11:39 -04:00
Joseph Doherty	1f8b4d2078	feat: 0014 schema — embeddings FK CASCADE (deferred or applied) + memories.event_id column (T109)	2026-04-27 05:00:57 -04:00
Joseph Doherty	3f1a284acb	merge: T108 scene-close-on-cancel strengthen test + rationale	2026-04-27 04:48:00 -04:00
Joseph Doherty	87f93f00b5	merge: T107 embeddings.py fallback warning	2026-04-27 04:48:00 -04:00
Joseph Doherty	d1e2902655	merge: T106 search.py N+1 batching + k constant	2026-04-27 04:48:00 -04:00
Joseph Doherty	54dfa8d611	merge: T105 snapshots.py polish	2026-04-27 04:48:00 -04:00
Joseph Doherty	baffeb3a44	chore: scene-close-on-cancel — strengthen regression test + document rationale (T108) Investigation surfaced a transactional bug in the cancel path: when the primary stream raises asyncio.CancelledError mid-stream, post_turn re-raises at end-of-function, and open_db's dependency teardown skips conn.commit() — rolling back ALL post-cancel writes including the scene_closed event. The existing T74.3 regression test only passes because asyncio is not imported at module scope, so CancelledError becomes NameError (caught by except Exception, leaves cancelled=False). Documented in turns.py + test docstring; deferred for triage.	2026-04-27 04:47:26 -04:00
Joseph Doherty	29b7c90b29	chore: embeddings.py warns on fallback for non-default models (T107)	2026-04-27 04:47:17 -04:00
Joseph Doherty	64c9ca634a	chore: snapshots.py polish — hoisted imports + strict kind + mtime doc (T105)	2026-04-27 04:47:14 -04:00
Joseph Doherty	374a76c867	chore: branches polish — global-leak docs + unknown-name warning (T103)	2026-04-27 04:34:32 -04:00
Joseph Doherty	996a16cfb5	perf: search.py N+1 batching + k constant extraction (T106)	2026-04-27 04:34:18 -04:00
Joseph Doherty	228f9abb19	test: phase 4 cross-feature integration coverage (T101)	2026-04-27 04:08:25 -04:00
Joseph Doherty	3b4c7b9cef	merge: T100 cross-chat search UX (top-bar + results page)	2026-04-27 03:48:06 -04:00
Joseph Doherty	0a2c5924f9	feat: cross-chat search UX (top-bar + results page) (T100) Wires T93's `search_all_memories` service into a small read-only HTML surface so users can find a memory across every chat in the database. * `chat/web/search.py` (new): GET `/search?q=...` runs the FTS service with k=50, hydrates each row with bot name + scene timestamp, and renders `search.html`. Empty `q` short-circuits to no results so the top-bar form can submit even with an empty input. * `chat/templates/search.html` (new): empty-state placeholder, results list with chat-level "Open chat" links (`/chats/{chat_id}` — memories don't carry an event_id today, so no per-turn anchor). * `chat/templates/layout.html`: append a small `<form>` to the rail nav, additive only. * `chat/app.py`: register `search_router` (additive import + include). * `tests/test_search_ux.py`: 3 tests — multi-chat results, empty-query placeholder, chat link.	2026-04-27 03:46:52 -04:00
Joseph Doherty	a5f0e69d44	feat: snapshot UX (manual trigger + list + restore + preview) (T99)	2026-04-27 03:46:49 -04:00
Joseph Doherty	4546bc0d9c	feat: drawer remaining v1 field edits (T98.5) Audit of chat/state/manual_edit.py target_kind dispatch found two §6.4 fields without drawer affordances despite being already-projected text columns: chat_state.narrative_anchor and chat_state.weather. Both land via new manual_edit branches (target_kind chat_narrative_anchor and chat_weather) plus paired drawer routes and Scene-section text inputs. The container properties_json blob is intentionally deferred — bounded JSON edits aren't wired through manual_edit and the drawer never surfaces multiple containers at once, so v1 leaves it out.	2026-04-27 03:35:54 -04:00
Joseph Doherty	c4fa11fe78	feat: drawer surgical delete with cascade preview (T98.4)	2026-04-27 03:29:07 -04:00
Joseph Doherty	461d441078	feat: drawer hide-from-view toggle + turn_hidden manual_edit branch (T98.3)	2026-04-27 03:27:59 -04:00
Joseph Doherty	b25007eb44	feat: drawer significance review panel (T98.2)	2026-04-27 03:25:40 -04:00
Joseph Doherty	d39d31479d	feat: drawer branching UI (T98.1)	2026-04-27 03:24:02 -04:00
Joseph Doherty	177e39d59c	feat: wire embedding worker call sites in turns/meanwhile/skip/regenerate (T97.5)	2026-04-27 03:08:36 -04:00
Joseph Doherty	64a07aa87f	feat: memory_write enqueues embedding job after each memory_written (T97.2)	2026-04-27 02:51:40 -04:00
Joseph Doherty	6674f9475c	feat: embedding worker drains queue and emits embedding_indexed events (T97.1)	2026-04-27 02:51:36 -04:00
Joseph Doherty	b8b4aed6d9	feat: combined FTS + vector retrieval ranking via RRF (T96)	2026-04-27 02:42:38 -04:00
Joseph Doherty	5ff107574c	merge: T95 delete-impact computation service	2026-04-27 02:37:28 -04:00
Joseph Doherty	28e13d416f	feat: delete-impact computation service (preview without mutation) (T95)	2026-04-27 02:36:30 -04:00
Joseph Doherty	296e8fdddd	feat: branching service (branch_from_event + switch + metadata) (T94)	2026-04-27 02:35:58 -04:00
Joseph Doherty	013b563f21	merge: T93 cross-chat search service	2026-04-27 02:32:53 -04:00

1 2 3 4

186 Commits