dohertj2/chat - chat - Gitea: Git with a cup of tea

dohertj2/chat

Fork 0

Commit Graph

Author	SHA1	Message	Date
Joseph Doherty	e0a28abbcd	feat: generate_embedding routes non-default models through client.embed (T112.3) When model != DEFAULT_EMBEDDING_MODEL, generate_embedding now calls client.embed(text, model=model) and wraps the returned vector in an EmbeddingResult tagged with the requested model. On any exception (NotImplementedError from providers without an embeddings endpoint, transient network errors, etc.), the existing T107 warning fires and the function falls back to the zero-vector sentinel — callers detect model == 'fallback' and skip indexing. Adds: - MockLLMClient accepts a canned_embeddings queue mirroring the existing canned pattern. embed() pops from the front; empty queue raises IndexError so misconfigured tests fail loudly. - Settings.embedding_model defaults to "pseudo-sha256-384" so existing zero-config installs keep Phase 4 behavior. The app lifespan now passes this through to EmbeddingWorker.model. The public signature of generate_embedding is unchanged: (client, *, text, model=DEFAULT_EMBEDDING_MODEL, dim=..., timeout_s=...).	2026-04-27 05:50:29 -04:00
Joseph Doherty	29b7c90b29	chore: embeddings.py warns on fallback for non-default models (T107)	2026-04-27 04:47:17 -04:00
Joseph Doherty	caa17b4174	feat: embedding generation service (Phase 4 pseudo-embedding) (T91)	2026-04-27 02:31:07 -04:00

Author

SHA1

Message

Date

Joseph Doherty

e0a28abbcd

feat: generate_embedding routes non-default models through client.embed (T112.3)

When model != DEFAULT_EMBEDDING_MODEL, generate_embedding now
calls client.embed(text, model=model) and wraps the returned
vector in an EmbeddingResult tagged with the requested model.
On any exception (NotImplementedError from providers without an
embeddings endpoint, transient network errors, etc.), the existing
T107 warning fires and the function falls back to the zero-vector
sentinel — callers detect model == 'fallback' and skip indexing.

Adds:
- MockLLMClient accepts a canned_embeddings queue mirroring
  the existing canned pattern. embed() pops from the front;
  empty queue raises IndexError so misconfigured tests fail
  loudly.
- Settings.embedding_model defaults to "pseudo-sha256-384"
  so existing zero-config installs keep Phase 4 behavior. The app
  lifespan now passes this through to EmbeddingWorker.model.

The public signature of generate_embedding is unchanged:
(client, *, text, model=DEFAULT_EMBEDDING_MODEL, dim=..., timeout_s=...).

2026-04-27 05:50:29 -04:00

Joseph Doherty

29b7c90b29

chore: embeddings.py warns on fallback for non-default models (T107)

2026-04-27 04:47:17 -04:00

Joseph Doherty

caa17b4174

feat: embedding generation service (Phase 4 pseudo-embedding) (T91)

2026-04-27 02:31:07 -04:00

3 Commits