docs(rag): note bluge full-text index (built via KWONDoc source)

Corpus indexed into KWONDoc's bluge index (~/.kwondoc/search-index,
category five-rag) so bluge_search surfaces it; README documents the
re-index command (cmd/ragindex online upsert, doesn't wipe other docs).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
CharlesKWON
2026-06-15 16:35:00 +09:00
parent 59d7e490b4
commit b8a19bd350

View File

@@ -32,8 +32,12 @@ Every file has YAML frontmatter (`doc`, `title`, `keywords`, `summary`) for rank
- **Keyword retrieval (built-in):** run `./search.sh <terms>` — a dependency-free
ripgrep/grep ranker over the corpus (frontmatter `keywords` weighted ×3 + body),
printing ranked docs with the matching `##` section headers. No index to build.
e.g. `./search.sh session token csprng``06-security.md §2`. `INDEX.md` is the
hand-curated routing table; a bluge/embeddings index can ingest the same `.md` files.
e.g. `./search.sh session token csprng``06-security.md §2`.
- **bluge full-text index (KWONDoc):** this corpus is indexed into KWONDoc's bluge
index (`~/.kwondoc/search-index`, category `five-rag`) so `bluge_search` finds it.
Re-index after edits: `cd ~/kwondoc && go run ./cmd/ragindex <abs path to rag> five-rag`
(online upsert — keyed by file path, does not wipe other docs).
- `INDEX.md` is the hand-curated routing table; an embeddings index can ingest the same `.md`.
- **Embedding RAG:** chunk by `##` headers (each section is self-contained). Frontmatter
`summary` makes a good chunk preamble.