Knowledge

Overview

The Knowledge module provides unified semantic search across all knowledge sources in a project — documents and memory entries. A single endpoint searches across these sources simultaneously, ranks results by vector similarity, and returns an interleaved list tagged by source type.

Each result carries a source_type discriminant ("document" or "memory") so callers know where each piece of knowledge came from.

See the Permissions Reference for the IAM action strings for this module.

Data Model

KnowledgeResult

A KnowledgeResult is a discriminated union on source_type. All results share common fields; source-specific fields are only present for the matching type.

Common fields (all source types)

Field	Type	Description
`source_type`	`"document"` \| `"memory"`	Discriminant for the knowledge source type
`content`	`string\|null`	Text content of the result
`score`	`number`	Relevance score (0–1); only present when `query` is used
`created_at`	`string`	ISO 8601 creation timestamp
`updated_at`	`string`	ISO 8601 last-updated timestamp

Document result (`source_type: "document"`)

Field	Type	Description
`document_id`	`string`	Public document ID (`doc_` prefix)
`file_id`	`string`	ID of the underlying File record
`project_id`	`string`	ID of the owning project
`path`	`string\|null`	Logical path within the project (e.g. `/reports/q1.txt`)
`filename`	`string`	Original filename
`size`	`number`	File size in bytes
`title`	`string\|null`	Document title (if set)
`metadata`	`object\|null`	Arbitrary JSON metadata
`tags`	`object`	Key-value tags associated with the document

Memory result (`source_type: "memory"`)

Field	Type	Description
`entry_id`	`string`	Public memory entry ID (`me_` prefix)
`memory_id`	`string`	Public ID of the parent memory (`mem_` prefix)

Key Concepts

Search Modes

The POST /knowledge/search endpoint accepts the following filters. At least one must be provided.

Parameter	Type	Description
`query`	`string`	Semantic search query — ranks results by vector similarity
`memory_ids`	`string[]`	Search entries within these specific memories
`memory_tags`	`string[]`	Search entries in memories whose tags match any of these patterns (supports glob: `user*`)
`document_paths`	`string[]`	Filter document results to paths starting with these prefixes
`document_ids`	`string[]`	Filter document results to specific document IDs

When query is set, results include a score field and are ordered by descending relevance. min_score and limit apply additional controls.

memory_ids and memory_tags can be combined — the search includes entries from memories matching either (union semantics).

If neither memory_ids nor memory_tags is provided, the search does not include memory entries (only documents). Similarly, if neither document_paths nor document_ids is provided and no query is given alone, only memories are searched. This lets callers control exactly which sources to include.

Project Scoping

project_id is optional. When omitted, the server resolves accessible projects from the caller's identity (API key project scope, admin wildcard, or explicit project memberships).

Configuration

Environment Variable	Required	Description
`FILES_STORAGE_DIR`	Yes	Directory where `.txt` files are stored (shared with Files)
`EMBEDDING_PROVIDER`	Yes	Embedding backend — only `ollama` is supported
`EMBEDDING_MODEL`	Yes	Model name, e.g. `qwen3-embedding:0.6b`
`EMBEDDING_DIMENSIONS`	Yes	Vector dimensions — must match the model output, e.g. `1024`
`OLLAMA_BASE_URL`	No	Ollama server URL, defaults to `http://localhost:11434`

Examples

Semantic search across documents and memories

CLI
SDK
curl

soat search-knowledge \
  --project-id proj_ABC \
  --query "quarterly revenue" \
  --memory-ids mem_xyz \
  --limit 5

import { SoatClient } from '@soat/sdk';
const soat = new SoatClient({
  baseUrl: 'https://api.example.com',
  token: 'sk_...',
});

const { data, error } = await soat.knowledge.searchKnowledge({
  body: {
    project_id: 'proj_ABC',
    query: 'quarterly revenue',
    memory_ids: ['mem_xyz'],
    limit: 5,
  },
});
if (error) throw new Error(JSON.stringify(error));
console.log(data.results);

curl -X POST https://api.example.com/api/v1/knowledge/search \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{
    "project_id": "proj_ABC",
    "query": "quarterly revenue",
    "memory_ids": ["mem_xyz"],
    "limit": 5
  }'

Memory-only search by tag

CLI
curl

soat search-knowledge \
  --project-id proj_ABC \
  --query "customer communication" \
  --memory-tags "customer*"

curl -X POST https://api.example.com/api/v1/knowledge/search \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{
    "project_id": "proj_ABC",
    "query": "customer communication",
    "memory_tags": ["customer*"]
  }'

Document-scoped retrieval

CLI
SDK
curl

soat search-knowledge \
  --project-id proj_ABC \
  --query "quarterly revenue" \
  --limit 5

import { SoatClient } from '@soat/sdk';
const soat = new SoatClient({
  baseUrl: 'https://api.example.com',
  token: 'sk_...',
});

const { data, error } = await soat.knowledge.searchKnowledge({
  body: { project_id: 'proj_ABC', query: 'quarterly revenue', limit: 5 },
});
if (error) throw new Error(JSON.stringify(error));
console.log(data.results);

curl -X POST https://api.example.com/api/v1/knowledge/search \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{
    "project_id": "proj_ABC",
    "query": "quarterly revenue",
    "limit": 5
  }'

Path-scoped document retrieval

CLI
curl

soat search-knowledge \
  --project-id proj_ABC \
  --document-paths /docs/products/

curl -X POST https://api.example.com/api/v1/knowledge/search \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{
    "project_id": "proj_ABC",
    "document_paths": ["/docs/products/"]
  }'

Overview​

Data Model​

KnowledgeResult​

Common fields (all source types)​

Document result (source_type: "document")​

Memory result (source_type: "memory")​

Key Concepts​

Search Modes​

Project Scoping​

Configuration​

Examples​

Semantic search across documents and memories​

Memory-only search by tag​

Document-scoped retrieval​

Path-scoped document retrieval​