Skip to main content

Vector search entries with full content and auto-embedding (SECONDARY)

POST /api/v1/vector/entries

Secondary vector search endpoint - Returns full entry objects with S3 content.

This is the COMPREHENSIVE search method that fetches complete entry content from S3. Use this when you need full documents, not just snippets.

Auto-Embedding Support:

Provide query (text) for automatic embedding generation
OR provide query_vector (pre-computed embeddings)
If both provided, query_vector takes precedence

Flow:

If query provided: Generate embeddings using configured provider
Query vector is sent to Qdrant
Matching chunks are grouped by entry_id
Entry records are fetched from PostgreSQL
Full content is retrieved from S3 (parallel batch operation)
Results combine entries with their matching chunks

Performance:

With query_vector: 200-500ms
With query (auto-embed): 700ms-2.5s (includes embedding generation)

Use cases:

Retrieving full documents
Document-level results
When you need complete content
Lower-throughput, higher-quality scenarios

Request

Responses

200
422

Successful Response