Vector search entries with full content and auto-embedding (SECONDARY)
POST/api/v1/vector/entries
Secondary vector search endpoint - Returns full entry objects with S3 content.
This is the COMPREHENSIVE search method that fetches complete entry content from S3. Use this when you need full documents, not just snippets.
Auto-Embedding Support:
- Provide
query(text) for automatic embedding generation - OR provide
query_vector(pre-computed embeddings) - If both provided,
query_vectortakes precedence
Flow:
- If
queryprovided: Generate embeddings using configured provider - Query vector is sent to Qdrant
- Matching chunks are grouped by entry_id
- Entry records are fetched from PostgreSQL
- Full content is retrieved from S3 (parallel batch operation)
- Results combine entries with their matching chunks
Performance:
- With
query_vector: 200-500ms - With
query(auto-embed): 700ms-2.5s (includes embedding generation)
Use cases:
- Retrieving full documents
- Document-level results
- When you need complete content
- Lower-throughput, higher-quality scenarios
Request
Responses
- 200
- 422
Successful Response
Validation Error