Vector search for similar chunks with auto-embedding (PRIMARY)
POST/api/v1/vector/chunks
Primary vector search endpoint - Returns matching text chunks directly from Qdrant.
This is the FAST search method that returns chunks without fetching full entry content. Use this when you want quick results with snippet-level granularity.
Auto-Embedding Support:
- Provide
query(text) for automatic embedding generation - OR provide
query_vector(pre-computed embeddings) - If both provided,
query_vectortakes precedence
Flow:
- If
queryprovided: Generate embeddings using configured provider - Query vector is sent to Qdrant
- Matching chunks are returned with scores and metadata
- No database or S3 lookups (very fast!)
Performance:
- With
query_vector: < 100ms - With
query(auto-embed): 500ms-2s (includes embedding generation)
Use cases:
- Quick similarity search
- Preview/snippet display
- Finding relevant passages
- High-throughput scenarios
Request
Responses
- 200
- 422
Successful Response
Validation Error