Vector search for similar chunks (PRIMARY)
POST/api/v1/vector/chunks
Primary vector search endpoint - Returns matching text chunks directly from Qdrant.
This is the FAST search method that returns chunks without fetching full entry content. Use this when you want quick results with snippet-level granularity.
Flow:
- Query vector is sent to Qdrant
- Matching chunks are returned with scores and metadata
- No database or S3 lookups (very fast!)
Performance: Typically < 100ms for most queries
Use cases:
- Quick similarity search
- Preview/snippet display
- Finding relevant passages
- High-throughput scenarios
Request
Responses
- 200
- 422
Successful Response
Validation Error