Skip to main content

Vector search for similar chunks (PRIMARY)

POST 

/api/v1/vector/chunks

Primary vector search endpoint - Returns matching text chunks directly from Qdrant.

This is the FAST search method that returns chunks without fetching full entry content. Use this when you want quick results with snippet-level granularity.

Flow:

  1. Query vector is sent to Qdrant
  2. Matching chunks are returned with scores and metadata
  3. No database or S3 lookups (very fast!)

Performance: Typically < 100ms for most queries

Use cases:

  • Quick similarity search
  • Preview/snippet display
  • Finding relevant passages
  • High-throughput scenarios

Request

Responses

Successful Response