AI Agent Integration
The Alien Intelligence platform exposes your document collections to AI assistants — Claude, GPT, custom agents — through the Model Context Protocol (MCP). AI agents can search, read, and analyze your documents using the same data that powers the platform's search features, without any custom integration code.
What Is MCP?
MCP is an open protocol that lets AI assistants call structured tools provided by external servers. Instead of the AI guessing or hallucinating, it can call a tool like datacluster_vector_search_chunks to search your actual data and get real results.
The platform hosts MCP servers that wrap your data cluster's search and retrieval capabilities as tools that AI agents can invoke directly.
Available MCP Servers
| MCP Server | Tools | Data Source | Description |
|---|---|---|---|
| mcp-datacluster | 7 | Your data clusters | Search and read your documents — the primary integration point |
We can build customized MCP servers for your specific data sources, similar to what we have done for research intelligence and cultural heritage organizations. Contact us at support@alien.club to discuss your needs.
mcp-datacluster Tools
This is the MCP server deployed on your data cluster. It provides tools for the complete document access workflow:
| Tool | What It Does |
|---|---|
datacluster_list_datasets | List available datasets with metadata |
datacluster_get_dataset | Get full details of a specific dataset including schema |
datacluster_keyword_search | Full-text keyword search with typo tolerance and filters |
datacluster_vector_search_chunks | Semantic search — find chunks by meaning similarity |
datacluster_get_entry_content | Read the processed content of a document (paginated) |
datacluster_get_entry_documents | List files attached to an entry |
datacluster_get_entry_file | Download a specific file (including extracted figures as viewable images) |
An AI agent can use these tools to autonomously navigate your document collection: discover datasets, search for relevant documents, read their content, and examine their figures.
Connecting an AI Agent
Authentication
MCP servers use OAuth for authentication. When an AI agent connects to an MCP server, the following flow occurs:
The agent inherits the authenticated user's permissions — it can only access data that the user is authorized to see.
Enterprise Authentication Bypass
For programmatic or server-to-server MCP access, you can bypass the interactive OAuth flow by injecting an API token directly in the request headers. This is useful for automated pipelines, backend integrations, and scenarios where interactive login is not possible.
The MCP server supports two token formats:
- Platform API tokens (
oat_*format) — Validated against the platform backend via the/users/meendpoint. The server extracts user identity, organization context, and token abilities from the backend response. - JWT tokens — Validated via OIDC introspection against the identity provider. Used when your application authenticates directly with the identity provider.
Configuration with API Token
To connect an AI tool (such as Claude Desktop or Claude Code) using an API token, configure the MCP server with an Authorization header:
{
"mcpServers": {
"my-data-cluster": {
"type": "http",
"url": "https://YOUR_MCP_SERVER_URL/mcp",
"headers": {
"Authorization": "Bearer oat_YOUR_API_TOKEN"
}
}
}
}
This configuration goes in:
- Claude Desktop:
claude_desktop_config.json(Settings > MCP Servers) - Claude Code:
.mcp.jsonin your project root
The MCP server URL is available from the MCP section of the platform dashboard after deployment. See Deploy MCP for details on deploying an MCP server and finding the connection URL.
When using API token authentication, the token's abilities determine what the agent can do. Ensure the token has at least CLUSTER_READ and DATASET_READ abilities for search operations.
Connecting Claude Desktop
To connect Claude Desktop to your document collection:
- Deploy an MCP server on your data cluster if you have not already.
- Navigate to MCP in the left sidebar of the platform dashboard.
- Find your cluster and expand its MCP connection details.
- Copy the
claude_desktop_config.jsonsnippet shown in the connection card. - In Claude Desktop, go to Settings > MCP Servers and paste the configuration.
- Claude will either use the embedded API token (if present) or prompt you to authenticate via OAuth.
Once connected, you can ask Claude to search and analyze your documents directly:
"Search my research papers for studies about CRISPR gene editing in agricultural applications and summarize the key findings."
Claude will use the MCP tools to search your data cluster, read the relevant documents, and provide a grounded answer based on your actual data.
Connecting Other AI Agents
Any AI agent or application that supports the MCP protocol can connect to the platform's MCP servers. The connection requires:
- Server URL — Available from the MCP section of the platform dashboard after deploying an MCP server.
- Authentication — OAuth flow for interactive use, or API token header for programmatic access.
The MCP servers expose a standard /.well-known/oauth-authorization-server endpoint for OAuth discovery and a /health endpoint for connectivity verification.
Per-Tool Access Control
Each MCP tool declares the permissions it requires. For example, datacluster_keyword_search requires both dataset:read and entry:read abilities. The platform enforces these checks on every tool invocation — an agent cannot access data that the authenticated user does not have permission to see.
Organization-level roles determine what an agent can do:
| Role | Can Use |
|---|---|
| Viewer | All read-only tools (list, search, get content) |
| Writer | Read-only tools + tools that modify data (if available) |
| Owner | All tools |
Example: Research Workflow
Here is how an AI agent might use MCP tools to answer a research question:
1. datacluster_list_datasets
→ Discovers "biomedical-papers" dataset (5,000 entries)
2. datacluster_vector_search_chunks
query: "CRISPR applications in crop disease resistance"
→ Returns 10 relevant chunks with similarity scores
3. datacluster_get_entry_content
entry_id: 4521, page: 1
→ Reads the full processed content of the top result
4. datacluster_get_entry_documents
entry_id: 4521
→ Lists files: original PDF, 3 extracted figures
5. datacluster_get_entry_file
entry_id: 4521, file_key: "processed/figures/figure_2.png"
→ Returns the figure image for the AI to analyze
The agent performs all of these steps autonomously, following the data to build a comprehensive answer.
Managing MCP Servers
The MCP section in the left sidebar shows all MCP servers available to your organization. From this page you can:
- Deploy or undeploy MCP servers on your clusters.
- View the connection URL and configuration snippet for each server.
- Check server health status.
- See which tools each server provides.
For step-by-step deployment instructions, see Deploy MCP.
Next Steps
- Deploy MCP — Deploy an MCP server on your data cluster
- Search and Query — The search capabilities that MCP tools wrap
- Manage Your Organization — Roles and API token management
- Security Model — Authentication and authorization details