RAG (Retrieval-Augmented Generation)

Upload documents, generate embeddings, and query them semantically — all through client.rag.

Quick upload

The upload_file convenience method handles both the init call and the PUT to the signed URL in one step:

1 from meshapi import MeshAPI
2 
3 client = MeshAPI(base_url="https://api.meshapi.ai", token="rsk_...")
4 
5 with open("handbook.pdf", "rb") as f:
6     upload = client.rag.upload_file(
7         file_name="handbook.pdf",
8         mime_type="application/pdf",
9         content=f.read(),
10     )
11 
12 print("File ID:", upload.file_id)

Async

1 from meshapi import AsyncMeshAPI
2 
3 async with AsyncMeshAPI(base_url="https://api.meshapi.ai", token="rsk_...") as client:
4     with open("handbook.pdf", "rb") as f:
5         upload = await client.rag.upload_file(
6             file_name="handbook.pdf",
7             mime_type="application/pdf",
8             content=f.read(),
9         )

Two-step upload

Use init_upload when you want to control the PUT yourself:

1 from meshapi import InitUploadRequest
2 import httpx
3 
4 upload = client.rag.init_upload(
5     InitUploadRequest(file_name="handbook.pdf", mime_type="application/pdf", embed=False)
6 )
7 
8 # PUT bytes directly to the signed URL (no auth header needed)
9 httpx.put(upload.signed_url, content=pdf_bytes, headers={"Content-Type": "application/pdf"}).raise_for_status()

Trigger embedding

1 from meshapi import BulkEmbedRequest
2 
3 resp = client.rag.embed(BulkEmbedRequest(file_ids=[upload.file_id]))
4 for r in resp.results:
5     print(r.file_id, r.embedding_status)

Poll until ready

1 import time
2 
3 while True:
4     status = client.rag.get(upload.file_id)
5     if status.embedding_status == "ready":
6         break
7     if status.embedding_status == "failed":
8         raise RuntimeError(f"Embedding failed: {status.last_error_code}")
9     time.sleep(3)

Search

1 from meshapi import SearchRequest
2 
3 results = client.rag.search(
4     SearchRequest(
5         query="What is the refund policy?",
6         top_k=5,
7         file_ids=[upload.file_id],  # omit to search all files
8     )
9 )
10 
11 for r in results.results:
12     print(f"[{r.score:.3f}] {r.text}")

Search options

Field	Type	Notes
`query`	str	Plain-language question
`top_k`	int	Results to return (1–50, default 5)
`file_ids`	list[str]	Restrict to specific files
`filter`	dict	Match on metadata key-value pairs
`date_from`	int	Unix timestamp — only chunks created after
`date_to`	int	Unix timestamp — only chunks created before

List files

1 page = client.rag.list(limit=50)
2 print(f"{page.total} total files")
3 for f in page.files:
4     print(f.file_id, f.upload_status, f.embedding_status)

RAG chat

Combine search results with a chat completion:

1 from meshapi import ChatCompletionParams, ChatMessage
2 
3 results = client.rag.search(SearchRequest(query="What is the refund policy?", top_k=3))
4 context = "\n\n".join(r.text for r in results.results)
5 
6 reply = client.chat.completions.create(
7     ChatCompletionParams(
8         model="openai/gpt-4o-mini",
9         messages=[
10             ChatMessage(role="system", content=f"Answer using only the context below.\n\n{context}"),
11             ChatMessage(role="user",   content="What is the refund policy?"),
12         ],
13     )
14 )
15 print(reply.choices[0].message.content)