Supabase | The Open Source Firebase Alternative

Open-source backend for RAG pipelines - Feedback and Collaboration

Integrations

Edge Functions

Auth

Storage

FastAPI

Python

React

JavaScript

I kept rebuilding the same RAG pipeline for different projects (chunking -> embeddings -> retrieval -> prompt injection), so I tried to turn it into a reusable backend instead.

Ended up building IntelliChat — an open-source, async FastAPI backend for spinning up RAG systems without wiring everything from scratch.

I structured it like a SaaS platform mainly to explore multi-tenant architecture (per-chatbot vector isolation, API key encryption, etc.). Curious if my design is really impactful for collaborative chatbot development.

Core ideas:

define a chatbot - upload LLM + embedding model API keys
upload docs
build prompt with AI assistants
it handles indexing, retrieval, and prompt injection
you just call an API

Stacks:

FastAPI (async-first) and maximize asyncio for background tasks
LangChain - mainly for orchestrating AI calls to its correct client SDK
Official LLM & Embedding model SDK (prefers this than LangChain's)
Qdrant for vector search
Redis for caching
BYOK (OpenAI / other providers)

Platforms:

Google Cloud Run - deployed server instance
Google Cloud Tasks - background tasks with retries
Google Cloud Storage - storing file bytes
Supabase - storing user data and authentication with RLS

A few things I focused on:

isolating vector collections per chatbot (multi-tenant setup)
system prompt that prompts AI to build system prompt for other chatbots
context engineering (recent + summarized memory injected into prompts)
context-window budgeting so retrieval doesn’t blow up token limits
retrieval and filtering strategy (dynamic documents score threshold filtering)

Things that were harder than expected:

multi-tenant first architecture - since this is all new to me
deciding chunk size vs retrieval quality
context-window budgeting - LLMs has different CW limit per model so I designed it to be dynamic
building prompts to build system prompts for other chatbots

Current limitations:

How to help

Imperial_Benji developed IntelliChat, an open-source backend for reusable RAG pipelines, focusing on multi-tenant architecture and vector isolation. The backend uses FastAPI, LangChain, Qdrant, and Supabase, and is deployed on Google Cloud. The user seeks feedback and collaboration, acknowledging limitations like cold starts and lack of websocket support.

Help on Reddit

Replies (3)

I don't care at all about the product and this whole thing is probably all ai generated crap. But these screenshots are so bad. Buttons are all styled differently and hero is not even aligned.

Bernier154·4/3/2026, 12:05:19 AM

yeah, i already centered the hero. After all, im especialize at backend

Imperial_Benji·4/4/2026, 12:35:47 AM

i can say that frontend is purely ai generated using antigravity. i can't write react with tailwind. its too much for me. the important parts for me in frontend layer are state management, token life cycle and security besides this, everything can be generated by ai

Imperial_Benji·4/4/2026, 12:39:56 AM