Semantic Search

Semantic Search with pgvector and Supabase Edge Functions

Semantic search interprets the meaning behind user queries rather than exact keywords. It uses machine learning to capture the intent and context behind the query, handling language nuances like synonyms, phrasing variations, and word relationships.

Since Supabase Edge Runtime v1.36.0 you can run the gte-small model natively within Supabase Edge Functions without any external dependencies! This allows you to generate text embeddings without calling any external APIs!

In this tutorial you're implementing three parts:

A generate-embedding database webhook edge function which generates embeddings when a content row is added (or updated) in the public.embeddings table.
A query_embeddings Postgres function which allows us to perform similarity search from an Edge Function via Remote Procedure Call (RPC).
A search edge function which generates the embedding for the search term, performs the similarity search via RPC function call, and returns the result.

You can find the complete example code on GitHub

Create the database table and webhook

Given the following table definition:

1
create extension if not exists vector with schema extensions;
2
3
create table embeddings (
4
  id bigint primary key generated always as identity,
5
  content text not null,
6
  embedding extensions.vector (384)
7
);
8
alter table embeddings enable row level security;
9
10
create index on embeddings using hnsw (embedding vector_ip_ops);

You can deploy the following edge function as a database webhook to generate the embeddings for any text content inserted into the table:

1
const model = new Supabase.ai.Session('gte-small')
2
3
Deno.serve(async (req) => {
4
  const payload: WebhookPayload = await req.json()
5
  const { content, id } = payload.record
6
7
  // Generate embedding.
8
  const embedding = await model.run(content, {
9
    mean_pool: true,
10
    normalize: true,
11
  })
12
13
  // Store in database.
14
  const { error } = await supabase
15
    .from('embeddings')
16
    .update({ embedding: JSON.stringify(embedding) })
17
    .eq('id', id)
18
  if (error) console.warn(error.message)
19
20
  return new Response('ok')
21
})

Create a Database Function and RPC

With the embeddings now stored in your Postgres database table, you can query them from Supabase Edge Functions by utilizing Remote Procedure Calls (RPC).

Given the following Postgres Function:

1
-- Matches document sections using vector similarity search on embeddings
2
--
3
-- Returns a setof embeddings so that we can use PostgREST resource embeddings (joins with other tables)
4
-- Additional filtering like limits can be chained to this function call
5
create or replace function query_embeddings(embedding extensions.vector(384), match_threshold float)
6
returns setof embeddings
7
language plpgsql
8
as $$
9
begin
10
  return query
11
  select *
12
  from embeddings
13
14
  -- The inner product is negative, so we negate match_threshold
15
  where embeddings.embedding <#> embedding < -match_threshold
16
17
  -- Our embeddings are normalized to length 1, so cosine similarity
18
  -- and inner product will produce the same query results.
19
  -- Using inner product which can be computed faster.
20
  --
21
  -- For the different distance functions, see https://github.com/pgvector/pgvector
22
  order by embeddings.embedding <#> embedding;
23
end;
24
$$;

Query vectors in Supabase Edge Functions

You can use supabase-js to first generate the embedding for the search term and then invoke the Postgres function to find the relevant results from your stored embeddings, right from your Supabase Edge Function:

1
const model = new Supabase.ai.Session('gte-small')
2
3
Deno.serve(async (req) => {
4
  const { search } = await req.json()
5
  if (!search) return new Response('Please provide a search param!')
6
  // Generate embedding for search term.
7
  const embedding = await model.run(search, {
8
    mean_pool: true,
9
    normalize: true,
10
  })
11
12
  // Query embeddings.
13
  const { data: result, error } = await supabase
14
    .rpc('query_embeddings', {
15
      embedding,
16
      match_threshold: 0.8,
17
    })
18
    .select('content')
19
    .limit(3)
20
  if (error) {
21
    return Response.json(error)
22
  }
23
24
  return Response.json({ search, result })
25
})

You now have AI powered semantic search set up without any external dependencies! Just you, pgvector, and Supabase Edge Functions!

Semantic Search

Semantic Search with pgvector and Supabase Edge Functions

Create the database table and webhook#

Create a Database Function and RPC#

Query vectors in Supabase Edge Functions#

Is this helpful?

Create the database table and webhook

Create a Database Function and RPC

Query vectors in Supabase Edge Functions