ClippyGPT - How I Built Supabase’s OpenAI Doc Search (Embeddings)

Rabbit Hole Syndrome

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 25 лис 2024

КОМЕНТАРІ • 333

@Jaybearno Рік тому ⁺⁶⁴
I am blown away at how much information is densely packed into this. You got yourself a new subscriber, sir.
It’s staggering to think about how these technologies will shape the landscape for data and analytics. This is just the beginning.
@RabbitHoleSyndrome Рік тому ⁺³
Thanks for the sub! I am continuously blown away by the possibilities of large language models 😀
@SatvikAgnihotri Рік тому ⁺⁴
The clarity of this video while maintaining detailed granularity of the subject is very impressive and very appreciated. Thank you for making this video.
@Candyapplebone Рік тому ⁺¹⁷
Damn, I spent like a week researching this shit on my own, and have been working on almost exactly the same thing. Processing MDX files into embeddings etc. It’s really cool to see somebody doing almost the same exact thing. Makes me think I am really on the right track!
@RabbitHoleSyndrome Рік тому
Nice! Glad to help give some validation 😄 are you also building for docs?
@automioai Рік тому
Hey! how had you turn your pdf files into a propper mdx format ? tnx
@RabbitHoleSyndrome Рік тому ⁺¹
@@automioai In this project no PDF files were used - all documentation had been written directly in MDX. You'll have to do some research on ways to extract text from PDF files. Once you have that, I wouldn't bother with MDX at all - just generate embeddings directly on that content.
@Kichaka_Ranch Рік тому ⁺¹
Thank you for sharing . I was struggling on how to get started . This was very well presented . From this side of the world asante sana !
@RabbitHoleSyndrome Рік тому
Glad it was helpful 😃
@jit-r5b Рік тому ⁺⁴
This was so valuable to just settle the thoughts into a clear action plan as to how to implement in production. Thank you!
@RabbitHoleSyndrome Рік тому
Glad it helped!
@milovangudelj Рік тому ⁺⁴
Wow, this is incredibile... I can see a future where every docs site does the same thing. This is truly powerful stuff! Well done 👏
@RabbitHoleSyndrome Рік тому ⁺²
I’m am both excited and blown away by the possibilities. Thanks for watching!
@FacadeMan Рік тому ⁺¹⁶
Prediction: This is gonna get a million views. Just saw fireship video about vector databases and wanted to understand embeddings. Before I could even search, this video was in the page. Though I wasn’t interested in a 40 min video (had a feeling I’ll just stop after 5 mins like I usually do) I ended up watching it all. The rabbit hole 🐇 🕳️ format is so naturally elegant. Clear end to end use case. I secretly don’t want to share it with anyone but I am forced to fulfill my prediction.
@RabbitHoleSyndrome Рік тому
Thanks for the comment! Glad to hear the format is working 😃
@Dom-zy1qy 17 днів тому
wrong!!
@trimonmusic Рік тому ⁺⁸
Found your channel while learning React Three Fiber, subbed with notifications immediately. Today I get a notification for a well-explained ChatGPT tutorial, right as I embark on building a similar thing. Fantastic continued work, thank you very much!
@RabbitHoleSyndrome Рік тому ⁺¹
Awesome! Thank you!
@9uifranco Рік тому ⁺¹
So you're the one who made this beautiful thing. Pretty nice.
@anonymousXYZ659 Рік тому ⁺¹
Seems to have found one great coding channel among the noise of today; back to good old days when coding was boring & nerdy. Great job!
@fotoflo Рік тому ⁺⁴
Ive been looking for this information for months. Such an excellent tutorial and I love that Supabase's code is all open source so i can actually clone it and read how it works in detail later. Thank you so much for the walk through. Super talented dude too - love the blender stuff.
@RabbitHoleSyndrome Рік тому
Glad to hear it helped! Agreed - open source is amazing! Let me know if you hit any road blocks along the way, happy to help 😃
@fraternitas5117 Рік тому
@@RabbitHoleSyndrome why is the generate embeddings file so different in the video then what is in the repo now? I can't find anything you talk about in minutes 10-13.
@RabbitHoleSyndrome Рік тому ⁺²
Hey @@fraternitas5117! Supabase moves pretty quick - the code I references has been refactored now to support multiple knowledge sources (ie. more than just markdown). You can find the markdown specific code here:
github.com/supabase/supabase/blob/1b2361c099c2573afa1fe59d3187343bb8f1bcab/apps/docs/scripts/search/sources/markdown.ts
@morrisy0x Рік тому ⁺⁶
This 40-minute video looks like 10 minutes. I have been researching related engineering topics recently and they have been very inspiring to me.
@RabbitHoleSyndrome Рік тому
The potentials with LLMs seem to be endless 🤯
@javiasilis Рік тому ⁺¹
Wow. I'm so thrilled to know that you were one of the ones behind that great feature.
I've been using Supabase for 6 months, and have been pretty happy with it.
Except for the docs and the transition to 2.0.
I was blown away when I saw that it generated the code for me when I started writing its documentation
@RabbitHoleSyndrome Рік тому ⁺¹
Glad to hear it has helped you! Any feedback on the docs that you think should improve?
@javiasilis Рік тому ⁺¹
@@RabbitHoleSyndrome So far so good!
I think one challenge is to know how we can check if a user's email address exists. (Or other specific user's metadata)
I couldn't find it in the docs.
There was a GitHub issue which said to store the user's data in a separate table as the auth table was private.
I ended up doing that and haven't had any problems.
Btw, thanks again for all the awesomeness!
@JimmySting Рік тому ⁺⁴
This content is really top notch. I appreciate the clear and detailed explanations of everything!
@RabbitHoleSyndrome Рік тому
Glad you found it helpful!
@franciscofredviana743 Рік тому ⁺⁹
What a fantastic video and content. I’ve gone through multiple videos trying to better understand embedding and how to work with ChatGPT in the best way for querying large amount of content and producing an analyzed response. I’m not a developer, have a background in computer science but I’m a software sales person that is curious about technology and I was able to completely understand your video and content. Subscribed, liked and will be watching more of your videos. Thank you!
@RabbitHoleSyndrome Рік тому
Glad it was helpful!
@jaiderariza8441 Рік тому ⁺¹
I am grateful to this video. This open my eyes to Embeddings
@RabbitHoleSyndrome Рік тому
Glad it helped 😃
@rhaffas Рік тому ⁺⁴
First time on this channel. The way you structured the video, the pace and the explanations are all on point. Keep up the good work. +1 subscriber.
@RabbitHoleSyndrome Рік тому ⁺¹
Glad to hear that, thanks for the sub!
@AngelEduardoLopezZambrano Рік тому ⁺¹
This channel is awesome! Love the rabbit holes you take us one! keep them coming please!
@roxforgegames4548 Рік тому ⁺¹
The most amazing think is that I basically made a chatbot app in less than a week with only the help of GPT4, I had no knowledge of AWS services, PostgreSQL or python. Everthing you told in the video is what GPT4 told me.
All of the serves and database are setup, it has memory, STT, TTS and Cognito login/register.
@RabbitHoleSyndrome Рік тому
It is quite amazing - and I’m sure it will only get better!
@Jandodev Рік тому ⁺²
I'm doing something even cooler with the vector embeds I cant wait till I can share!
@rikawrites7104 4 місяці тому
whaaaattt tell us??
@vince2nd Рік тому ⁺¹
Fireship guy?
Either way, its been pretty useful in terms of learning API's and how to connect them to my nocode builder. Spent hours trying to get things working and the Assistant basically told me what i was doing wrong and how to fix it. So well done with the implementation.
@RabbitHoleSyndrome Рік тому
Glad it helped!
@zaynjarvis9443 Рік тому ⁺¹
fantastic content, didn't expect it will be so informative in just 40 mins. Looking forwards to the next one!
@RabbitHoleSyndrome Рік тому
Thanks for watching 😀
@ooogabooga5111 Рік тому ⁺³
Insane, I love how you are able to do so many things. My laptop atm is unable to power every wish of mine (getting into 3D) but I hope I will soon be able to do so.
@ThiagoVictorino Рік тому ⁺¹
You are awesome! Now I can understand how these things work together.
@RabbitHoleSyndrome Рік тому ⁺¹
Happy to hear it! 😀 thanks for supporting the channel!
@imranaalam Рік тому ⁺¹
excellent . you are hands-on & practical
@benmak5326 Рік тому ⁺¹
This really is/was an epic video clearly and well laid out!
@amardeep.sahota Рік тому ⁺¹
Amazed by your content. Fantastic work here .
@RabbitHoleSyndrome Рік тому
Thanks, glad it helps!
@flying-kite-spectre Рік тому ⁺¹
Wonderful primer on prompt engineering.
@ryanyoung1925 Рік тому ⁺¹
You do a such great job bro ! I love what youhave built and your video, keep build great things bro.
@swyxTV Рік тому ⁺²
incredible end to end tutorial, nicely done
@RabbitHoleSyndrome Рік тому
Thank you! 😃 Have you explored embeddings or pgvector?
@haunebuiii4103 Рік тому ⁺¹
This is amazing, you’ve created Clippy just as enthusiastic and helpful you are!
Thanks a lot
@RabbitHoleSyndrome Рік тому
Thanks for watching 😄
@maertscisum Рік тому
@@RabbitHoleSyndromehow did you generate the mdx files?
@RabbitHoleSyndrome Рік тому ⁺¹
The MDX files weren’t generated - the Supabase team wrote them as you would any markdown file.
@antonodman5709 Рік тому ⁺¹
Absolutely amazing quality here, glad I found your video. Subbed!
@RabbitHoleSyndrome Рік тому
Thank you!
@ApplicableProgramming Рік тому ⁺¹
Thanks Greg for a great explanation. I like your presentational style!
@RabbitHoleSyndrome Рік тому
Glad it was helpful!
@aldousd666 Рік тому ⁺¹
This is a glorious illustration. Thank you very much! I've been trying to find an example of doing this, and yours has put it all together for me! Subscribed!
@RabbitHoleSyndrome Рік тому
Glad it helped, thanks for the sub!
@КириллПальцев-ю1о Рік тому ⁺¹
Dude, that's exactly what I was looking for. No more bullshit articles with clickbait titles, just DIY in the essence. Is there a way to support you through patreon or smth. ?
@RabbitHoleSyndrome Рік тому ⁺¹
Glad to hear it! You support by watching 🙌
@ahmadbasyouni9173 3 місяці тому ⁺¹
best tutorial i have ever watched u are a genius ty!
@martinfilteau8668 Рік тому ⁺¹
Amazing video! I'm going to put this in practice right now!
@RabbitHoleSyndrome Рік тому
Awesome! Feel free to share as you make progress!
@gr8tbigtreehugger Рік тому ⁺²
Many thanks for this insightful and helpful video!
@RabbitHoleSyndrome Рік тому ⁺¹
Happy to help, thanks for watching!
@IanTrolinger Рік тому ⁺¹
incredible value in this video
@CraigShieldsAOTG Рік тому ⁺¹
This video was fantastic, lots of information given in an easily digestible way. Subscribing!
@RabbitHoleSyndrome Рік тому
Glad it was helpful, thanks for the sub!
@NickLambourne Рік тому ⁺¹
GREAT content. It explained everything you need to know about creating 'chat docs' or similar in one run, and all open source. Kudos! And subscribed.
@RabbitHoleSyndrome Рік тому
Thanks for the sub! Great to hear 😃
@fiftygrapes Рік тому ⁺¹
Cool video. I was thinking about doing something similar for some reference pdfs. Thanks for the video
@RabbitHoleSyndrome Рік тому
Thanks for watching & best of luck!
@field-officer Рік тому ⁺⁵
I'm reading the comments, and I'm like.. Yeah WTF 🗿🔥
@caliwolf7150 Рік тому ⁺²
This is truly valuable and useful content, thanks a lot.
@RabbitHoleSyndrome Рік тому
You bet! Glad it was helpful
@naibafYT Рік тому ⁺¹
Just what I was looking for - thank you very much🥳
@RabbitHoleSyndrome Рік тому
Great 😃 Thanks for watching!
@neociber24 Рік тому ⁺²
Cool project, would be cool to create a tool like this that you can embed in any documentation, reading directly the markdown or scraping the website.
@RabbitHoleSyndrome Рік тому
Definitely!
@koigxiritb7ttgyuv Рік тому ⁺¹
I'm at 1:42 and this video already is 10/10
@jonasqiao8834 Рік тому ⁺¹
that's great, you did great job! helps me a lot in this.
@RabbitHoleSyndrome Рік тому
Glad it helped!
@tohafi Рік тому ⁺¹
Amazing video! Great and detailed information!
@RabbitHoleSyndrome Рік тому
Glad it helped!
@vedantnn7 Рік тому ⁺¹
This is really helpful and valuable, thanks a ton!!!
@RabbitHoleSyndrome Рік тому ⁺¹
You bet, thanks for watching!
@JuanUys Рік тому ⁺¹
28:37 Whenever I see examples of decoder (GPT) prompts starting with "You are a helpful finance advisor" or "You are an enthusiastic support rep", I can almost see the AI clearing its throat and sitting up straight and saying "right, ok". Gimme that can-do attitude, GPT!
@RabbitHoleSyndrome Рік тому
🤣
@marspark6351 Рік тому ⁺¹
One thing that might help is if the question result shows the links to the documents that it acquired the information from. Since you are currently fetching which document to run chatgpt on based on similarity of features, maybe you can change the prompt so that it also returns the link of the document that was deemed as a "similar document"
@RabbitHoleSyndrome Рік тому
Absolutely! This should definitely be the next progression.
@keepitdialed Рік тому ⁺¹
Thanks for the knowledge share my friend. 💪🙏
@RabbitHoleSyndrome Рік тому
You bet!
@joaorodriguesjr Рік тому ⁺¹
This is really interesting! I'm looking to build something similar.
@RabbitHoleSyndrome Рік тому ⁺¹
Best of luck!
@ssahillppatell Рік тому ⁺¹
Thanks for explaining everything so clearly :))
@RabbitHoleSyndrome Рік тому
You bet! Thanks for watching 😃
@thehouse2620 Рік тому ⁺¹
excellent info, great presentation
@RabbitHoleSyndrome Рік тому
Glad it was helpful!
@jamelljones122 Рік тому ⁺¹
This is awesome, thank you!
@RabbitHoleSyndrome Рік тому
Glad it was helpful!
@fflv_irn Рік тому ⁺¹
super helpful. thanks. love supabase
@RabbitHoleSyndrome Рік тому
Happy to help!
@olboone Рік тому ⁺¹
Really good video, nice job! 🎉
@RabbitHoleSyndrome Рік тому
Cheers! 😀
@joshmadrid5253 Рік тому ⁺¹
this would be amazing for obsidian note taking application
@RabbitHoleSyndrome Рік тому
Great idea!
@phemartin Рік тому ⁺¹³
I'd love to learn how to also incorporate user-feedback (thumbs up/down)
@RabbitHoleSyndrome Рік тому ⁺⁶
This will likely make it into the next iterations. Will be a good challenge!
@phemartin Рік тому
@@RabbitHoleSyndrome That's awesome! Can't wait
@jit-r5b Рік тому ⁺¹
@@RabbitHoleSyndrome amazing video! Please let us know if you get to it:)
@forbiddenera Рік тому
@@RabbitHoleSyndromeany progress?
@RikLogtenberg-uv4gx Рік тому ⁺¹
Such a great video.
@GeyzsonKristoffer Рік тому ⁺¹
Watched because of the content, subscribed because of the dog. 👍🏻
@RabbitHoleSyndrome Рік тому ⁺¹
Thanks for the sub! 🐶
@BernhardSchlegel Рік тому
First time here. This is so well done. Subscribed. Your viewer number will explode! I like how you approached the topic in a very calm way without jumping on the "LLMs will take over the world" train :) You don't happen to have the clippy blender asset somewhere?
@phil-jc8hp Рік тому ⁺¹
You basically build llama-Index yourself, good job
@benrobo8 Рік тому ⁺¹
Great job 👍, this was really helpful
@RabbitHoleSyndrome Рік тому
Glad it helped!
@sensvitae Рік тому ⁺¹
Thanks for the share !
@rogerganga Рік тому
This is a fantastic video! Thank you very much for sharing :D
Quick question - Currently if the info is not in the documentation it responds "Sorry I don't know how to help with that". But how can we make it respond like this:
"Sorry I don't have relevant info in the documentation but you can do something like this".
For e.g. "I don't have any info about how to make banana pancakes in the documentation, but here is how you can make one...."
Idea here is to make it act like chatgpt on top of the information provided. Keen to know more on this and thank you so much for making this video :D
@BradleyKieser Рік тому ⁺¹
Damn that's a great, helpful video!
@RabbitHoleSyndrome Рік тому
Glad to hear it was helpful, thanks for watching!
@dataray Рік тому ⁺¹
I have not seen such a great video in a while! How wonderful have you explained the whole process 👍👍! Could you explain a bit more about how did you choose 0.78 as threshold for embeddings comparision? have you statisticized that wether the most relevant sections can be found with it?
@RabbitHoleSyndrome Рік тому
Glad you liked it! 0.78 was a first-stab threshold that worked best based on a limited sample of test queries. I wouldn’t claim that this number is universal - almost certainly this could change by domain.
@GrifVonHolst Рік тому ⁺¹
Subscribed, amazing content.
@hellofahmid2331 Рік тому ⁺¹
Brilliant.
Рік тому ⁺¹
Great content.
I wonder if it can generate code as well.
Lets say you are doing this for cloudflare products, kv, workers, durable objects etc all documents injected. Then give prompt like "generate a worker for x" and it will specifically generate with given docs etc..
@RabbitHoleSyndrome Рік тому
I think this is where the industry is going next!
@MT222100 Рік тому ⁺¹
Amazing Content..
@ShotterManable Рік тому ⁺¹
This was very useful! Thanks a lot ofr sharing. Are there plans for keep it up? I'm glad to found this type of content
@RabbitHoleSyndrome Рік тому ⁺¹
You bet! Definitely - videos will keep coming. Some take a bit longer than others - thanks for your patience 😃
@ShotterManable Рік тому
@@RabbitHoleSyndrome Thanks you for making this content for us for free! I very much appreciate it and you motivates me to share knowledge with my friends :)
@casualcycling8738 Рік тому ⁺¹
🔥🔥🔥 This is amazing!
@georgebarlowr Рік тому ⁺²
I've got an internal api that returns train times for given stations. Could I use user prompts to ask the GPT LLM to find train times for a particular station and then fetch using an api get request an array of trains that are set to depart and then get GPT to format that back to the user in a tailored natural language type of way?
@RabbitHoleSyndrome Рік тому ⁺¹
Definitely possible, will just take some work on the prompt engineering side. Your best bet today is probably LangChain - I recommend you check out their documentation on creating custom tools:
python.langchain.com/en/latest/modules/agents/tools.html
@spirobel Рік тому ⁺⁴
are there alternatives to openai to create these vectors? Dont really feel comfortable building something around a closed source api that is controlled by one vendor.
@RabbitHoleSyndrome Рік тому ⁺²
Really great question. You’ll want to look into sentence embeddings. There has been a lot of work on the OSS side with Sentence-BERT (SBERT) you can check out. You might also want to look into Universal Sentence Encoder (USE) and InferSent.
@RabbitHoleSyndrome Рік тому ⁺¹
LlamaIndex actually uses OpenAI (text-embedding-ada-002) by default for embeddings today. They're more of a toolkit layer to assist with the workflow. There are many other alternatives though (which LlamaIndex supports via LangChain) that are worth checking out:
langchain.readthedocs.io/en/latest/reference/modules/embeddings.html
@trejohnson7677 Рік тому
LOL what computer arch r u on
@shellbot972 2 місяці тому ⁺¹
amazing!
@onemanops Рік тому ⁺¹
Thank you
@voldrichmatous Рік тому ⁺¹
Loved it, came for the embeddings and left with that and whole lot more! Too bad that the clippy did not made to the web on supabase :). Copyright problems?
@RabbitHoleSyndrome Рік тому
Thanks for watching! Clippy did make it - but things move fast at Supabase 😅. This feature has evolved into a unified cmd+k menu and just renamed as “Supabase AI”.
@_thehunter_ Рік тому ⁺²
you are great!
@RabbitHoleSyndrome Рік тому ⁺¹
Thanks for watching!
@mrc580 Рік тому ⁺²
Amazing video! This is exactly what I was looking for a long time. You basically explains everything I wanted to know about how to create a search engine using open ai.
But I have a few questions:
How much did you spend on open ai embending API building this?
How much supabase spends monthly with searchs using the open ai api?
It is possible to use an open source embedding API instead of calling the open ai api ? Wouldn't it be less expensive than the approach you took?
@RabbitHoleSyndrome Рік тому ⁺⁶
Glad it was useful! As for costs, you may be surprised how inexpensive OpenAI embeddings are (at least I was).
To put it in perspective, for the Supabase guides we currently have around 1500 page sections which total just over 220000 tokens. At OpenAI's current embedding price ($0.0004/1k tokens), that brought us to just less than $0.10 for the entire guide knowledge base (~one-time pre-processing). After that the average query is likely
@aliandiazperez7602 Рік тому
Excellent video and the whole methodology for deep dive into sou much information!!!! will you consider elastic as vector database?
@RabbitHoleSyndrome Рік тому
Thank you! I haven't looked too much into elastic yet. I might consider it if I was already using Elastic in my stack. Have you tried it?
@orebimbo-salami4107 Рік тому ⁺¹
This is Awesome
@hallowatcher Рік тому ⁺¹
So if I understood correctly: The embeddings were only used to check for similarity between the user's input and the doc's content, in order to provide the prompt with relevant (text) context, right? Is there a way to provide the GPT model with the embeddings instead?
@RabbitHoleSyndrome Рік тому ⁺¹
That’s correct. You could have used an alternate search method, but embeddings have a nice alignment with LLMs since they also use language model themselves.
Unfortunately no there is not currently a way to inject embeddings directly into GPT today. Maybe this will change in the future or become available in open source models like LLaMa in the same way we’ve seen it happen with Stable Diffusion.
@talhahasan6470 Рік тому ⁺¹
Great video! How closely do the .mdx files have to match this structure before they can be processed into embeddings? Do they need to export the meta const, for example?
@RabbitHoleSyndrome Рік тому
The meta const is optional! You’re also free to tweak the pre processing logic to fit whichever format you need to work with
@mohali4338 Рік тому ⁺¹
Good job! :)
@namangarg4897 Рік тому ⁺¹
Hey great content! I was looking for exactly this. Do you have a discord for asking queries? I was making the same thing but for ".md" files not mdx and faced some issues.
@RabbitHoleSyndrome Рік тому
Thanks! For now, feel free to reach out to the channel's email address and I'll do my best to give you a hand.
@zejiaann Рік тому ⁺¹
Hey, awesome video! Could I check with you if it is possible to search the database that contains CSV files?
@RabbitHoleSyndrome Рік тому
Hey - CSV files are definitely doable. It will mostly come down to how you plan to pre-process them. Perhaps you can import your CSV file into a table itself and generate embeddings on the content within it.
@coreykline8916 Рік тому ⁺²
Woah, wonder if you could make this work with a browser extension, could have either a private or public store of indexed website embeddings
@RabbitHoleSyndrome Рік тому
Can’t see any reason this wouldn’t work. If you build this please report back 😃
@BusinessAutomatedTutorials Рік тому ⁺¹
This is amazing. Super clear and short explanation of embeddings. Great walkthrough the code touching on the relevant parts. Subbed the channel👍
I see the code is still in the Supabase repo - but the Clippy is not there? what happened?
@RabbitHoleSyndrome Рік тому
Glad it was helpful!
Edit: Just realized you might have been talking about the Clippy graphic on the site, not the code. Search and Clippy have been combined into the same interface - you can find Clippy by clicking search, then switch to “Ask Clippy”.
Original:
Things move quick at Supabase 😆 The Clippy frontend code got moved when search was upgraded to also use embeddings. After the refactor everything is just under “Search”:
github.com/supabase/supabase/blob/0ecc238ad6d81202bb2301f7919b166a98929697/apps/docs/components/Search/SearchModal.tsx
The backend Clippy logic is still in the same edge function:
github.com/supabase/supabase/blob/0ecc238ad6d81202bb2301f7919b166a98929697/supabase/functions/clippy-search/index.ts
@BusinessAutomatedTutorials Рік тому ⁺¹
@@RabbitHoleSyndrome cool. You can feedback to Supabase, that a) this video significantly increased my interest in adoption of Supabase b) clipy icon is tooo small , it should be large and obstructive.
Serious question Prompt Engineering: With Chat Completion API - do you do system persona and pack all prompt into user? Or would you break prompt like here into example 3 user entries? E.g.: Context, question, use markdown. Does it make a difference?
@RabbitHoleSyndrome Рік тому
Good feedback 🙌 and great question about prompt engineering in the chat API. We are actually experimenting with this right now and trying to understand what produces the best results. At the moment we do a bit of both (system message and user message with a bit of prompt overlap between both). OpenAI says that the model doesn’t pay strong attention to the system message, so it may be better to use system message strictly to set identity and provide instructions & context in a user message.
@ar9maker Рік тому ⁺¹
You are so cool! I love you!
@diaryofacrankykid7270 Рік тому
Enjoyed this tutorial. Would love to know how you'd approach fine-tuning a model with this data to build a chat bot?
@RabbitHoleSyndrome Рік тому
Thanks! Any reason you're looking to fine-tune vs. embeddings+context injection?
@Jaybearno Рік тому
16:10 would love an explanation on how embedding work with a document structure? I.e query is “summarize chapter 3”. The embedding sans retrieval don’t seem to capture the structure of the chunks that are contained in title chunk “chapter 3 “. All explanations on embedding I’ve seen all rely on the text content within a chunk.
@muhammedkoroglu6544 Рік тому
Great explanation! I was wondering how to perform similarity searches in my files, there you have it!
I'm also wondering if there would be a way (or need??) to speed up the similarity check step? For this application it might be fast enough, but imagine having a billion or trillion entries which you need to check you query against.. That would be a nightmare. Something like clustering ,say, N similar entries and in the first step only checking your query against their average value. If they are similar enough (above a certain threshold) you could then proceed to check against each of them and you have made only 1 extra check. If they're not at all similar, then you don't have to check against each and you have saved yourself N-1 unnecessary checks!!
What do you guys think?
@MikeAirforce111 Рік тому ⁺¹
Any hints on papers on "context injection" for LM or LLMs?
@RabbitHoleSyndrome Рік тому ⁺¹
You may find this post helpful:
mattboegner.com/knowledge-retrieval-architecture-for-llms/
The community is starting to settle with "retrieval-augmented generation" as the terminology for this now.
@haisai4159 Рік тому ⁺¹
amazing! how does clippy update the vector db and process the text as embedding when NEW documentation is added? is it automatic? maybe i missed it in the video
@RabbitHoleSyndrome Рік тому ⁺¹
Great question! The `generate-embeddings` script was designed to be diff-based. So next time you run it, it will pull in only the documents that have changed and re-create embeddings on just those. It currently works using checksums:
1. Generate a checksum for the content and store in the DB
2. Next time the script runs, compare the checksums. If they don't match, the content has changed and embeddings should be re-generated.
The script runs on CI, so anytime documents change a GitHub Action will trigger the script. See this PR for details:
github.com/supabase/supabase/pull/13936
@Cygx Рік тому ⁺¹
Is there a version of this that can apply to any type of content? Not just specifically made for Supabase documentation, but documentation of any type, or insert a book, of textbook, and the search outputs relevant gpt responses?
@RabbitHoleSyndrome Рік тому
Hey! I don’t have a pre-built suggestion off the top of my head, but solutions to this seem to be popping up everywhere (just hang out on Twitter for a few hours 😅). If you don’t mind coding, it should be relatively straightforward to swap out the MDX docs with really any kind of content source, and the remaining steps should be identical.
@Asasiene23 Рік тому
Yo, thank you for putting out your knowledge :). But I have question regarding the data search for the context. So basically your creating a graphdatabse with the gpt models and then you input the users question into the database to find the relevant articles/information?

Наступне

Автоматичне відтворення

$0 Embeddings (OpenAI vs. free & open source)