Building a RAG application using open-source models (Asking questions from a PDF using Llama2)

Ғылым және технология

GitHub Repository: github.com/svpino/llm
I teach a live, interactive program that'll help you build production-ready machine learning systems from the ground up. Check it out at www.ml.school.
Twitter/X: / svpino

Пікірлер: 108

  • @anonymoustechnopath1138
    @anonymoustechnopath11383 ай бұрын

    Thanks a lot Santiago!! Really needed these videos for LLMs. Keep them coming!

  • @yasirgamieldien
    @yasirgamieldien2 ай бұрын

    This is an amazing video. Literally answered all the questions I had on building a RAG and it was really useful to see the comparison between GPT, Llama, and Mixtral

  • @GaurangDave
    @GaurangDave2 ай бұрын

    Oh please don't stop creating these videos, this is really helpful. Very detailed and well explained! Thank you so much for this!

  • @berkbatuhangurhan708
    @berkbatuhangurhan7083 ай бұрын

    Came from X, this is an amazing and very detailed walk through. Thanks for explaining even the tiniest bits of everything. Highly recommend this.

  • @TooyAshy-100
    @TooyAshy-1003 ай бұрын

    Santiago, your videos on LLMs have been incredibly helpful! Thank you so much for sharing your expertise. I'm eager to see more of your content in the future.

  • @QuentinFennessy
    @QuentinFennessy2 ай бұрын

    This is an excellent walk through - easy to follow and very practical

  • @lokeshsharma4177
    @lokeshsharma4177Ай бұрын

    This is the BEST video ever made comparing all the LLMs performing same task. God Bless You

  • @sumitrana8114
    @sumitrana81143 ай бұрын

    Thank you for leaving your job and starting your channel.

  • @geethikaisurusampath
    @geethikaisurusampath2 ай бұрын

    This is really Helpful. Specially the explainations behind why do it. Keep up the good work. Respect to you man.

  • @liuyan8066
    @liuyan80663 ай бұрын

    I like this fundamental courses, especially the last RAG one, I followed other training to build AI products, some teaching is over 10 hours. After i finished, I still didn't fully understand why I coded like that. Now these courses can make the connection step by step. Thank you.

  • @sarash5061
    @sarash50612 ай бұрын

    This was just Amazing, You are a Star. Thanks for all the effort.

  • @SuhasKM-tl1rg
    @SuhasKM-tl1rg3 ай бұрын

    I love your content. More of this in my feed please!

  • @asifm3520
    @asifm3520Ай бұрын

    That was a really clear explanation. Even novices will have no trouble following along.

  • @junaidali1853
    @junaidali18533 ай бұрын

    Lovely. Super useful video. I’ll be building a RAG system with a Vector Database and langchain for my freelance client for around $2,000 or more. Thanks Santiago for helping make my life better.

  • @swatantrasohni5235
    @swatantrasohni52353 ай бұрын

    Thanks Santiago for wonderful video..running LLM locally is something very handy for variety of task..Eventually everyone will have their own LLM running locally in device..thats the future..

  • @bhusanchettri8594
    @bhusanchettri85943 ай бұрын

    Great piece of work. Well explained!

  • @Orenji902
    @Orenji9024 күн бұрын

    Incredible video, really like the longer content codealong.

  • @sushanths.l4865
    @sushanths.l48653 ай бұрын

    This is the great video santiago I really learned a lot

  • @ravindarmadishetty736
    @ravindarmadishetty7365 күн бұрын

    It's such a fantastic video, Santiago. 🎉

  • @adinathdesai6880
    @adinathdesai6880Ай бұрын

    Amazing Video. You added great value to our knowledge. Thank you so much.

  • @koko9712
    @koko97123 ай бұрын

    Nice video Santiago ! Keep up the good work

  • @dannysuarez6265
    @dannysuarez626529 күн бұрын

    What a great presentation! Thank you so much, sir!

  • @farukondertr
    @farukondertr2 ай бұрын

    dude, its awesome! do not stop pls

  • @chanukyapekala
    @chanukyapekala2 ай бұрын

    excellent work! so clear and concise..

  • @alextiger548
    @alextiger548Ай бұрын

    Ma, thanks for what you are doing! Fantastic stuff.

  • @RameshBaburbabu
    @RameshBaburbabu3 ай бұрын

    Wow gr8 video, I am able walk with you and finished till the end. "Batch" is gr8 . Thanks please post more videos .. 🙏🙏

  • @alexstele5315
    @alexstele53153 ай бұрын

    Thanks a bunch! 🎉 I've been looking for something like that.

  • @fredericv3497
    @fredericv3497Ай бұрын

    Really good job and clear tutorial ! Thank you

  • @seanb9949
    @seanb99493 ай бұрын

    Another great video Santiago! I really look forward to seeing more of these. Heck, I'll watch the ads to make sure you get some $$$ 🙏

  • @underfitted

    @underfitted

    3 ай бұрын

    Ha ha! Thanks!

  • @toddroloff93
    @toddroloff9323 күн бұрын

    Nicely done. Always learn something from your video's. Looking forward to more content. Thanks for doing them.

  • @underfitted

    @underfitted

    23 күн бұрын

    Thanks for coming back!

  • @user-dg9by2ju2y
    @user-dg9by2ju2y3 ай бұрын

    Very informative video, Santiago!

  • @gonzaloplazag
    @gonzaloplazagАй бұрын

    Great video! incredibly helpful!!!

  • @epicfootball007
    @epicfootball0073 ай бұрын

    you are by far the best teacher on youtube regarding ML/AI. please consider launching a course on generative AI.

  • @underfitted

    @underfitted

    3 ай бұрын

    Thanks!

  • @nevildev
    @nevildev3 ай бұрын

    Thx! Very straightforward

  • 3 ай бұрын

    Love your video ! Thanks !

  • @kergee
    @kergeeАй бұрын

    The lesson was very good, thanks

  • @TheMunishk
    @TheMunishk3 ай бұрын

    Congrats and well done for producing this useful content. Exactly what I was looking to kick start my langchain journey with the models. Let me practice this but I was also looking for how to integrate all this in the front end. Do you have a video of which tools to build a front end for the prompt that will interact with the backend LLMs?

  • @sumittupe3925
    @sumittupe39252 ай бұрын

    Thanks for the video..... Well Explained....!

  • @peacefullmusic8374
    @peacefullmusic8374Ай бұрын

    best tutorial for start

  • @subhajitchatterjee1637
    @subhajitchatterjee163718 күн бұрын

    Hella amazing vid

  • @Sam-oi3hw
    @Sam-oi3hw25 күн бұрын

    Thank you very much for your explication !!!

  • @square007tube
    @square007tubeАй бұрын

    Many Thanks for this video. I walked through the video, I was able to install Ollama3 on my machine, but I have nvidia GPU MX250, which is taking long time to answer the questions. it take 7 mins to answer two questions. I will watch your playlist of LLM.

  • @user-nl2ix1cj1q
    @user-nl2ix1cj1q3 күн бұрын

    This is great !! thanks

  • @tipiapagupo
    @tipiapagupo27 күн бұрын

    Amazing video! Are you still planning to make the video on how to communicate with websites? I'm really curious about the technologies you consider most relevant.

  • @andresfelipeestradarodrigu301
    @andresfelipeestradarodrigu3012 ай бұрын

    AMAZING BRO, THANKS

  • @ThamBui-ll7qc
    @ThamBui-ll7qc2 ай бұрын

    Great video, I would love to see how to properly structure the prompt and make the bot remember context as conversation goes on...

  • @malikgaruba4079
    @malikgaruba40793 ай бұрын

    Awesom video. Thanks

  • @MarkoKhomytsya
    @MarkoKhomytsya3 ай бұрын

    Thank you for the video! I found it particularly intriguing to consider the possibility of obtaining more accurate responses from the PDF using the Llama2 model. Given that local Language Models (LMs) tend to be highly sensitive to how queries are formatted, I believe it's crucial to refine your example further. Here are a couple of suggestions: 1) Instead of relying on a basic parser, it would be beneficial to prepare a set of predefined questions and answers. For instance, a question like "How much does the course cost?" could have a straightforward answer like "$400." 2) It's also important to determine the optimal format for prompts, specifically tailored for models like Mistral. By addressing these points, you could develop a truly functional product that delivers accurate responses. As it stands, most examples seem to demonstrate that local models struggle with practical applications and aren't quite ready for real-world deployment.

  • @underfitted

    @underfitted

    3 ай бұрын

    Great suggestions!

  • @mehmetbakideniz

    @mehmetbakideniz

    3 ай бұрын

    hi. prompt engineering would definetely solve the problem of verbose answers but do you think it would also correct hallucinations as seen in the video?

  • @MarkoKhomytsya

    @MarkoKhomytsya

    3 ай бұрын

    good question @@mehmetbakideniz ! I would like to know answer too!

  • @researchpaper7440
    @researchpaper74403 ай бұрын

    Amazing channel just a great guy'

  • @samcavalera9489
    @samcavalera94893 ай бұрын

    Thanks Santiago! I am a student of your ML School course and I haven taken your course in two different cohorts. You ML School course is definitely the best of its kind in the market. Can you please design a new course on RAG that covers everything about this awesome technology including evaluation techniques and deployment? That will be wonderful and I cannot wait to enrol in your RAG (and any other AI) course!

  • @underfitted

    @underfitted

    3 ай бұрын

    Working on it!

  • @samcavalera9489

    @samcavalera9489

    3 ай бұрын

    @@underfitted Many thanks 🙏 🙏 🙏

  • @joeldartez829
    @joeldartez8293 ай бұрын

    Truly, you're the best. I've never met someone who explains things so well. I apologize if it is written somewhere and I missed it, but I wanted to ask if I buy your course today, can I have access to the past content today? I don't want to wait until the live sessions in April (or I want to arrive prepared for them). Thank you very much.

  • @underfitted

    @underfitted

    3 ай бұрын

    Yes, you get immediate access to everything from day 1.

  • @sam-uw3gf
    @sam-uw3gf3 ай бұрын

    good video and your tweets are more informative....✌

  • @researchpaper7440
    @researchpaper74403 ай бұрын

    Looking for these videos, next i am looking a model to train on SQL data

  • @mehmetbakideniz
    @mehmetbakideniz3 ай бұрын

    this was super helpfull. I noticed that using m2 pro some cells took 16 seconds in my laptop while it just took 0.5 second in your computer. then you said you are using m3gpu. How can I make sure that I am using gpu instead of cpu in executing this code? or does langchain already utilize gpu when needed?

  • @azharsham
    @azharsham27 күн бұрын

    Brilliant video! but one quick question , are you passing the same string to both question and context here ? If yes does it always work in case of document reader

  • @TexttoInvoice
    @TexttoInvoice2 ай бұрын

    This video is awesome so so great!!! Thank you so much for such a quality video. Question: what’s the best way to improve the results being accurate to the document, can using structured data such as spreadsheets and CSV files give you a more accurate answer and maybe the model prefers interacting with them? Also, if there was more instances of the data, say multiple different documents, containing the same information that needs to reference ? Anyone who has found the best way to optimize getting correct answers from your retrieval. Please let me know! Thanks

  • @fintech1378
    @fintech13783 ай бұрын

    is searching via embedding always better compared to 'traditional' search aka very long context window? where should we use one or the other..how bout if we wanna build multimodal video recommendation system

  • @MarcosScheeren
    @MarcosScheeren3 ай бұрын

    Came here from X. Great overview on how to implement an LLM+RAG locally. Any multimodal ones incoming?

  • @underfitted

    @underfitted

    3 ай бұрын

    Yes

  • @chiragharish1020
    @chiragharish102014 күн бұрын

    Great video 👏👏Really helpfull , can you tell which model mac do u have,I have a m2 mac 8gb -i am doubtfull i will be able to run these powerfull models.

  • @fatiga2426
    @fatiga2426Ай бұрын

    Santiago, muy buen video! Una pregunta, por que usas un parser para obtener el output del modelo como string? Por que mejor no obtener el content directamente? Saludos

  • @noa2427
    @noa24272 ай бұрын

    I am running in to vector store problem saying import error docarray which i installed. I tried many ways i tried many vertions of docarray and DocArrayInMemorySearch any helpfull thanks

  • @TomasTrenor
    @TomasTrenorАй бұрын

    Amazing video Santiago ! Many thanks . Just tried it with Llama3 8b and, as it seems , is not so accurate as Llama2 ( what does not make sense obviously). I need to deep into it

  • @mrskenz1068
    @mrskenz10683 ай бұрын

    Thanks for the vidéo. How we can do for scientific PDF that contains a lot of mathematical and chemical formulas.

  • @nguyenquocviet4287
    @nguyenquocviet42873 ай бұрын

    Dear Santiago, I would like to ask you about the evaluation metric? Do you know any evaluation metric for evaluating between the generated answers and true answers? (eg. Rouge metric) Thank you so much!

  • @mehmetbakideniz
    @mehmetbakideniz3 ай бұрын

    Thanks!

  • @underfitted

    @underfitted

    3 ай бұрын

    Thank you so much! Really appreciate you!

  • @lindavid1975
    @lindavid197528 күн бұрын

    Thank you Santiago - sorry about the code red.

  • @Jonathan-ru9zl
    @Jonathan-ru9zl2 ай бұрын

    Great! Can this model and setup serve as an assistant to, lets say, a board design engineer that have thousands of components specs in pdf files? To find and analyze the components faster?

  • @nikkypuvvada2666
    @nikkypuvvada26662 ай бұрын

    Thanks

  • @vivekatbitm
    @vivekatbitm14 күн бұрын

    Great video, just curious though how come both gpt and llama model generated the same joke? Isn't that weird??

  • @gauravpratapsingh8840
    @gauravpratapsingh884017 күн бұрын

    Hey can you make a video for website page Q/A chatbot by using langchain framework? by using some open llms or free public API keys?

  • @jpagano569
    @jpagano56917 күн бұрын

    Hmm is there a way to run this in CoLab or Github Codespace? I suppose the point is to run locally, but I hate setting up dev environments (because I'm new to coding!)

  • @derekottman9622
    @derekottman96222 ай бұрын

    This video is supposed to have a link to ANOTHER "from scratch" video as a popup - but WHEN that link pops up, I think it's ACTUALLY a link within this video pointing in a circular loop BACK TO THIS VIDEO, instead of the "pointing elsewhere" link to the other video it's supposed to be. (This video has a link to itself, if I didn't get MY wires crossed.)

  • @underfitted

    @underfitted

    2 ай бұрын

    I don’t think that’s possible? Anyway, you’ll find the other video here: kzread.info/dash/bejne/dKan0cWDkpDMcdI.htmlsi=BVJfS_0Iq9lwRX0B

  • @_kissimusic
    @_kissimusic6 күн бұрын

    Can I embed this, such as with laravel, and then serve it on a host online. So I can access it anywhere?

  • @KumR
    @KumR24 күн бұрын

    Can u also pl add an UI using streamlit

  • @user-pb8qi4ht4h
    @user-pb8qi4ht4hАй бұрын

    Sir how can i do this project using java or spring boot

  • @basantsingh6404
    @basantsingh64043 ай бұрын

    if you are using open_ai key, it means you are paying to use the open ai model. how is it open source ?

  • @user-rj1eu6kp3u
    @user-rj1eu6kp3u3 ай бұрын

    has anybody used ollamaembedding and got it working?

  • @Hizar_127
    @Hizar_127Ай бұрын

    i want to deploy it on cloud. does it is possible?

  • @antonioskarvelas1325
    @antonioskarvelas132513 күн бұрын

    I have problem with the code. I run the code in VScode and I get the error: ValueError: Ollama call failed with status code 403. Could you help me?

  • @theDrewDag
    @theDrewDag3 ай бұрын

    Is it actually true that you need models to be aligned with their respective embeddings? I don't think so 🤔 Embeddings are used only for the vector search and lookup functionality. At the end of the day all the model is seeing is your textual prompt. You can use OpenAI embeddings with any open source models and viceversa.

  • @underfitted

    @underfitted

    3 ай бұрын

    You are right. In this example I only use the embeddings for the search, so what I said is irrelevant here.

  • @learningwithmahasin
    @learningwithmahasin2 ай бұрын

    kindly create another video in which you use Pinecone and also give a GUI making it a complete standalone application

  • @serhiua

    @serhiua

    Ай бұрын

    Pinecode already explained here kzread.info/dash/bejne/dKan0cWDkpDMcdI.html&ab_channel=Underfitted

  • @user-uu1ko7oi8z
    @user-uu1ko7oi8z28 күн бұрын

    Can I use LLama 3 models with your tutorial?

  • @underfitted

    @underfitted

    28 күн бұрын

    Yes

  • @Sam-oi3hw
    @Sam-oi3hw25 күн бұрын

    Does anyone know of similar videos on KZread? I'd really appreciate it.

  • @learningwithmahasin
    @learningwithmahasin2 ай бұрын

    kindly convert the same project into a GUI based application

  • @mohamadhasanzeinali3674
    @mohamadhasanzeinali3674Ай бұрын

    good video, but i see a lot of Ads.

  • @samcavalera9489
    @samcavalera94892 ай бұрын

    Many thanks Santiago! Before watching this video, I was mainly using the series of tests from this video for my rag applications: kzread.info/dash/bejne/oK6ZlLx_mpqsc84.htmlsi=-69PI_cJqJn4SgVf Now, my life is much simpler than ever 😂 Thanks hero 🙏

  • @RameshBaburbabu
    @RameshBaburbabu3 ай бұрын

    Thanks Santiago, for wonderful videos !! , I am still pending to do step by step instructions shown in kzread.info/dash/bejne/dKan0cWDkpDMcdI.html . wonderful explanation. Thanks , it keep coming and helping us to follow and gain some knowledge..

  • @serhiua
    @serhiuaАй бұрын

    Thank you again for the wonderful explanations; they are works of art! This was a very good logical continuation of kzread.info/dash/bejne/dKan0cWDkpDMcdI.html.

  • @UditAgarwalBME
    @UditAgarwalBMEАй бұрын

    langchain_community is not working, unable to import it.

  • @UditAgarwalBME

    @UditAgarwalBME

    Ай бұрын

    worked after pip install langchain-community, thanks

  • @fredericv3497
    @fredericv3497Ай бұрын

    Merci !

Келесі