Run ANY Open-Source LLM Locally (No-Code LMStudio Tutorial)

Ғылым және технология

LMStudio tutorial and walkthrough of their new features: multi-model support (parallel and serialized) and JSON outputs.
Join My Newsletter for Regular AI Updates 👇🏼
www.matthewberman.com
Need AI Consulting? ✅
forwardfuture.ai/
My Links 🔗
👉🏻 Subscribe: / @matthew_berman
👉🏻 Twitter: / matthewberman
👉🏻 Discord: / discord
👉🏻 Patreon: / matthewberman
Rent a GPU (MassedCompute) 🚀
bit.ly/matthew-berman-youtube
USE CODE "MatthewBerman" for 50% discount
Media/Sponsorship Inquiries 📈
bit.ly/44TC45V
Links:
LMStudio - lmstudio.ai/
LMStudio Tutorial 1 - • Run ANY Open-Source Mo...
Disclosures:
I'm an investor in LMStudio and CrewAI

Пікірлер: 279

@bonkywonky12 ай бұрын
Project idea: create a bunch of agents that are experts in specific areas, like coding, Wikipedia, reasoning, law, etc, and then an orchestrating agent. The orchestrator will be the only one the user then interacts with. The orchestrator then figures out how to respond to user queries by finding all available agents available and selecting either one or multiple to produce the best answer possible. Either the agents have descriptions of what it’s good at or if the orchestrator can see each agents metadata and even just recognize what they’re good at by just seeing how they’re setup, that would be even better.
@positivevibe142
2 ай бұрын
There are coding copilots/agents out there already, like Pythagora GPT Pilot, Devin, Devika, Auto-GPT, Github copilot, Warp...etc... Personally, I rely heavily on the current Claude 3 Opus, OpenAI ChatGPT 4 looks like a joke next to it! 😅
@Thedeepseanomad
2 ай бұрын
Yes. a specialized and optimized LLM will outperform a general model. If one could successfully train custom specializations, having a constellations of models for specialized tasks could result in very high capacity (like a model that is 7B for Cyberpunk story structure, 7B for dialogue in a Cyberpunk setting, 7B for pacing in adventure stories, etc, you could have a a setup running like 28B for only making a cyberpunk stories, but only running a single model at a time.
@executivelifehacks6747
2 ай бұрын
I have to say I use Claude 3 Opus as a first choice for AI.
@bigglyguy8429
2 ай бұрын
@@positivevibe142 I have both and I probably won't renew Opus as it's not giving me anything GPT doesn't, but GPT can do more
@agentxyz
2 ай бұрын
great idea! you could call it "mixture of experts"
@karankatke2 ай бұрын
We need more usecase and practical guides with LM studio. Love your videos. ❤
@starblaiz19862 ай бұрын
I've been using LMStudio since your last tutorial on it, and I can attest that it's FANTASTIC to use and takes all of the headache out of setting up local AI's. TIP: In LMStudio's settings you can specify the folder to download AI models to. It's worth getting a small dedicated flash drive to store them on. That way you can play about with them without having to worry about hard drive space as the smallest models are about 5GB and the largest can get into tripple digits. Yes loading them up will take slightly longer, but inferrance won't be affected as that's done entirely from RAM (and if it's too big for the RAM then it will use your main hard drive for VRAM just like it would normally, so having it on external USB flash doesn't affect it).
@jayr7741
2 ай бұрын
Hey should I consider perplexity ai pro subscription to analysis my previous year questions of exam or free LMStudio will also be good please reply
@serikazero128
2 ай бұрын
does LM studio allow chat with documents? I'm trying to set this up properly, but even Private GPT doesn't work as it used to
@myandrobox3427
2 ай бұрын
@@serikazero128Anything LLM is good for interacting with docs
@serikazero128
2 ай бұрын
@@myandrobox3427 thanks, I'll look into it
@bigglyguy8429
2 ай бұрын
@@jayr7741 Use Perplexity for now, unless you have a really high-end machine that can run big models (2x 3090 at 24GB each).
@abdelhakkhalil76842 ай бұрын
Thank you for the video, and thank you for disclosing that you are an investor in both LMStudio and CrewAI. I wish you could mention it in the video for better transparency.
@xtramoist9999
2 ай бұрын
Agreed. Would be highly regarded.
@matthew_berman
2 ай бұрын
I considered this, maybe I should have. I didn’t want to be…show-off-y.
@qwazy0158
2 ай бұрын
Are either of these public? Or are both private companies ?
@BudoReflex2 ай бұрын
I love how you move through topics and keep a concise summary of what is happening without going down rabbit holes. I learn a lot very quickly.
@matthew_berman
2 ай бұрын
Thank you!
@JohnLewis-old2 ай бұрын
Always love your reviews. Thanks!
@TheExcellentVideoChannel2 ай бұрын
Thanks Mat, what would we do without you to guide us on this journey!! I started on ollama and ran into some issues that I don't want to solve just yet but it looks like LMStudio is what I need to move to to get around the issues. Nice and timely tutorial/overview.
@lancemarchetti86732 ай бұрын
The system requirements are very helpful. Thanks
@OproDarius2 ай бұрын
Such an awesome software, can't wait to see the local open source software delivering agent and llms in 5 years, will be such a ride!
@krisknap2 ай бұрын
Thanks for sharing this update and demonstrating with the examples!
@riyadwahib47552 ай бұрын
Thanks Matt! Great video as usual :) Yes please would be nice to see you build something with powering agents!
@issiewizzie2 ай бұрын
I'm hoping in the future, there will be a way to train an LLM or specialised model more easily for a beginner. Almost iPhone-friendly.
@Yakibackk
2 ай бұрын
No hope bro
@alx8439
2 ай бұрын
Oobabooga has it in UI already. You just need to have hardware which can cope it
@DanielArnolf2 ай бұрын
This is getting cooler by the day, what a gift !!! Thanks for your professionalism and dedication.
@OzzyMoto2K102 ай бұрын
Great video, Matt - you have no longer jumped the shark. :)
@jackflash63772 ай бұрын
YES to the agents locally. Any interest in checking out Devika? Claim to be open source Devin.
@matthew_berman
2 ай бұрын
I spent a good bit of time trying to get it to work but couldn’t. I’ll certainly do a review when I get it working though. Based on its popularity, I suspect it’ll evolve quickly.
@rudolfviljoen2847
2 ай бұрын
@@matthew_berman If you do make a video on devika please please do a section on using it with a local LLM
@screamingiraffe2 ай бұрын
best video so far, thank you for this
@morena-jackson2 ай бұрын
Love this type of video, thanks so much for really going into LMStudio. I"ve had the program for a dew months now but never really played with.
@bigglyguy8429
2 ай бұрын
If you're using it for role-play also looking into Faraday, which is more setup for role-play and I find it runs the same models faster.
@morena-jackson
2 ай бұрын
@@bigglyguy8429 thank you!!
@eagleterry33492 ай бұрын
Looking forward to seeing you build a project .
@panagiotisgalinos13352 ай бұрын
Man, i like your videos. Very informative.
@supahfly_uk22 күн бұрын
Wow this is amazing, thank you.
@wakingdreamsroleplay2 ай бұрын
I am not a coder so I love your videos. This agents stuff excites and terrifies me. I would love to see a project that involves writing complicated documents such as text-based games (for improvised drama type activities) where the character descriptions have clue info about other players. So, the writing agent creates setting and background and character descriptions but an editor needs to go through the full document and check and make sure clues found in one character file also show up in other files and send back to writer to reiterate until everything checks out. I can do part of this with prompting but it always needs a lot of manual editing and so a way to automate would be nice. If you can demonstrate some similar process, I would certainly be thrilled. ;-)
@vibeymonk2 ай бұрын
Loved this video, please make a playlist out of it! More inclined for people like me.
@anubisai2 ай бұрын
Great video, Matt.
@lucademarco59692 ай бұрын
I would really love to see document q&a using lmstudio, because I think a lot of companies are interested in this kind of ai use.
@johnp90912 ай бұрын
Such a good way to test drive some of the local models. Great job on this and all of your other tutorials! I've really learned a lot from you vids.
@sperazza2 ай бұрын
fantastic, really great video
@armans44942 ай бұрын
Yes, please. Also publish your endpoints for consumers 🎉❤
@matthew_berman
2 ай бұрын
Endpoints?
@REDULE262 ай бұрын
Nice video as always 👍
@vladyslavkorenyak8722 ай бұрын
Wow, this is Gold! I would love a tutorial on how to integrate this into a website.
@mathieuboisvert68652 ай бұрын
Fantastic, I was just playing with this today. How would you integrate AnythingLLM with these different models running in parallel and interacting through crewAI?
@agentxyz2 ай бұрын
ty. great video
@aott67992 ай бұрын
Really excellent videos on LMStudio. Does it have the capability to access local files to update chats with data newer than the cut-off date of the LLM? I'd like to be able to input locally stored ebooks and generate summaries along the lines of "What is 'This Book' about?"
@teddygbg2 ай бұрын
Thanks for this Matthew! Super helpful overview.
@colmxbyrne2 ай бұрын
LM Studio really useful
@neugen10192 ай бұрын
Matt you tricked us yesterday with that 01 thing and that voice distraction
@erikjohnson9112
2 ай бұрын
I think that was actually real. The presenter just wants attention, they want to get noticed when they speak (a form of narcissism). I found the product to be interesting and the presentation to be distracting (a negative because it draws away from what is being presented).
@Akcvs
2 ай бұрын
@@erikjohnson9112 actually I think you are the narcissist and are just projecting. Transgender people have existed in every region of the Earth since before civilization itself. They're a real naturally occurring demographic. Get over it
@hiddenkirby2 ай бұрын
Thanks for this video. I love this development setup. How do you properly serve it all from a cloud?
@mikhailkalashnik0v2 ай бұрын
Great video thanks! Any known good models I can use for infosec & or application security (pen testing)?
@333dsteele12 ай бұрын
Great video
@YusriSalleh2 ай бұрын
Excellent video. Tqvm! Is there any video considering various option of hardwares for running local? Say various Nvidia GPUS, or AMD Rocm or even apple metal ?
@justinrose86612 ай бұрын
Thanks Matt! You're my favorite AI KZreadr. I'd love to see you build something cool, we all would I'm sure.
@punishedproduct2 ай бұрын
Agents working locally!!❤
@TomHimanen2 ай бұрын
Please demo LM Studio and Crew AI as combo! Also this was a great demo, thanks!
@carthagely1222 ай бұрын
Thanks alot
@tech-vp5xe2 ай бұрын
Hey Matt, been following you for a long time awesome work. Does LM Studio allow different models to exist on separate GPU's
@matthew_berman
2 ай бұрын
Like if you have multiple GPUs? I don’t think so
@drlordbasil2 ай бұрын
LM studios and their model server is soooo easy.
@drlordbasil
2 ай бұрын
im hoping they add combining and fine tuning functions or at least image/other gen models in future.
@myandrobox34272 ай бұрын
This is awesome! Thanks for sharing again!! I have quick question... I want to run this on server like you showed, create a nocode app using APIs, and have users access this application. Kinda creating for a local group of users. How do you think this is going to work in terms of machine requirements. Please guide if it's good approach! 🙏🏻
@dieselphiend2 ай бұрын
Considering what I was going through to install models previously, this is dumbfoundingly simple.
@alanmckeon83212 ай бұрын
Could you use these to help build an application?
@thewatersavior2 ай бұрын
branching chat would be a dope chrome plugin
@faultyogiАй бұрын
👌"I have learned a lot."
@Myplaylist892Ай бұрын
Is it possible to set folders within the LLMStudio in order to do some document querying?
@mendthedivide2 ай бұрын
what kinda chip does your laptop have Matthew Berman? m1 m2 m3?
@fruitpnchsmuraiG9 күн бұрын
Hey, can you share some advice for an undergrad to get into generative models, where to begin or rather what should i learn to understanding the working of LLMs and playing around with them?
@sizwemsomi2392 ай бұрын
this is fire
@NahFam132 ай бұрын
I'd love to see videos on running specific models. I've been able to run almost every model I've downloaded but I can't seem to get StarCoder or StarCoder2 regardless of the preset I use and I've love to get it running without hallucinations, or looping the same sentence. Another thing I'd love to know is what happened to TheBloke!! I heard he stopped converting models and I'm sure the reason behind has to be epic.
@Paulina-ds5bg2 ай бұрын
Can someone suggest the way/tool when can I train open source model with custom data? For example with pdf.? And giving a questions in terms the given data?
@profitunist587622 күн бұрын
Hi, I was wondering how you'd download gated models, like meta-llama/Meta-Llama-3-8B? There are quantized versions from other authors but I'd prefer to download the actual one released by Meta. Thanks
@babbagebrassworks42782 ай бұрын
Doesn't work on my Pi5 but Ollama does. They only seem to use memory when answering a prompt, so I can multiple version of Ollama running with different models, as long as I only prompt one at a time. AMD Ryzen AI and Intel Core Ultra have NPUs onboard now so no need for big GPU card.
@imacuser1012 ай бұрын
use autogen to create a series of agents with their agent builder to run a task
@rickevans79412 ай бұрын
How can good content from a genuine, ethical and knowledgeable person only have 200k subs in this niche when DS has 143k with his empty parroting grift? You are headed to ,,, broski, hope you're ready. You deserve it.
@jeffg46862 ай бұрын
They should get a visual node setup like comfy UI but for creating purely llm based apps. Some generate button or whatever spits out the python code (or just runs it) Not that it's hard to write, but some might prefer the node setup.
@Al-Storm2 ай бұрын
I run anythingllm on top of this. It has a nice rag setup.
@berer.2 ай бұрын
Can you install Grok from the download provided by Elon Musk? Or do you always have to use their dl link?
@horriblyblue62032 ай бұрын
Hello Matthew, does LMStudio supports AMD RX GPUs to power LLM? Because i can't figure it out, and it still uses my CPU only.
@nannan33472 ай бұрын
If LMStudio had a built in RAG it would be perfection.
@mydogsbutler2 ай бұрын
One thing which would be useful... how to sync models between LM studio and Ollama (to avoid duplication to save space). LM Studios defaults Local models in windows is: C:\Users\User\.cache\lm-studio\models When I point it to Ollama's in Windows 11 WSL2 ubuntu 22.04 it doesn't work ( \\wsl.localhost\Ubuntu-22.04\|usr\share\ollama\.ollama\models) Anyone know the answer? Is it even possible?
@irom772 ай бұрын
Hey, what laptop would you recommend for this stuff ?
@user-ef4df8xp8p2 ай бұрын
LMStudio is cool.....Please, make more videos on this tool....
@trazercreations84782 ай бұрын
also you can use this with AnythingLLM for documents
@racerx1777
2 ай бұрын
DO NOT USE ANYTHINGLLM it is no longer free! Im starting to see a pattern with this video creator. As soon as he releases a video these so called free things all of the sudden become paid for versions! I used AnythingLLM one night on this computer after watching this guys video on AnythingLLM the very next morning at work i went to put it on a computer at work and it was no longer free but subscription based! I AM DONE WITH THE MONEY GAMES BASED ON PRINCIPAL! THEY ARE ALL TRYING TO CASH IN ON THIS CRAP THAT AMOUNTS TO NOTHING MORE THAN A GIMMICK A AI TREND IF YOU WILL!!! BANKRUPT THESE PEOPLE!
@rodrimora2 ай бұрын
Does it support exl2 quants? If not that would be the only thing imssing to make the switch from textgen web ui
@chrisb90452 ай бұрын
Hi, it is possible to upload to LM studio your own document files ex excel files, photos, pdf, txt?
@infernosfmatt2 ай бұрын
Can u make or do you have a vid on how LLMs are created?
@Mangini0372 ай бұрын
Yes pleeeease do a video using AutoAgent tutorial. Thanks.
@therighteousagentАй бұрын
will there be a memory and roleplay function implemented?
@TheBlaser552 ай бұрын
Mathew, it would be great if there was a reference listing of all your videos so we could just pick a topic and find your videos that apply so we can watch them, again..... I think you did a video with agents before that did something simular, just wish they were easier to find
@michai3332 ай бұрын
I can’t believe how fast it’s running, fully offloaded, on my GPU (OC 4090). I’m using Mistral Claud merged 7B Q8.
@KaiPhox2 ай бұрын
There are many models of Grok to download. Do I need all the files or is there a one click download for LM studio?
@WylieWaspАй бұрын
Hi Matthew just try to sign up for the newsletter something is broken just to let you know
@timduck8506Ай бұрын
Hmm so which is better for a newbie VS coder dev, LM studio or Ollama
@thegooddoctor67192 ай бұрын
So the big question is, When are they going to start charging you money to use LMStudio ?????? You got the best content as usual !!!!!!!!!!
@MeinDeutschkurs2 ай бұрын
Throttle: genius! LM-Studio got definitely improved. Gorgeous video!
@robottalks73122 ай бұрын
crew ai with claude 3 is it possible ? comparison with Devin
@jackiekerouac20902 ай бұрын
Would that version be good for a professional translator from English to Spanish?
@user-bd8jb7ln5g2 ай бұрын
I'm especially interested in the ability to generate multiple responses from the same model then selecting the best one. Can LM Studio do that at this time?
@DaveEtchellsАй бұрын
Amazing to me how little RAM the models use. Time for me to get a maxed M3 Max MBP I guess, although gonna wait till after the May 7 event JIC. (I know very unlikely to have any MBP hardware impact, but I’m cautious 🙂)
@001110002 ай бұрын
In terms of being lightweight, how does this compare to Ollama?
@Brax1982Ай бұрын
I don't think that's how the compatibility filter works. I checked for a couple models that are in a repo with tensor files. They did not show up if that filter was active. I guess they include "does not work on LM Studio" as being not compatible with your system. Because it isn't compatible with their tool? At first I thought that it's not true that every model on HF could be found on there. But for those that were missing, I then saw that they show up if that filter is off. Of course, this is a sample size of trying 3 models or something. May be outliers. But do any models that are not GGUF work on LM Studio?
@Shari_TejpАй бұрын
can someone explain me importance of Ram VS Gpu vram? because for example on dolphin mixtral website the creator says you need at least 32 Gb of ram, and he's not mentioning nothing about GPU. and i taught the file size (for example 32 GB's) gets loded fully on your RAM, not on your gpu vram... im confused
@Parisneo2 ай бұрын
LM studio is a nice project. The only complaint I have is that it is not open source. People can use LM Studio along with lollms as it can be run as a server and they seem to get very good output. So yeah, this is a very cool and useful tool.
@aimademerich2 ай бұрын
Phenomenal
@jamesnaftalin61032 ай бұрын
I have quite an old machine with not much of a GPU, can you make LM Studio run in the cloud?
@ErickJohnson-qx8tb2 ай бұрын
You need to do a troubleshooting shoot video when the download folder gets moved it ruins everything and I cant seem to line realign path
@GetzAI2 ай бұрын
Matthew, you should conduct a Mac vs PC Local LLM comparisons. M2,M3, RTX 4090. MBP vs Studio.
@dreamyrhodes2 ай бұрын
Wow finally an UI with documentation? I always hated that they throw all that GGUF, GPTQ, 4Q 5Q _K_M_S at you without ever telling what it means and what it needs to run.
@JoaoWilliamRodriguesCardoso2 ай бұрын
Would there be any app similar to LM studio for mobile phone? I would love to run these open source LLM on my phone.
@u-save59892 ай бұрын
How to set it up on a server so I can sell access to chats of trained agents. Like GPTs in OpenAI shop. What GPU RAM CPU needed to use it per 5k daily users and if concurrent - how to calculate this and not crash server. Also - GROK, is it supported?
@ricardocnn2 ай бұрын
It also integrates with langchain and llamaindex
@farorasyid18322 ай бұрын
hi. what common computer spec for this ? thanks
@duckpear24422 ай бұрын
2 questions: (1) do these models run offline (good for private data?); and (2) is LMStudio better than ollama?
@phobes2 ай бұрын
I like LM Studio but I've had an issue with every single model hallucinating when I try to use the built-in server.