Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

Phi-3-vision is the first multimodal model in the Phi-3 family, bringing together text and images, and the ability to reason over real-world images and extract and reason over text from images. It has also been optimized for chart and diagram understanding and can be used to generate insights and answer questions. Phi-3-vision builds on the language capabilities of the Phi-3-mini, continuing to pack strong language and image reasoning quality in a small model.
Phi-3-vision can generate insights from charts and diagrams:
Code Link: colab.research.google.com/dri...
----------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
►Llamindex Playlist: • Announcing LlamaIndex ...
►Google Gemini Playlist: • Google Is On Another L...
►Langchain Playlist: • Amazing Langchain Seri...
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD

Пікірлер: 22

@krishnaik06Ай бұрын
Join my data science community discord group where we discuss many things. Happy Learning!! discord.gg/u7q6ZNSH
@KevinKreger18 күн бұрын
It's a great model. Very useful. Thanks Krish.
@mishrajii5298Ай бұрын
Thanks, sir I have watched your videos and I learned a lot
@AshrafAli-yb4tlАй бұрын
Awesome content Krish, you are really inspiring a generation who are interested in genai
@twinklepardeshi3113Ай бұрын
Amazing stuff !❤
@raph8240Ай бұрын
Thank you for your contribution to the open source community. Pls make video on crewai agents creation
@ashraf_isbАй бұрын
thanks again sir!
@smitparikh3969Ай бұрын
Hello Krish, thank you so much for the amazing video. Can you please make a video explaining the architecture of multimodal LLMs?
@maheshkuttymarar2694Ай бұрын
Hey Krish!! Please start a playlist on evaluation methods and techniques of LLM applications please.
@lenovo57787Ай бұрын
Hi Krish, can you please do an end-to-end ML model or project using Kubernetes? Every company is asking about deploy, deploy, deploy and they want us to have practical experience using Kubernetes. Something more than just a basic tutorial.
@Aditya-on9roАй бұрын
Is there any future plan to create a hugging face course
@krishnaik06
Ай бұрын
Yes comin up soon
@moderxАй бұрын
Thanks a lot sir , I have learned much about a.i from you For 1 year almost. And I'm upgrading my pc for a.i workflow, which setup should I consider single GPU or multi GPU.
@moderx
Ай бұрын
Please guide me sir , I want to become A.I Engineer.
@rishiraj2548Ай бұрын
🙏💯👍
@IdPreferNot1Ай бұрын
Can you do the equivalent but simpler with the new Hugging face Langchain SDK?
@amitguitarist2008Ай бұрын
I think it needs A100. Is it possible to run in free GPUs
@commoncats5437Ай бұрын
I was stucked here if anyone guide me i will get good idea…. Thanks krish ❤
@ibrahimmuhammad5414Ай бұрын
Please be sharing the link to the codes in the video
@RamaChandran-fc3hpАй бұрын
Sir talk about alpha fold 3
@sameerjadhav560329 күн бұрын
5:15 its 'CAUSAL' and NOT 'Casual'

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

Пікірлер: 22

@krishnaik06

Ай бұрын

@moderx

Ай бұрын

Келесі