Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

Phi-3-vision is the first multimodal model in the Phi-3 family, bringing together text and images, and the ability to reason over real-world images and extract and reason over text from images. It has also been optimized for chart and diagram understanding and can be used to generate insights and answer questions. Phi-3-vision builds on the language capabilities of the Phi-3-mini, continuing to pack strong language and image reasoning quality in a small model.
Phi-3-vision can generate insights from charts and diagrams:
Code Link: colab.research.google.com/dri...
----------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
►Llamindex Playlist: • Announcing LlamaIndex ...
►Google Gemini Playlist: • Google Is On Another L...
►Langchain Playlist: • Amazing Langchain Seri...
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD

Пікірлер: 22

  • @krishnaik06
    @krishnaik06Ай бұрын

    Join my data science community discord group where we discuss many things. Happy Learning!! discord.gg/u7q6ZNSH

  • @KevinKreger
    @KevinKreger18 күн бұрын

    It's a great model. Very useful. Thanks Krish.

  • @mishrajii5298
    @mishrajii5298Ай бұрын

    Thanks, sir I have watched your videos and I learned a lot

  • @AshrafAli-yb4tl
    @AshrafAli-yb4tlАй бұрын

    Awesome content Krish, you are really inspiring a generation who are interested in genai

  • @twinklepardeshi3113
    @twinklepardeshi3113Ай бұрын

    Amazing stuff !❤

  • @raph8240
    @raph8240Ай бұрын

    Thank you for your contribution to the open source community. Pls make video on crewai agents creation

  • @ashraf_isb
    @ashraf_isbАй бұрын

    thanks again sir!

  • @smitparikh3969
    @smitparikh3969Ай бұрын

    Hello Krish, thank you so much for the amazing video. Can you please make a video explaining the architecture of multimodal LLMs?

  • @maheshkuttymarar2694
    @maheshkuttymarar2694Ай бұрын

    Hey Krish!! Please start a playlist on evaluation methods and techniques of LLM applications please.

  • @lenovo57787
    @lenovo57787Ай бұрын

    Hi Krish, can you please do an end-to-end ML model or project using Kubernetes? Every company is asking about deploy, deploy, deploy and they want us to have practical experience using Kubernetes. Something more than just a basic tutorial.

  • @Aditya-on9ro
    @Aditya-on9roАй бұрын

    Is there any future plan to create a hugging face course

  • @krishnaik06

    @krishnaik06

    Ай бұрын

    Yes comin up soon

  • @moderx
    @moderxАй бұрын

    Thanks a lot sir , I have learned much about a.i from you For 1 year almost. And I'm upgrading my pc for a.i workflow, which setup should I consider single GPU or multi GPU.

  • @moderx

    @moderx

    Ай бұрын

    Please guide me sir , I want to become A.I Engineer.

  • @rishiraj2548
    @rishiraj2548Ай бұрын

    🙏💯👍

  • @IdPreferNot1
    @IdPreferNot1Ай бұрын

    Can you do the equivalent but simpler with the new Hugging face Langchain SDK?

  • @amitguitarist2008
    @amitguitarist2008Ай бұрын

    I think it needs A100. Is it possible to run in free GPUs

  • @commoncats5437
    @commoncats5437Ай бұрын

    I was stucked here if anyone guide me i will get good idea…. Thanks krish ❤

  • @ibrahimmuhammad5414
    @ibrahimmuhammad5414Ай бұрын

    Please be sharing the link to the codes in the video

  • @RamaChandran-fc3hp
    @RamaChandran-fc3hpАй бұрын

    Sir talk about alpha fold 3

  • @sameerjadhav5603
    @sameerjadhav560329 күн бұрын

    5:15 its 'CAUSAL' and NOT 'Casual'