Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

Phi-3-vision is the first multimodal model in the Phi-3 family, bringing together text and images, and the ability to reason over real-world images and extract and reason over text from images. It has also been optimized for chart and diagram understanding and can be used to generate insights and answer questions. Phi-3-vision builds on the language capabilities of the Phi-3-mini, continuing to pack strong language and image reasoning quality in a small model.
Phi-3-vision can generate insights from charts and diagrams:
Code Link: colab.research.google.com/dri...
----------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
►Llamindex Playlist: • Announcing LlamaIndex ...
►Google Gemini Playlist: • Google Is On Another L...
►Langchain Playlist: • Amazing Langchain Seri...
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD

Пікірлер: 22

  • @krishnaik06
    @krishnaik0625 күн бұрын

    Join my data science community discord group where we discuss many things. Happy Learning!! discord.gg/u7q6ZNSH

  • @KevinKreger
    @KevinKreger9 күн бұрын

    It's a great model. Very useful. Thanks Krish.

  • @mishrajii5298
    @mishrajii529825 күн бұрын

    Thanks, sir I have watched your videos and I learned a lot

  • @AshrafAli-yb4tl
    @AshrafAli-yb4tl25 күн бұрын

    Awesome content Krish, you are really inspiring a generation who are interested in genai

  • @raph8240
    @raph824025 күн бұрын

    Thank you for your contribution to the open source community. Pls make video on crewai agents creation

  • @ashraf_isb
    @ashraf_isb24 күн бұрын

    thanks again sir!

  • @twinklepardeshi3113
    @twinklepardeshi311323 күн бұрын

    Amazing stuff !❤

  • @smitparikh3969
    @smitparikh396923 күн бұрын

    Hello Krish, thank you so much for the amazing video. Can you please make a video explaining the architecture of multimodal LLMs?

  • @maheshkuttymarar2694
    @maheshkuttymarar269424 күн бұрын

    Hey Krish!! Please start a playlist on evaluation methods and techniques of LLM applications please.

  • @lenovo57787
    @lenovo5778724 күн бұрын

    Hi Krish, can you please do an end-to-end ML model or project using Kubernetes? Every company is asking about deploy, deploy, deploy and they want us to have practical experience using Kubernetes. Something more than just a basic tutorial.

  • @rishiraj2548
    @rishiraj254825 күн бұрын

    🙏💯👍

  • @moderx
    @moderx23 күн бұрын

    Thanks a lot sir , I have learned much about a.i from you For 1 year almost. And I'm upgrading my pc for a.i workflow, which setup should I consider single GPU or multi GPU.

  • @moderx

    @moderx

    23 күн бұрын

    Please guide me sir , I want to become A.I Engineer.

  • @commoncats5437
    @commoncats543725 күн бұрын

    I was stucked here if anyone guide me i will get good idea…. Thanks krish ❤

  • @IdPreferNot1
    @IdPreferNot125 күн бұрын

    Can you do the equivalent but simpler with the new Hugging face Langchain SDK?

  • @Aditya-on9ro
    @Aditya-on9ro25 күн бұрын

    Is there any future plan to create a hugging face course

  • @krishnaik06

    @krishnaik06

    25 күн бұрын

    Yes comin up soon

  • @RamaChandran-fc3hp
    @RamaChandran-fc3hp24 күн бұрын

    Sir talk about alpha fold 3

  • @ibrahimmuhammad5414
    @ibrahimmuhammad541424 күн бұрын

    Please be sharing the link to the codes in the video

  • @sameerjadhav5603
    @sameerjadhav560321 күн бұрын

    5:15 its 'CAUSAL' and NOT 'Casual'

  • @amitguitarist2008
    @amitguitarist200824 күн бұрын

    I think it needs A100. Is it possible to run in free GPUs