Google's New PaliGemma-Open Vision Language Model

PaliGemma is a powerful open VLM inspired by PaLI-3. Built on open components including the SigLIP vision model and the Gemma language model, PaliGemma is designed for class-leading fine-tune performance on a wide range of vision-language tasks. This includes image and short video captioning, visual question answering, understanding text in images, object detection, and object segmentation.
developers.googleblog.com/en/...
Code:colab.research.google.com/dri...
------------------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
/ @krishnaik06
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist: • Generative AI In AWS-A...
►Llamindex Playlist: • Announcing LlamaIndex ...
►Google Gemini Playlist: • Google Is On Another L...
►Langchain Playlist: • Amazing Langchain Seri...
►Data Science Projects:
• Now you Can Crack Any ...
►Learn In One Tutorials
Statistics in 6 hours: • Complete Statistics Fo...
End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's
Machine Learning In 6 Hours: • Complete Machine Learn...
Deep Learning 5 hours : • Deep Learning Indepth ...
►Learn In a Week Playlist
Statistics: • Live Day 1- Introducti...
Machine Learning : • Announcing 7 Days Live...
Deep Learning: • 5 Days Live Deep Learn...
NLP : • Announcing NLP Live co...
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD

Пікірлер: 19

  • @ariouathanane
    @ariouathanane9 күн бұрын

    Thanks a lot, could you please provide How to Fine-tune PaliGemma for Object Detection Tasks?

  • @adityaramesh551

    @adityaramesh551

    6 күн бұрын

    Dafaq will you use VLM for object detection?

  • @steverogers34
    @steverogers3415 күн бұрын

    Sir is any model read horoscope or astrology

  • @maxinteltech3321
    @maxinteltech332115 күн бұрын

    I like your new look❤ welcome to the bald guys club 🎉

  • @rishiraj2548

    @rishiraj2548

    15 күн бұрын

    😃

  • @keedabyte

    @keedabyte

    14 күн бұрын

    😂

  • @mohsenghafari7652
    @mohsenghafari765214 күн бұрын

    thanks krish

  • @Sci-PiExplained
    @Sci-PiExplained15 күн бұрын

    Sir genrative ai for web developers

  • @HDSV10
    @HDSV1014 күн бұрын

    Chat Q n A with KZread video transcript by uploading yt link + multilingual text to speech sir make this project video

  • @satyamoahnty
    @satyamoahnty15 күн бұрын

    This is very bad at extracting key information from images

  • @rohansai715
    @rohansai71515 күн бұрын

    Hello is there anyone interested to collab and do a project ?

  • @gunavardhan000

    @gunavardhan000

    15 күн бұрын

    Yeah intrested in ml / genai projects

  • @CodeWonders_

    @CodeWonders_

    14 күн бұрын

    No ⌚

  • @annu8276

    @annu8276

    14 күн бұрын

    Yes I am interested to do project

  • @akshaysrivastava4304

    @akshaysrivastava4304

    11 күн бұрын

    sure

  • @lalithX1406
    @lalithX140613 күн бұрын

    anyone intrested in doing realtime projects using GENAI ?

  • @akshaysrivastava4304

    @akshaysrivastava4304

    11 күн бұрын

    yes

  • @AbhishekJain-lw5pe

    @AbhishekJain-lw5pe

    9 күн бұрын

    Yes