Selenium Headless Scraping For Servers & Docker

Ғылым және технология

In this video we learn how to do web scraping with Selenium in a server or container environment.
◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾
📚 Programming Books & Merch 📚
🐍 The Python Bible Book: www.neuralnine.com/books/
💻 The Algorithm Bible Book: www.neuralnine.com/books/
👕 Programming Merch: www.neuralnine.com/shop
💼 Services 💼
💻 Freelancing & Tutoring: www.neuralnine.com/services
🌐 Social Media & Contact 🌐
📱 Website: www.neuralnine.com/
📷 Instagram: / neuralnine
🐦 Twitter: / neuralnine
🤵 LinkedIn: / neuralnine
📁 GitHub: github.com/NeuralNine
🎙 Discord: / discord

Пікірлер: 62

  • @Akshatgiri
    @Akshatgiri7 ай бұрын

    Good man. This was super helpful. Easily saved me 5+ hours of searching around.

  • @roflcopter645
    @roflcopter6458 ай бұрын

    This video tutorial came at the perfect time. I'm currently working on a project that scrapes from a docker container, and I've been struggling to find out how to make it work. Thank you NeuralNine.

  • @amodseth8448
    @amodseth84488 ай бұрын

    It was very helpful thank you! I'll definitely keep this in mind ❤

  • @martinloeffler2119
    @martinloeffler21192 ай бұрын

    Thanks a lot struggled since yesterday to get selenium up and running inside docker. this works perfect

  • @andriezergalvamontero8978
    @andriezergalvamontero89783 ай бұрын

    Realmente uno de los videos mas utiles de python y selenium. You're a Crack!

  • @GeorgeChar95
    @GeorgeChar958 ай бұрын

    Thanks for the awesome video! This is exactly what I needed for my project!

  • @user-rb6gk4jq1m
    @user-rb6gk4jq1m2 ай бұрын

    Thank you from St. Petersburg! Your video helped me a lot in my automation work project. Now I can continue to create the project.

  • @joseantonioromeroespejo160
    @joseantonioromeroespejo1605 ай бұрын

    "Great video man. Very helpful and well explained. Thank you very much!!!"

  • @desouzafelipe
    @desouzafelipe7 ай бұрын

    Thank you so much for posting this video, it solves exactly what was blocking me!

  • @vkfalan
    @vkfalan8 ай бұрын

    Great tutorial, thank you for your efforts !!

  • @bernardosilva697
    @bernardosilva6977 ай бұрын

    You saved me, you won a new subscriber.

  • @DanielLima97dlcs
    @DanielLima97dlcs6 ай бұрын

    Thank you my bro! Works like a charm!

  • @reaganlopezmusic
    @reaganlopezmusic4 ай бұрын

    Thank you. This was really helpful.

  • @jaimesaldarriaga2910
    @jaimesaldarriaga29107 ай бұрын

    Thanks, this is an incredibly useful video.

  • @muhammadumer4127
    @muhammadumer41275 ай бұрын

    300K subscribers ❤🖤 Congratulation man. All of your videos are always good and helpfull. keep it up. Thankyou

  • @agentDueDiligence
    @agentDueDiligence5 ай бұрын

    Very helpful video! Thank you

  • @paulthomas1052
    @paulthomas10528 ай бұрын

    Useful utility - thanks !

  • @devmts
    @devmts4 ай бұрын

    thanks! greetings from Brazil.

  • @davidlopezfelix3668
    @davidlopezfelix3668Ай бұрын

    Awesome video. I run it as in the video and it worked!! thanks

  • @TheShox79
    @TheShox798 ай бұрын

    This is great! Thanks!

  • @felipe1990batista
    @felipe1990batista6 ай бұрын

    Thanks a lot. from Brazil

  • @vitorsilva-or1dj
    @vitorsilva-or1dj23 күн бұрын

    thanks bro! you solved my problem

  • @o2c4r1
    @o2c4r16 ай бұрын

    Thanks man!

  • @taz2177
    @taz21772 ай бұрын

    I have been trying to setup chrome and chrome driver for the docker image from past 5 hours, chatgpt got me swinging from one command to another, finally your 16 min video helped me thanks a ton. Alhamdulillah. AI cannot replace devs today I got a taste of it finally.

  • @nygma6

    @nygma6

    2 ай бұрын

    As salam aleykum which url did you use to dl google chrome th one in the video return a 404 ?

  • @po6577
    @po65778 ай бұрын

    There is another way that you can use a remote web drive(set up this in remote server with selenium official docker image). Then run the scarping part in remote.

  • @allailqadrillah

    @allailqadrillah

    3 ай бұрын

    can you provide some reference? i want to find out more

  • @ammadkhan4687
    @ammadkhan46875 ай бұрын

    You are Genius

  • @digitalmachine0101
    @digitalmachine010110 күн бұрын

    Good information

  • @sahil5124
    @sahil51247 ай бұрын

    thanks man, this is very helpful. Can you also create one for scrapy as well. What are the areas we should be concerned about when deploying a service that requires scrapy.

  • @buzadam1144
    @buzadam11448 ай бұрын

    Thx bro

  • @thandokuhlebrianmsane7043
    @thandokuhlebrianmsane7043Ай бұрын

    You might want to revisit the documentation and see that some modifications have been made. Thanks .

  • @kanwaradnan4849
    @kanwaradnan48498 ай бұрын

    Nice trick

  • @richardmarch3750
    @richardmarch37508 ай бұрын

    Thank you so much for creating such helpful videos! Can you make a video on how to make a AI spotify playlist generator where each track seamlessly transitions from one track to another?

  • @LegionLeague
    @LegionLeague8 ай бұрын

    Great video! Quick question: if you need to scrape several pages from your website, is it possible to make it async and print the results as soon as selenium is done scraping each page as opposed to printing the whole thing after every page is scraped? If so, I would love to see a video on that topic.

  • @richardhoppe4991
    @richardhoppe49917 ай бұрын

    In the main file I was getting an error "Failed to send GpuControl.CreateCommandBuffer" when I ran the script locally. Adding the chrome_options.add_argument('--disable-gpu)' made the error go away. Just in case anyone else is running into that error message.

  • @UnderworldCoder
    @UnderworldCoder8 ай бұрын

    nice, would be nice if you did a video on seleniumbase using from seleniumbase import SB

  • @JustinK0
    @JustinK05 ай бұрын

    The current set up i have works in a docker container when i have it running on windows but when i pull it to my ec2 instance on aws, it doesnt work, it tries to go to the url to get the data but just takes forever then times out.

  • @Franx570
    @Franx5708 ай бұрын

    Wouldnt it be better to use Selenium Grid instead? So I can use the Grid as a driver instead of doing all that?

  • @haggard17335
    @haggard173356 ай бұрын

    Hi, or several days I have been dealing with a problem that I cannot solve, I have a script that obtains the profile url, but in some profiles it does not work, I made sure that the selectors in both profiles are valid in addition to the html structure, I am running my code on a digital ocean server with linux without interface

  • @vihari2010
    @vihari20108 ай бұрын

    can we use pupetter to do this? also make a video on pupeteer

  • @kanwaradnan4849
    @kanwaradnan48493 ай бұрын

    updated! To bad it didnt work for the amazon.

  • @wallarichard8981
    @wallarichard89818 ай бұрын

    Hi, Probably i didn't catch that information but why the selenium necessary ? You can get the html content with Beautiful Soup ?!

  • @soul_maestro

    @soul_maestro

    8 ай бұрын

    selenium is handy for javascript heavy websites where you need a browser to execute the javascript to render parts of the site. with beautiful soup you'll pull in the bare html by itself, and have to pull all the javascript seperate and execute it correctly.

  • @joseantonioromeroespejo160
    @joseantonioromeroespejo1604 ай бұрын

    I've followed the steps, and it works correctly on my PC, but when deploying it on AWS EC2, Selenium fails and doesn't scrape. Do you know what could be causing this?

  • @jerick242
    @jerick2428 ай бұрын

    Is it works with streamlit cloud?

  • @marcelfranca5304
    @marcelfranca53042 ай бұрын

    since the 'new headless', for me, is not working anymore. Do you know how to make it work?

  • @aryangoel5578
    @aryangoel557826 күн бұрын

    Docker File isn't working for dependencies for chrome aren't getting installed over docker container

  • @manickpillai
    @manickpillai8 ай бұрын

    Good tutorial. Minor correction on verbiage at 9:55 its building a `docker image` from a Dockerfile. Then from image we run container using `docker run`

  • @chillfill4866
    @chillfill48666 ай бұрын

    Does anyone know any good cloud options? I want my scraping script running 19 hours/day, and obviously thats expensive.

  • @hrithiksharma2047
    @hrithiksharma20478 ай бұрын

    Wouldn't it be much easier to use firefox instead of chrome?

  • @collinsikotun1436
    @collinsikotun14362 ай бұрын

    This doesn't work on my M1 Mac, any suggestions?

  • @philtoa334
    @philtoa3348 ай бұрын

    Thx_.

  • @alexs7612
    @alexs76123 ай бұрын

    No github code?

  • @cbacca2999
    @cbacca299929 күн бұрын

    I have Python 3.11 on Windows 10. I'm just using a text editor to edit the Python program and I'm using a virtual environment in my cmd.exe shell. In this line "driver = webdriver.Chrome(service=Service(ChromeDriverManager().install()))" I get this error: "'powershell' is not recognized as an internal or external command, operable program or batch file. 'powershell' is not recognized as an internal or external command, operable program or batch file." It looks like selenium supports Python 3.11 so that should be an issue. I also have Selenium 4.21.0. Any idea how to fix this?

  • @ekopras6095
    @ekopras60955 ай бұрын

    Bro you dont need chrome driver? Why its work normally?

  • @TrNgTinh

    @TrNgTinh

    5 ай бұрын

    because using webdriver manager, automatically install chromedriver. I think so

  • @AliceShisori
    @AliceShisori7 ай бұрын

    dude how come you know everything :D

  • @i4i3i360
    @i4i3i3608 ай бұрын

    First comment bro ❤

  • @alex_law_codes
    @alex_law_codes3 ай бұрын

    Has anyone run into this error: executor failed running [/bin/sh -c apt install -y ./google-chrome-stable_current_amd64.deb]: exit code: 100

  • @tianniezing1941

    @tianniezing1941

    3 ай бұрын

    Yes me, have you already found a solution? I still can't figure out what the problem is.

Келесі