Web Scraping with Professional Proxy Servers in Python

Ғылым және технология

Today we learn how to do professional Web Scraping in Python by using reliable and high quality proxy servers from Oxylabs.
Oxylabs: oxylabs.io/pages/neuralnine?u...
Promo Code: NN25
Site To Scrape: books.toscrape.com/
◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾
📚 Programming Books & Merch 📚
🐍 The Python Bible Book: www.neuralnine.com/books/
💻 The Algorithm Bible Book: www.neuralnine.com/books/
👕 Programming Merch: www.neuralnine.com/shop
🌐 Social Media & Contact 🌐
📱 Website: www.neuralnine.com/
📷 Instagram: / neuralnine
🐦 Twitter: / neuralnine
🤵 LinkedIn: / neuralnine
📁 GitHub: github.com/NeuralNine
🎙 Discord: / discord
🎵 Outro Music From: www.bensound.com/
Timestamps:
(0:00) Intro
(0:17) The Need For Professional Proxies
(3:36) Simple Web Scraping Example
(11:12) Sending Proxy Requests
(17:11) Scrape Multiple Pages
(23:04) Scrape Detail Information
(31:39) Outro

Пікірлер: 43

  • @CrazyFanaticMan
    @CrazyFanaticMan2 жыл бұрын

    As long as you're not hacking into a database or something, scraping is perfectly legal because all you're doing is gathering data that is being made publicly available to visitors on their websites

  • @NeuralNine

    @NeuralNine

    2 жыл бұрын

    Yeah but if you are constantly bombarding a website with requests to get "live data" it can be really annoying.

  • @CrazyFanaticMan

    @CrazyFanaticMan

    2 жыл бұрын

    @@NeuralNine I agree, I can't remember which site it was or what I was scraping, but I definitely sent way too many requests to the site and eventually it just didnt want to give me access to its pages anymore haha so your video is extremely useful

  • @starling0216

    @starling0216

    2 жыл бұрын

    @@NeuralNine proxy selenium python

  • @charlesgormley9075

    @charlesgormley9075

    2 жыл бұрын

    It’s not illegal but could go against the websites terms of service which if ignored could result in a legal case, if they pursue legal action.

  • @samuelcooper9880

    @samuelcooper9880

    2 жыл бұрын

    Lots of devs layout the rules anyway on the site. Many sites just straight up give out api keys to help.

  • @pitaeata8493
    @pitaeata84932 жыл бұрын

    LOL of all the books to pick first...Golden Condom. hahaha. you dealt with it quickly. good on you, funny shit.

  • @giannisnik5295
    @giannisnik5295 Жыл бұрын

    Does it automatic rotating the proxies or we have to change the country with code for each time that our ip is blocked?

  • @2ru2pacFan
    @2ru2pacFan2 жыл бұрын

    Do you know how to scrape data bypassing the CloudFlare? Been trying for weeks now

  • @antwanissa8099
    @antwanissa8099 Жыл бұрын

    can we use it with selenium ? And what about csrf problem for example doing instagram scraping ?

  • @ammaralzhrani6329
    @ammaralzhrani6329 Жыл бұрын

    Thanks. It is possible with asynic?

  • @arijanrahimic3210
    @arijanrahimic32102 жыл бұрын

    Bester Tutorial wie immer danke !!!

  • @haifengsu
    @haifengsu Жыл бұрын

    does this solution allow to pass 'headers' info if calling an api ?

  • @kameshs4109
    @kameshs41092 жыл бұрын

    How can I get terminal output stored into my.txt file

  • @TheMaxKids
    @TheMaxKids Жыл бұрын

    Could you do an example with Playwright and Oxylabs?

  • @TheMaxKids
    @TheMaxKids Жыл бұрын

    Secondly, if you went to Settings on PyCharm, and set the Proxy manually to OxyLabs, would that bypass the proxy code in your video?

  • @ayanwoyegideonayandele3729
    @ayanwoyegideonayandele3729 Жыл бұрын

    Is multithreading/parallel webscraping also okay even with proxy?

  • @atultanna
    @atultanna Жыл бұрын

    What would be the code for scraping proxies for SB

  • @mrlectus
    @mrlectus5 ай бұрын

    How does portfolio for web scrapers look like?

  • @uplifthabesha754
    @uplifthabesha754 Жыл бұрын

    Can you please do a video where you automatically download multiple files from a website using python. It can be pdf or csv files.

  • @user-gg4en1nc5s
    @user-gg4en1nc5s4 ай бұрын

    why am I receiving the message after a print(response.json()) ' {'message': 'provided url is not supported'}' . This happens when I try to web scrap from google scholar. Does this mean that the OxyLabs IP's didn't work? Because it couldn't support the url from scholar?

  • @JnWayn
    @JnWayn Жыл бұрын

    My question is, isn't it fair to conclude that some some web servers are just so adamant about not getting scraped, and good at implementing it, that no amount of coding is ever gonna get past their security? With one particular website that I need data from, which already has my IP blocked, I can get to it's home page, which just has a disclaimer, with a VPN connection, but as soon as I get to the page where I need to fill out the search form, I get nabbed and the IP from the VPN turns out to be listed as suspect and from that point, blocked. I would like to think that there are tricks in my court that I could use but I'm not seeing it at all. Can you shred some light on this? I'd rather know that it's not possible and stop pulling my hair out, than think I just need to do a little more, when that's not the case. Thank you

  • @AWhite_
    @AWhite_2 жыл бұрын

    You did great job, 👍

  • 2 жыл бұрын

    Greetings from brazil. Nice video!

  • @vidhya-mc9bc
    @vidhya-mc9bc Жыл бұрын

    What to do if this error occurs ? response_html = response.json()['results'][0]['content'] KeyError: 'results'

  • @ReligionAndMaterialismDebunked
    @ReligionAndMaterialismDebunked Жыл бұрын

    I remember that book, A Light in the Attic. Hehe

  • @nicohussein2635
    @nicohussein26352 жыл бұрын

    Why do you use statistics.mean(prices) instead of just avg(prices)? Thanks in advance

  • @midmanphilippines

    @midmanphilippines

    2 жыл бұрын

    Speed

  • @md.shazidalhasan6726
    @md.shazidalhasan67262 жыл бұрын

    Thanks

  • @ShawnM33
    @ShawnM333 ай бұрын

    just stumbled upon this vid about web scraping with proxies in Python. Been thinking of giving it a shot, especially with Proxy-Store's proxies. Anyone got experience with this? Lemme know!

  • @jaydenlewis5320
    @jaydenlewis53202 жыл бұрын

    Dude why are you so good??

  • @NeuralNine

    @NeuralNine

    2 жыл бұрын

    Glad you like the video! :)

  • @billigerfusel

    @billigerfusel

    2 жыл бұрын

    Practice and big brain

  • @starling0216
    @starling02162 жыл бұрын

    Proxy selenium python videos

  • @pavelerokhin1512
    @pavelerokhin1512 Жыл бұрын

    nice!

  • @almighysanti3061
    @almighysanti30612 жыл бұрын

    🔥🔥🔥

  • @NeuralNine

    @NeuralNine

    2 жыл бұрын

    🙏🏻

  • @user-lr1nz1xz6w
    @user-lr1nz1xz6w2 жыл бұрын

    I wish I was smart enough to understand this video lol.

  • @s6yx
    @s6yx2 жыл бұрын

    @13:37 couldn’t you just do requests.post

  • @NeuralNine

    @NeuralNine

    2 жыл бұрын

    Yes I think that would have the same effect

  • @Remedialpriya.
    @Remedialpriya.2 жыл бұрын

    Hi

Келесі