Hi Everyone! Today we will talk about a powerful web scraping library named Playwright! 🎭
We will learn it step by step, going over the most important commands; from basic web page automation to advanced CAPTCHA solving!! So that by the end of this video, you’ll be able to scrape anything you want and your bots will never be detected! 🤖🤖🤖
We will begin with a quick start guide, and continue with web drivers, popular selectors and mouse/keyboard controls. The best part is – I’ll even show you how to bypass Cloudflare protection, and break a massive variety of CAPTCHAs with a single tool!!! 😱
💸 Coupon 💸
———————————————-
⭐ Bright Data’s Web Unlocker:
https://brdta.com/pysimplified
📺 Related Videos 📺
———————————————-
⭐ Anaconda Beginners Guide:
💻 Code on GitHub 💻
———————————————-
⭐ Playwright Web Scraping:
https://github.com/MariyaSha/Playwright_WebScraping/tree/main
⏰ Time Stamps ⏰
———————————————-
00:00 – Intro
00:35 – Playwright Quickstart
02:27 – Install Playwright
03:55 – Firefox Web Driver
05:36 – Developer Tools
06:17 – Get by Placeholder
06:56 – Get by Role
07:12 – Get by Text Chaining
08:16 – Select Nth Element
09:20 – Locate by Xpath
12:06 – Download URLs as Files
14:27 – Bypass CAPTCHA with Proxies
15:08 – Set Up Web Unlocker
🤝 Let’s Connect 🤝
———————————————-
🔗 Github:
https://github.com/mariyasha
🔗 X:
https://x.com/MariyaSha888
🔗 LinkedIn:
https://ca.linkedin.com/in/mariyasha888
🔗 Blog:
https://www.pythonsimplified.org
🔗 Discord:
https://discord.com/invite/wgTTmsWmXA
💳 Credits 💳
———————————————-
⭐ Beautiful titles, transitions, sound FX:
mixkit.co
⭐ Icons and Graphics:
flaticon.com
コメント
Please make next video on Linear Regression Simply Explained. It would be helpful.
Is there any other manual way to avoid services like bright data and other which provide scrapers? As in manually bypass the captchas in some way via playwright/selenium etc? There has to be some way because even these services do it right. Yes they may have multiple proxies so one IP doesnt get blocked for too many requests but other than that what about the captcha part?