Skip to content

mido-99/RV-competitros-pricing

Repository files navigation

upwork-glrv

Scrape glrv for upwork client

Questions

Notes on main.py

Targets

Delivery 7

Delivery 6

Delivery 5

Delivery 4

Delivery 3

  • https://gcrvc.com/

  • https://abcmotors.com/

  • www.pplmotorhomes.com

    • Follows this pattern in pagination: https://www.pplmotorhomes.com/used-rvs-for-sale/all-available-vehicles?sortBy=price%20desc&startIndex=24 , as each page has 24 items.

      So this is its logic for pagination:

      per_page = 24   #? PPLMOTORHOMES only
      for i in range(1, min(10, max_page+1) if QUICK_TEST else max_page+1):
          url = competitor_url % ((i-1) * per_page)   # set off is 0, p2 = 24, p3 = 48, ...
          driver.get(url)
  • https://batesrv.com/

    • Location not available:

      ![image-20250329211828872](C:\Users\Mido Hany\AppData\Roaming\Typora\typora-user-images\image-20250329211828872.png)

  • https://lakecorv.com/


Delivery 2

Drive URL


Delivery 1

  • https://smithrv.com/inventory/

    • Catch TimeoutException in driver.get()

    ![image-20250318175135709](C:\Users\Mido Hany\AppData\Roaming\Typora\typora-user-images\image-20250318175135709.png)

    • JS rendered content, pagination doesn't work, requires clicking on next at the end of the loop to work.
  • https://www.glrv.com/rv-sales/

  • https://lewistonrv.com/shop/

  • https://www.buckeyerv.com/inventory-all/

  • https://www.aokrvs.com/new-rvs

    • I extracted both designation & stocknumber (if available) from title.

    • I've included the part in the middle between year & stocknumber as rv_type - sometimes it's "5th Wheel" not sure if that's correct rv_type - but we can omit it if not very accurate.

    • ![image-20250318163701900](C:\Users\Mido Hany\AppData\Roaming\Typora\typora-user-images\image-20250318163701900.png)

    • If the part above title is model (I guess so) I can include it too.

About

Scrape glrv for upwork client

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published