Scrape glrv for upwork client
- https://www.literv.com/
- https://www.jerrystrailers.com/
- https://wolfecampers.com/
- https://www.northtrailrv.com/
- https://www.epicrvliquidators.com/
-
https://www.schroedersbodyshop.com/
-
No obvious inventory. Found this link: https://www.schroedersbodyshop.com/skylinerv/
which says: search for “Skyline RV and Auto Sales” so is this the target site?
-
-
Details of changes in .py file
-
https://www.whiteautosalesandrvs.com/
Couldn't debug due to vpn not working with selenium
-
https://www.bestcampersales.com/

-
-
Follows this pattern in pagination:
https://www.pplmotorhomes.com/used-rvs-for-sale/all-available-vehicles?sortBy=price%20desc&startIndex=24
, as each page has 24 items.So this is its logic for pagination:
per_page = 24 #? PPLMOTORHOMES only for i in range(1, min(10, max_page+1) if QUICK_TEST else max_page+1): url = competitor_url % ((i-1) * per_page) # set off is 0, p2 = 24, p3 = 48, ... driver.get(url)
-
-
-
Location not available:

-
-
- Home shows only 3 items. has
/New-Inventory
and/Used-Inventory
pages - Should I add those as a list in
competitor_url_list
- In /new some items have
*** AVAILABLE IN STOCK ***
, others haveSOLD!!
values, should I scrape them? if yes to which key?
- Home shows only 3 items. has
-
Inventory: https://jjcamper.com/inventory/
-
-
Some values return
Consignment
like vin, designation, or stocknumber. Should i replace withNone
? -
A price is listed is this
msrp
orsale price
?
-
-

-
https://smithrv.com/inventory/
- Catch
TimeoutException
indriver.get()

- JS rendered content, pagination doesn't work, requires clicking on next at the end of the loop to work.
- Catch
-
https://www.aokrvs.com/new-rvs
-
I extracted both designation & stocknumber (if available) from title.
-
I've included the part in the middle between year & stocknumber as
rv_type
- sometimes it's "5th Wheel" not sure if that's correct rv_type - but we can omit it if not very accurate. -

-
If the part above title is model (I guess so) I can include it too.
-