Skip to content

Toolbox for scraping, good examples of the use of pandas, bs4 and concurrency via tornado.

License

Notifications You must be signed in to change notification settings

bjmask/scraping

Repository files navigation

scraping

Web-scraping examples

Tokyo Stock Exchange (tse)

Use the Tornado async HTTP client to achieve N concurrent requests, in comparison to a simple for-loop concurrency enables us to scrape ~4000 urls in ~3.5 min or approximately 60X increase in speed.

Note: Use this script at your own peril.

About

Toolbox for scraping, good examples of the use of pandas, bs4 and concurrency via tornado.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published