Set up GitHub API response caching proxy-server #99

hlibbabii · 2021-01-17T15:00:03Z

We often need to query GitHub API to get data. However, there is a limit on how many requests per hour one can do with one token; also querying on the fly is expensive timewise. The approach we took with our "bugginess" training set is to preload all the commits. and save them into CSV files. Now I started looking into the tools that detect refactorings in a given commit. Unfortunately, one of the tools, RefactoringMiner, does not provide the possibility to pass a git diff to it out of the box. It asks for a GitHub URL or path to a locally cloned repo. Given this, an alternative to pre-loading commits in a format that might not be suitable for all use-cases is to consider querying API on the fly again. However, we can make it cheaper by setting up a proxy that would cache the GitHub API responses so that when querying them repeatedly quota is not used. Another benefit of this is speed. We can set up the proxy on the same machine we run the pipeline on (ironspeed), so this won't be different than just reading pre-loaded data from the disk.

hlibbabii · 2021-04-20T22:41:11Z

Use a virtual file system instead? See related work from here: https://hal.archives-ouvertes.fr/hal-03139393/document

hlibbabii · 2021-07-19T12:12:49Z

Started working on CommitExplorer that downloads GitHub repos and stores them on the disk

hlibbabii added the idea label Jan 17, 2021

hlibbabii added the p2-medium label Apr 20, 2021

hlibbabii added the tech label May 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set up GitHub API response caching proxy-server #99

Set up GitHub API response caching proxy-server #99

hlibbabii commented Jan 17, 2021

hlibbabii commented Apr 20, 2021

hlibbabii commented Jul 19, 2021

Set up GitHub API response caching proxy-server #99

Set up GitHub API response caching proxy-server #99

Comments

hlibbabii commented Jan 17, 2021

hlibbabii commented Apr 20, 2021

hlibbabii commented Jul 19, 2021