You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GitPrey is a tool for searching sensitive information or data according to company name or key word something.The design mind is from searching sensitive data leakling in Github:
61
+
GitPrey is a tool for searching sensitive information or data according to company name or key word something.The design mind is from searching sensitive data leakling in GitHub:
62
62
* Search code in file and path according to key word to get all related projects;
63
63
* Search code in every related project to find matching file or content in PATTERN_DB;
64
64
* Output all matching file information,project information and user information;
65
65
66
66
By the way, there is some missing file or mistake file with using Gitprey,the reason is:
67
-
* Only the default branch is considered by Github. In most cases, this will be the master branch.
68
-
* Only files smaller than 384 KB are searchable by Github.
69
-
*Github only make up to 1,000 results for each search.
67
+
* Only the default branch is considered by GitHub. In most cases, this will be the master branch.
68
+
* Only files smaller than 384 KB are searchable by GitHub.
69
+
*GitHub only make up to 1,000 results for each search.
70
70
71
71
Gitprey also provides the search level to adjust scanning deep, it's between Level 1 to Level 5:
72
72
* Level 1: Only search 10 pages in recently indexed code results.
@@ -75,12 +75,12 @@ Gitprey also provides the search level to adjust scanning deep, it's between Lev
75
75
* Level 4: Only search 70 pages in recently indexed code results.
76
76
* Level 5: Only search 100 pages in recently indexed code results.
77
77
78
-
You can modify the Level in Config.py.To search as quick as you can,you must configure your own Github account username and password to avoid 429 ERROR which is too many requests.
78
+
You can modify the Level in Config.py.To search as quick as you can,you must configure your own GitHub account username and password to avoid 429 ERROR which is too many requests.
79
79
80
80
### Tech detail introduction
81
81
There are some hints to declare about technological details:
82
-
*Github API is not used in searching code,because its rate limit up to 30 times per minute,even if you authenticate by access token.
83
-
* Only user information crawler used Github API,it's enough for scanning speed.
82
+
*GitHub API is not used in searching code,because its rate limit up to 30 times per minute,even if you authenticate by access token.
83
+
* Only user information crawler used GitHub API,it's enough for scanning speed.
84
84
You have to config FILE_DB/INFO_DB/PASS_DB/PATH_DB in config.py:
85
85
* PATH_DB is used to search specific file in related projects when searching file leaking.
86
86
* FILE_DB and PASS_DB are used to searching sensitive content in related projects when searching content leaking, while INFO_DB and PASS_DB is used to output code line._
0 commit comments