Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lists of domains other than .gov and .mil? #1

Open
freegovinfo opened this issue Jan 26, 2017 · 4 comments
Open

lists of domains other than .gov and .mil? #1

freegovinfo opened this issue Jan 26, 2017 · 4 comments

Comments

@freegovinfo
Copy link

I see the seed lists for .gov and .mil, but wondering about .org/.us/.com etc. Is there a way to generate those lists as well?

@jeffersonbailey
Copy link
Contributor

Uh, those lists are old. From pre-crawl research. I hope to get the final (or final as of now) lists up soon.

That said, we have not been separating the lists by .org/.us/.com but instead by "basic" (i.e. http), FTP, and "social media." One could easily take the "basic" list and grep out the .org/.us/.com etc.

@edsu
Copy link

edsu commented Sep 15, 2017

I guess they were never added? The scrapes of those sites would be pretty darn useful if they aren't somewhere else by now.

@jeffersonbailey
Copy link
Contributor

We decided to wait until all the data was indexed. We are wrapping up ingesting the data from all the crawling partners now. That should finish this month. At that point we will have a "complete" copy of all EOT data and will be working with folks at the GSA to make (and hopefully maintain) a complete list.

@konklone
Copy link

At that point we will have a "complete" copy of all EOT data and will be working with folks at the GSA to make (and hopefully maintain) a complete list.

We're happy to help, just let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants