You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a repo for course projects @ Professor Torsten Suel's course Web Search Engines.
2
+
3
+
1. Jcrawler : a primitive multi-threaded focused web crawler to collect web pages from www, with concentration on given key words. Language : python
4
+
2. indexer : a c++ program to parse web pages, do reverse index, and generate final index for later query processing. involving massive data processing, file compression(var-byte).
5
+
3. query processor, ask former built inverted index to answer user's search queries.
6
+
7
+
8
+
4. Foursquare crawler and recommendation system : including a crawler to collect user, venue, rating, check in information from Foursquare, Twitter and Facebook, then apply machine learning algorithms (collaborative-filtering, SVD, etc) to recommend friends and venues to users.
0 commit comments