Skip to content

Commit f99a2a0

Browse files
author
Jeremy H. Shi
committed
Update README.md
1 parent 249c1f5 commit f99a2a0

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

README.md

+8
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
This is a repo for course projects @ Professor Torsten Suel's course Web Search Engines.
2+
3+
1. Jcrawler : a primitive multi-threaded focused web crawler to collect web pages from www, with concentration on given key words. Language : python
4+
2. indexer : a c++ program to parse web pages, do reverse index, and generate final index for later query processing. involving massive data processing, file compression(var-byte).
5+
3. query processor, ask former built inverted index to answer user's search queries.
6+
7+
8+
4. Foursquare crawler and recommendation system : including a crawler to collect user, venue, rating, check in information from Foursquare, Twitter and Facebook, then apply machine learning algorithms (collaborative-filtering, SVD, etc) to recommend friends and venues to users.

0 commit comments

Comments
 (0)