You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<li><ahref="#adoption">Reviews and Adoption</a></li>
67
+
</ul>
56
68
</div>
57
69
70
+
</header>
71
+
72
+
<sectionid="info">
73
+
<divclass="page-header">
74
+
<h1>Book Information</h1>
75
+
<pclass="pull-right"><ahref="#">Back to top</a></p>
76
+
</div>
77
+
78
+
<divclass="row">
79
+
<divclass="span4">
80
+
81
+
<h2>Table of Contents</h2>
82
+
83
+
<ol>
84
+
85
+
<li>Introduction</li>
86
+
87
+
<li>MapReduce Basics</li>
88
+
89
+
<li>MapReduce algorithm design</li>
90
+
91
+
<li>Inverted Indexing for Text Retrieval</li>
92
+
93
+
<li>Graph Algorithms</li>
94
+
95
+
<li>EM Algorithms for Text Processing</li>
96
+
97
+
<li>Closing Remarks</li>
98
+
99
+
</ol>
100
+
101
+
</div>
102
+
103
+
104
+
<divclass="span8">
105
+
58
106
<h2>Abstract</h2>
59
107
60
108
<p>Our world is being revolutionized by data-driven methods: access to
@@ -80,7 +128,8 @@ <h2>Abstract</h2>
80
128
81
129
<p>Quite explicitly, this book focuses on MapReduce algorithm design, not <ahref="http://hadoop.apache.org/">Hadoop</a> programming. Tom White's <ahref="http://www.amazon.com/gp/product/0596521979?ie=UTF8&tag=dataintetextp-20&linkCode=as2&camp=1789&creative=9325&creativeASIN=0596521979">Hadoop: The Definitive Guide</a><imgsrc="http://www.assoc-amazon.com/e/ir?t=dataintetextp-20&l=as2&o=1&a=0596521979" width="1" height="1" alt="" style="border:none !important; margin:0px !important;" /> is a great resource for learning Hadoop.</p>
82
130
83
-
<h2style="padding-top:15px">Getting the Book</h2>
131
+
132
+
<h2>Publisher</h2>
84
133
85
134
<p>This book is part of the Morgan & Claypool <a
86
135
href="http://www.morganclaypool.com/toc/hlt/1/1">Synthesis Lectures on
@@ -106,27 +155,21 @@ <h2 style="padding-top:15px">Getting the Book</h2>
106
155
resource helpful, please consider purchasing an actual copy to support
107
156
our work!</p>
108
157
109
-
<h2style="padding-top:15px">Table of Contents</h2>
158
+
</div>
110
159
111
-
<ol>
112
160
113
-
<li>Introduction</li>
114
-
115
-
<li>MapReduce Basics</li>
116
-
117
-
<li>MapReduce algorithm design</li>
118
-
119
-
<li>Inverted Indexing for Text Retrieval</li>
161
+
</div>
120
162
121
-
<li>Graph Algorithms</li>
122
163
123
-
<li>EM Algorithms for Text Processing</li>
124
164
125
-
<li>Closing Remarks</li>
165
+
</section>
126
166
127
-
</ol>
128
167
129
-
<h2style="padding-top:15px">Design Patterns & Algorithms</h2>
168
+
<sectionid="ref">
169
+
<divclass="page-header">
170
+
<h1>Reference Implementations</h1>
171
+
<pclass="pull-right"><ahref="#">Back to top</a></p>
is a MapReduce library for Hadoop designed to serve as both a teaching
@@ -152,7 +195,16 @@ <h2 style="padding-top:15px">Design Patterns & Algorithms</h2>
152
195
153
196
</ul>
154
197
155
-
<h2style="padding-top:15px">What People Are Saying</h2>
198
+
</section>
199
+
200
+
201
+
<sectionid="adoption">
202
+
<divclass="page-header">
203
+
<h1>Reviews and Adoption</h1>
204
+
<pclass="pull-right"><ahref="#">Back to top</a></p>
205
+
</div>
206
+
207
+
<h3>What others are saying...</h3>
156
208
157
209
<ul>
158
210
@@ -166,48 +218,60 @@ <h2 style="padding-top:15px">What People Are Saying</h2>
166
218
167
219
<li>Deepak Singh from Amazon <ahref="http://mndoci.com/2010/07/02/recommendation-data-intensive-text-processing-with-mapreduce/">recommends the book</a> (July 2, 2010)</li>
168
220
169
-
<li>Used in <ahref="http://www.csee.ogi.edu/~zak/cs506-pslc/">CS 506/606: Special Topics: Problem Solving with Large Clusters</a> by Izhak Shafran and Richard Sproat at Oregon Health & Science University (Spring 2010)</li>
221
+
</ul>
170
222
171
-
<li>Used in a Google-supported <ahref="http://net.pku.edu.cn/~course/cs402/2010/index.html">Peking University course</a> on cloud computing by Hongfei Yan and Bo Peng (Summer, 2010)</li>
223
+
<h3>Courses using this book...</h3>
172
224
173
-
<li>Used in <ahref="http://www.andyli.ece.ufl.edu/teaching/eel6935/">EEL 6935: Special Topics in Cloud Computing and Storage</a> by Andy Li at the University of Florida (Fall, 2010 and Fall, 2011)</li>
225
+
<ul>
226
+
227
+
<li><ahref="http://www.csee.ogi.edu/~zak/cs506-pslc/">CS 506/606: Special Topics: Problem Solving with Large Clusters</a> by Izhak Shafran and Richard Sproat at Oregon Health & Science University (Spring 2010)</li>
228
+
229
+
<li><ahref="http://net.pku.edu.cn/~course/cs402/2010/index.html">Peking University course</a> on cloud computing by Hongfei Yan and Bo Peng (Summer, 2010)</li>
174
230
175
-
<li>Used in <ahref="http://courses.cs.tamu.edu/caverlee/csce689/">CSCE 689: Internet-Scale Data Management</a> by James Caverlee at Texas A&M (Fall, 2010)</li>
231
+
<li><ahref="http://www.andyli.ece.ufl.edu/teaching/eel6935/">EEL 6935: Special Topics in Cloud Computing and Storage</a> by Andy Li at the University of Florida (Fall, 2010 and Fall, 2011)</li>
176
232
177
-
<li>Used in <ahref="http://courses.cse.tamu.edu/caverlee/csce670/">CSCE 670: Information Storage and Retrieval</a> by James Caverlee at Texas A&M (Spring, 2011 and Spring, 2012)</li>
233
+
<li><ahref="http://courses.cs.tamu.edu/caverlee/csce689/">CSCE 689: Internet-Scale Data Management</a> by James Caverlee at Texas A&M (Fall, 2010)</li>
178
234
179
-
<li>Used in <ahref="http://cs.ua.edu/691Vrbsky/">CS 691-001: Cloud Computing</a> by Susan Vrbsky at University of Alabama (Spring, 2011)</li>
235
+
<li><ahref="http://courses.cse.tamu.edu/caverlee/csce670/">CSCE 670: Information Storage and Retrieval</a> by James Caverlee at Texas A&M (Spring, 2011 and Spring, 2012)</li>
180
236
181
-
<li>Used in <ahref="http://courses.washington.edu/css534/syllabi/s11.html">CSS 534: Parallel Programming in Grid and Cloud</a> by Munehiro Fukuda at University of Washington (Spring, 2011)</li>
237
+
<li><ahref="http://cs.ua.edu/691Vrbsky/">CS 691-001: Cloud Computing</a> by Susan Vrbsky at University of Alabama (Spring, 2011)</li>
182
238
183
-
<li>Used in <ahref="http://snap.stanford.edu/class/cs341-2011/">CS341: Advanced Topics in Data Mining</a> by Jure Leskovec, Anand Rajaraman, and Jeff Ullman at Stanford (Spring, 2011)</li>
239
+
<li><ahref="http://courses.washington.edu/css534/syllabi/s11.html">CSS 534: Parallel Programming in Grid and Cloud</a> by Munehiro Fukuda at University of Washington (Spring, 2011)</li>
184
240
185
-
<li>Used in <ahref="http://www.eurecom.fr/~michiard/CCSS.html">Summer School on Cloud Computing: Challenges and opportunities</a> by Pietro Michiard (Summer, 2011)</li>
241
+
<li><ahref="http://snap.stanford.edu/class/cs341-2011/">CS341: Advanced Topics in Data Mining</a> by Jure Leskovec, Anand Rajaraman, and Jeff Ullman at Stanford (Spring, 2011)</li>
186
242
187
-
<li>Used in a <ahref="http://net.pku.edu.cn/~course/cs402/2011/index.html">Peking University course</a>on mass data processing/cloud computing by Hongfei Yan and Bo Peng (Summer, 2011)</li>
243
+
<li><ahref="http://www.eurecom.fr/~michiard/CCSS.html">Summer School on Cloud Computing: Challenges and opportunities</a>by Pietro Michiard (Summer, 2011)</li>
188
244
189
-
<li>Used in <ahref="http://dicta-f11.utcompling.com/">CS395T / INF385T / LIN386M: Data-Intensive Computing for Text Analysis</a>by Jason Baldridge and Matt Lease at the University of Texas, Austin (Fall, 2011)</li>
245
+
<li><ahref="http://net.pku.edu.cn/~course/cs402/2011/index.html">Peking University course</a>on mass data processing/cloud computing by Hongfei Yan and Bo Peng (Summer, 2011)</li>
190
246
191
-
<li>Used in <ahref="http://www.ccs.neu.edu/home/mirek/classes/2011-F-CS6240/index.htm">CS 6240: Parallel Data Processing in MapReduce</a> by Mirek Riedewald at Northeastern University (Fall 2011)</li>
247
+
<li><ahref="http://dicta-f11.utcompling.com/">CS395T / INF385T / LIN386M: Data-Intensive Computing for Text Analysis</a> by Jason Baldridge and Matt Lease at the University of Texas, Austin (Fall, 2011)</li>
192
248
193
-
<li>Used in <ahref="http://www.cs.gmu.edu/syllabus/syllabi-fall11/CS795BarbaraD.html">CS 795 Mining Massive Datasets</a> by Daniel Barbara at George Mason University (Fall, 2011)</li>
249
+
<li><ahref="http://www.ccs.neu.edu/home/mirek/classes/2011-F-CS6240/index.htm">CS 6240: Parallel Data Processing in MapReduce</a> by Mirek Riedewald at Northeastern University (Fall 2011)</li>
194
250
195
-
<li>Used in <ahref="http://www.cs.kent.edu/~jin/Cloud12Spring/Cloud.html">CS 4/5/6/79995: Advanced Computing Platforms for Data Processing</a> by Ruoming Jin at Kent State University (Spring 2012)</li>
251
+
<li><ahref="http://www.cs.gmu.edu/syllabus/syllabi-fall11/CS795BarbaraD.html">CS 795 Mining Massive Datasets</a> by Daniel Barbara at George Mason University (Fall, 2011)</li>
196
252
197
-
<li>Used in <ahref="http://beowulf.lcs.mit.edu/18.337/">18.337/6.338: Parallel Computing</a> by Alan Edelman at MIT (Fall, 2011)</li>
253
+
<li><ahref="http://www.cs.kent.edu/~jin/Cloud12Spring/Cloud.html">CS 4/5/6/79995: Advanced Computing Platforms for Data Processing</a> by Ruoming Jin at Kent State University (Spring 2012)</li>
198
254
199
-
<li>Used in <ahref="http://www.cse.buffalo.edu/~bina/cse487/fall2011/">CSE487/587 Data-Intensive Computing</a> by Bina Ramamurthy at SUNY Buffalo (Fall, 2011)</li>
255
+
<li><ahref="http://beowulf.lcs.mit.edu/18.337/">18.337/6.338: Parallel Computing</a> by Alan Edelman at MIT (Fall, 2011)</li>
200
256
201
-
<li>Used in <ahref="http://www.csc.lsu.edu/~wuyj/Teaching/7481/fa11/">CSC7481/LIS 7610 - Information Retrieval Systems</a> by Yejun Wu at LSU (Fall, 2011)</li>
257
+
<li><ahref="http://www.cse.buffalo.edu/~bina/cse487/fall2011/">CSE487/587 Data-Intensive Computing</a> by Bina Ramamurthy at SUNY Buffalo (Fall, 2011)</li>
202
258
203
-
<li>Used in <ahref="http://www.cs.brown.edu/courses/csci2950-u/f11/index.html">CSCI-2950u: Data-Intensive Scalable Computing</a> by Rodrigo Fonseca at Brown (Fall, 2011)</li>
259
+
<li><ahref="http://www.csc.lsu.edu/~wuyj/Teaching/7481/fa11/">CSC7481/LIS 7610 - Information Retrieval Systems</a> by Yejun Wu at LSU (Fall, 2011)</li>
204
260
205
-
<li>Used in <ahref="http://www.cs.sunysb.edu/~rezaul/CSE590-S12.html">CSE 590 (#50569): Topics in Computer Science (Supercomputing)</a> by Rezaul A. Chowdhury at Stony Brook University (Spring 2012)</li>
261
+
<li><ahref="http://www.cs.brown.edu/courses/csci2950-u/f11/index.html">CSCI-2950u: Data-Intensive Scalable Computing</a> by Rodrigo Fonseca at Brown (Fall, 2011)</li>
206
262
207
-
<li>Used in <ahref="http://www-scf.usc.edu/~csci572/">Course 572: Information Retrieval and Web Search Engines</a> by Ellis Horowitz at USC (Spring 2012)</li>
263
+
<li><ahref="http://www.cs.sunysb.edu/~rezaul/CSE590-S12.html">CSE 590 (#50569): Topics in Computer Science (Supercomputing)</a> by Rezaul A. Chowdhury at Stony Brook University (Spring 2012)</li>
264
+
265
+
<li><ahref="http://www-scf.usc.edu/~csci572/">Course 572: Information Retrieval and Web Search Engines</a> by Ellis Horowitz at USC (Spring 2012)</li>
208
266
209
267
</ul>
210
268
269
+
</section>
270
+
271
+
<footerclass="footer">
272
+
<pclass="pull-right"><ahref="#">Back to top</a></p>
0 commit comments