Buy new:
-51% $36.95$36.95
FREE delivery February 25 - 28
Ships from: firstclassbooks Sold by: firstclassbooks
Save with Used - Very Good
$12.34$12.34
$3.98 delivery February 24 - 25
Ships from: glenthebookseller Sold by: glenthebookseller
![Kindle app logo image](https://m.media-amazon.com/images/G/01/kindle/app/kindle-app-logo._CB668847749_.png)
Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.
Read instantly on your browser with Kindle for Web.
Using your mobile phone camera - scan the code below and download the Kindle app.
Follow the author
OK
Mining of Massive Datasets 2nd Edition
Purchase options and add-ons
- ISBN-109781107077232
- ISBN-13978-1107077232
- Edition2nd
- PublisherCambridge University Press
- Publication dateDecember 29, 2014
- LanguageEnglish
- Dimensions7.25 x 1.25 x 10 inches
- Print length476 pages
There is a newer edition of this item:
Customers who viewed this item also viewed
Editorial Reviews
Book Description
About the Author
Anand Rajaraman is a serial entrepreneur, venture capitalist, and academic based in Silicon Valley. He is a Founding Partner of two early-stage venture capital firms, Milliways Labs and Cambrian Ventures. His investments include Facebook (one of the earliest angel investors in 2005), Aster Data Systems (acquired by Teradata), Efficient Frontier (acquired by Adobe), Neoteris (acquired by Juniper), Transformic (acquired by Google), and several others. Anand was, until recently, Senior Vice President at Walmart Global eCommerce and co-head of @WalmartLabs, where he worked at the intersection of social, mobile, and commerce. He came to Walmart when Walmart acquired Kosmix, the startup he co-founded, in 2011. Kosmix pioneered semantic search technology and semantic analysis of social media. In 1996, Anand co-founded Junglee, an e-commerce pioneer. As Chief Technology Officer, he played a key role in developing Junglee's award-winning Virtual Database technology. In 1998, Amazon.com acquired Junglee, and Anand helped launch the transformation of Amazon.com from a retailer into a retail platform, enabling third-party retailers to sell on Amazon.com's website. Anand is also a co-inventor of Amazon Mechanical Turk, which pioneered the concepts of crowdsourcing and hybrid Human-Machine computation. As an academic, Anand's research has focused at the intersection of database systems, the World-Wide Web, and social media. His research publications have won several awards at prestigious academic conferences, including two retrospective 10-year Best Paper awards at ACM SIGMOD and VLDB. In 2012, Fast Company magazine named Anand to its list of '100 Most Creative People in Business'. In 2013, he was named a Distinguished Alumnus by his alma mater, IIT Madras. You can follow Anand on Twitter at @anand_raj.
Jeffrey David Ullman is the Stanford W. Ascherman Professor of Computer Science (Emeritus) and he is currently the CEO of Gradiance. His research interests include database theory, data mining, and education using the information infrastructure. He is one of the founders of the field of database theory, and was the doctoral advisor of an entire generation of students who later became leading database theorists in their own right. He was the Ph.D. advisor of Sergey Brin, one of the co-founders of Google, and served on Google's technical advisory board. Ullman was elected to the National Academy of Engineering in 1989, the American Academy of Arts and Sciences in 2012, and he has held Guggenheim and Einstein Fellowships. Recent awards include the Knuth Prize (2000), and the Sigmod E. F. Codd Innovations award (2006). Ullman is also the co-recipient (with John Hopcroft) of the 2010 IEEE John von Neumann Medal, for 'laying the foundations for the fields of automata and language theory and many seminal contributions to theoretical computer science'.
Product details
- ASIN : 1107077230
- Publisher : Cambridge University Press; 2nd edition (December 29, 2014)
- Language : English
- Hardcover : 476 pages
- ISBN-10 : 9781107077232
- ISBN-13 : 978-1107077232
- Item Weight : 2.18 pounds
- Dimensions : 7.25 x 1.25 x 10 inches
- Best Sellers Rank: #1,977,316 in Books (See Top 100 in Books)
- #556 in Database Storage & Design
- #3,742 in Databases & Big Data
- Customer Reviews:
About the author
![Anand Rajaraman](https://m.media-amazon.com/images/I/01Kv-W2ysOL._SY600_.png)
Discover more of the author’s books, see similar authors, read book recommendations and more.
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonCustomers say
Customers find the book's content accessible and practical, with succinct summaries and examples. They appreciate the variety of heuristics and algorithms from data mining to large-scale machine learning.
AI-generated from the text of customer reviews
Customers find the book's content accessible and practical. They appreciate the succinct chapter summaries that are descriptive without being overly technical. The examples are intuitive and help distill the key points from each chapter. Overall, customers find the book a good reference guide for understanding data mining details.
"...in the book are very intuitive and the book follows an easy to understand train of thought. The chapter summaries are a pleasant surprise...." Read more
"...The book covers a wide range of topics from MapReduce and Locality Sensitive Hashing (LSH) to algorithms on graphs and large scale machine learning...." Read more
"...Content, they cover a lot of topics. I like the way the chapters are arranged. There are summaries at the end of every chapter...." Read more
"...Well written for a text book. A great reference guide to understand details, especially if you already know a bit about data mining." Read more
Customers find the book provides an assortment of heuristics and algorithms from data mining to big data, including graphs and large-scale machine learning. They appreciate the excellent background and examples of data mining.
"...Sensitive Hashing (LSH) to algorithms on graphs and large scale machine learning. I think you would not regret the purchase." Read more
"Excellent background and examples of data mining. Well written for a text book...." Read more
"...online course of same title, this books is an assortment of heuristics and algorithms from data mining to some big data applications nowadays...." Read more
Top reviews from the United States
There was a problem filtering reviews right now. Please try again later.
- Reviewed in the United States on June 22, 2015This book is a delight for anyone who deals with practical Data Mining applications. Over the past few years, I have gathered bits and pieces of knowledge from various sources about machine learning, Map Reduce programming paradigm, design and analysis of algorithms, information retrieval, etc. But this book serves to tie it all together beautifully. If you have delved in the above topics and are looking for a reference book that strikes a balance between rigor and practicality, this book will serve you right. On the other hand, if you are just starting out in the field of Data Mining/Machine Learning then you may do well by starting out with more detailed material.
The book has a nice compilation of many "greatest hits" algorithms, especially those related to mining graph data. The book treats the theory and the implementation aspects of algorithms with equal importance with ample consideration for scaling.The examples in the book are very intuitive and the book follows an easy to understand train of thought. The chapter summaries are a pleasant surprise. They are a great resource to help you distill and digest the key points from each chapter. The summaries are succinct enough to be un-intimidating and are descriptive enough to be useful.
The book does keep referring back and forth between chapters but that is only because much of the material is actually interlinked and treating the topics in isolation would miss the point.
All in all a great purchase for a lifetime!
- Reviewed in the United States on April 3, 2019If you hesitate to buy this book, I would suggest you go to the official site of the book and check it. Full official PDF is available on the MMDS site.
I got this book to take Stanford MMDS online course but then decided to read it fully (the course does not cover some advanced topics). The book content is very accessible. For example in chapter 5 authors cover PageRank algorithm, instead of introducing it via probability and linear algebra (Markov chains and eigenvectors) they touch the theory a little and then provide many examples, so the book is very practical oriented. Knowledge in probability and linear algebra would help, but not necessary, although you still need to know some very basic concepts like matrix by matrix multiplication.
The book covers a wide range of topics from MapReduce and Locality Sensitive Hashing (LSH) to algorithms on graphs and large scale machine learning. I think you would not regret the purchase.
- Reviewed in the United States on June 13, 2015First, the book is affordable at under $70. That is a big deal. You can download a PDF for free at several sites, but printing it would cost you $70 and the physical package would not be nearly as good. This is a significant physical hardback book.
Content, they cover a lot of topics.
I like the way the chapters are arranged. There are summaries at the end of every chapter. I found myself reading the summaries of topics before reading the pertinent sections and then reading the summaries again section by section. I learned much more using that practice instead of simply reading cover to cover in order.
This is a good book. It is a good substitute for any number of online learning programs in data science.
- Reviewed in the United States on March 9, 2017Excellent background and examples of data mining. Well written for a text book. A great reference guide to understand details, especially if you already know a bit about data mining.
- Reviewed in the United States on September 18, 2016liked it for the depth of the topic in the book.
- Reviewed in the United States on October 2, 2015very helpful information and easy to understand even for the new student
- Reviewed in the United States on March 11, 2015As the textbook of the Stanford online course of same title, this books is an assortment of heuristics and algorithms from data mining to some big data applications nowadays. I think this book can be especially suitable for those who:
1. Have some machine learning background and want to have a quick glance over every popular data mining techniques;
2. Have learned data mining and need to quickly look up some phrases along with compact explanations.
In other word, I don't think this book is for those who wish to see rigorous mathematical elements because frankly the content far from that; also, if you're totally new to machine learning or data mining, you can take your first step from here, but it'll be a struggled step I would guess. However, if you're buying this book to go with the online course, then this is a great complement.
- Reviewed in the United States on October 3, 2015The print is good.
Top reviews from other countries
- Tim VervoortReviewed in Germany on April 27, 2020
5.0 out of 5 stars Great introduction to data science
Extensive, yet easy to follow introduction to a lot of techniques. The book also focuses a lot on cloud algorithms.
- EdsReviewed in the United Kingdom on September 26, 2019
5.0 out of 5 stars Excellent
Excellent
- Amazon CustomerReviewed in India on February 25, 2019
5.0 out of 5 stars Great book!
Great book on algorithms and techniques for large scale data mining. Highly recommended for data scientists and big data enthusiasts. Loved the exercises. However, page quality is poor in the South Asia edition.
-
EleonoraReviewed in Italy on November 29, 2017
5.0 out of 5 stars Perfetto
Libro fondamentale per avere una panoramica delle tecniche fondamentali nel campo dei big data. A differenza di altri libri, risulta essere chiaro e conciso.
- Kevin LeeReviewed in Canada on September 16, 2016
5.0 out of 5 stars Five Stars
This book is very practical!