Minted - Shop now
Buy new:
-51% $36.95
FREE delivery February 25 - 28
Ships from: firstclassbooks
Sold by: firstclassbooks
$36.95 with 51 percent savings
List Price: $75.99
FREE delivery February 25 - 28. Details
Or fastest delivery February 19 - 21. Details
Only 3 left in stock - order soon.
$$36.95 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$36.95
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Ships from
firstclassbooks
firstclassbooks
Ships from
firstclassbooks
Returns
30-day refund/replacement
30-day refund/replacement
This item can be returned in its original condition for a full refund or replacement within 30 days of receipt. You may receive a partial or no refund on used, damaged or materially different returns.
Payment
Secure transaction
Your transaction is secure
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others. Learn more
$12.34
Very Good Very Good See less
$3.98 delivery February 24 - 25. Details
Only 1 left in stock - order soon.
$$36.95 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$36.95
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Access codes and supplements are not guaranteed with used items.
Ships from and sold by glenthebookseller.
Kindle app logo image

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.

Read instantly on your browser with Kindle for Web.

Using your mobile phone camera - scan the code below and download the Kindle app.

QR code to download the Kindle App

Follow the author

Something went wrong. Please try your request again later.

Mining of Massive Datasets 2nd Edition

4.3 4.3 out of 5 stars 63 ratings

{"desktop_buybox_group_1":[{"displayPrice":"$36.95","priceAmount":36.95,"currencySymbol":"$","integerValue":"36","decimalSeparator":".","fractionalValue":"95","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"8Z8lqNqpv8PNXLVu3lglaxJIvd8XVgt7ED0YPzqDjOF5GjqiYe761FUsrcmwrQAvFzEu%2Bf%2FT%2FG1iv5JzSk3nmrdbhl4LVfHVdMyBVNmG4NyzFpePboTh05oWM7ya8lKo%2FWSkr6nYshTMO7%2BKrJsFGSCOZ0f6NgoashYGYck7FlR%2FUzdX4NIYNg%3D%3D","locale":"en-US","buyingOptionType":"NEW","aapiBuyingOptionIndex":0}, {"displayPrice":"$12.34","priceAmount":12.34,"currencySymbol":"$","integerValue":"12","decimalSeparator":".","fractionalValue":"34","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"8Z8lqNqpv8PNXLVu3lglaxJIvd8XVgt70wpngG5A96cFF%2FnTPbEtk0Qrk%2FY0JF1SGsW%2FzFIeFWEecOhUVYx58dVnUgMqgjxboId91h%2BC0T9SzlndQ7LUAyO9nqATDy6YBgLBg21DcKhGgx%2FQvlo2pI65%2BuzgukpTA9YA%2BZgBmPf8l7KYRkuxxg8wBCIU5%2FD1","locale":"en-US","buyingOptionType":"USED","aapiBuyingOptionIndex":1}]}

Purchase options and add-ons

Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.

There is a newer edition of this item:

Mining of Massive Datasets
$83.99
(26)
Only 1 left in stock - order soon.

Editorial Reviews

Book Description

Now in its second edition, this book focuses on practical algorithms for mining data from even the largest datasets.

About the Author

Jure Leskovec is Assistant Professor of Computer Science at Stanford University. His research focuses on mining large social and information networks. Problems he investigates are motivated by large scale data, the Web and on-line media. This research has won several awards including a Microsoft Research Faculty Fellowship, the Alfred P. Sloan Fellowship, Okawa Foundation Fellowship, and numerous best paper awards. His research has also been featured in popular press outlets such as the New York Times, the Wall Street Journal, the Washington Post, MIT Technology Review, NBC, BBC, CBC and Wired. Leskovec has also authored the Stanford Network Analysis Platform (SNAP, http://snap.stanford.edu), a general purpose network analysis and graph mining library that easily scales to massive networks with hundreds of millions of nodes and billions of edges. You can follow him on Twitter at @jure.

Anand Rajaraman is a serial entrepreneur, venture capitalist, and academic based in Silicon Valley. He is a Founding Partner of two early-stage venture capital firms, Milliways Labs and Cambrian Ventures. His investments include Facebook (one of the earliest angel investors in 2005), Aster Data Systems (acquired by Teradata), Efficient Frontier (acquired by Adobe), Neoteris (acquired by Juniper), Transformic (acquired by Google), and several others. Anand was, until recently, Senior Vice President at Walmart Global eCommerce and co-head of @WalmartLabs, where he worked at the intersection of social, mobile, and commerce. He came to Walmart when Walmart acquired Kosmix, the startup he co-founded, in 2011. Kosmix pioneered semantic search technology and semantic analysis of social media. In 1996, Anand co-founded Junglee, an e-commerce pioneer. As Chief Technology Officer, he played a key role in developing Junglee's award-winning Virtual Database technology. In 1998, Amazon.com acquired Junglee, and Anand helped launch the transformation of Amazon.com from a retailer into a retail platform, enabling third-party retailers to sell on Amazon.com's website. Anand is also a co-inventor of Amazon Mechanical Turk, which pioneered the concepts of crowdsourcing and hybrid Human-Machine computation. As an academic, Anand's research has focused at the intersection of database systems, the World-Wide Web, and social media. His research publications have won several awards at prestigious academic conferences, including two retrospective 10-year Best Paper awards at ACM SIGMOD and VLDB. In 2012, Fast Company magazine named Anand to its list of '100 Most Creative People in Business'. In 2013, he was named a Distinguished Alumnus by his alma mater, IIT Madras. You can follow Anand on Twitter at @anand_raj.

Jeffrey David Ullman is the Stanford W. Ascherman Professor of Computer Science (Emeritus) and he is currently the CEO of Gradiance. His research interests include database theory, data mining, and education using the information infrastructure. He is one of the founders of the field of database theory, and was the doctoral advisor of an entire generation of students who later became leading database theorists in their own right. He was the Ph.D. advisor of Sergey Brin, one of the co-founders of Google, and served on Google's technical advisory board. Ullman was elected to the National Academy of Engineering in 1989, the American Academy of Arts and Sciences in 2012, and he has held Guggenheim and Einstein Fellowships. Recent awards include the Knuth Prize (2000), and the Sigmod E. F. Codd Innovations award (2006). Ullman is also the co-recipient (with John Hopcroft) of the 2010 IEEE John von Neumann Medal, for 'laying the foundations for the fields of automata and language theory and many seminal contributions to theoretical computer science'.

Product details

  • ASIN ‏ : ‎ 1107077230
  • Publisher ‏ : ‎ Cambridge University Press; 2nd edition (December 29, 2014)
  • Language ‏ : ‎ English
  • Hardcover ‏ : ‎ 476 pages
  • ISBN-10 ‏ : ‎ 9781107077232
  • ISBN-13 ‏ : ‎ 978-1107077232
  • Item Weight ‏ : ‎ 2.18 pounds
  • Dimensions ‏ : ‎ 7.25 x 1.25 x 10 inches
  • Customer Reviews:
    4.3 4.3 out of 5 stars 63 ratings

About the author

Follow authors to get new release updates, plus improved recommendations.
Anand Rajaraman
Brief content visible, double tap to read full content.
Full content visible, double tap to read brief content.

Discover more of the author’s books, see similar authors, read book recommendations and more.

Customer reviews

4.3 out of 5 stars
63 global ratings

Review this product

Share your thoughts with other customers

Customers say

Customers find the book's content accessible and practical, with succinct summaries and examples. They appreciate the variety of heuristics and algorithms from data mining to large-scale machine learning.

AI-generated from the text of customer reviews

Select to learn more
6 customers mention "Content"6 positive0 negative

Customers find the book's content accessible and practical. They appreciate the succinct chapter summaries that are descriptive without being overly technical. The examples are intuitive and help distill the key points from each chapter. Overall, customers find the book a good reference guide for understanding data mining details.

"...in the book are very intuitive and the book follows an easy to understand train of thought. The chapter summaries are a pleasant surprise...." Read more

"...The book covers a wide range of topics from MapReduce and Locality Sensitive Hashing (LSH) to algorithms on graphs and large scale machine learning...." Read more

"...Content, they cover a lot of topics. I like the way the chapters are arranged. There are summaries at the end of every chapter...." Read more

"...Well written for a text book. A great reference guide to understand details, especially if you already know a bit about data mining." Read more

3 customers mention "Heuristics"3 positive0 negative

Customers find the book provides an assortment of heuristics and algorithms from data mining to big data, including graphs and large-scale machine learning. They appreciate the excellent background and examples of data mining.

"...Sensitive Hashing (LSH) to algorithms on graphs and large scale machine learning. I think you would not regret the purchase." Read more

"Excellent background and examples of data mining. Well written for a text book...." Read more

"...online course of same title, this books is an assortment of heuristics and algorithms from data mining to some big data applications nowadays...." Read more

Top reviews from the United States

  • Reviewed in the United States on June 22, 2015
    This book is a delight for anyone who deals with practical Data Mining applications. Over the past few years, I have gathered bits and pieces of knowledge from various sources about machine learning, Map Reduce programming paradigm, design and analysis of algorithms, information retrieval, etc. But this book serves to tie it all together beautifully. If you have delved in the above topics and are looking for a reference book that strikes a balance between rigor and practicality, this book will serve you right. On the other hand, if you are just starting out in the field of Data Mining/Machine Learning then you may do well by starting out with more detailed material.

    The book has a nice compilation of many "greatest hits" algorithms, especially those related to mining graph data. The book treats the theory and the implementation aspects of algorithms with equal importance with ample consideration for scaling.The examples in the book are very intuitive and the book follows an easy to understand train of thought. The chapter summaries are a pleasant surprise. They are a great resource to help you distill and digest the key points from each chapter. The summaries are succinct enough to be un-intimidating and are descriptive enough to be useful.

    The book does keep referring back and forth between chapters but that is only because much of the material is actually interlinked and treating the topics in isolation would miss the point.

    All in all a great purchase for a lifetime!
    6 people found this helpful
    Report
  • Reviewed in the United States on April 3, 2019
    If you hesitate to buy this book, I would suggest you go to the official site of the book and check it. Full official PDF is available on the MMDS site.

    I got this book to take Stanford MMDS online course but then decided to read it fully (the course does not cover some advanced topics). The book content is very accessible. For example in chapter 5 authors cover PageRank algorithm, instead of introducing it via probability and linear algebra (Markov chains and eigenvectors) they touch the theory a little and then provide many examples, so the book is very practical oriented. Knowledge in probability and linear algebra would help, but not necessary, although you still need to know some very basic concepts like matrix by matrix multiplication.

    The book covers a wide range of topics from MapReduce and Locality Sensitive Hashing (LSH) to algorithms on graphs and large scale machine learning. I think you would not regret the purchase.
  • Reviewed in the United States on June 13, 2015
    First, the book is affordable at under $70. That is a big deal. You can download a PDF for free at several sites, but printing it would cost you $70 and the physical package would not be nearly as good. This is a significant physical hardback book.

    Content, they cover a lot of topics.

    I like the way the chapters are arranged. There are summaries at the end of every chapter. I found myself reading the summaries of topics before reading the pertinent sections and then reading the summaries again section by section. I learned much more using that practice instead of simply reading cover to cover in order.

    This is a good book. It is a good substitute for any number of online learning programs in data science.
    3 people found this helpful
    Report
  • Reviewed in the United States on March 9, 2017
    Excellent background and examples of data mining. Well written for a text book. A great reference guide to understand details, especially if you already know a bit about data mining.
  • Reviewed in the United States on September 18, 2016
    liked it for the depth of the topic in the book.
  • Reviewed in the United States on October 2, 2015
    very helpful information and easy to understand even for the new student
  • Reviewed in the United States on March 11, 2015
    As the textbook of the Stanford online course of same title, this books is an assortment of heuristics and algorithms from data mining to some big data applications nowadays. I think this book can be especially suitable for those who:
    1. Have some machine learning background and want to have a quick glance over every popular data mining techniques;
    2. Have learned data mining and need to quickly look up some phrases along with compact explanations.
    In other word, I don't think this book is for those who wish to see rigorous mathematical elements because frankly the content far from that; also, if you're totally new to machine learning or data mining, you can take your first step from here, but it'll be a struggled step I would guess. However, if you're buying this book to go with the online course, then this is a great complement.
    9 people found this helpful
    Report
  • Reviewed in the United States on October 3, 2015
    The print is good.

Top reviews from other countries

Translate all reviews to English
  • Tim Vervoort
    5.0 out of 5 stars Great introduction to data science
    Reviewed in Germany on April 27, 2020
    Extensive, yet easy to follow introduction to a lot of techniques. The book also focuses a lot on cloud algorithms.
  • Eds
    5.0 out of 5 stars Excellent
    Reviewed in the United Kingdom on September 26, 2019
    Excellent
  • Amazon Customer
    5.0 out of 5 stars Great book!
    Reviewed in India on February 25, 2019
    Great book on algorithms and techniques for large scale data mining. Highly recommended for data scientists and big data enthusiasts. Loved the exercises. However, page quality is poor in the South Asia edition.
  • Eleonora
    5.0 out of 5 stars Perfetto
    Reviewed in Italy on November 29, 2017
    Libro fondamentale per avere una panoramica delle tecniche fondamentali nel campo dei big data. A differenza di altri libri, risulta essere chiaro e conciso.
  • Kevin Lee
    5.0 out of 5 stars Five Stars
    Reviewed in Canada on September 16, 2016
    This book is very practical!