Small Summaries for Big Data

Small Summaries for Big Data

by Graham Cormode and Ke Yi
English | 2020 | ASIN : B08GG8TJQF | 279 Pages | PDF | 2.27 MB

The massive volume of data generated in modern applications can overwhelm our ability to conveniently transmit, store, and index it. For many scenarios, building a compact summary of a dataset that is vastly smaller enables flexibility and efficiency in a range of queries over the data, in exchange for some approximation. This comprehensive introduction to data summarization, aimed at practitioners and students, showcases the algorithms, their behavior, and the mathematical underpinnings of their operation. The coverage starts with simple sums and approximate counts, building to more advanced probabilistic structures such as the Bloom Filter, distinct value summaries, sketches, and quantile summaries. Summaries are described for specific types of data, such as geometric data, graphs, and vectors and matrices. The authors offer detailed descriptions of and pseudocode for key algorithms that have been incorporated in systems from companies such as Google, Apple, Microsoft, Netflix and Twitter.

'A very thorough compendium of sketching and streaming algorithms, and an excellent resource for anyone interested in learning about them, understanding how they work and deploying them in applications. Good job!' - Piotr Indyk, Massachusetts Institute of Technology


[Fast Download] Small Summaries for Big Data

Related eBooks:
Linear Model Theory: Exercises and Solutions
Exercises in Numerical Linear Algebra and Matrix Factorizations
Python for Marketing Research and Analytics
Linear Model Theory: Exercises and Solutions
Topics in orbit equivalence
Mathematical Structures for Computer Science, 7th Edition
Martin Isaacs - Character Theory of Finite Group
Hadamard's Plane Geometry: A Reader's Companion
Mathematical Modelling and Numerical Analysis of Size-Dependent Structural Members in Temperature Fi
Homology in Group Theory (Lecture notes in mathematics) by U. Stammbach
The L2-Moduli Space and a Vanishing Theorem for Donaldson Polynomial Invariants
Process Control: The Passive Systems Approach (Advances in Industrial Control)
Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.