The massive volume of data generated in modern applications requires the ability to build compact summaries of datasets. This introduction aimed at students and practitioners covers algorithms to describe massive data sets from simple sums to advanced probabilistic structures, with applications in big data, data science, and machine learning.