Introduction to Data Compression

4th Edition

Authors: Khalid Sayood
Published Date: 16th October 2012
Table of Contents

Editor Board



1 Audience

2 Course Use

3 Approach

4 Learning from This Book

5 Content and Organization

6 A Personal View



1.1 Compression Techniques

1.2 Modeling and Coding

1.3 Summary

1.4 Projects and Problems


Mathematical Preliminaries for Lossless Compression

2.1 Overview

2.2 A Brief Introduction to Information Theory

2.3 Models

2.4 Coding

2.5 Algorithmic Information Theory

2.6 Minimum Description Length Principle

2.7 Summary

2.8 Projects and Problems


Huffman Coding

3.1 Overview

3.2 The Huffman Coding Algorithm

3.3 Nonbinary Huffman Codes

3.4 Adaptive Huffman Coding

3.5 Golomb Codes

3.6 Rice Codes

3.7 Tunstall Codes

3.8 Applications of Huffman Coding

3.9 Summary

Further Reading

3.10 Projects and Problems


Arithmetic Coding

4.1 Overview

4.2 Introduction

4.3 Coding a Sequence

4.4 Generating a Binary Code

4.5 Adaptive Arithmetic Coding

4.6 Binary Arithmetic Coding

4.7 Comparison of Huffman and Arithmetic Coding

4.8 Applications

4.9 Summary

4.10 Projects and Problems


Dictionary Techniques

5.1 Overview

5.2 Introduction

5.3 Static Dictionary

5.4 Adaptive Dictionary

5.5 Applications

5.6 Beyond Compression—Lempel-Ziv Complexity

5.7 Summary

5.8 Projects and Problems


Context-Based Compression

6.1 Overview

6.2 Introduction

6.3 Prediction with Partial Match (ppm)

6.4 The Burrows-Wheeler Transform

6.5 Associative Coder of Buyanovsky (ACB)



Each edition of Introduction to Data Compression has widely been considered the best introduction and reference text on the art and science of data compression, and the fourth edition continues in this tradition. Data compression techniques and technology are ever-evolving with new applications in image, speech, text, audio, and video. The fourth edition includes all the cutting edge updates the reader will need during the work day and in class.

Khalid Sayood provides an extensive introduction to the theory underlying today’s compression techniques with detailed instruction for their applications using several examples to explain the concepts. Encompassing the entire field of data compression, Introduction to Data Compression includes lossless and lossy compression, Huffman coding, arithmetic coding, dictionary techniques, context based compression, scalar and vector quantization. Khalid Sayood provides a working knowledge of data compression, giving the reader the tools to develop a complete and concise compression package upon completion of his book.

Key Features

  • New content added to include a more detailed description of the JPEG 2000 standard
  • New content includes speech coding for internet applications
  • Explains established and emerging standards in depth including JPEG 2000, JPEG-LS, MPEG-2, H.264, JBIG 2, ADPCM, LPC, CELP, MELP, and iLBC
  • Source code provided via companion web site that gives readers the opportunity to build their own algorithms, choose and implement techniques in their own applications


Professionals, software and hardware engineers, students; digital libraries, multimedia


This text is a truly introductory treatment of the entire field of data compression, including lossless coding, speech coding, and audio coding, which are often neglected in other data compression books. Sayood’s book has the very best tutorial treatment of lossless source coding anywhere, with detailed coverage of Lempel-Ziv, arithmetic, Golumb, and Tunstall coding, in addition to treatments of fixed and adaptive Huffman coding and context-based methods. Additionally, the book contains material on M-band quadrature mirror filter banks, the polyphase decomposition, and wavelets beyond what is normally found in any introductory text. I have used Sayood’s book for a reference and as a text for a course on signal compression. I highly recommend it for adoption. --Jerry D. Gibson, Professor of Electrical and Computer Engineering, University of California, Santa Barbara

Khalid Sayood's book has long been the standard academic reference for those interested in Data Compression. I am very pleased to see his ongoing effort to keep the content timely with the release of the fourth edition this fall. If you want to be well versed in state of the art, ranging from simple lossless coding up to complex video compression, this is the only book I know that will stay with you on every step of the journey. --Mark Nelson, Engineer at Cisco Systems, Inc and Senior Member of IEEE

About the Authors

Khalid Sayood Author

Khalid Sayood received his BS and MS in Electrical Engineering from the University of Rochester in 1977 and 1979, respectively, and his Ph.D. in Electrical Engineering from Texas A&M University in 1982. In 1982, he joined the University of Nebraska, where he is the Heins Professor of Engineering. His research interests include data compression, joint source channel coding, and bioinformatics.

Affiliations and Expertise

Department of Electrical and Computer Engineering, University of Nebraska, Lincoln, Nebraska, USA