Lossless vs lossy compression of images
Image formats
Photographic images can be represented as a very large array of pixels. Those dot-like pixels have varying amounts of the primary colors mixed together to form the final color of those dots. All image formats are really just ways of storing that image information in a computer file until the original image is reconstituted for display or for printing. The differences in the formats have primarily to do with the efficiency with which the original information can be stored.

Resolution
Image resolution includes a measure of the pixel density of an image. In reality this would be the number pixels in a given area of the image. However, practically speaking it is given as the number of pixels per inch of linear distance, and assumes that this is the same horizontally and vertically. In printing, the density is often referred to as "dpi" or "dots per inch," which is equivalent to pixels per inch.

In addition to the spatial resolution given by pixel density, there are also differences in the color depth. The individual pixels of a digital image are generally stored in the computer in a mix of amounts of red, green and blue in a color mode called RGB. Most commonly, the intensity of the color these three channels is represented by an 8-bit number which in turn means that there are 256 different levels of each color. In some cases more bits are used to represent each color, giving greater depth of color representation. For printing processes, 8 bits per color channel or 24-bit color, is probably greater than the printing method can represent. However, for film processes using transmitted light, or for the flexibility in image processing for detail, more color depth can be quite useful. Practically speaking most images and most image processing is carried out at a 24-bit color depth.

Image compression
Inventors and software  engineers have devised clever ways to store a given amount of digital information in a smaller space. Virtually all image formats have one or another of these methods of compression applied to them for making the overall file size smaller. When the compression method allows the original image to be reconstituted intact, without loss of any resolution, it is called "lossless compression.

The TIFF file format can fully specify the content of an original bit mapped image. ZIP, LZW and CCITT are the names of a few of the compression algorithms that have been applied to shrink TIFF images and still allow them to be fully reconstituted. In principle there is nothing wrong with an image compressed with one of these techniques. In practice, the RIP used by Lulu's print partners can produce artifacts when uncompressing LZW or CCITT compressed TIFFs in the process of creating the raster image that is printed.

PNG is another lossless image format with some more advanced image compression techniques. It generally results in somewhat smaller file size than compressed TIFF files. In addition to RGB color channels, PNG has an "alpha" channel for specifying a transparency level for each pixel. RIP software sometimes has issues with transparencies in other images. I do not know if these issues are a problem with PNGs in PDFs.

Lossy compression
Lossy compression is achieved by combining standard compression techniques plus simplifications to the image that reduce the amount of data required to store it. An older style of lossy compression is GIF. With GIF, the colors in an image are reduced to a 256 color palette. This limits the ability to faithfully render photographs but it may work fine for some illustrations that may have relatively few colors in their palettes. It can be very useful for line drawings.

A very popular image format, jpeg, is also lossy. The simplifications that go into jpeg compression make clever use of knowledge about the limitations of human visual discrimination. In saving an image in jpg format one can usually choose a value for the "quality" of the image or the amount of compression. The goal in lossy compression is to reduce the file size further than is possible with lossless compression but to keep the appearance of the image as intact as possible while making the simplifications. There is an almost continuous trade off that can be made between the apparent resolution of the image and the reduction in file size.

The numbers that a given image application assigns to the quality level is an arbitrary number that does not necessarily correlate with the same quality number of another application. Importantly, a quality of 100 or maximum, is not lossless. On the other hand, the color output of printing devices is sufficiently limited that even lower jpeg quality settings can be indistinguishable from the printed output of a TIFF image.

Video displays generate colored light while printed images are limited to reflecting light that is not absorbed by the colors of the image. Because of that, the range of colors and intensities that can be distinguished in looking at a video display of an image is greater than the range of colors and intensities that can be distinguished in a printed image. Thus, in general, if you cannot see the losses in a jpg when looking at it on your screen, you are highly unlikely to be able to distinguish those losses in the printed output.

Examples
The test photograph below was converted from a "raw" digital camera image. (It is shown here reduced to 1/3 original size.)

Original png

This raw image was converted to TIFF and then saved in several other file formats to test the size of the image in those formats.

File sizes for identical images saved in non-lossy formats:
TIFF 11.2 Mb  
TIFF with LZW compression 7.3 Mb
PNG 6.2 Mb  

File sizes for this image saved with various JPG quality settings in The GIMP:
JPG quality-100 (maximum quality) 3.0 Mb
JPG quality-88  0.93 Mb
JPG quality-80  0.69 Mb

Note that the "quality" values in various applications are arbitrary. They are not generally equivalent from application to application. Notice that JPG with The GIMP's quality setting of 80 results in almost a 20-fold reduction in file size. How much does that compression decrease the actual noticeable quality of the resulting image?

Note also that PNG compression is more effective than LZW compression of TIFF files. This is important since LZW compression is not recommended for printing at Lulu.com.

Below are screenshots, of a magnified section of our test image after being saved as PNG, JPG quality100, JPG quality88 and JPG quality80 (but not in that order). Can you perceive the difference in these blow ups with this kind of test? I highly doubt it.

Even when the images are stacked one on top of the other and then switched rapidly, the differences are extremely hard to notice. Not only are the differences very difficult to discern--it is even more challenging to decide, regardless of small differences, which image is superior. If the PNG is not distinguishable from the JPG-80, you should ask yourself what you are getting for all that extra storage space required by the file.

ATest image

B
test image

C
test image

D
test image

So, lets look closer and put the compared sections side by side:
detail image
A
detail image
B
detail image
C
detail image
D
You might convince yourself that the difference is noticeable at this gain. I am skeptical. I have put one image on top of another and switched back and forth rapidly trying to find differences. What differences I see are minimal. Importantly, the printing process will give you less detail than any of these screen images.

Some images are not ideal for JPG compression
JPG is terrific for photographs where there is considerable detail. The compression techniques are not as well suited for images where there are sharp lines dividing areas of single color. Such images include diagrams, art with text and some architectural images. The following image is an example:
circle test image
Original PNG at 100 %
jpg 100
JPG quality 100
jpg 88
JPG quality 88
JPG 80
JPG quality 80

File sizes:
PNG  5,076 bytes
JPG-100  13,890 bytes
JPG-88  6,708 bytes
JPG-80  5,460 bytes

The PNG file is the smallest and it is also lossless. This is because the PNG format can take advantage of the low number of colors and the large areas of single color to greatly reduce file size.

Magnified images detailing the introduction of "jpeg artifact" with higher levels of jpeg compression:
circle test
PNG 

circle test image
JPG 100

circle test image
JPG 88

circle test image
JPG 80

The bottom line:
JPG compression of photographs is extremely effective at reducing file size of some photos with minimal loss of printed image quality. By contrast, PNG compression of images with large blocks of single color maybe both more efficient and greater in printed quality.


©2006 by Don Campbell
last edited 06/28/2006