|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
4. DIGITAL PRODUCTION2007 Revised Edition | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
4.1 Setting up a Digital Production Station
Digital production is probably one of the easiest functions of creating a digital project --- and the most fun! It is exciting to see long-stored items, fragile materials, and negatives come to life on a screen. While scanning or photographing is easy, it can be deceptively so. And if you are looking for a high quality product it can get complicated. Once the initial thrill wears off, the processes in production become repetitive rapidly and even boring, causing mistakes to be made. Aside from the physical handling of original materials, digital production is time-consuming and redundant work, and therefore the imaging should be done correctly the first time. It is very difficult and expensive to go back and scan or photograph again or to recover documentation that did not accompany the original production image. To this end, NC ECHO follows the "scan once methodology" (this covers both scanning and digital photography). In basic terms, this means that in creating digital images, the digital production should be done at the highest level of quality that an institution can afford. The higher the quality, the longer the life of the image and the more versatile its uses. As you plan for digital production and are determining the level of quality your institution can support, consider the future uses of the digital images. Do not anticipate returning to re-digitize. Many originals could suffer from the handling and exposure to the bright light required by digitization. For instance, it is reported that scanning exposes a document to four times the destructive light as one photocopy. Therefore it is best to simply "scan once," to create a master image, and make any future duplicates from that master image. This chapter of the Guidelines specifies the equipment, standards, and techniques required to conduct the digital production portion of a digital project. It discusses digital production stations, the "scan once methodology," purchasing decisions for hardware and software, imaging standards, basic steps to follow, documentation, and quality assurance. Setting up a Digital Production StationOne of the major considerations in beginning a digitization project is to make best use of the available workspace. While some considerations listed here may not be practicable for your institution, we provide the best possible scenario and reasoning so that you can make appropriate decisions according to your physical configurations. In an ideal world, institutions would have the resources to have designated digital production station that will not only be perfect for scanning or digital imaging with a camera but also not deprive you of space already devoted to other activities. This ideal is rare, so institutions are often faced with multi-purposing space or reconfigurations that put additional strains on an already limited resource. In order to make the best use of existing space, begin with the following questions: 1. What will the space be used for?
2. What type of materials will be captured?
3. What sort of equipment will be used?
4. How many staff will be working at one time?
Once you have considered the above questions and surveyed your existing physical facilities, consider the following as "best practice" for digital production workspace configurations:
Establishing an environment that is functional and pleasant will make the experience of a digital project all the more rewarding and will increase your efficiency in producing the digital images. The "Scan Once Methodology"It is expensive for institutions to go back and re-digitize their holdings. Few ever do so. In addition, many originals could suffer from the handling and exposure to bright light required by digitization. Therefore, it is best to simply "scan once," create a master image, and make any future duplicates from it. Step One - Create a Master ImageThe highest quality copy of a digital image, often called the mater image, is expected to be a quality surrogate of the original. As such, it should represent the un-manipulated original and be created at a high resolution and stored in an uncompressed format (usually TIFF). High resolution equals large amounts of information captured, and large amounts of information captured usually equal a higher quality digital image. The higher the quality, the longer the life of the digital copy and the more versatile its uses. It is the master image that holds the promise of versatility and longevity. From it, high quality prints or publications might be made as well as derivatives for a variety of uses. Step Two - Create an Access ImageAccess images are lower resolution copies taken from the master by using a "save as" function and changing the storage format and resolution. Access images may be of varying quality and are generally manipulated for better display upon the screen or page (cropping, re-sizing, etc.) Additional images, such as "thumbnails" (even lower resolution copies) may also be created from the master or access image. These thumbnails allow for even quicker downloads of pages, and faster retrieval of large numbers of images. Suggested resolutions, bit depths, and storage formats of each of these types of digital reproductions (master, access, and thumbnail) are outlined below. Images created from the master are often referred to as derivative images. Step Three - Storing the Master ImageThe master image is the copy to be maintained for the long-term. As such, it should be stored appropriately. Master images take up a great deal of space, and most institutions will not wish to store them for the long-term on computer hard-drives. Some institutions maintaining large amounts of digital images will wish to work with a form of tape or server backup, while those institutions engaged in more modest digital products may choose to store master images on CDs. If an institution decides to use CDs as a storage medium, it is suggested that two copies of each CD be prepared and stored separately. One will serve as the "master" CD and the other will be the "use" CD from which access images, copies for users, etc. may be prepared. CDs used in this way should be "refreshed" regularly, that is copied from the old CD to a new CD (approximately every 5 years)..
Getting the EquipmentSelection of the necessary equipment can have the greatest impact on the quality of images for a digital project. The development of scanning and digital camera technology has led to a proliferation of equipment varying in quality and availability. This section provides the necessary information to make an effective decision for your institution. Before any equipment is purchased, consider the following overall questions:
Hardware: Digital CaptureThere are basically six types of digital capture devices.
Pros and Cons of Digital Capture Devices
Table adapted from Don Williams, "Selecting a Scanner," Guides to Quality in Visual Resource Imaging Hardware: ComputersSelect the computer that will be used in the digital production. It is recommended to devote one computer to this and below are outlined some guidelines on the best selection for this. Select a computer that:
If you are going to be purchasing a new computer to act as your digitization station, it is recommended that you review trade publications such as PC Magazine to help make an informed decision. In making these decisions, it is recommended that you involve your technology support as much as possible. Not only can technology personnel provide help in making decisions, but they will be better able to perpetuate their support throughout your digitization project. Digital camera reviews can be found at http://www.dpreview.com/. Hardware: PurchasingIn purchasing hardware, consider these issues: What are the resolution capabilities? Is the scan bed large enough to handle your originals? How long does it take to scan one image at your master image specifications? Does the manufacturer have a good reputation for service and durability? Optics quality is important. Manufacturers' claims sometime may be unreliable, especially relating to the number of pages scanned per minute and the maximum possible resolutions. Look for product reviews, ask those using the equipment, and play close attention to actual rather than interpolated resolution. A scanner's speed is directly related to the associated computer's capabilities. The higher or faster the RAM, Hard disk space and CPU speed, the better. Reviews:
SoftwareSome kind of software usually accompanies the digital production device. For a scanner, this is the scanning software and for a digital camera, this is the software that provides the interface to download images from the camera to the computer. A second kind of software is used to manipulate the scanned image. This is image manipulation software. It may come with the scanner, but it will usually allow for only the very basic editing of an image. Manipulation software is mounted on the hard drive of a computer and is used to orient the image; crop it; adjust brightness, contrast, and resolution; transform; flip; or otherwise manipulate the image. The de facto standard for image manipulation is the software package, Adobe PhotoShop. It can import the scanning software so that you are able to scan and manipulate the image within the PhotoShop umbrella application. There are several versions of PhotoShop, ranging from PhotoShop Elements (about $40.00) to PhotoShop Creative Suite Premium (about $1,200). Other imaging software is adequate for basic tasks (Paint Shop Pro, Deskscan II, etc.). It is recommended that you look for software that allows you some flexibility for advanced manipulation and saves the image in all the common formats (i.e., TIFF, JPEG, GIF). It is also recommended that the software allows conversion from one format to another. If the project will require the processing of a large volume of images, it is best to consider additional software that allows batch processing (i.e., PhotoShop, Debabelizer or ImageMagic) that will enable the automatic processing of files and the standardization of compression. When selecting image manipulation software, institutions should look for
Software: Purchasing
Purchasing EquipmentThe main factors to consider in purchase: Cost Installation Destination of the image Resolution needed Number of items to be scanned Format of items to be digitized Additional Tools If you are purchasing an expensive capture device, company representatives should demonstrate its capabilities. You should also negotiate a trial period in which you can evaluate the results of digitizing a full range of materials. What Can Be Digitized and HowBelow is a table showing types of materials that can be digitized: the type of file (master, access, and thumbnail) and suggestions for corresponding resolution, storage format, and bit-depth. These suggestions are based upon standards and best practices being followed by some of the nation's major digitization projects. The resolutions, abbreviations, bit types etc. mentioned in this table are discussed later in this section.
Digital Audio StandardsMany digitization projects are interested in including digital audio, whether digitizing analog audio media or creating new digital media. Audio files provide depth and variety to digital projects. Transferring analog audio to a digital media is a relatively simple process. The conversion involves four devices: an analog audio playback device, an analog-to-digital converter, a computer to process the digital signal, and a device for digital file storage. Other devices can include a mixing device. There are several audio software programs available to allow manipulation of the audio, including volume adjustments, tracking, equalization, noise reduction, and compression. For master files, these methods are used sparingly, but for derivative files can help to provide enhanced access to the audio file. Digital audio files can be recorded in many formats, such as WAV, AIF, and MP3. The most important aspect in selecting a file format is to choose one that is non-proprietary, with a high potential for future readability. Uncompressed formats will provide maximum audio fidelity. The WAV file was developed by Microsoft and is in widespread use. WAV is readable by virtually all audio software programs. AIF file type was developed by Apple Computer and is also used widely. Both WAV and AIF are uncompressed and accepted for long-term file storage. MP3 file format has emerged as the file type of choice for many applications. This file format Is highly compressed for electronic transfer. It is recommended that institutions use WAV for master files but can use MP3 for access files and delivery on the web. Pros and Cons of Digital Capture Devices
Elements of a Digital Object3 Types of ScanScanners generally support three types of scans and present these options to their users.
File FormatsDigital images are stored in five major types of formats. It is the type of format and level of resolution, which is the difference between the "level" of scans. In order to save space and "move faster" over the Internet, some formats drop information from an image. Later the software analyzes what it did not drop, infers what must have been discarded and partially reconstructs the original image. This process is called "compression." It is recommended that master images are not compressed to maintain as much original information as possible.
Master images must be of the highest quality. Web images need not require such stringent quality controls. But, before compromising on image quality, consider the cost of migrating the image. Because migration is costly, it is far sounder to migrate a high quality (master) image than one of lesser quality. All digital images will have to be migrated, if kept long enough. While the primary use of images in North Carolina ECHO is focused on Web access, repositories need to be mindful of future use, remembering the fragile nature of the originals and potential damage digital capture can do. Publishing on the Web will result in requests for high quality copies of the images, so consider all possible needs before you produce your digital master image. Remember the advice, it is better to "Scan Once, Save Twice!" Basic Production StepsScanningThe basic steps in scanning an image will be determined by the format of the material to be scanned, but all formats (color, B&W, and Bi-tonal) share common scanning techniques. While a full scanning manual is beyond the scope of this document, scanning an individual image might look like this:
Digital Cameras
Digital Audio
Image Size and ProportionWhen trying to determine the size of the image as it will appear on a monitor, confusion often arises from the method of measurement. What is the difference in dpi, ppi, and lpi? The original image may be measured by inches, centimeters or millimeters.
A longer horizontal dimension indicates a "landscape" view. A longer vertical dimension indicates a "portrait" view. In the example above, if you want the entire image to show on the 640 x 480 pixel computer screen, you would have to resize the portrait view to reduce the vertical dimension to 480 pixels or less. The information in a pixel is fixed and does not change. What can change is the array of numbers of pixels. Increasing the numbers of pixels increases the size and resolution of an image. All scanning is a "sampling" of portions of the original. The higher the resolution of this sampling, the more real information you have to work with. However, the higher the ppi, the longer it will take the computer to load the image to the screen. Consider the following examples when figuring out the proportion of computer screen image to original.
Let's say you want a print of your image, and you capture it in at 600 dpi (remember, dots per inch for printing). The image will print satisfactorily, but the screen resolution will be over-kill. Typically, the resolution of the access image should be about 1/2 the resolution you want to make a print. Note that the 300 ppi (remember pixels per inch for screen) used for the image above is 1/2 the print resolution of 600 dpi. Keeping your image resolution at 1/2 your print resolution is a good rule of thumb and will lessen the confusion often found between print resolution and image resolution. To re-cap: the print size of an image is important only for printing. If you go by the print pixel size and carry that image to the screen, it will usually exceed the screen size. For example, a printed image of 3 x 2 inches scanned at 300 ppi will be 900 x 600 pixels on the screen and the same image scanned at 600 ppi will be 1800 x 1200 pixels! Therefore, the image intended for a printer is far too large to fit most screens and must be re-sized with your photo application by lowering the resolution. COLOR information in an image is dependent on the number of colors or shades of gray that can be carried by a pixel. This carrying capacity is referred to as the pixel's dynamic range or its bit-depth. The standard minimum carrying capacity is 8 bit and currently the maximum is 42 bits. However, most image viewing software cannot display 42 bits yet. A BIT is the smallest storage unit in a computer. It is the unit of measurement for determining the range of color or shades of gray found in an image. The greater the dynamic range or bit-depth, the greater the subtlety of color or gray. Remember the trinity of three types of scan: bi-tonal, grayscale, and color? There is a preferred bit-depth for scanning each of these. BIT DEPTH
One bit bi-tonal is obviously more suited to line drawings and text and 8 bit and 24 bit color more suited to images where the full range of colors are needed. Eight bit grayscale is the de facto standard for black and white photography and many graphics. One bit bi-tonal may not be satisfactory for some text and line drawings because it does not capture enough information. Experimentation with grayscale may also be necessary. Likewise, some badly faded color photographs may be better scanned in 8 bit grayscale. Some will wish to know why their institution should digitize at a higher resolution than a computer screen is able to present. The answer lies in potential and multiple uses. High quality printers (the type used in books and magazines, for example) use higher resolutions than do computer screens. If a publisher spots an image in a digital collection, he or she may wish to use it in a printed work. If the original has only been scanned for access (low resolution or compressed), then it will necessarily need to be scanned again at higher resolution and an uncompressed format. This means re-handling and another exposure to a bright light source. More handling and more light means more damage to the original. And who knows? Higher resolution computer screens and delivery systems that can handle the correspondingly large files may be a common part of homes and offices in the not-too-distant future. Signal to Noise or Image Quality FeaturesThe signal to noise ratio is the primary measure of image quality. If the signal to noise (S/N) ratio is high, the image quality will be high. However, measuring signal to noise is largely a subjective process. "Noise," or degradation of the image, is not generally a good thing. The more noise, the poorer the image. The aim in a good scan is to decrease the noise. There are several ways to do this that use standard image quality features found in most image manipulation software. The most common noise reduction features are:
Be careful that features are not used that can increase noise. The following features should be used with caution as they increase the noise of a scan:
Reminder: Image quality is generally measured by evaluating some or all of the following features:
Monitor DisplayMonitor resolutions can vary with the type of monitor used, which can affect your image display. Standard monitor resolutions follow:
Monitors should also be calibrated for best results. Software packages like Adobe Photoshop often include a basic monitor-calibration tool. Quality AssuranceQuality assurance needs to be performed throughout the creation of your digital images. While it may seem like a daunting task, digital images should be checked to ensure that they are at the level of quality specified by your institution. For example, documents can be skewed and not noticed by the staff member creating the digital images. Always use more than one set of eyes to assure that your images are consistent and of high quality. It is recommended to establish a system early for quality assurance and do the work on a regular basis. Digital objects tend to pile up, and you don't want to have a large number to go through at one time. That situation can lead to sloppy checking. Some institutions choose to look at a sampling of the images once initial production set up has been established. A good quality assurance program will help you to stick to the "Scan Once Methodology" by ensuring that the master image is accurate.
Optical Character RecognitionWhen a computer scans a text, all it duplicates are graphical bits on a virtual page. In other words, it creates a digital image or copy of the page. A user can not edit or search in the newly-created document. If that image is passed through an Optical Character Recognition program (OCR), the software converts the shapes it recognizes into individual letters, creating a text document. However, OCR recognizes and converts few documents perfectly. It makes frequent errors, especially if the original image is blurred, faded, or otherwise unclear. Work with OCR is even more labor intensive than straightforward imaging of pages, requiring a great deal of editing and quality control. It does, however, produce a much more versatile digital product. OCR'd documents must be proofed, word by word. Such things as unusual proper names, blemishes on the document, uneven light, tables, borders around text, offset fonts, superscripts and subscripts can throw the word recognition off. There are various devices available for capturing images for OCR but a desktop scanner will also work. Because a desktop scanner digitizes the image by dividing it into hundreds of pixel-sized boxes per inch and represents each box with either a 1 or a 0, the OCR program organizes the patterns of dots into characters. This allows for the computer to translate character images into editable text. In addition to OCR, there is the PDF (Portable Document Format) which is an open and universal file format that preserves the fonts, images, graphics, and layout of any source document, regardless of the application and platform used to create it. Governments and enterprises around the world have adopted PDF to streamline document management, increase productivity, and reduce reliance on paper. PDF is available to anyone who wants to develop tools to create, view, or manipulate PDF documents. A good place to research PDF's is http://www.pdfzone.com. Documenting the Digital Production ProcessMany collections are now realizing the importance of documenting the process of digital production. There are many reasons documentation is wise. One of the most critical reasons is that as technology changes, migration of earlier information is a reality. The more documentation that is available, the easier and less costly the migration will be when it occurs. And it will occur. NC ECHO has created a preservation metadata standard that helps you document your digital production of images by encouraging you to record information about each digital image you create (http://www.ncecho.org/presmet/index.htm). Some aspects of the preservation metadata remain relatively stable while others will change for each image. But preservation metadata is not the only documentation you should consider. Other documentation that can inform your digital product and management include:
An example of scanning documentation is for the Historic American Sheet Music Project from the Rare Book, Manuscript, and Special Collections Library at Duke University.
ConclusionDigital production is the fun part of a digitization project, and it can be done relatively easily. Unfortunately, the ease can be deceptive. To produce a "down and dirty" quick digital image takes almost no time and effort at all. With just a bit more time, effort, and storage space, an image can be created that would
To support the long-term viability of a digitization project, the production process should be thoroughly documented to help future caretakers and users of the images. Once there is a documented digital master, a variety of processes (including some that are automated) can be used to manipulate the access and thumbnail images for better presentation over the Web. The scanner or camera awaits--and it only takes a little more time and effort to give those images a better chance at varied use and long-term viability. Further ReadingBesser, Howard. Procedures and Practices for Scanning. http://sunsite.berkeley.edu/Imaging/Databases/Scanning/ CDP Digital Audio Working Group, Digital Audio Best Practices, version 2.0, November 2005. http://www.cdpheritage.org/digital/audio/documents/CDPDABP_1-2.pdf Research Libraries Group and Digital Library Federation. Guides to Quality in Visual Resource Imaging. http://www.rlg.org/legacy/visguides/visguide6.html Kenney, Anne and Steven Chapman. Digital Imaging for Libraries and Archives. Ithaca: New York, Department of Preservation and Conservation, Cornell University Library, June 1996. Kenny, Anne R. and Oya Y. Rieger. Moving Theory into Practice. Research Libraries Group, 2000. See also: http://www.library.cornell.edu/preservation/tutorial/index.html Williams, Don. "Selecting a Scanner," Guides to Quality in Visual Resource Imaging. http://www.rlg.org/visguides/visguide2.html
Return to NC ECHO Home Page |