Digitization Guidelines
Scanning Specifications
Any materials digitized for the NIUDL must adhere to the following minimum requirements. Any divergences from these minimum requirements will be documented in the project's specifications.
Text (Books, Newspapers, Documents)
Image Types | Bit Depth | Color Space | Resolution (ppi) | Scale | File Format |
B&W Text Only | 1-bit | Bitonal | 600dpi | 100% (1:1) | TIFF (uncompressed or lossless compression) |
B&W Text with Illustrations | 8-bit | Grayscale | 400dpi | 100% (1:1) | TIFF (uncompressed) |
Text with Continuous-Tone Images & Photos (Color) | 24-bit | RGB | 400dpi | 100% (1:1) | TIFF (uncompressed) |
Photographs (Prints)
Image Types | Bit Depth | Color Space | Resolution (ppi) | Scale | File Format |
B&W Photographs | 8-bit | Grayscale | 600dpi | 100% (1:1) | TIFF (uncompressed) |
Color Photographs | 24-bit | RGB | 600dpi | 100% (1:1) | TIFF (uncompressed) |
Photographs (Negatives and Slides)
Image Types | Bit Depth | Color Space | Resolution (ppi) | Scale | File Format |
B&W Negatives and Slides | 8-bit | Grayscale | 5,000 pixels across the longest side | 100% (1:1) | TIFF (uncompressed) |
Color Negatives and Slides | 24-bit | RGB | 5,000 pixels across the longest side | 100% (1:1) | TIFF (uncompressed) |
Large Format (Posters and Maps)
Image Types | Bit Depth | Color Space | Resolution (ppi) | Scale | File Format |
B&W Posters and Maps | 8-bit | Grayscale | 5,000 - 10,000 pixels across the longest side | 100% (1:1) | TIFF (uncompressed) |
Color Posters and Maps | 24-bit | RGB | 5,000 - 10,000 pixels across the longest side | 100% (1:1) | TIFF (uncompressed) |
Video Recordings
Video Types | Codec | Resolution | Sampling Size | Data Rate | Frame Rate | File Format |
Archival Preservation | Uncompressed YCbCr or lossless JPEG2000 | 640 x 480 (4:3) or 1280 x 720 (16:9) | 30 bit | 30 MiB/s | 30p (24p for film) | AVI, QuickTime, or MXF |
Access Preservation | DV or ProRes | 640 x 480 (4:3) or 1280 x 720 (16:9) | 30 bit | 30 MiB/s | 30p (24p for film) | AVI or QuickTime |
Born Digital | DV or ProRes | >1280 x 720 (16:9) | 30 bit | >50 MiB/s | 30p (24p for film) | AVI or QuickTime |
Scanning Practices and Procedures
Guidelines for Digital Images
- If digitizing a multi-part resource, always digitize the entire item, including the binding or wrapper and any blank pages.
- Ensure that there is adequate space around each side of the resource (approximately 1/8"), including the gutter of any opposing pages. Each edge should be clearly defined.
- If the image contains important color information, scan with a ruler and color checker, providing space between the resource and the ruler.
- Always use a plain underlay, preferably dark gray, that will contrast with the resource being digitized.
Handling
- Materials are handled as little as possible and stored in a secured location when not in use.
- Bound materials are scanned lying flat, but should not be opened beyond the point where the binding is stressed.
- Scanning of tightly-bound materials occasionally requires removing the binding, which is done in consultation with the Project Manager and Content Provider.
- Glass is used to flatten pages for scanning, but only when it poses no danger to the materials. Fingers or pens should never be used to flatten resources.
- Uncut book edges will be cut by scanning technicians, unless otherwise requested.
- Materials are handled with clean bare hands, unless otherwise requested.
Scanner Calibration
- Scanners are calibrated every week using targets to enure accurate color capture and focus.
- Monitors are calibrated annually.
- Scanners are serviced every year.
Image Manipulation
Image manipulation is kept to a minimum after scanning and should always be avoided whenever possible. In the event that significant problems are identified, the batch will be re-scanned. For this reason, consistent quality control is extremely important. Some exceptions to image manipulation include:
- Images may be desewed or rotated, as needed.
- Images are often cropped, leaving a small border of space (1/8") around each item.
Filenaming and Storage
Files are given meaningful names, typically indicating the collection or project that the files belong to using an alpha-numberic identifier. The name should always end with numerals in a sequnce of at least three digits, seperated from the identifier using the underscore character (_). Filenames should not contain any spaces or other special characters.
If the collection contains resources with multiple parts (e.g. books or manuscripts), each part should be assigned a unique and meaningful identifier after the collection identifier, preceded by an underscore (_). For example:
SEADL999_CambodianManuscripts025
--> SEADL999_CambodianManuscripts025_001.tif
--> SEADL999_CambodianManuscripts025_002.tif
--> SEADL999_CambodianManuscripts025_003.tif
Files are stored on the University Libraries' Network Attached Storage (NAS) in the Scanning directory during production, then copied to the Staging directory when they are ready to be published. Files are removed from the Scanning directory after they have been published and reviewed for quality.
Publishing
Two mechanisms are used for publishing digital content to NIUDL: batch ingest and form ingest. Batch ingest involves creating a Submission Information Package (SIP), which typically consist of a MODS record and an archival file in the appropriate directory structure, which varies by content type. This allows NIUDL to publish many assets at the same time. Form ingest involves publishing one file at a time through the NIUDL front-end and does not require technical knowledge or server access, allowing those outside of NIUDL staff to participate in publishing, include staff from other Library units and partners at other libraries. Ingest forms are developed by the Digital Collections and Metadata Librarian in consultation with the Project Manager.