Blog > Digitizing the Photothek’s Card Catalogues
Digitizing the Photothek’s Card Catalogues: Insights into the Project’s Development
June 2025
Introduction
-
Digitize and Transcribe:
All analogue cards are converted into machine-readable records through an AI-driven transcription pipeline. This system processes both handwritten and typed entries, including archival symbols and shorthand, and outputs structured text. Transcriptions with low confidence scores are flagged for manual verification to ensure accurate capture of metadata.
-
Enhance Access
The project aims to enable full-text search and advanced metadata queries across the entire collection, allowing researchers to filter results by specific fields and discover photographs, including those not yet digitized, within the Photothek’s holdings.
-
Integrate and Preserve
The integration of transcribed data into the Photo Library database is planned to enhance existing records and add new entries, supporting the database’s ongoing expansion and enrichment.
Digitizing and annotating each card ensures these archival records are preserved and accessible in a single, searchable catalogue that connects the analogue and digital collections. For more details, visit the project page.
On May 29, 2025, we shared our preliminary findings and methodology at the European University Institute’s event “Photography and Archives: Discovery, Technology and Innovation” presenting our pilot scanning test, AI-assisted transcription process, and plans to expand the project.
The Card Catalogue

© Bärbel Reinhard 2024
Card Structure and Metadata Fields

Each index card contains key information about the photograph and its depicted object. Typical fields include:
- The photograph’s inventory number, often marked with a red dot to indicate cataloguing status
- The photograph’s location within the archive
- The object’s title, artist, and date of creation
- The geographic location associated with the object
-
Information specific to the photograph, such as the photographer’s name and negative number
Digitization
-
Evaluate Scanning Quality
To assess the technical feasibility of digitizing the entire collection while maintaining high image quality and legibility.
-
Test Transcription Techniques
To evaluate various AI-based OCR models and prompt strategies in order to determine the most effective approach for accurate and reliable data extraction.
The results of this test are currently under review to guide the scaling of the project to include the entire catalogue.
As the project moves forward, the evaluation of scanning techniques remains a critical step. Identifying the most suitable hardware and workflows will ensure both efficiency and long-term preservation quality for the full collection.
AI-Aided Automatic Transcription
Prompt Engineering
- Instructing the model to recognize and categorize metadata fields despite minor variations in card layouts
- Accurately transcribing handwritten annotations and corrections, some of which are difficult to read even for human reviewers
- Correctly identifying categories and subcategories, a task complicated by inconsistencies in how these are visually indicated across cards.
Several iterations of prompt design were tested to address these challenges. Each version was evaluated for its ability to handle field recognition, layout variability, and metadata assignment. Feedback from these tests informed progressive refinements.
The final prompt guides the model through a multi-step process: first, distinguishing separator cards from index cards; second, constructing a hierarchical folder structure based on identified categories; and finally, extracting all relevant data from index cards and mapping each element to the corresponding metadata field. This prompt includes comprehensive descriptions of all known field types, layout variations, and exceptional cases. The central challenge lay in achieving a balance between clarity and completeness to reduce the risk of transcription errors.Cards Transcription Platform


A primary goal is to train the AI to distinguish between category (separator) cards and index cards. As soon as a category is identified, the system creates a folder structure that mirrors the original card catalogue’s hierarchy. After each transcription run, index cards are automatically sorted into these folders based on the category cards, preserving the archival organization in the digital environment. Each card links to its own metadata page, which displays the transcription output—including recognized fields and any flagged uncertainties that require human review.

Evaluating Transcriptions
-
Flagged Images Gallery
The platform displays all “flagged” cards, containing transcription errors or uncertain fields, as a series of thumbnail previews. Users can click on any thumbnail to visualize a detailed comparison of model outputs versus user corrections, helping to quickly identify problematic scans or fields.

-
Detailed Comparison View
This view presents a side-by-side layout showing:
- The original scanned card image
- The “ground truth” or user-corrected transcription
- Transcriptions generated by each selected LLM and prompt

-
Evaluation Heatmap
A heatmap summarizes how each LLM and prompt combination performed on every metadata field. Cells are color-coded—green for exact matches, yellow for partial matches, and red for mismatches—enabling quick identification of which fields and prompts produce consistent errors or require further refinement. The summary interface helps quickly reveal recurring transcription issues across multiple cards.
