How does Talis Aspire Digitised Content (TADC) treat my PDFs?

When a PDF is uploaded into a digitisation request, TADC performs the following:

  1. Generates a single page coversheet with the necessary information from the request. Note: A new coversheet is generated for each request.
  2. Begins the packing process to create a 'bundle'. This combines the TADC generated single page coversheet with the originally uploaded PDF into a new PDF. From this we generate an image of every page in the 'bundle' - this is what is served to the user when they are viewing a bundle in the application.

When is the coversheet updated?

Once you have updated information within a 'Live' digitisation request the bundle will be repacked with an updated coversheet reflecting the changes you have made to the request. This will be documented in the request's worklog.

Can I edit the coversheet?

No, the copyright text contained in the coversheet generated by TADC cannot be edited.

When is a coversheet not automatically applied?

Coversheets are created in TADC to display the approved copyright statement associated with your copyright ruleset. 

When the rights clearance for the digitisation have been manually obtained a coversheet is not automatically created and attached to a bundle.  In these cases, a coversheet can be manually uploaded to the digitisation request.

What does this mean for accessibility?

When viewing a bundle on-screen:

Bundles are presented on-screen within the viewer. Currently, the viewer does not provide any hidden text for screen readers.

Downloading the bundle:

If the original PDF had been created for accessibility purposes, then this would continue to be the case when the user downloads the bundle and opens in a PDF player that included screen reading capabilities e.g. the PDF reader in Adobe or the OSX PDF player with VoiceOver enabled.

The coversheet has no accessible information purposefully built into it, however, some testing we have done against Adobe Reader and OSX Preview player indicates that some screen readers will be able to read the coversheet.

Update - July 2020 

You are able to preview OCR scans in Talis Aspire Copyright Clearance. To do this, use the OCR toggle in the Preview player to see the plain text OCR already available on the downloaded version of the scan.

PDF_Viewer_non_ocr.png

OCR:

PDF_Viewer_OCR.png

Note that on individual pages or scans that do not contain OCR text the following message is displayed “This page has no accessible text”.

PDF_Viewer_no_accessible_text.png

This OCR layer has also been added to existing request previews, as a useful quick check on whether one of your existing scans has been OCR’d previously or not.

Image_and_txt_1Digitised_Content____Talis_Aspire.png

OCR:

PDF_Viewer_OCR.png

Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.