HT Ingest Service

From MPublishing

(Difference between revisions)
Jump to: navigation, search
m
Line 2: Line 2:
'''
'''
In order to ingest digtized books (or booklike volumes) into HathiTrust, each included image must possess proper OCR, pageturner and preservation metadata.  Although books digitized by Google, Internet Archive, and other vendors already have these, locally-digitized images may not.  MPublishing has developed software and workflows to produce this metadata for and begin the ingest process on behalf of HathiTrust partners.
In order to ingest digtized books (or booklike volumes) into HathiTrust, each included image must possess proper OCR, pageturner and preservation metadata.  Although books digitized by Google, Internet Archive, and other vendors already have these, locally-digitized images may not.  MPublishing has developed software and workflows to produce this metadata for and begin the ingest process on behalf of HathiTrust partners.
 +
'''Frequently Asked Questions
'''Frequently Asked Questions
'''
'''
 +
'''1.''' How is pricing structured?
'''1.''' How is pricing structured?
Fees are structured in two tiers:
Fees are structured in two tiers:
Line 14: Line 16:
-
'''2'''. Does MPublishing offer volume discounts for larger projects?
+
'''''2'''. Does MPublishing offer volume discounts for larger projects?''
No, not at this time.
No, not at this time.
-
'''3.''' What is the turnaround time?
+
'''''3.''' What is the turnaround time?''
Normally, books should appear in HathiTrust within 4-6 weeks of initial delivery, provided that they meet the image specifications.  [link]
Normally, books should appear in HathiTrust within 4-6 weeks of initial delivery, provided that they meet the image specifications.  [link]
-
'''4.''' Where can I see examples?
+
'''''4.''' Where can I see examples?''
[Link to Utah State]
[Link to Utah State]
-
'''5.''' What are the steps in the process?
+
'''''5.''' What are the steps in the process?''
1. If needed, Convert the PDF to bitonal and contone TIFFs.
1. If needed, Convert the PDF to bitonal and contone TIFFs.

Revision as of 13:10, 24 October 2012

What is the Projects2HT service? In order to ingest digtized books (or booklike volumes) into HathiTrust, each included image must possess proper OCR, pageturner and preservation metadata. Although books digitized by Google, Internet Archive, and other vendors already have these, locally-digitized images may not. MPublishing has developed software and workflows to produce this metadata for and begin the ingest process on behalf of HathiTrust partners.


Frequently Asked Questions

1. How is pricing structured? Fees are structured in two tiers:

1-100 books: $5/book 101+ books: $4.5/book

The first 100 books incur a slightly higher fee to account for one-time setup tasks.


2. Does MPublishing offer volume discounts for larger projects? No, not at this time.


3. What is the turnaround time? Normally, books should appear in HathiTrust within 4-6 weeks of initial delivery, provided that they meet the image specifications. [link]


4. Where can I see examples?

[Link to Utah State]


5. What are the steps in the process?

1. If needed, Convert the PDF to bitonal and contone TIFFs.

2. Send bitonal TIFFs to OCR.

3. Add needed preservation headers to TIFFs, convert contone TIFFs to JP2s.

4. Manually add necessary structural metadata for Pageturner, page by page.

5. Integrate OCR, Images, and Pagetag data into a package and pass to Core Services for ingest.

Personal tools