Itemviewer Image Conversion
From DLXS Documentation
(→Scenario 2: "pdf" requested) |
|||
(One intermediate revision not shown.) | |||
Line 1: | Line 1: | ||
[[DLXS Wiki|Main Page]] > [[Mounting Collections: Class-specific Steps]] > [[Mounting a Text Class Collection]] > [[Newspaper Clip Image Access Mechanisms]] > Itemviewer Image Conversion | [[DLXS Wiki|Main Page]] > [[Mounting Collections: Class-specific Steps]] > [[Mounting a Text Class Collection]] > [[Newspaper Clip Image Access Mechanisms]] > Itemviewer Image Conversion | ||
- | + | ==General Information== | |
<p><span class="command">Itemview.pm</span> is a DLXS Perl class module, | <p><span class="command">Itemview.pm</span> is a DLXS Perl class module, | ||
found in <span class="command">$DLXSROOT/lib</span>. It contains, among | found in <span class="command">$DLXSROOT/lib</span>. It contains, among | ||
Line 9: | Line 9: | ||
and <span class="command">$DLXSROOT/cgi/t/text/ClipView.pm</span> are both | and <span class="command">$DLXSROOT/cgi/t/text/ClipView.pm</span> are both | ||
subclasses of it, and so therefore inherit those methods.</p> | subclasses of it, and so therefore inherit those methods.</p> | ||
- | + | ||
+ | ==Handler Resolution== | ||
<p> After a <span class="command">Pageview</span> or <span class="command">ClipView</span> object | <p> After a <span class="command">Pageview</span> or <span class="command">ClipView</span> object | ||
is instantiated, it contains information about every page in the | is instantiated, it contains information about every page in the | ||
Line 19: | Line 20: | ||
which conversion "helper" program to use. </p> | which conversion "helper" program to use. </p> | ||
- | + | ==Scenarios== | |
<ul> | <ul> | ||
<li>General information about scenarios</li> | <li>General information about scenarios</li> | ||
Line 26: | Line 27: | ||
</ul> | </ul> | ||
- | + | ===General information about scenarios=== | |
<p>In the running | <p>In the running | ||
of <span class="command">pageviewer-idx</span> and its running | of <span class="command">pageviewer-idx</span> and its running | ||
Line 43: | Line 44: | ||
would determine the proper method to invoke the conversion, which | would determine the proper method to invoke the conversion, which | ||
is in turn coded to use a particular helper app. </p> | is in turn coded to use a particular helper app. </p> | ||
- | + | ||
+ | ===Scenario 1: "image" requested=== | ||
<p>Assume that the CGI requests "image". First, the application | <p>Assume that the CGI requests "image". First, the application | ||
Line 69: | Line 71: | ||
is looked for. If there is a TIF file, available, then the <span class="command">TifTGifHandler</span> is | is looked for. If there is a TIF file, available, then the <span class="command">TifTGifHandler</span> is | ||
called which in turn uses <span class="command">tif2web</span>.</p> | called which in turn uses <span class="command">tif2web</span>.</p> | ||
- | + | ||
+ | ===Scenario 2: "pdf" requested=== | ||
<p>Assume that the CGI requests "pdf". First, the application | <p>Assume that the CGI requests "pdf". First, the application | ||
checks the <span class="command">%gStoredFilePriorityMap</span> hash | checks the <span class="command">%gStoredFilePriorityMap</span> hash | ||
in <span class="command">ItemView.cfg</span>. This | in <span class="command">ItemView.cfg</span>. This | ||
hash says that if the CGI is requesting a pdf file, the first | hash says that if the CGI is requesting a pdf file, the first | ||
- | file format to look for | + | file format to look for is a PDF containing the full contents of the item, stored on disk as a .pdf file whose filename is the full idno.</p> |
- | + | ||
- | + | ||
- | + | <p>If there is such a PDF file, then the <span class="command">DeliverFileHandler</span> method | |
- | + | ||
is called, which simply copies the file from its storage location | is called, which simply copies the file from its storage location | ||
(<span class="command">$DLXSROOT/obj/... </span>to | (<span class="command">$DLXSROOT/obj/... </span>to | ||
- | the cached image directory <span class="command">$DLXSROOT/web/cache/...</span>).</p> | + | the cached image directory <span class="command">$DLXSROOT/web/cache/...</span>). It also passes a parameter so that Adobe Reader browser plugin can open the PDF to the page that was last viewed (note this feature is not supported by other PDF viewing applications).</p> |
+ | |||
+ | <p>If the full PDF is not available, the next format checked for (on disk, known via the <span class="command">Pageview</span> database | ||
+ | table and therefore via the <span class="command">PageView</span> or <span class="command">ItemView | ||
+ | object</span>), is a PDF for the current page.</p> | ||
+ | |||
+ | <p>If there is such a PDF file, then the requested | ||
+ | format matches the stored image format and <span class="command">DeliverFileHandler</span> is called as described above.</p> | ||
<p>If | <p>If | ||
there is no PDF file stored, then the <span class="command">%gStoredFilePriorityMap</span> says | there is no PDF file stored, then the <span class="command">%gStoredFilePriorityMap</span> says |
Current revision
Main Page > Mounting Collections: Class-specific Steps > Mounting a Text Class Collection > Newspaper Clip Image Access Mechanisms > Itemviewer Image Conversion
Contents |
[edit] General Information
Itemview.pm is a DLXS Perl class module, found in $DLXSROOT/lib. It contains, among other things, the methods (handlers) that may invoke external image conversion programs. $DLXSROOT/cgi/t/text/PageView.pm and $DLXSROOT/cgi/t/text/ClipView.pm are both subclasses of it, and so therefore inherit those methods.
[edit] Handler Resolution
After a Pageview or ClipView object is instantiated, it contains information about every page in the document, including how many and what kinds of image files are stored for each page. Based on the CGI's request for a particular output format in which to view a page image (whether to retrieve and display a PDF or a GIF or a JPG file) and the types of file formats available for that page, the application object can determine which conversion "helper" program to use.
[edit] Scenarios
- General information about scenarios
- Scenario 1: "image" requested
- Scenario 2: "pdf" requested
[edit] General information about scenarios
In the running of pageviewer-idx and its running of PVApp/CVApp, a PageView/ClipView object is instantiated. By using the Pageview/ArticleClips table in the dlxs database, and querying by the document's idno and the page's sequence number, the object is loaded up with all the page information necessary for a particular document. Therefore, it reflects every page and which file formats are stored on disk for each page.
PVApp/CVApp, invoked by pagevewer-idx and through its parent object, ItemViewApp, would look at the "stored image format" and the "requested image format", and using a hash found in $DLXSROOT/lib/ItemView.cfg, would determine the proper method to invoke the conversion, which is in turn coded to use a particular helper app.
[edit] Scenario 1: "image" requested
Assume that the CGI requests "image". First, the application checks the %gStoredFilePriorityMap hash in ItemView.cfg. This hash says that if the CGI is requesting an "image" file, the first file format to look for, stored on disk, is "jp2". If there is a JP2 file on disk, it will be used, then the Jp2ToJpgHandler would be used, which in turn, would use the helper app kdu_expand_jpg which is a wrapper script we provide to run kdu_expand_jpg with the proper parameters.
If there is no JP2 file on disk, the next file format to look for is "sid"; that is, MrSid. If there is a MrSid file on disk, then the SidToJpgHandler would be used, which in turn, would use the helper app mr_sid_retrieve.
If there is no MrSid file on disk, the next file format to look for is "jpg". If there is a JPG file on disk, then the requested and stored file formats match. Whenever this is the case, DeliverFileHandler is used, which simply copies the file from its storage location ($DLXSROOT/obj/... to the cached image directory $DLXSROOT/web/cache/...).
Finally, if there is no JPG file available on disk, a TIF file is looked for. If there is a TIF file, available, then the TifTGifHandler is called which in turn uses tif2web.
[edit] Scenario 2: "pdf" requested
Assume that the CGI requests "pdf". First, the application checks the %gStoredFilePriorityMap hash in ItemView.cfg. This hash says that if the CGI is requesting a pdf file, the first file format to look for is a PDF containing the full contents of the item, stored on disk as a .pdf file whose filename is the full idno.
If there is such a PDF file, then the DeliverFileHandler method is called, which simply copies the file from its storage location ($DLXSROOT/obj/... to the cached image directory $DLXSROOT/web/cache/...). It also passes a parameter so that Adobe Reader browser plugin can open the PDF to the page that was last viewed (note this feature is not supported by other PDF viewing applications).
If the full PDF is not available, the next format checked for (on disk, known via the Pageview database table and therefore via the PageView or ItemView object), is a PDF for the current page.
If there is such a PDF file, then the requested format matches the stored image format and DeliverFileHandler is called as described above.
If there is no PDF file stored, then the %gStoredFilePriorityMap says to look for a TIF file. If there is one stored, then the Tif2PdfHandler is called which in turn uses c42pdf.