I was actually thinking of you when drafting a number of the LiveDocs articles, including a few I haven't published yet. Despite the overwhelming character of your hundreds of corpuses (a problem that can be dealt with better thanks to the new import/export features from a few versions ago, which allow consolidation), you're probably the best example I know of a power user who gets the most out of the corpora for your legal and financial translation.
I wait in eager anticipation for this year's Sustainability Report from a favourite client (130-odd pages in Word) and exporting that LiveDocs to a TM is a great way to start.
If the published (final) source and target versions are in PDF format, I can show you better ways than OCR to capture and align those versions. The relevant information is in my old YouTube video about iceni InFix, but I'll recap it all. This is especially important in cases where the PDF publications are protected (with a password), because most import filters, OCR software, etc. won't work in these cases. InFix just blows past all that "protection" and gives you XML. I used to do that with annual reports from the German Patent Office, though they eventually stopped passwording their publications.
In this case, I am ready, since there were only a few changes made on the final proof in PDF format, and I manually updated my memoQ translation file last year.
PDFKey Pro is excellent for unlocking PDFs: I routinely unlock PDFs before indexing them with dtSearch or creating bilingual files with AlignFactory, bilingual files I import into LiveDocs.
I haven't been able to warm to Iceni InFix and find it too cumbersome for my needs. Also, it's one more annual subscription I don't need.
Excellent piece. I am prompted to rethink my LiveDocs strategy.
I was actually thinking of you when drafting a number of the LiveDocs articles, including a few I haven't published yet. Despite the overwhelming character of your hundreds of corpuses (a problem that can be dealt with better thanks to the new import/export features from a few versions ago, which allow consolidation), you're probably the best example I know of a power user who gets the most out of the corpora for your legal and financial translation.
I wait in eager anticipation for this year's Sustainability Report from a favourite client (130-odd pages in Word) and exporting that LiveDocs to a TM is a great way to start.
If the published (final) source and target versions are in PDF format, I can show you better ways than OCR to capture and align those versions. The relevant information is in my old YouTube video about iceni InFix, but I'll recap it all. This is especially important in cases where the PDF publications are protected (with a password), because most import filters, OCR software, etc. won't work in these cases. InFix just blows past all that "protection" and gives you XML. I used to do that with annual reports from the German Patent Office, though they eventually stopped passwording their publications.
In this case, I am ready, since there were only a few changes made on the final proof in PDF format, and I manually updated my memoQ translation file last year.
PDFKey Pro is excellent for unlocking PDFs: I routinely unlock PDFs before indexing them with dtSearch or creating bilingual files with AlignFactory, bilingual files I import into LiveDocs.
I haven't been able to warm to Iceni InFix and find it too cumbersome for my needs. Also, it's one more annual subscription I don't need.
PDFKey Pro costs all of €22 + tax.