Hi!
I've done some testing with PDF properties and assigned some metadata to a pdf file.
While using Directory Opus this looks like this:
Searching in ET, added some columns, this file looks like:
I'm missing the title, the subject and the tags.
When opening the pdf in Adobe Reader, looking at the file properties, I see:
ET is configured to index properties in Options-Properties: Comment, Description, Subject, Tags, Title for *.pdf files, so I wonder why I'm missing at least some of those properties?
I'm a bit lost - PDF metadata
Re: I'm a bit lost - PDF metadata
Thank you for the bug report Michi,
Could you please send me your test pdf to support@voidtools.com
I'll look into this issue.
I'm wondering if indexing these properties is causing the issue..
Could you please send me your test pdf to support@voidtools.com
I'll look into this issue.
I'm wondering if indexing these properties is causing the issue..
Re: I'm a bit lost - PDF metadata
I would not say it's a bug, rather an issue right now
I assume that Directory Opus stores the metadata somewhere else. At least the comment is saved to an ADS as Leo from DO support team mentioned. Only the tags and the subject found their way into the PDF.
Nevertheless, ET did only index the comment so far, previously edited by using Directory Opus.
I just did an additional test, modified the subject and the tags again and had a look into Index Journal of ET: Guess, ET recognized the file change. However, in the search result, still the comment is visible.
I've sent you the PDF file I'm currently testing on...
Thanks for looking into!
Michael
I assume that Directory Opus stores the metadata somewhere else. At least the comment is saved to an ADS as Leo from DO support team mentioned. Only the tags and the subject found their way into the PDF.
Nevertheless, ET did only index the comment so far, previously edited by using Directory Opus.
I just did an additional test, modified the subject and the tags again and had a look into Index Journal of ET: Guess, ET recognized the file change. However, in the search result, still the comment is visible.
I've sent you the PDF file I'm currently testing on...
Thanks for looking into!
Michael
Re: I'm a bit lost - PDF metadata
Just to be clear: my additional modification, that was recorded by the journal, was correctly index by ET, however only the modified comment was indexed again - nothing else...
Re: I'm a bit lost - PDF metadata
Maybe this picture can help: Picture 3. -> Extended.
The pdf document was opened in Adobe Acrobat 11.0.23 to display the document properties.
...
The pdf document was opened in Adobe Acrobat 11.0.23 to display the document properties.
...
Re: I'm a bit lost - PDF metadata
Thank you for sending the test PDF.
The metadata is being stored as XMP instead of PDF metadata.
Everything currently only looks at the PDF metadata.
The PDF is using Cross-reference streams.
Everything doesn't support Cross-reference streams yet.
I am guessing Windows Explorer also does not show any metadata under Properties -> Details for this PDF?
Everything will fall back to the system to gather properties for the PDF.
In Everything, you should see the same as Windows Explorer.
I am looking into adding native XMP support and Cross-reference stream support.
The metadata is being stored as XMP instead of PDF metadata.
Everything currently only looks at the PDF metadata.
The PDF is using Cross-reference streams.
Everything doesn't support Cross-reference streams yet.
I am guessing Windows Explorer also does not show any metadata under Properties -> Details for this PDF?
Everything will fall back to the system to gather properties for the PDF.
In Everything, you should see the same as Windows Explorer.
I am looking into adding native XMP support and Cross-reference stream support.
Re: I'm a bit lost - PDF metadata
Hi! Just got your e-mail and your answer! Many thanks for this!
To be fair, this document was the one and only I've seen so far with this issue, so it's certainly not a big problem. Anyway, I will look forward for your enhancement which will again bring ET one step ahead of others similar tools!
To be fair, this document was the one and only I've seen so far with this issue, so it's certainly not a big problem. Anyway, I will look forward for your enhancement which will again bring ET one step ahead of others similar tools!
Re: I'm a bit lost - PDF metadata
Yes, exactly!
With Adobe Reader installed, Explorer does not show even a single metadata property. However, Microsoft Edge is configured as the primary PDF viewer, if this matters.
Re: I'm a bit lost - PDF metadata
Just discovered this interesting thread, which I'd like to join right away.
I am also looking for a way to search metadata in PDF files, e.g. the ones shown below:
Have I set this up correctly below in Everything's options (using "Producer" as an example)?
The file extensions *.mkv;*.mp4;*.avi;*.flv;*.webm were present by default, and I only added *.pdf.
Does Everything need to read the entire file contents of all corresponding files for indexing metadata? I'm asking because I have a few 100 GB of files on a slow remote Windows server over VPN.
I am also looking for a way to search metadata in PDF files, e.g. the ones shown below:
Have I set this up correctly below in Everything's options (using "Producer" as an example)?
The file extensions *.mkv;*.mp4;*.avi;*.flv;*.webm were present by default, and I only added *.pdf.
Does Everything need to read the entire file contents of all corresponding files for indexing metadata? I'm asking because I have a few 100 GB of files on a slow remote Windows server over VPN.
Re: I'm a bit lost - PDF metadata
No.Does Everything need to read the entire file contents of all corresponding files for indexing metadata?
Properties with the type: "Metadata" are stored in a file header.
Everything will only read the file header. (not the entire file content)
Properties with the type: "Content" will read the entire file content.
Re: I'm a bit lost - PDF metadata
Thanks very much