Skip to main content
Solved

PDF reader - reading hyperlinks

  • August 7, 2020
  • 3 replies
  • 46 views

oliver.morris
Contributor
Forum|alt.badge.img+14

Is there a way to extract the hyperlinks from a pdf using the pdf reader. At the moment I can see the text but not the underlying hyperlink.

 

Thank you for the help in advance.

Best answer by oliver.morris

I resolved by first exporting using adobe pro from pdf to html, then got FME to read the html as text and then did some string matching using the inline query tool.

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

3 replies

jdh
Contributor
Forum|alt.badge.img+38
  • Contributor
  • August 7, 2020

Could you provide a sample pdf?


oliver.morris
Contributor
Forum|alt.badge.img+14
  • Author
  • Contributor
  • Best Answer
  • August 14, 2020

I resolved by first exporting using adobe pro from pdf to html, then got FME to read the html as text and then did some string matching using the inline query tool.


oliver.morris
Contributor
Forum|alt.badge.img+14
  • Author
  • Contributor
  • August 14, 2020

Could you provide a sample pdf?

thanks for the help, I cant share the content I am using but any pdf with a hyperlink isnt currently exposed using the fme pdf reader, just the hyperlink text as it appears in the page.