Skip to main content
Solved

PDF reader - reading hyperlinks


oliver.morris
Contributor
Forum|alt.badge.img+12

Is there a way to extract the hyperlinks from a pdf using the pdf reader. At the moment I can see the text but not the underlying hyperlink.

 

Thank you for the help in advance.

Best answer by oliver.morris

I resolved by first exporting using adobe pro from pdf to html, then got FME to read the html as text and then did some string matching using the inline query tool.

View original
Did this help you find an answer to your question?

3 replies

jdh
Contributor
Forum|alt.badge.img+28
  • Contributor
  • August 7, 2020

Could you provide a sample pdf?


oliver.morris
Contributor
Forum|alt.badge.img+12
  • Author
  • Contributor
  • Best Answer
  • August 14, 2020

I resolved by first exporting using adobe pro from pdf to html, then got FME to read the html as text and then did some string matching using the inline query tool.


oliver.morris
Contributor
Forum|alt.badge.img+12
  • Author
  • Contributor
  • August 14, 2020
jdh wrote:

Could you provide a sample pdf?

thanks for the help, I cant share the content I am using but any pdf with a hyperlink isnt currently exposed using the fme pdf reader, just the hyperlink text as it appears in the page.


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings