PDF Reader
- January 27, 2015
- 21 replies
- 40 views
- xiaomengatsafe
- rylanatsafe
- hollyatsafe
- siennaatsafe
- nathanatsafe
- fmelizard
- redgeographics
- takashi
- danilo_fme
- dustin
- erik_jan
- sigtill
- itay
- tomf
- todd_davis
- davideagle
- stalknecht
- jelle
- geosander
- kennyo
- philippeb
- fhilding
- ciarab
- courtney_m
- revesz
- paalped
- franco69
- gerhard
- marko
- gschleusner
- dunuts
- adieporter
- arnebrucksch
- makt
- ml56067
- dmatranga
- geospatiallover
- cwarren
- dfresh
- zubairsm
- mygis
- luigibr
- kd
- ekkischeffler
- jatoxa
- lau
- roland.martin
- zzupljanin
- ngstoke
- dannymatranga
- jneujens
- taojunabc
- setld_solutions
- adriano
- jpvo
- mb_fdfa
- wicki
- howard_l
- ville
- mostafabahloul
- drose
- derek
- richardsnyder3
- wellis11
- akituo
- lpalli
- vyaenec
- battlezone77
- dom
- zu
- rbh22988
- maarten
- juanfrasan
- fme4me
- marten_m
- westdakota
- davidwesstrom
- sonya_k
- m
- roger7467
- Safer
- 3719 replies
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.
21 replies
- Contributor
- 7 replies
- January 27, 2015
- Celebrity
- 3682 replies
- January 4, 2016
This would make some of my projects a lot easier. For a certain client I have to design a map in vector and then deliver individual layers as PNG files (sometimes in excess of 20000 pixels per side). My current workflow is to write out individual PDF's from Illustrator and rasterize them in Photoshop. This often takes a lot of time and requires some manual actions by me (open file, set size, save file). If I could just run it through FME that would at the very least save me the manual work.

- 68 replies
- January 8, 2016
Would be very useful. We get lots of site plans and data in PDF's, FME could save lots of time in digisiting sites.
- Contributor
- 167 replies
- February 25, 2016
This would be very useful at the moment. We have upcoming requests to read PDF so would be interested in having a reader to analyse geospatial PDF
- Author
- Safer
- 3719 replies
- February 25, 2016
I can confirm that we've been laying the groundwork for this. Won't be in 2016.1, but I'd be surprised if we didn't a form of PDF reading by end of calendar 2016. @ciarab can I ask you to send a couple sample PDFs into support@safe.com so we can be sure your scenario is targetted?
- Participant
- 70 replies
- June 7, 2016
I never realized I have to do this but true enough I have are projects that would require this. Thanks for the update @daleatsafe. If you need some more PDFs to try let me know.
- Supporter
- 956 replies
- August 24, 2016
There is a new open source implementation towards GDAL that reads PDF - more information here http://blog.klokantech.com/2016/08/pdfium-geopdf-driver-in-gdal-21.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+klokan-blog-osgeo+%28Blog%3A+Klokan+Petr+P%C5%99idal%29 @fme_lizard @daleatsafe

- 25 replies
- March 10, 2017
When are we expecting this (pdf reader), if at all
Thanks
- 4 replies
- April 13, 2017
We have been using A-PDF Data extractor to extract data from pdfs. We use a system caller to connect to the app. We hope to see a similiar feature directly in FME without the need of a 3rd party app.
- Contributor
- 2179 replies
- April 17, 2017
At this moment I have no need for a PDF reader.
But I will vote for it as it might speed up the improvements for the PDF writer that I do need:
https://knowledge.safe.com/idea/38680/better-pdf-writer-support.html
- 195 replies
- July 16, 2017
I tried to read text from a pdf file using a PythonCaller and the pdfminer plugin, and it went pretty well. For a start? Like this:
import fme import fmeobjects import sys import chardet from pdfminer.pdfinterp import PDFResourceManager, PDFPageInterpreter from pdfminer.pdfpage import PDFPage from pdfminer.converter import XMLConverter, HTMLConverter, TextConverter from pdfminer.layout import LAParams from cStringIO import StringIO # Template Function interface: # When using this function, make sure its name is set as the value of # the 'Class or Function to Process Features' transformer parameter def processFeature(feature): data = FME_MacroValues['SourcePdfFile'] fp = file(data, 'rb') rsrcmgr = PDFResourceManager() retstr = StringIO() codec = 'utf-8' laparams = LAParams() device = TextConverter(rsrcmgr, retstr, codec=codec, laparams=laparams) # Create a PDF interpreter object. interpreter = PDFPageInterpreter(rsrcmgr, device) # Process each page contained in the document. for page in PDFPage.get_pages(fp): interpreter.process_page(page) data = retstr.getvalue() e = chardet.detect(data) u = None try: if e['confidence'] > 0.3: u = unicode(data, e['encoding']) except: pass if u: feature.setAttribute('pdfcontent', u) else: feature.setAttribute('pdfcontent', data) pass- Contributor
- 305 replies
- July 17, 2017
- Contributor
- 130 replies
- December 7, 2017
I use poppler to read PDF as Raster. Basically it just converts pdf files to jpgs and then u read the jpg.
- Author
- Safer
- 3719 replies
- December 10, 2017
- Influencer
- 626 replies
- December 15, 2017
I'm late to the party, but I vote for this. My primary use would be change detection between two GeoPDF's.
- Author
- Safer
- 3719 replies
- January 4, 2018
Hi all -- what better way to start the year than to try out the new PDF reader in FME 2018 betas. Builds 18236 and later have it. Get it from http://www.safe.com/download and let us know what you think. @ciarab @marko @redgeographics @geospatiallover @gschleusner @sigtill @cartoscro @dannymatranga @zubairsm FYI
- Contributor
- 167 replies
- January 4, 2018
@croningarrett our long awaited PDF reader ;)
This would make some of my projects a lot easier. For a certain client I have to design a map in vector and then deliver individual layers as PNG files (sometimes in excess of 20000 pixels per side). My current workflow is to write out individual PDF's from Illustrator and rasterize them in Photoshop. This often takes a lot of time and requires some manual actions by me (open file, set size, save file). If I could just run it through FME that would at the very least save me the manual work.
Would be very useful. We get lots of site plans and data in PDF's, FME could save lots of time in digisiting sites.
This would be very useful at the moment. We have upcoming requests to read PDF so would be interested in having a reader to analyse geospatial PDF
I can confirm that we've been laying the groundwork for this. Won't be in 2016.1, but I'd be surprised if we didn't a form of PDF reading by end of calendar 2016. @ciarab can I ask you to send a couple sample PDFs into support@safe.com so we can be sure your scenario is targetted?
I never realized I have to do this but true enough I have are projects that would require this. Thanks for the update @daleatsafe. If you need some more PDFs to try let me know.
There is a new open source implementation towards GDAL that reads PDF - more information here http://blog.klokantech.com/2016/08/pdfium-geopdf-driver-in-gdal-21.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+klokan-blog-osgeo+%28Blog%3A+Klokan+Petr+P%C5%99idal%29 @fme_lizard @daleatsafe
When are we expecting this (pdf reader), if at all
Thanks
We have been using A-PDF Data extractor to extract data from pdfs. We use a system caller to connect to the app. We hope to see a similiar feature directly in FME without the need of a 3rd party app.
At this moment I have no need for a PDF reader.
But I will vote for it as it might speed up the improvements for the PDF writer that I do need:
https://knowledge.safe.com/idea/38680/better-pdf-writer-support.html
Any notable progress on the PDF reader?
I tried to read text from a pdf file using a PythonCaller and the pdfminer plugin, and it went pretty well. For a start? Like this:
import fme import fmeobjects import sys import chardet from pdfminer.pdfinterp import PDFResourceManager, PDFPageInterpreter from pdfminer.pdfpage import PDFPage from pdfminer.converter import XMLConverter, HTMLConverter, TextConverter from pdfminer.layout import LAParams from cStringIO import StringIO # Template Function interface: # When using this function, make sure its name is set as the value of # the 'Class or Function to Process Features' transformer parameter def processFeature(feature): data = FME_MacroValues['SourcePdfFile'] fp = file(data, 'rb') rsrcmgr = PDFResourceManager() retstr = StringIO() codec = 'utf-8' laparams = LAParams() device = TextConverter(rsrcmgr, retstr, codec=codec, laparams=laparams) # Create a PDF interpreter object. interpreter = PDFPageInterpreter(rsrcmgr, device) # Process each page contained in the document. for page in PDFPage.get_pages(fp): interpreter.process_page(page) data = retstr.getvalue() e = chardet.detect(data) u = None try: if e['confidence'] > 0.3: u = unicode(data, e['encoding']) except: pass if u: feature.setAttribute('pdfcontent', u) else: feature.setAttribute('pdfcontent', data) passis there a PDF to Excel reader in FME?
Please build something for PDF converter!
I use poppler to read PDF as Raster. Basically it just converts pdf files to jpgs and then u read the jpg.
https://poppler.freedesktop.org/
I'm late to the party, but I vote for this. My primary use would be change detection between two GeoPDF's.
Hi all -- what better way to start the year than to try out the new PDF reader in FME 2018 betas. Builds 18236 and later have it. Get it from http://www.safe.com/download and let us know what you think. @ciarab @marko @redgeographics @geospatiallover @gschleusner @sigtill @cartoscro @dannymatranga @zubairsm FYI
@croningarrett our long awaited PDF reader ;)
Related Topics
PDF Reader: Can it read bookmarks?icon
GeneralCant find pdf-reader in FME Form 2024?icon
TransformersCombining the output of PATH and PDF readersicon
GeneralNeed Correct Settings for PDF Reader(2D) and Writer - Are there Limitations?icon
TransformersBlack Background after deleting Alpha Band layer from Pdf reader, how to avoid transparency turning Blackicon
Transformers
Helpful Members This Week
- redgeographics
18 votes
- ebygomm
15 votes
- hkingsbury
10 votes
- nielsgerrits
9 votes
- liamfez
9 votes
- david_r
9 votes
- geomancer
9 votes
- philippeb
8 votes
- takashi
8 votes
- alexbiz
6 votes
Recently Solved Questions
Community Stats
- 31,940
- Posts
- 121,312
- Replies
- 39,556
- Members
Latest FME
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.
Scanning file for viruses.
Sorry, we're still checking this file's contents to make sure it's safe to download. Please try again in a few minutes.
OKThis file cannot be downloaded
Sorry, our virus scanner detected that this file isn't safe to download.
OKCookie policy
We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.
Cookie settings
We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.