Skip to main content
Question

Extract image from PDF

  • April 2, 2021
  • 3 replies
  • 240 views

Hello !

I'm actually trying to extract image (raster) from a PDF, but I'm having a black background. I've tried to extract as vector, but for FME, the pdf is a raster image with text as polygons and area box as line. I need the inside information from the pdf ( see attached image). Unfortunately, I'm not being able to have a proper raster representing the initial image. Should I perform an operation on resulting extracted image, so as to remove the black background. If yes, can someone help me.

Thanks a lot!

 

3 replies

nielsgerrits
VIP
Forum|alt.badge.img+62

If you share the pdf somebody can try and reproduce.


caracadrian
Contributor
Forum|alt.badge.img+23
  • Contributor
  • April 2, 2021

Use RasterBandNodataSetter to remove what you need.

RasterToPolygonCoercer to go from raster to polygons.

Usually you need to do a lot of cleanup on rasters from pdf because of the compression so good luck.


  • Author
  • April 6, 2021

Use RasterBandNodataSetter to remove what you need.

RasterToPolygonCoercer to go from raster to polygons.

Usually you need to do a lot of cleanup on rasters from pdf because of the compression so good luck.

Yes, you are right. The problem is from the PDF compression. I've tried to deflate the PDF and I don't have the black background anymore! Thanks for the hints!