Question

Extract image from PDF

  • 2 April 2021
  • 3 replies
  • 46 views

Hello !

I'm actually trying to extract image (raster) from a PDF, but I'm having a black background. I've tried to extract as vector, but for FME, the pdf is a raster image with text as polygons and area box as line. I need the inside information from the pdf ( see attached image). Unfortunately, I'm not being able to have a proper raster representing the initial image. Should I perform an operation on resulting extracted image, so as to remove the black background. If yes, can someone help me.

Thanks a lot!

 


3 replies

Userlevel 6
Badge +33

If you share the pdf somebody can try and reproduce.

Badge +20

Use RasterBandNodataSetter to remove what you need.

RasterToPolygonCoercer to go from raster to polygons.

Usually you need to do a lot of cleanup on rasters from pdf because of the compression so good luck.

Use RasterBandNodataSetter to remove what you need.

RasterToPolygonCoercer to go from raster to polygons.

Usually you need to do a lot of cleanup on rasters from pdf because of the compression so good luck.

Yes, you are right. The problem is from the PDF compression. I've tried to deflate the PDF and I don't have the black background anymore! Thanks for the hints!

Reply