I received a few old Soviet geological map scans to experiment with georeferencing. Can we place simple JPEGs where they belong in the world? It actually quite possible today with Gemini (other models, unfortunately, are not even close).
The idea is simple - find the the map on the scan with a segmentation mask, clip the scan with the mask, make small crops of the map corners and ask AI to find pixel coordinates of the neatline corner and read the coordinates. The old Soviet map sheet nomenclature is quite elegant, and it’s easy to make names and calculate projection zone from scale, latitudes and longitudes in the map corners, so I was able to build control vectors to transform the pixel coordinates into Gauss-Krüger. Of course it only makes sense with automation on multiple maps - drop your files in a folder, let FME do the work while you enjoy your coffee 😎
Now I am exploring whether I can replace Gemini with other models, especially local ones such as Gemma 4. So far, the results are mixed, but I am testing Visual Anchoring, which allows any model to be a little bit better than it is by itself. If I get something relatively reasonable, I certainly will share it here.
Check my LinkedIn post with more details about this scenario and watch the video of how it works.
Dmitri





