Skip to main content

Looking for ideas on geocoding from text blocks. Each feature has a block of text (~50 to 100 words) describing an incident. More than one place could be mentioned so results would likely include more than one place. But in any case the goal is to geocode based on places mentioned in the text.

An example might look like this:

"...Something happed in Streetsville and people were walking toward Placeville."

And we would want geocoding results from Streetsville and Placeville.

Hi @braggken​,

Not sure if this is too helpful but a while ago I had a similar requirement with address being stored in blocks with a variety of other attributes and text. Admittedly, probably not as long as 50-100 words though.

At first, as we were only looking for a general geocode (Suburb/City level), we split the text with an attribute splitter (and list exploder from memory I believe) and attempted to geocode all values.

In the end, It was actually easier to split the text block and then attempt to join to administrative data (city, suburb, town etc.) using a feature merger.

 

Also interested to see if there are other thoughts on this one.

 


Do you have a list of locations that are likely included in the text?


Do you have a list of locations that are likely included in the text?

Yes I can create a list of most of locations that are likely to be in the text.

In fact this the road I was starting down. Short story: I look for each location in each block with a StringSearcher. This kind of works but two limitations:

  • the names must match exactly
  • my list is likely not complete.

Yes I can create a list of most of locations that are likely to be in the text.

In fact this the road I was starting down. Short story: I look for each location in each block with a StringSearcher. This kind of works but two limitations:

  • the names must match exactly
  • my list is likely not complete.

Yeah, I fear that that may be the case. I wonder if you can use ChatGPT (or other AI, I don't really work with it that much) to extract locations out of the text and then a geocoding API.


Reply