Skip to main content

I have an export of an MS Whiteboard in pdf form. (I also have the JSON and HTML. The JSON doesn’t store the note contents). 

 

PDF reader is spotting text, but I’m struggling to pull it out ‘per note’. Any ideas? 

Hi ​@p_c_20 
This workbench may work for you. The text ends up being tagged with a sticky note name based on the following steps:
1. Read in pdf
2. Create polygons using AreaBuilder
3. Calculate Area
4. Filter for Area of StickyNote (this may change, might have to tweak)
5. Count stickynotes and name from counter (Sticky_1, Sticky_2….)
6. Spatial Relate text to Sticky and explode that list.
7. Text will be tagged according to the Sticky name it was in.

Hope that helps!


Hi ​@p_c_20 
This workbench may work for you. The text ends up being tagged with a sticky note name based on the following steps:
1. Read in pdf
2. Create polygons using AreaBuilder
3. Calculate Area
4. Filter for Area of StickyNote (this may change, might have to tweak)
5. Count stickynotes and name from counter (Sticky_1, Sticky_2….)
6. Spatial Relate text to Sticky and explode that list.
7. Text will be tagged according to the Sticky name it was in.

Hope that helps!

Thank you so much. Will have a play!


Hi ​@p_c_20 
This workbench may work for you. The text ends up being tagged with a sticky note name based on the following steps:
1. Read in pdf
2. Create polygons using AreaBuilder
3. Calculate Area
4. Filter for Area of StickyNote (this may change, might have to tweak)
5. Count stickynotes and name from counter (Sticky_1, Sticky_2….)
6. Spatial Relate text to Sticky and explode that list.
7. Text will be tagged according to the Sticky name it was in.

Hope that helps!

Is definitely getting me closer to where I need to be. Thank you again.