Skip to main content
Solved

Read word doc tables

  • October 2, 2025
  • 5 replies
  • 91 views

hannahwh05
Contributor
Forum|alt.badge.img+7

I’m looking to read in a load of word documents that contain tables of information. I’d like to extract all the text from those tables so that I can then search the text for specific information. I’ve tried using the Microsoft Word Reader and the MSWordStyler but I’m only seeing the paragraph text come through to the attributes. Any ideas how to extract the table text? 

 

Best answer by rchoucroun

I’m using 2025.1, but I suspect it’s the same in 2024.2. I created the following Word Document:

I used the MS Word Reader and the content appears as unexposed attributes. The relevant attributes for the tables are lists called msword_table_col_header{} and msword_table_col_value{}. Apparently, it doesn’t matter if the table header is defined as a header in Word or not. In FME, the top row of the table is the header.

Each row of the table is it’s own feature. Because my tables have 3 columns, the lists have 3 values. 


You’ll have to expose those attributes as required and search them using a ListSearcher.

 

Hope that helps.

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

5 replies

danilo_fme
Celebrity
Forum|alt.badge.img+51
  • Celebrity
  • October 2, 2025

Hi ​@hannahwh05 

 

What is the version of your FME Form?

 

Thanks in Advance,


hannahwh05
Contributor
Forum|alt.badge.img+7
  • Author
  • Contributor
  • October 2, 2025

Hi ​@hannahwh05 

 

What is the version of your FME Form?

 

Thanks in Advance,

Hi ​@danilo_fme 2024.2.4.0


danilo_fme
Celebrity
Forum|alt.badge.img+51
  • Celebrity
  • October 3, 2025

Hi ​@hannahwh05 

 

What is the version of your FME Form?

 

Thanks in Advance,

Hi ​@danilo_fme 2024.2.4.0

Thank you! Could you share an example?


rchoucroun
Contributor
Forum|alt.badge.img+20
  • Contributor
  • Best Answer
  • October 4, 2025

I’m using 2025.1, but I suspect it’s the same in 2024.2. I created the following Word Document:

I used the MS Word Reader and the content appears as unexposed attributes. The relevant attributes for the tables are lists called msword_table_col_header{} and msword_table_col_value{}. Apparently, it doesn’t matter if the table header is defined as a header in Word or not. In FME, the top row of the table is the header.

Each row of the table is it’s own feature. Because my tables have 3 columns, the lists have 3 values. 


You’ll have to expose those attributes as required and search them using a ListSearcher.

 

Hope that helps.


hannahwh05
Contributor
Forum|alt.badge.img+7
  • Author
  • Contributor
  • October 6, 2025

I’m using 2025.1, but I suspect it’s the same in 2024.2. I created the following Word Document:

I used the MS Word Reader and the content appears as unexposed attributes. The relevant attributes for the tables are lists called msword_table_col_header{} and msword_table_col_value{}. Apparently, it doesn’t matter if the table header is defined as a header in Word or not. In FME, the top row of the table is the header.

Each row of the table is it’s own feature. Because my tables have 3 columns, the lists have 3 values. 


You’ll have to expose those attributes as required and search them using a ListSearcher.

 

Hope that helps.

Thank you! That was really helpful - I completely missed the lists that could be exposed in the table view schema