Is there a transformer for stripping HTML from text? I see the HTMLStripper, but it appears to replace HTML with XML. I want all HTML tags removed, including the <!DOCTYPE html>.
Thanks
Is there a transformer for stripping HTML from text? I see the HTMLStripper, but it appears to replace HTML with XML. I want all HTML tags removed, including the <!DOCTYPE html>.
Thanks
Best answer by jdh
Using the StringReplacer as @erik_jan says, but with <[^>]*> instead should remove all opening and closing tags.