I think it takes a lot of time to convert one by one.
Best answer by daveatsafe
View originalI think it takes a lot of time to convert one by one.
Best answer by daveatsafe
View originalHi @lily,
Yes, it's possible with a simple Text File to Text File conversion. Set the encoding on the Text File reader to Latin-1 (windows-1252) and the encoding on the Text File writer to Unicode 8-bit (utf-8). This will create an output file identical to the input, except with the different encoding.
However, if you have any tags within the HTML identifying the encoding, these will need to be changed as well. You can do this by adding a StringReplacer to the workspace to replace the string 'iso-8859-1' with 'UTF-8'.
@daveatsafe has the correct solution here - but to illustrate it I made this one of my question-of-the-week and added it to a video here: https://youtu.be/uyF7MEuBdK0
Thank you Dave and Mark! I will try Dave's solution and give a reply as soon as I can!
Hi @lily,
Yes, it's possible with a simple Text File to Text File conversion. Set the encoding on the Text File reader to Latin-1 (windows-1252) and the encoding on the Text File writer to Unicode 8-bit (utf-8). This will create an output file identical to the input, except with the different encoding.
However, if you have any tags within the HTML identifying the encoding, these will need to be changed as well. You can do this by adding a StringReplacer to the workspace to replace the string 'iso-8859-1' with 'UTF-8'.
Hi @daveatsafe ,
Thank you for your solution!
I have tried it and it works with one file at a time.
Then I tried using Zip instead since I wish to get all files done with the encoding workspace. But I ended up with a big html (instead of several html files which is suppose to be the same number of files in the original).
So I tried batch processing with reader "Directory and File Pathnames",
But now facing the problem that destination folder option is not available. Instead it writes everything to a single file too.
Any tips?
@daveatsafe has the correct solution here - but to illustrate it I made this one of my question-of-the-week and added it to a video here: https://youtu.be/uyF7MEuBdK0
Thank you @mark2atsafe ! I have seen your youtube video and it helps a lot! =)
Hi @daveatsafe ,
Thank you for your solution!
I have tried it and it works with one file at a time.
Then I tried using Zip instead since I wish to get all files done with the encoding workspace. But I ended up with a big html (instead of several html files which is suppose to be the same number of files in the original).
So I tried batch processing with reader "Directory and File Pathnames",
But now facing the problem that destination folder option is not available. Instead it writes everything to a single file too.
Any tips?
Hi @lily,
You can use the Dataset Fanout to distinguish the output files:
This should write each input file to a separate output file in the output zip file.
Hi @daveatsafe ,
Thank you for your solution!
I have tried it and it works with one file at a time.
Then I tried using Zip instead since I wish to get all files done with the encoding workspace. But I ended up with a big html (instead of several html files which is suppose to be the same number of files in the original).
So I tried batch processing with reader "Directory and File Pathnames",
But now facing the problem that destination folder option is not available. Instead it writes everything to a single file too.
Any tips?
Thank you @daveatsafe !
I will give a feedback as soon as I can! BeSafe =)
Hi @daveatsafe ,
Thank you for your solution!
I have tried it and it works with one file at a time.
Then I tried using Zip instead since I wish to get all files done with the encoding workspace. But I ended up with a big html (instead of several html files which is suppose to be the same number of files in the original).
So I tried batch processing with reader "Directory and File Pathnames",
But now facing the problem that destination folder option is not available. Instead it writes everything to a single file too.
Any tips?
It works perfectly!! Now I can move on to my next assignment =)
Hi @daveatsafe ,
Thank you for your solution!
I have tried it and it works with one file at a time.
Then I tried using Zip instead since I wish to get all files done with the encoding workspace. But I ended up with a big html (instead of several html files which is suppose to be the same number of files in the original).
So I tried batch processing with reader "Directory and File Pathnames",
But now facing the problem that destination folder option is not available. Instead it writes everything to a single file too.
Any tips?
Thank you!! @daveatsafe @mark2atsafe
Enter your username or e-mail address. We'll send you an e-mail with instructions to reset your password.