BOM removal from zip file

asked 2018-07-30 21:54:03 -0500

anonymous user


A zip file in a hdfs directory contains 1000's of xml files. while parsing the xml files, I am getting xml parser_00 error. it looks like the files contains BOM. I followed the steps from other post to remove BOM and write it to a directory as a whole zip file, but it didn't work. What is the process to remove BOM from zip file?

1 Answer

answered 2018-07-31 09:22:52 -0500

metadaddy

The process should be:

  1. Extract XML files from ZIP
  2. Remove BOM from each XML file
  3. Create new ZIP with BOM-less XML files

Did you do this? What was the result?

please provide the processor which can be use for the above logic?

edukondalu
