Ask Your Question
0

XML zip file to HDFS

asked 2017-11-14 11:15:02 -0500

sanjay12 gravatar image

I have a zip file containing xml files in S3. I want to pick the file up from s3 unzip it and save the extracted xmls to hdfs. I am unable to find a processor that can do that. Can someone please guide if there is any solution or workaround to this problem.

edit retag flag offensive close merge delete

1 Answer

Sort by ยป oldest newest most voted
1

answered 2017-11-20 13:38:21 -0500

jeff gravatar image

You should be able to use the Amazon S3 origin as is, choosing XML as the Data Format and specifying Compressed File as the Compression Format. Do you have errors with that configuration?

edit flag offensive delete link more

Comments

Hi jeff, I tried to send xml file into a compressed format in SFTP (origion) and it is pushed into HadoopFS.StreamSet was running without error but I couldn't receive the file in HadoopFS.

Sam Sambath gravatar imageSam Sambath ( 2017-12-04 06:49:02 -0500 )edit
Login/Signup to Answer

Question Tools

1 follower

Stats

Asked: 2017-11-14 11:15:02 -0500

Seen: 137 times

Last updated: Nov 20 '17