How to parse XML file using StreamSets?

2019-05-14

2019-05-20

I have a simple xml file that need to be parsed using StreamSets. Created a pipeline with Origin as directory and process component as XML Parser, destination as local directory.

The validation of the pipeline was successful and there is no output in the XML parser component,configured the delimiter element as xml element.

Please share /direct me what are the configuration parameters need to set for the successful parsing?

C:\fakepath\xmlparser.png C:\fakepath\xml.png

Can you provide an example of the XML and the XML parser configuration?

2019-05-20

You should parse the XML in the Directory origin rather than in a separate XML Parser processor.

image description

Set Data Format to XML, and Delimiter Element to msg:

image description

Now the origin will emit a record for each <msg> element in the input file:

image description

Asked: 2019-05-14 23:42:05 -0500

Seen: 503 times

Last updated: May 20 '19