Ask Your Question

How can I read Parquet files from HDFS? Using which component?

asked 2020-02-13 06:09:55 -0600

Aakash gravatar image

updated 2020-02-13 10:10:26 -0600

metadaddy gravatar image

I want to read a Parquet format file and tried reading the file using Hadoop FS Standalone origin. In that origin we have the Data Format option but Parquet file format is not listed there.

edit retag flag offensive close merge delete

1 Answer

Sort by ยป oldest newest most voted

answered 2020-02-13 10:09:32 -0600

metadaddy gravatar image

The Parquet file format does not match StreamSets Data Collector's stream-based architecture. Instead, you should use StreamSets Transformer, which includes support for reading Parquet files.

edit flag offensive delete link more
Login/Signup to Answer

Question Tools

1 follower


Asked: 2020-02-13 06:09:55 -0600

Seen: 22 times

Last updated: Feb 13