How can I run StreamSets Data Collector cluster for Kafka origin pipelines?

Does StreamSets Data Collector support cluster deployment for Kafka origin pipelines?

1 Answer

Yes - in fact, Kafka is the only origin supported for Cluster Streaming pipelines.

Thanks for quick response. Is there any tutorial or screen shot guide to follow? Thanks.

I am using Apache Kafka 1.0.0 and hadoop/yarn 2.7.3 without CDH or HDP, but it reports VALIDATION_0071 - Stage 'Kafka Consumer' from 'Apache Kafka 1.0.0' library does not support 'Cluster Yarn Streaming' execution mode. Can't the stage library be used in cluster yarn streaming mode?

