Ask Your Question

Revision history [back]

Although StreamSets Data Collector is a Java app, which in theory can run 'anywhere', it relies on several bash shell scripts, for example bin/streamsets and libexec/_sdc set up the environment and run the actual Java app.

There are several possible ways forward:

Of course, even the app starts, there may well be code changes required due to assumptions in the code that do not hold true on Windows.

The main reason none of the above has been tried is that we haven't seen much demand for running SDC on Windows. Most big data processing takes place on Linux - even Azure HDInsight is Linux-based.

The whole product is open source, of course, so you are very welcome to take a look at running it on Windows, and even submit pull requests with any necessary changes.

For the purposes of this answer, I'll assume you've considered and rejected the possibility of running the SDC Docker image with Docker for Windows.

Although StreamSets Data Collector is a Java app, which in theory can run 'anywhere', it relies on several bash shell scripts, for example bin/streamsets and libexec/_sdc set up the environment and run the actual Java app.

There are several possible ways forward:

Of course, even the app starts, there may well be code changes required due to assumptions in the code that do not hold true on Windows.

The main reason none of the above has been tried is that we haven't seen much demand for running SDC on Windows. Most big data processing takes place on Linux - even Azure HDInsight is Linux-based.

The whole product is open source, of course, so you are very welcome to take a look at running it on Windows, and even submit pull requests with any necessary changes.