StreamSets Community Forum For Data Professionals

Recently active
Help others

Update Job using REST API

I have a requirement to update job parameter via restAPI. I get the job details via GET function. I modify the content via code. But I am not sure how to pass the modified json content in a variable/file in the restapi post call? I have the need to run the api as automated batch with no manual intervention. I tried below two ways but none worked. I dont see any other viable option to pass modified content in the curl command.var1=`cat /home/script/modified.json`curl -X POST https://XXX.hub.streamsets.com/jobrunner/rest/v1/job/72b8200c-0e3b-426e-b564-bceb17220b1e:8c2c652f-e3d9-11eb-9fb3-b974ac4c3f67 -d '{ @var1}’ -H "Content-Type:application/json" -H "X-Requested-By:curl" -H "X-SS-REST-CALL:true" -H "X-SS-App-Component-Id: $CRED_ID" -H "X-SS-App-Auth-Token: $CRED_TOKEN" -i curl -X POST https://XXX.hub.streamsets.com/jobrunner/rest/v1/job/72b8200c-0e3b-426e-b564-bceb17220b1e:8c2c652f-e3d9-11eb-9fb3-b974ac4c3f67 -d '{ /home/script/modified.json}’ -H "Content-Type:application/json" -H "X-R

8 hours ago

sturnerFan

Community Articles and Got a Question?

Datacollector 'https://streamsets-dc.somedomain:18636' has pipeline in status 'STARTING_ERROR'; message is 'JDBC_09 - Could not retrieve information regarding the columns for table 'SECTION_SCHEDULES''

We are standing up a new Control Hub 3.75 with Data Collectors (DC) version 3.18 and 5.9.1 to run parallel on the new control hub. We are able to ingest data on DC 3.18 with no issues. However, we are receiving the following error on DC 5.9.1:” Datacollector 'https://streamsets:18636' has pipeline in status 'STARTING_ERROR'; message is 'JDBC_09 - Could not retrieve information regarding the columns for table 'SECTION_SCHEDULES''We have run the DDL Job (Start Up) to populate the schema in AWS S3 with no errors on DC 3.18 and DC 5.9.1. The job fails when running DC 5.9.1 on JenzEx and Colleague Student Information Systems.If there is something that someone has come across that would help in troubleshooting this error, it would be greatly appreciated.

21 hours ago

jeffersonverissimoFan

Community Articles and Got a Question?

DolphinDiscovered Fame

Show us your Pipelines

Cannot Connect StreamSets with Python SDK, but Can connect via CURL command with same Credential ID and Token

Hi Team,I generate Credential ID and Token via StreamSets UI → Manage → API Credential, and with the generated ID and Token I can run Curl command, returned status is “HTTP/1.1 200 OK”, also return me a json format that show my organization id, email, extra.. However, When I want to connect with below code snippet (using same credential ID and Token), it returns me error of 403. paste code and error below:>>> from streamsets.sdk import ControlHub>>> sch = ControlHub(credential_id='absd_myid', token="abcd_mytoken.") Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/opt/module/sfsdcvenv/lib/python3.8/site-packages/streamsets/sdk/sch.py", line 141, in __init__ self.api_client = sch_api.ApiClient( File "/opt/module/sfsdcvenv/lib/python3.8/site-packages/streamsets/sdk/sch_api.py", line 96, in __init__ raise ValueError('Encountered error while decoding auth token: {}'.format(e))ValueError: Encountered error while decoding aut

28 days ago

DolphinDiscovered Fame

Show us your Pipelines

How to Config to make Pipeline "Stop Event" not run when pipeline failed

Hi team,we have a pipeline, which configured a stop event, it let a sql statement to run once pipeline finish processing data, however we found, when the pipeline failed due to some reason, this “Stop Event” still run, which is not expected for us. Can you please let me know if some place can be configured, which let the Pipeline "Stop Event" not run when pipeline failed processing data? I actually found if the pipelien “Start Event” failed, the pipeline will not run which is expected, however the “Stop Event” always run even pipeline failed.

28 days ago

Meghana ChinnuFan

Community Articles and Got a Question?

HTTP Client missing output records

The HTTP Client in my pipeline is not processing all of the input records that it gets. For eg: Input to HTTP client is 1430 records, but the output records processed in the same client are 1360 records only, with 0 error records. Not sure if I am missing any configuration to be added so that I can balance the I/O records, and send the error records to the error stage.

1 month ago

Karan BhatiaFan

Community Articles and Got a Question?

API request using multiple values for a query parameter in one job

Hello There I am trying to solve a very specific usecase here. I am trying to query a DB using an API and this API is using multiple query parameters. one of the query parameters is going to be ids which are more than 100 in count. The catch is that I can’t pass all 100 ids as an array to the API call because API is not designed to accept array for that parameter. It is going to be kind of looping over those 100 ids one by one and then calling the API with new id as the parameter value in each iteration.Also, these IDs needs to be fetched from a snowflake table and then passed as the parameter to the API call. So I am thinking of having some snowflake or JDBC query consumer as as an origin. And these IDs would increase over the period of time so want to make it as dynamic as possible but that is not priority for now. Having multiple jobs to solve this would lead to 100+ jobs and that would keep increasing which is not a good practice at all. Could someone please suggest the best possi

1 month ago

AtousaRoadie

StreamSets Academy

Parameters in stream selector stage

Hi,In my pipeline, I am having a stream selector stage. I want to parameterize it and use the following expression for the condition:${record:value('/rating_text') == '${pipeline_rating_text}'}Here, pipeline_rating_text is my parameter that I have defined for my pipeline. The problem is that when I run the pipeline it does not work. If I use this expression ${record:value('/rating_text') == 'Excellent'}everything will be fine. Can somebody help me, please?

1 month ago

shankaraviFan

Community Articles and Got a Question?

Update Job using REST API

8 hours ago

sturnerFan

Community Articles and Got a Question?

Datacollector 'https://streamsets-dc.somedomain:18636' has pipeline in status 'STARTING_ERROR'; message is 'JDBC_09 - Could not retrieve information regarding the columns for table 'SECTION_SCHEDULES''

21 hours ago

jeffersonverissimoFan

Community Articles and Got a Question?

Problem opening pipeline

10 days ago

moviepagesFan

Show us your Pipelines

how to create the pipeline for the xml data where it is taken from sftp

how to create the pipeline for the xml data where it is taken from sftp in the streamsets we have getting only 1 record from sftp in the preview data

12 days ago

lizzieFan

Community Articles and Got a Question?

Simple JDBC Query Consumer to Local LS

18 days ago

DolphinDiscovered Fame

Show us your Pipelines

Cannot Connect StreamSets with Python SDK, but Can connect via CURL command with same Credential ID and Token

28 days ago

DolphinDiscovered Fame

Show us your Pipelines

How to Config to make Pipeline "Stop Event" not run when pipeline failed

28 days ago

Karan BhatiaFan

Community Articles and Got a Question?

API request using multiple values for a query parameter in one job

1 month ago

AtousaRoadie

StreamSets Academy

[JDBC Table 1] Cannot connect to specified database: Communications link failure The last packet sent successfully to the server was 0 milliseconds ago. The driver has not received any packets from the server. (JDBC_00)

Hi,I have a JDBC connection for a database that is located in a docker container in my local machine. This connection works perfectly when I make a data collector pipeline.I installed a transformer engine in a docker container in my local machine (I have installed the external JDBC libraries). Then, I made a very simple pipeline to read from my database using this JDBC connection. I constantly get this error message that “[JDBC Table 1] Cannot connect to specified database: Communications link failure The last packet sent successfully to the server was 0 milliseconds ago. The driver has not received any packets from the server. (JDBC_00)”.Can anyone help me with solving the problem?

1 month ago

ricsammaFan

Community Articles and Got a Question?

Kafka Connection error Http failure response for https TUNNELING_INSTANCE_ID=tunneling-1: 500 OK

Dears,I have configured a local docker image for the Streamsets and kafka images to try a simple Kafka connection, but I´m receiving the following:Http failure response for https://na01.hub.streamsets.com/tunneling/rest/660c2f92-c396-4322-a9ea-cd73758897a1/rest/v1/pipeline/dynamicPreview?TUNNELING_INSTANCE_ID=tunneling-1: 500 OK The kafka image names are:bash-3.2$ docker-compose psNAME IMAGE COMMAND SERVICE CREATED STATUS PORTSkafka wurstmeister/kafka "start-kafka.sh" kafka 27 minutes ago Up 27 minutes 0.0.0.0:9092->9092/tcpzookeeper wurstmeister/zookeeper "/bin/sh -c '/usr/sb…" zookeeper 27 minutes ago Up 27 minutes 22/tcp, 2888/tcp, 3888/tcp, 0.0.0.0:2181->2181/tcp And I try the Test connection kafka:9092 or localhost:9092 and receive the Http failure response for https://na01.hub.streamsets.com/tunneling

1 month ago

Community Leaderboard

📌 Start a conversation. Ask a question. Help others.

Become a leader!

Show full leaderboard

📌 Start a conversation. Ask a question. Help others.

Become a leader!

Show full leaderboard

Events calendar

Badge winners

ajinkyahas earned the badge Innovator
Sanjeevhas earned the badge Eager to help

Show all badges

Quick Links

Updates on all things product

Upcoming events

User guides and tutorials

Get trained and certified

Contact support

Update Job using REST API

Datacollector 'https://streamsets-dc.somedomain:18636' has pipeline in status 'STARTING_ERROR'; message is 'JDBC_09 - Could not retrieve information regarding the columns for table 'SECTION_SCHEDULES''

Problem opening pipeline

how to create the pipeline for the xml data where it is taken from sftp

Simple JDBC Query Consumer to Local LS

Cannot Connect StreamSets with Python SDK, but Can connect via CURL command with same Credential ID and Token

How to Config to make Pipeline "Stop Event" not run when pipeline failed

HTTP Client missing output records

API request using multiple values for a query parameter in one job

Parameters in stream selector stage

Update Job using REST API

Datacollector 'https://streamsets-dc.somedomain:18636' has pipeline in status 'STARTING_ERROR'; message is 'JDBC_09 - Could not retrieve information regarding the columns for table 'SECTION_SCHEDULES''

Problem opening pipeline

how to create the pipeline for the xml data where it is taken from sftp

Simple JDBC Query Consumer to Local LS

Cannot Connect StreamSets with Python SDK, but Can connect via CURL command with same Credential ID and Token

How to Config to make Pipeline "Stop Event" not run when pipeline failed

API request using multiple values for a query parameter in one job

[JDBC Table 1] Cannot connect to specified database: Communications link failure The last packet sent successfully to the server was 0 milliseconds ago. The driver has not received any packets from the server. (JDBC_00)

Kafka Connection error Http failure response for https TUNNELING_INSTANCE_ID=tunneling-1: 500 OK

Community Leaderboard

Events calendar

Badge winners

Quick Links

Community Leaderboard

Events calendar

Badge winners

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded