postgresql cdc client loss some data

asked 2020-01-09 06:29:00 -0500

shixinbao gravatar image

updated 2020-01-09 20:14:26 -0500

hello, I use sdc(v3.9.1) to collect data of postgresql cdc(use default params).But when postgresql generated a large number of logs in a short time, the processor loss some data within the time difference between batches. At first I thought that wal2json didn't collect logs, and then I used debezium and streamsets to collect data from this database at the same time. The results showed that debezium collected all data change records, while streamsets lost some data. Interestingly, I found that the missing data was generated between batches.

It seems that "query timeout" doesn't work

image description

1 Answer

answered 2020-01-10 15:51:54 -0500

metadaddy gravatar image

Please file a P1 bug at with as much detail as possible.

hello pat! I have already raised the issue at I think this is a very serious bug. This issue is bothering us. I hope you can solve it as soon as possible

shixinbao gravatar imageshixinbao ( 2020-01-13 05:40:35 -0500 )edit
