This is a continuation of previous work (klog 19671).
The job to receive MDC alert by igwn-alert failed with some reason. Here is a time of running job:
(1) 2/10 18:00 ~ 2/14 12:00 (3day18hour)
(2) 2/16 23:33 ~ 2/18 19:33 (1day20hour)
(3) 2/19 16:33 ~ (still running)
Error log :
Traceback (most recent call last):
File "/home/controls/bin/miniconda2/envs/o4_dqr_proto_4/lib/python3.8/site-packages/igwn_alert/client.py", line 224, in listen
for payload, metadata in s.read(metadata=True,
File "/home/controls/bin/miniconda2/envs/o4_dqr_proto_4/lib/python3.8/site-packages/hop/io.py", line 313, in read
for message in self._consumer.stream(autocommit=autocommit, **kwargs):
File "/home/controls/bin/miniconda2/envs/o4_dqr_proto_4/lib/python3.8/site-packages/adc/consumer.py", line 120, in_stream_forever
messages = self._consumer.consume(batch_size, batch_timeout.total_seconds())
File "/home/controls/bin/miniconda2/envs/o4_dqr_proto_4/lib/python3.8/site-packages/adc/errors.py", line 22, in log_client_errors
raise(KafkaException.from_kafka_error(kafka_error))
adc.errors.KafkaException: Error communicating with Kafka: code=_TIMED_OUT sasl_ssl://kb-1.prod.hop.scimma.org:9092/1: 1 request(s) timed out: disconnect (after 193924539ms in state UP)