2020.12.18 16:37:20.970111 [ 623 ] {} <Error> Application: Coordination::Exception: All connection tries failed while connecting to ZooKeeper. nodes: 10.253.9.3:2181
Code: 33, e.displayText() = DB::Exception: Cannot read all data.
Bytes read: 0. Bytes expected: 4.: while receiving handshake from ZooKeeper (version 20.8.9.6 (official build)), 10.253.9.3:2181
Code: 33, e.displayText() = DB::Exception: Cannot read all data.
Bytes read: 0. Bytes expected: 4.: while receiving handshake from ZooKeeper (version 20.8.9.6 (official build)), 10.253.9.3:2181
Code: 33, e.displayText() = DB::Exception: Cannot read all data.
Bytes read: 0. Bytes expected: 4.: while receiving handshake from ZooKeeper (version 20.8.9.6 (official build)), 10.253.9.3:2181
(Connection loss): Cannot attach table `csplogs`.`stat_load_log_row_shard` from metadata file /var/lib/clickhouse/metadata/csplogs/stat_load_log_row_shard.sql from query ATTACH TABLE csplogs.stat_load_log_row_shard (`file_date` Date, `load_start` DateTime, `load_end` DateTime, `file_timestamp` DateTime, `file_type` String, `server_role` String, `server_name` String, `file_name` String, `row_count` UInt64) ENGINE = ReplicatedMergeTree('/clickhouse/{cluster}/tables/{shard}/stat_load_log_row', '{replica}', file_date, file_name, 8192): while loading database `csplogs` from path /var/lib/clickhouse/metadata/csplogs
2020.12.18 16:37:20.970126 [ 623 ] {} <Information> Application: shutting down
2020.12.18 16:37:20.970129 [ 623 ] {} <Debug> Application: Uninitializing subsystem: Logging Subsystem
2020.12.18 16:37:20.970168 [ 631 ] {} <Trace> BaseDaemon: Received signal -2
2020.12.18 16:37:20.970178 [ 631 ] {} <Information> BaseDaemon: Stop SignalListener thread
/etc/init.d/clickhouse-server restart
the next restart is successful.
We saw something similar.
zkCli at the same time was reporting smth like that:
2020-12-05 05:17:00,514 [myid:] - INFO [main-SendThread(127.0.0.1:2181):ClientCnxn$SendThread@852] - Socket connection established to 127.0.0.1/127.0.0.1:2181, initiating session
2020-12-05 05:17:00,516 [myid:] - INFO [main-SendThread(127.0.0.1:2181):ClientCnxn$SendThread@1098] - Unable to read additional data from server sessionid 0x0, likely server has closed socket, closing socket connection and attempting reconnect
And in ZooKeeper logs it was
2020-12-04 20:44:03,059 [myid:1] - INFO [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:NIOServerCnxnFactory@197] - Accepted socket connection from /172.18.49.160:11944
2020-12-04 20:44:03,060 [myid:1] - WARN [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:NIOServerCnxn@362] - Exception causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not running
Afaik it was some kind of zookeeper misbehavior related to system / network reconfiguration. Unfortunately i have no details.