扩容失败

  • nebula 版本:2.6.1
  • 部署方式:分布式
  • 安装方式:RPM
  • 是否为线上版本:Y
  • 硬件信息
    • 磁盘 SSD
    • CPU、内存信息 32C 128G
旧的storaged
E0701 10:20:03.451078  9599 WalFileIterator.cpp:30] [Port: 9780, Space: 6, Part: 2] The given log id 1 is out of the range, the wal firstLogId is 100744225
E0701 10:20:03.451082  9603 WalFileIterator.cpp:30] [Port: 9780, Space: 6, Part: 1] The given log id 1 is out of the range, the wal firstLogId is 100567485
E0701 10:20:03.451041  9598 WalFileIterator.cpp:30] [Port: 9780, Space: 5, Part: 4] The given log id 1 is out of the range, the wal firstLogId is 10073329
E0701 10:20:03.451155  9604 WalFileIterator.cpp:30] [Port: 9780, Space: 1, Part: 2] The given log id 1 is out of the range, the wal firstLogId is 533127095
E0701 10:20:03.451166  9611 WalFileIterator.cpp:30] [Port: 9780, Space: 6, Part: 5] The given log id 1 is out of the range, the wal firstLogId is 100725310
E0701 10:20:03.451359  9599 WalFileIterator.cpp:30] [Port: 9780, Space: 5, Part: 11] The given log id 1 is out of the range, the wal firstLogId is 10163191
E0701 10:20:03.451396  9611 WalFileIterator.cpp:30] [Port: 9780, Space: 5, Part: 5] The given log id 1 is out of the range, the wal firstLogId is 10265225
E0701 10:20:03.451417  9604 WalFileIterator.cpp:30] [Port: 9780, Space: 5, Part: 8] The given log id 1 is out of the range, the wal firstLogId is 10279013

新加入storaged
I0701 07:41:44.148319 39687 Part.cpp:458] [Port: 9780, Space: 5, Part: 9] Clean rocksdb part data
I0701 07:41:44.177561 39685 RaftPart.cpp:1200] [Port: 9780, Space: 1, Part: 3] Clean up the snapshot
I0701 07:41:44.177579 39685 RaftPart.cpp:1220] [Port: 9780, Space: 1, Part: 3] Clean up the snapshot
I0701 07:41:44.178150 39685 FileBasedWal.cpp:634] Removing /data1/nebula/data/storage/nebula/1/wal/3/0000000000528490122.wal
I0701 07:41:44.178200 39685 Part.cpp:458] [Port: 9780, Space: 1, Part: 3] Clean rocksdb part data

从日志上看,是WAL日志没了。

能否有更完整的日志,包括meta,storaged的?单从贴上来的日志只能判断有space被remove了。

E0701 14:20:56.033053  9604 Host.cpp:348] [Port: 9780, Space: 5, Part: 5] [Host: 192.168.1.184:9780] Failed to append logs to the host (Err: E_UNKNOWN_PART)
E0701 14:37:06.893186  9608 Host.cpp:348] [Port: 9780, Space: 5, Part: 5] [Host: 192.168.1.184:9780] Failed to append logs to the host (Err: E_UNKNOWN_PART)
E0701 15:16:48.244459  9607 Host.cpp:348] [Port: 9780, Space: 5, Part: 5] [Host: 192.168.1.184:9780] Failed to append logs to the host (Err: E_UNKNOWN_PART)

是否wal_ttl设的太小

此话题已在最后回复的 30 天后被自动关闭。不再允许新回复。