在导入数据时,nebula-storaged 异常挂机

  • nebula 版本:3.20
  • 部署方式:分布式
  • 安装方式:RPM
  • 是否为线上版本:Y
  • 硬件信息
    • 磁盘( 推荐使用 SSD)
    • CPU、内存信息 32核 128G
  • 问题的具体描述
  • 相关的 meta / storage / graph info 日志信息
gdb /usr/local/nebula/bin/nebula-storaged core.22389
GNU gdb (GDB) Red Hat Enterprise Linux 7.6.1-120.el7
Copyright (C) 2013 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu".
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>...
Reading symbols from /usr/local/nebula/bin/nebula-storaged...(no debugging symbols found)...done.
[New LWP 22449]
[New LWP 22468]
[New LWP 22476]
[New LWP 22510]
[New LWP 22527]
[New LWP 22477]
[New LWP 22404]
[New LWP 22450]
[New LWP 22519]
[New LWP 22545]
[New LWP 22393]
[New LWP 22555]
[New LWP 22473]
[New LWP 22544]
[New LWP 22558]
[New LWP 22466]
[New LWP 22458]
[New LWP 22478]
[New LWP 22529]
[New LWP 22479]
[New LWP 23136]
[New LWP 22536]
[New LWP 22521]
[New LWP 22550]
[New LWP 23140]
[New LWP 22465]
[New LWP 23168]
[New LWP 22455]
[New LWP 22514]
[New LWP 23138]
[New LWP 23149]
[New LWP 22559]
[New LWP 22452]
[New LWP 22482]
[New LWP 22537]
[New LWP 23135]
[New LWP 22553]
[New LWP 22402]
[New LWP 23108]
[New LWP 23116]
[New LWP 22551]
[New LWP 22437]
[New LWP 22472]
[New LWP 22523]
[New LWP 23111]
[New LWP 23152]
[New LWP 23171]
[New LWP 23153]
[New LWP 22528]
[New LWP 23170]
[New LWP 22557]
[New LWP 23109]
[New LWP 22400]
[New LWP 39536]
[New LWP 23112]
[New LWP 22539]
[New LWP 22445]
[New LWP 23107]
[New LWP 23141]
[New LWP 23156]
[New LWP 22583]
[New LWP 22428]
[New LWP 23145]
[New LWP 23110]
[New LWP 22461]
[New LWP 23174]
[New LWP 22394]
[New LWP 23142]
[New LWP 36486]
[New LWP 22552]
[New LWP 22460]
[New LWP 23162]
[New LWP 23165]
[New LWP 23169]
[New LWP 22517]
[New LWP 22396]
[New LWP 23143]
[New LWP 22549]
[New LWP 22546]
[New LWP 39535]
[New LWP 22421]
[New LWP 23178]
[New LWP 22447]
[New LWP 23155]
[New LWP 23151]
[New LWP 36485]
[New LWP 23157]
[New LWP 36484]
[New LWP 36488]
[New LWP 22556]
[New LWP 22894]
[New LWP 23172]
[New LWP 22554]
[New LWP 22422]
[New LWP 22397]
[New LWP 22406]
[New LWP 22401]
[New LWP 23147]
[New LWP 43357]
[New LWP 22427]
[New LWP 23115]
[New LWP 23177]
[New LWP 23144]
[New LWP 22417]
[New LWP 23134]
[New LWP 23159]
[New LWP 22535]
[New LWP 22410]
[New LWP 22548]
[New LWP 23137]
[New LWP 23166]
[New LWP 22530]
[New LWP 23139]
[New LWP 23114]
[New LWP 23175]
[New LWP 23173]
[New LWP 23176]
[New LWP 22531]
[New LWP 23150]
[New LWP 23154]
[New LWP 23161]
[New LWP 23163]
[New LWP 43386]
[New LWP 23113]
[New LWP 22408]
[New LWP 22439]
[New LWP 22436]
[New LWP 22415]
[New LWP 23167]
[New LWP 22414]
[New LWP 39537]
[New LWP 22418]
[New LWP 22426]
[New LWP 36487]
[New LWP 23164]
[New LWP 22389]
[New LWP 23146]
[New LWP 23148]
[New LWP 22448]
[New LWP 22420]
[New LWP 23160]
[New LWP 23158]
[New LWP 22431]
[New LWP 22433]
[New LWP 22434]
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib64/libthread_db.so.1".
Core was generated by `/usr/local/nebula/bin/nebula-storaged --flagfile /usr/local/nebula/etc/nebula-s'.
Program terminated with signal 6, Aborted.
#0  0x00007fa8291ff387 in raise () from /lib64/libc.so.6
Missing separate debuginfos, use: debuginfo-install nebula-graph-3.2.0-1.x86_64
(gdb) bt
#0  0x00007fa8291ff387 in raise () from /lib64/libc.so.6
#1  0x00007fa829200a78 in abort () from /lib64/libc.so.6
#2  0x000000000256310a in ?? ()
#3  0x0000000002566204 in ?? ()
#4  0x0000000002562bd9 in ?? ()
#5  0x00000000025668b9 in ?? ()
#6  0x0000000001befb62 in nebula::raftex::RaftPart::processAppendLogResponses(std::vector<std::pair<unsigned long, nebula::raftex::cpp2::AppendLogResponse>, std::allocator<std::pair<unsigned long, nebula::raftex::cpp2::AppendLogResponse> > > const&, folly::EventBase*, nebula::raftex::AppendLogsIterator, long, long, long, long, long, std::vector<std::shared_ptr<nebula::raftex::Host>, std::allocator<std::shared_ptr<nebula::raftex::Host> > >) ()
#7  0x0000000001bf0217 in ?? ()
#8  0x0000000001bf0807 in ?? ()
#9  0x000000000249acac in ?? ()
#10 0x00000000020ba2d7 in virtual thunk to apache::thrift::concurrency::FunctionRunner::run() ()
#11 0x0000000002216e08 in apache::thrift::concurrency::ThreadManager::Impl::Worker::run() ()
#12 0x0000000002218f0e in apache::thrift::concurrency::PthreadThread::threadMain(void*) ()
#13 0x00007fa82959eea5 in start_thread () from /lib64/libpthread.so.0
#14 0x00007fa8292c7b0d in clone () from /lib64/libc.so.6
1 个赞

我可能需要点日志和debug信息 这个栈看不太出来
可以开trace_raft=true试试能不能复现(会打印很多额外信息出来,小心爆了)

线上不太能操作,看起来是raft log日志有问题

你看看是不是盘满了

这个没有的,磁盘还空闲50%

那不太好说,我得需要点日志,最近家里没发现有这个问题,回头在关注下

此话题已在最后回复的 30 天后被自动关闭。不再允许新回复。