nebula3.0 启动不起来

修改完了 还是不行

E20220301 13:51:24.924636 11690 ActiveHostsMan.cpp:303] Get last update time failed, error: E_KEY_NOT_FOUND
I20220301 13:51:26.088774 11690 HBProcessor.cpp:33] Receive heartbeat from "hg45":9669, role = GRAPH
E20220301 13:51:26.091428 11690 ActiveHostsMan.cpp:303] Get last update time failed, error: E_KEY_NOT_FOUND

add hosts “hg46”:9779 执行过了吗,可以再贴一下最新show hosts的结果

现在一会能起到起来 一会就启动不起来

你可以把启动不了时候的日志贴一下,方便我们排查下问题。

我执行 nebula.service status all ,显示storaged是红色。。
asdx
明明就是正常的,只是没add hosts而已。
不觉得这个手动添加很反人类吗?
单机版也要add hosts吗?

哦 我知道了 这个有个bug 最近会修 已经有个PR了

See this:
现在add hosts 会影响meta的ready响应,导致storage在add host之前一直等待,尚未启动,此时检查storage端口会发现为红色。

修复pr如下:

至于add hosts作为3.0.0的不兼容部分,还望理解,后续也可能会支持自动注册参数的方式,减少使用难度。

2 个赞

我也是同样问题,说实话,很失望

看了这个我真的非常失望,原来是个bug , 我认为这不是在捉虫, 这兼职是被 蚊子咬了全身都是包 ,我和他是相同的情况, 这么一个基础的bug 怎么可以发布呢 ,太不负责任了。我们用的是上班工作的时间打算试用nebula 3.0.这个bug 耽搁了半天,一直在找自己机器的原因,nebula3.0把我对你们的期望值拉低太多了,怎么会想到如此基础bug可以通过呢,自己的测试不负责任吗

这个 pr如果已经测试过了,期望你们立马替换原来带着bug 的release版本 的rpm 格式的文件

给大家带来的不便,实在抱歉!我们会尽快修复问题,出一个3.0.1的包。后续加大测试力度,给社区提供更加稳定的版本。减少和规避这种不必要问题排查导致的效率降低和时间消耗!

4 个赞

@SuperYoko 我理解这个问题是没有 add hosts 之前造成服务状态闪烁,但是add hosts 之后不应该影响服务,对吧?

现在 @Veggie 这边发现 add hosts 之后还是 offline,这个是预期的么?

  1. 是的,add hosts后状态应该会正常。
  2. 要等待storage 服务正常启动后online,如果没有可以在add hosts之后尝试重启storage服务,这是因为如果meta长期未回应ready可能会导致storage超过等待时间, 如果有其他问题,可以贴下相关日志,我们再一起确认一下。

谢谢 @SuperYoko 哈,线下确认是 cluster.id 文件不存在造成的,确认了不是认为删除的,这个小问题会影响它的创建时机咩?

理论不会,我和文档同学在测试这个问题并在文档中补充FAQ的时候也进行过这个流程。

有点奇怪, cluster.id 发现没法创建,清理掉重装,add hosts也是一样。

一般来说 add hosts 之后要等多久呢?

突然想到,昨天你的storaged 配置的 ip 段是什么段?有没有防火墙/安全组问题?

@SuperYoko
我在重现 @Veggie 的时候发现在 aliyun centOS 里有一个问题,storageD 的 cluster.id 不生成,配置是默认的没改过,3.0.0。

我offline给你一下连接方式,你帮忙看下哈?

(root@nebula) [(none)]> show hosts
+------+------+--------+--------------+---------------------+------------------------+---------+
| Host | Port | Status | Leader count | Leader distribution | Partition distribution | Version |
+------+------+--------+--------------+---------------------+------------------------+---------+
+------+------+--------+--------------+---------------------+------------------------+---------+
Empty set (time spent 579/861 us)

Fri, 11 Mar 2022 10:02:16 CST

(root@nebula) [(none)]> 

Bye root!
Fri, 11 Mar 2022 10:02:18 CST

[root@nebula-aliyun nebula]# tail logs/*storage*
==> logs/nebula-storaged.ERROR <==
E20220311 10:00:48.757405  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:00:58.758133  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:08.758744  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:18.759363  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:28.759989  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:38.760632  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:48.761250  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:58.761898  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:02:08.762518  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:02:18.763087  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!

==> logs/nebula-storaged.INFO <==
I20220311 10:01:48.761276  1661 MetaClient.cpp:133] Waiting for the metad to be ready!
W20220311 10:01:58.761387  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:58.761898  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
I20220311 10:01:58.761924  1661 MetaClient.cpp:133] Waiting for the metad to be ready!
W20220311 10:02:08.762028  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:08.762518  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
I20220311 10:02:08.762544  1661 MetaClient.cpp:133] Waiting for the metad to be ready!
W20220311 10:02:18.762632  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:18.763087  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
I20220311 10:02:18.763142  1661 MetaClient.cpp:133] Waiting for the metad to be ready!

==> logs/nebula-storaged.nebula-aliyun.root.log.ERROR.20220311-094418.1661 <==
E20220311 10:00:48.757405  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:00:58.758133  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:08.758744  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:18.759363  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:28.759989  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:38.760632  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:48.761250  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:01:58.761898  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:02:08.762518  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
E20220311 10:02:18.763087  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!

==> logs/nebula-storaged.nebula-aliyun.root.log.INFO.20220311-094416.1661 <==
I20220311 10:01:48.761276  1661 MetaClient.cpp:133] Waiting for the metad to be ready!
W20220311 10:01:58.761387  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:58.761898  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
I20220311 10:01:58.761924  1661 MetaClient.cpp:133] Waiting for the metad to be ready!
W20220311 10:02:08.762028  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:08.762518  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
I20220311 10:02:08.762544  1661 MetaClient.cpp:133] Waiting for the metad to be ready!
W20220311 10:02:18.762632  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:18.763087  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
I20220311 10:02:18.763142  1661 MetaClient.cpp:133] Waiting for the metad to be ready!

==> logs/nebula-storaged.nebula-aliyun.root.log.WARNING.20220311-094418.1661 <==
W20220311 10:01:38.760131  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:38.760632  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:01:48.760751  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:48.761250  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:01:58.761387  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:58.761898  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:02:08.762028  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:08.762518  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:02:18.762632  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:18.763087  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!

==> logs/nebula-storaged.WARNING <==
W20220311 10:01:38.760131  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:38.760632  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:01:48.760751  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:48.761250  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:01:58.761387  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:01:58.761898  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:02:08.762028  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:08.762518  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!
W20220311 10:02:18.762632  1661 FileBasedClusterIdMan.cpp:43] Open file failed, error No such file or directory
E20220311 10:02:18.763087  1661 MetaClient.cpp:102] Heartbeat failed, status:Machine not existed!

==> logs/storaged-stderr.log <==

==> logs/storaged-stdout.log <==
[root@nebula-aliyun nebula]# ls
bin  data  etc  logs  nebula-console-linux-amd64-v3.0.0  pids  scripts  share

咱俩的状态一样,9779一直不行,安装了好几次,系统都重做了几次也没有解决,你那边解决了没