安装nebula3.0.0出错

  • nebula 版本:3.0.0
  • 部署方式:分布式
  • 安装方式:RPM
  • 是否为线上版本:Y / N
  • 硬件信息
    • 磁盘 SSD 1T
    • CPU、内存信息 48C 256G
  • 问题的具体描述
    我今天在安装文档搭建3.0.0版本的数据库,我分布式部署3台物理机,每台一个metad、一个graph、一个storage。但是我在启动3太metad的时候就出现了问题,三台metad都启动以后,检查状态发现,有一台metad状态一直在闪烁,这台metad的日志如下:
I20220316 11:22:21.697365 23370 RaftPart.cpp:1184] [Port: 9560, Space: 0, Part: 0] Sending out an election request (space = 0, part = 0, term = 8, lastLogId = 0, lastLogTerm = 0, candidateIP = 172.17.141.116, candidatePort = 9560), isPreVote = 1
W20220316 11:22:21.698217 23363 RaftPart.cpp:1122] [Port: 9560, Space: 0, Part: 0] Receive response about askForVote from "172.17.141.118":9560, error code is E_RAFT_TERM_OUT_OF_DATE, isPreVote = 1
I20220316 11:22:21.698267 23363 RaftPart.cpp:1150] [Port: 9560, Space: 0, Part: 0] Did not get enough votes from election of term 8, isPreVote = 1
I20220316 11:22:22.591348 23326 MetaDaemonInit.cpp:116] Leader has not been elected, sleep 1s
I20220316 11:22:22.716485 23367 RaftPart.cpp:1184] [Port: 9560, Space: 0, Part: 0] Sending out an election request (space = 0, part = 0, term = 8, lastLogId = 0, lastLogTerm = 0, candidateIP = 172.17.141.116, candidatePort = 9560), isPreVote = 1
W20220316 11:22:22.717331 23363 RaftPart.cpp:1122] [Port: 9560, Space: 0, Part: 0] Receive response about askForVote from "172.17.141.118":9560, error code is E_RAFT_TERM_OUT_OF_DATE, isPreVote = 1
I20220316 11:22:22.717381 23363 RaftPart.cpp:1150] [Port: 9560, Space: 0, Part: 0] Did not get enough votes from election of term 8, isPreVote = 1
I20220316 11:22:23.591557 23326 MetaDaemonInit.cpp:116] Leader has not been elected, sleep 1s

3 台 meta 的端口都是配置的 9559 吗

是的

是的,端口都是9559

把配置文件贴一下,:thinking: 可能是选举出了问题

########## basics ##########
# Whether to run as a daemon process
--daemonize=true
# The file to host the process id
--pid_file=pids/nebula-metad.pid

########## logging ##########
# The directory to host logging files
--log_dir=logs
# Log level, 0, 1, 2, 3 for INFO, WARNING, ERROR, FATAL respectively
--minloglevel=0
# Verbose log level, 1, 2, 3, 4, the higher of the level, the more verbose of the logging
--v=0
# Maximum seconds to buffer the log messages
--logbufsecs=0
# Whether to redirect stdout and stderr to separate output files
--redirect_stdout=true
# Destination filename of stdout and stderr, which will also reside in log_dir.
--stdout_log_file=metad-stdout.log
--stderr_log_file=metad-stderr.log
# Copy log messages at or above this level to stderr in addition to logfiles. The numbers of severity levels INFO, WARNING, ERROR, and FATAL are 0, 1, 2, and 3, respectively.
--stderrthreshold=2
# wether logging files' name contain time stamp, If Using logrotate to rotate logging files, than should set it to true.
--timestamp_in_logfile_name=true

########## networking ##########
# Comma separated Meta Server addresses
--meta_server_addrs=172.17.141.116:9559,172.17.141.117:9559,172.17.141.118:9559
# Local IP used to identify the nebula-metad process.
# Change it to an address other than loopback if the service is distributed or
# will be accessed remotely.
--local_ip=172.17.141.116
# Meta daemon listening port
--port=9559
# HTTP service ip
--ws_ip=0.0.0.0
# HTTP service port
--ws_http_port=19559
# HTTP2 service port
--ws_h2_port=19560
# Port to listen on Storage with HTTP protocol, it corresponds to ws_http_port in storage's configuration file
--ws_storage_http_port=19779

########## storage ##########
# Root data path, here should be only single path for metad
--data_path=data/meta

########## Misc #########
# The default number of parts when a space is created
--default_parts_num=100
# The default replica factor when a space is created
--default_replica_factor=1

--heartbeat_interval_secs=10
--agent_heartbeat_interval_secs=60

我发现我报错的这一台总是116的这一台机器,不管怎么启动,都是这台接受118这台有问题
W20220316 15:53:56.024557 687 RaftPart.cpp:1122] [Port: 9560, Space: 0, Part: 0] Receive response about askForVote from “172.17.141.118”:9560, error code is E_RAFT_TERM_OUT_OF_DATE, isPreVote = 1
I20220316 15:53:56.024608 687 RaftPart.cpp:1150] [Port: 9560, Space: 0, Part: 0] Did not get enough votes from election of term 11, isPreVote = 1

你确认下三台机器的这个配置是不是一样
···
–meta_server_addrs=172.17.141.116:9559,172.17.141.117:9559,172.17.141.118:9559
···

按理来说116应该能收到117和118的rpc,但是从你给的日志看 116只能收到118的 收不到117的 ,所以防火墙之类的也得检查下

厉害了,大哥,是的我最后发现我的116的防火墙不知道为什么开着,其他两台没关掉

:ok_hand: :ok_hand:

此话题已在最后回复的 30 天后被自动关闭。不再允许新回复。