执行 submit job stats, 然后执行 show jobs 状态是FAILED

1 个赞

大佬,mate服务 报错



求教,是怎么回事

请问是干净的环境么? data 目录有铲掉么?

data目录还在

你要是从2.6 重新部署,要清理掉,数据不兼容的。
另外,我注意到你修改了容器名字,这里假设了容器名字是domain name,配置里要匹配好,因为彼此通信的配置项也是依赖这些域名的。

可以先不要修改,用原始的 3metad,3graphd,3storaged 的,起来确定没有问题再一点点取修改。

不不不,我换了台服务器重新搞的,跟2.6没关系

先试一下,原始的 compose file 不改动的哈

还是一样的报错,这个我得yaml文件
docker-compose.yaml (7.1 KB)

发错了,是这个
docker-compose.yaml (7.1 KB)

好的,我看了没有问题,能确保docker有权限创建 data 在里边写进去东西么?
现在是什么状态:

rock@rock-3a:~/nebula-docker-compose$ docker-compose ps
              Name                             Command                  State                                                         Ports
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
nebula-docker-compose_console_1     sh -c sleep 3 &&                 Exit 1
                                    nebula-co ...
nebula-docker-compose_graphd1_1     /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49174->19669/tcp,:::49174->19669/tcp, 0.0.0.0:49171->19670/tcp,:::49171->19670/tcp,
                                                                                    0.0.0.0:49177->9669/tcp,:::49177->9669/tcp
nebula-docker-compose_graphd2_1     /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49175->19669/tcp,:::49175->19669/tcp, 0.0.0.0:49172->19670/tcp,:::49172->19670/tcp,
                                                                                    0.0.0.0:49178->9669/tcp,:::49178->9669/tcp
nebula-docker-compose_graphd_1      /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49176->19669/tcp,:::49176->19669/tcp, 0.0.0.0:49173->19670/tcp,:::49173->19670/tcp,
                                                                                    0.0.0.0:9669->9669/tcp,:::9669->9669/tcp
nebula-docker-compose_metad0_1      /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49155->19559/tcp,:::49155->19559/tcp, 0.0.0.0:49153->19560/tcp,:::49153->19560/tcp,
                                                                                    0.0.0.0:49158->9559/tcp,:::49158->9559/tcp, 9560/tcp
nebula-docker-compose_metad1_1      /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49157->19559/tcp,:::49157->19559/tcp, 0.0.0.0:49154->19560/tcp,:::49154->19560/tcp,
                                                                                    0.0.0.0:49160->9559/tcp,:::49160->9559/tcp, 9560/tcp
nebula-docker-compose_metad2_1      /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49159->19559/tcp,:::49159->19559/tcp, 0.0.0.0:49156->19560/tcp,:::49156->19560/tcp,
                                                                                    0.0.0.0:49161->9559/tcp,:::49161->9559/tcp, 9560/tcp
nebula-docker-compose_storaged0_1   /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49163->19779/tcp,:::49163->19779/tcp, 0.0.0.0:49162->19780/tcp,:::49162->19780/tcp, 9777/tcp,
                                                                                    9778/tcp, 0.0.0.0:49165->9779/tcp,:::49165->9779/tcp, 9780/tcp
nebula-docker-compose_storaged1_1   /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49168->19779/tcp,:::49168->19779/tcp, 0.0.0.0:49166->19780/tcp,:::49166->19780/tcp, 9777/tcp,
                                                                                    9778/tcp, 0.0.0.0:49170->9779/tcp,:::49170->9779/tcp, 9780/tcp
nebula-docker-compose_storaged2_1   /usr/local/nebula/bin/nebu ...   Up (healthy)   0.0.0.0:49167->19779/tcp,:::49167->19779/tcp, 0.0.0.0:49164->19780/tcp,:::49164->19780/tcp, 9777/tcp,
rock@rock-3a:~/nebula-docker-compose$ tree data -L 3
data
├── meta0
│   └── nebula
│       └── 0
├── meta1
│   └── nebula
│       └── 0
├── meta2
│   └── nebula
│       └── 0
├── storage0
│   └── nebula
│       ├── 0
│       ├── 4
│       └── 5
├── storage1
│   └── nebula
│       ├── 0
│       ├── 4
│       └── 5
└── storage2
    └── nebula
        ├── 0
        ├── 4
        └── 5

能去确保data目录可以创建 但是 在data/meta0 下ls 是空,不确定能不能写进去
docker-compose.yaml (7.1 KB)

logs是可以写进去的,data应该也是可以的吧

还是这个报错对么?是的话,就是 filesystem_error 哈,disk 没满?

DiskManager::DiskManager(const std::vector<std::string>& dataPaths,
                         std::shared_ptr<thread::GenericWorker> bgThread)
    : bgThread_(bgThread) {
  try {
    // atomic is not copy-constructible
    std::vector<std::atomic_uint64_t> freeBytes(dataPaths.size() + 1);
    Paths* paths = new Paths();
    paths_.store(paths);
    size_t index = 0;
    for (const auto& path : dataPaths) {
      auto absolute = boost::filesystem::absolute(path);
      if (!boost::filesystem::exists(absolute)) {
        if (!boost::filesystem::create_directories(absolute)) {
          LOG(FATAL) << folly::sformat("DataPath:{} does not exist, create failed.", path);
        }
      } else if (!boost::filesystem::is_directory(absolute)) {
        LOG(FATAL) << "DataPath is not a valid directory: " << path;
      }
      auto canonical = boost::filesystem::canonical(path);
      auto info = boost::filesystem::space(canonical);
      paths->dataPaths_.emplace_back(std::move(canonical));
      freeBytes[index++] = info.available;
    }
    freeBytes_ = std::move(freeBytes);
  } catch (boost::filesystem::filesystem_error& e) {
    LOG(FATAL) << "DataPath invalid: " << e.what(); <----------------
  }

disk还有2.9T

能分享一下您的操作系统信息么?还有文件系统信息 disk 是网盘么?
应该是 boost::filesystem::status() 没有对应的实现status() 是一个兼容性问题 bug,我去提一个 issue。
期待回复。

@steam 捉虫活动还在继续么?

结束了,不过我们论坛捉虫的徽章一直都是在发放的。别担心

1 个赞

大佬,那这个情况咋整

能打印一下 uname -a 么?,sorry,看到了。
@yee 老师,能帮忙看下么,这个 4.14 kernel arcch64 容器环境里

boost 的 filessystem 报没有实现 status()

https://github.com/vesoft-inc/nebula/issues/4205