Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pkgs: HPC packages need change from apt to environment modules/spack #7

Open
bowling233 opened this issue Nov 24, 2024 · 4 comments
Open
Assignees

Comments

@bowling233
Copy link
Member

目前暂时采用 APT 安装系统级别的 oneAPI 和 CUDA。

等 Spack 和 Lmod 测试完成,应当将 HPC 相关包依赖从 APT 迁移出去。

@bowling233
Copy link
Member Author

参考:编译器、MPI库、数学库、应用工具 - 上海交大超算平台用户手册,可学习他们的部署、使用和测试方式

@bowling233
Copy link
Member Author

spack 安装 intel-oneapi-* 出现磁盘空间不足的问题

# cat spack-build-out.txt 
==> intel-oneapi-itac: Executing phase: 'install'
==> [2025-01-15-01:52:44.929515] 'bash' 'l_itac_oneapi_p_2022.1.0.158_offline.sh' '-s' '-a' '-s' '--action' 'install' '--eula' 'accept' '--install-dir' '/opt/spack/opt/spack/linux-debian12-haswell/gcc-12.2.0/intel-oneapi-itac-2022.1.0-q2rag6dchjnqruamgbuozacvh7v65igr'
Checking system requirements...
Error:
System requirements are not met.
Failed to check that system has enough disk space.
Cannot get free space: volume containing '/opt' folder is not valid or not ready to work with.
Please make sure that your target directory has at least 132.5 MB available for installation.

那么它是怎么检查磁盘空间的呢?看看 log:

<Bootstrapper> CHECKING PREREQUISITES IS STARTED
PreRequisite for free disk space begin [PreRequisiteFreeDiskSpace]
Create test file /home/bowling/intel/oneapi/installer/tmp.1736884181290 of size 138969532 bytes.
<RemoveFile>: file '/home/bowling/intel/oneapi/installer/tmp.1736884181290' is removed.
PreRequisite for free disk space passed
PreRequisite for free disk space end [PreRequisiteFreeDiskSpace]
<Bootstrapper> Prerequisite 'Free Disk Space' is correct!

在家目录创建临时文件以检测磁盘空间。不管是空间还是权限,都是在家目录下检查的,即使 --install-dir 并不在家目录。抽象嗷。

那么问题来了,它是在 Docker 内 chroot 的 root 的家目录中创建文件失败了吗?为什么会失败呢?

@bowling233
Copy link
Member Author

/tmp 下的 installer log 信息:

<Bootstrapper> CHECKING PREREQUISITES IS STARTED
PreRequisite for free disk space begin [PreRequisiteFreeDiskSpace]
Cannot get free space: volume containing '/opt' folder is not valid or not ready to work with.
Please make sure that your target directory has at least 132.5 MB available for installation.
MESSAGE : PreRequisite for free disk space failed
MESSAGE : Title [Failed to check that system has enough disk space.]
MESSAGE : Description [Cannot get free space: volume containing '/opt' folder is not valid or not ready to work with.
irectory has at least 132.5 MB available for installation.]
MESSAGE : PreRequisite for free disk space end [PreRequisiteFreeDiskSpace]

在 chroot 环境内的 df -h 结果:

+ df -h
Filesystem      Size  Used Avail Use% Mounted on
/dev            4.0M     0  4.0M   0% /dev
/run            126G     0  126G   0% /run
/tmp            126G   35M  126G   1% /tmp

难道必须使其挂载吗?

@bowling233
Copy link
Member Author

intel-oneapi-* fixed df77018

bind mount 根目录即可

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants