-
Notifications
You must be signed in to change notification settings - Fork 1
Update to add several fixes (seeds, updated environments) and metric collection plots (from Mark) #30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
* run from post-0.2.0 tag, main branch * docs: use public repo URIs * chore: add convenience target to build environments * add top-level Makefile to prepare env * feat: parametrize num of cores on the makefile * chore: ignore common temporary outputs and image build artifacts * update .eb files to easybuild 5.0 * remove remote storage * do not run artifacts if not in main repo * inject checksums to rmarkdown easyconfig * update sklearn singularity definition * feat: add microbenchmark for numpy operations * chore: bump clustering-benchmarks to 1.1.6 * feat: templatize the definitions * feat: mv output folders to timestamped names * feat: add --yes flag * docs: update README
@imallona extend the makefile and the yaml-izer starting with conda on some versions, e.g. to reproduce Mark's singularity recipe results (with conda) |
Surely this is known and planned for, but noting that this PR is not able to run the whole clustbench example (missing some of the envs etc). Should I review the current state, or wait? In other words: is the idea to merge changes incrementally or? |
This comment was marked as resolved.
This comment was marked as resolved.
FCPS instability is fixed (it was the seed) imallona/clustering_report#3 (comment) (this fixes it for repeated runs, but not for repeated ks within a run) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- envs/README could use some update (remove reference to unexsisting env files)
- maybe add a script to do the easybuild builds?
FWIW at d414d26 (omnibenchmark) ✘-INT ~/clustering_example/envs [imallona|✔]
12:45 $ eb clustbench.eb --robot
== Temporary log file in case of crash /tmp/eb-65zdr4j2/easybuild-6yjcgpbh.log
== found valid index for /home/ben/micromamba/envs/omnibenchmark/easybuild/easyconfigs, so using it...
== found valid index for /home/ben/micromamba/envs/omnibenchmark/easybuild/easyconfigs, so using it...
== resolving dependencies ...
== processing EasyBuild easyconfig /home/ben/clustering_example/envs/clustbench.eb
== building and installing clustbench/1-foss-2023b...
>> installation prefix: /home/ben/.local/easybuild/software/clustbench/1-foss-2023b
== fetching files and verifying checksums...
== ... (took < 1 sec)
== FAILED: Installation ended unsuccessfully: Checksum verification for extension source genieclust-1.1.6.tar.gz failed (took 0 secs)
== Results of the build can be found in the log file(s) /tmp/eb-65zdr4j2/easybuild-clustbench-1-20250618.124547.mKhtA.log
== Summary:
* [FAILED] clustbench/1-foss-2023b
ERROR: Installation of clustbench.eb failed: 'Checksum verification for extension source genieclust-1.1.6.tar.gz failed'
(omnibenchmark) ✘-42 ~/clustering_example/envs [imallona|✔]
12:45 $ eb --version
This is EasyBuild 5.1.0 (framework: 5.1.0, easyblocks: 5.1.0) on host omnibenchmark.
12:46 $ git rev-parse --short HEAD
d414d26
I'd be ok with merging the branch, though, and debug building of the easyconfigs in a separate issue, more isolated from everything else. |
does it work ignoring checksums? |
ignoring, or fixing the checksum, lets it go beyond
The problem, to me, seems to be improper configuration by the include flags in the genieclust package:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to merge, apart from the conda env question left.
envs/fcps.yml
Outdated
- conda-forge::r-rmarkdown | ||
- conda-forge::r-cairo | ||
- conda-forge::r-svglite | ||
- conda-forge::r-ggplot2 | ||
- conda-forge::r-tidyr | ||
- bioconda::bioconductor-complexheatmap | ||
- conda-forge::r-jsonlite | ||
- conda-forge::r-dplyr | ||
- conda-forge::r-r.utils |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't these be pinned to a version?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks 65262c9
Doing a fresh build of the
From the log file:
Do note this is from a shared environment if you want to debug.
|
Perhaps premature but it's running well for me, user mkdir -p /home/easybuild01/imallona/src
## retrieve `develop` easyconfigs as for today, because Perl 5x CPAN errors
## were common in the past but issues have been fixed already (I hope)
## please see https://github.com/easybuilders/easybuild-easyconfigs/issues?q=is%3Aissue%20Perl-bundle-CPAN
git clone https://github.com/easybuilders/easybuild-easyconfigs.git
## clone the clustering example repo - current state (`branch imallona`)
git clone [email protected]:omnibenchmark/clustering_example.git
cd clustering_example
git checkout imallona
## specify recent easyconfigs as robotspath hoping someone fixed the Perl 5 CPAN easyconfig (I think it's fixed)
eb --robot-paths=/home/easybuild01/imallona/src/easybuild-easyconfigs/easybuild/easyconfigs/ \
--robot \
/home/easybuild01/imallona/src/clustering_example/envs/clustbench.eb edit it runs; please note I've updated genieclust and clustbench to 1.6 and clarified they're from the author's repo and from source dab8639 . Trying now fcps.eb using the same tmux/user/path. |
Something's going on with pydantic-core (oras recipe), conda works but singularity doesn't anymore. Oddly enough it used to work with older omnibenchmark versions, same apptainer. |
hmm should we document or script the fetching of easybuilders/easybuild-easyconfigs ? |
Another fresh build that took several hours to complete. I had to manually remove a bogus source file in the Doxygen build to let it continue.
|
This originally aimed to incorporate (parts of) btraven00@e0989fb and #29 but sequentially to have a better understanding of the many features
But currently it mostly deals with pinning software environments and fixing random seeds in some of the methods, as well as plotting results (from Mark).
First iteration
a conda with the apptainer's versionsa grid of versions in, separately, apptainer and conda