The Koala Benchmark Suite

Benchmarks | Quick Setup | More Info | License

For issues and ideas, open a GitHub issue.

Koala is a benchmark suite aimed at the characterization of performance-oriented research targeting the POSIX shell. It consists of 14 sets of real-world shell programs from diverse domains ranging from CI/CD and AI/ML to biology and the humanities. They are accompanied by real inputs that facilitate small- and large-scale performance characterization and varying opportunities for optimization.

If any aspect of Koala is useful, please cite the ATC'25 Koala paper:

@inproceedings {koala:atc:2025,
 title = {The Koala Benchmarks for the Shell: Characterization and Implications},
 author = {Evangelos Lamprou and Ethan Williams and Georgios Kaoukis and Zhuoxuan Zhang and Michael Greenberg and Konstantinos Kallas and Lukas Lazarek and Nikos Vasilakis},
 booktitle = {Proceedings of the 2025 USENIX Annual Technical Conference (USENIX ATC '25)},
 year = {2025},
 isbn = {978-1-939133-48-9},
 address = {Boston, MA},
 pages = {449--64},
 url = {https://www.usenix.org/conference/atc25/presentation/lamprou},
 publisher = {USENIX Association},
 month = jul
}

As part of the ATC'25 Artifact Evaluation process, the Koala frozen atc25-ae branch received all three badges—artifact Available, Functional, and Reproduced.

Benchmarks

Each of the top-level folders (except infrastructure) contains a benchmark set. Please explore the individual benchmark directories for more details on their specific inputs, dependencies, and usage.

Benchmark	Description
`analytics`	processes real-world network logs to extract and summarize key events.
`bio`	performs genomic and transcriptomic analysis using population and RNA-seq data.
`ci-cd`	builds and tests open-source software projects.
`covid`	analyzes public transit activity during the covid-19 pandemic.
`file-mod`	compresses, encrypts, and converts various file formats.
`inference`	runs media-related inference tasks using large foundation models.
`ml`	implements a full machine learning pipeline using scikit-learn.
`nlp`	processes books using shell-based nlp pipelines from unix for poets.
`oneliners`	executes classic and modern one-liner shell pipelines.
`pkg`	builds aur packages and analyzes npm packages for permissions.
`repl`	performs security auditing and git-based development workflow replay.
`unixfun`	solves unix text-processing problems from the 50-year anniversary challenge.
`weather`	computes and visualizes historical weather statistics.
`web-search`	implements crawling, indexing, and querying of wikipedia data.

Quick Setup

Koala can be obtained using the following ways:

Run curl up.kben.sh | sh from your terminal,
Clone the repo and run cd koala && ./setup.sh,
Fetch the oficial Docker container by running docker pull ghcr.io/kbensh/koala:latest, or
Build the Docker container from scratch.

Note: Docker version 20.10.0 or later is required to run or build the Koala container successfully.

More info

Setup and configuration: Extended instructions on Koala's setup and configuration can be found in the INSTRUCTIONS file.

Inputs and dependencies: Information on Koala's multiple inputs (e.g., min, small, and full) as well as the dependencies of all benchmarks, can be found in the INSTRUCTIONS file.

Contributions, always welcomed! Further details on how to contribute to the Koala benchmark project, take a look at the CONTRIBUTING file.

License

The Koala Benchmarks are licensed under the MIT License. See the LICENSE file for more information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Koala Benchmark Suite

Benchmarks

Quick Setup

More info

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 25

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1,018 Commits
.github/workflows		.github/workflows
analytics		analytics
bio		bio
ci-cd		ci-cd
covid		covid
file-mod		file-mod
inference		inference
infrastructure		infrastructure
ml		ml
nlp		nlp
oneliners		oneliners
pkg		pkg
repl		repl
unixfun		unixfun
weather		weather
web-search		web-search
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
INSTRUCTIONS.md		INSTRUCTIONS.md
LICENSE		LICENSE
README.md		README.md
main.sh		main.sh
setup.sh		setup.sh

License

kbensh/koala

Folders and files

Latest commit

History

Repository files navigation

The Koala Benchmark Suite

Benchmarks

Quick Setup

More info

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 25

Uh oh!

Languages

Packages