Select a tag to browse associated projects and drill deeper into the tag cloud.
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.
RIDE is a multi-flow HPC paradigm and the system that implements it. It aims to beat MPI and OpenMP for simplicity of use. It also beats approaches such as MapReduce and Task Flow because RIDE solves wider range of tasks.
Likwid stands for Like I knew what I am doing. This project contributes easy to use command line tools for Linux to support programmers in developing high performance multi threaded programs. It contains the following tools: likwid-topology: Show the thread and cache topology likwid-perfCtr: ... [More]
io-watchdog is a facility for monitoring user applications and parallel jobs for "hangs" which typically have a side effect of ceasing all IO in a cyclic application (i.e. one that writes something to a log or data file during each cycle of computation). The io-watchdog attempts to watch ... [More]