Hi,
Has anyone built a converged cluster using the storage on the compute nodes to reduce costs? If so:
- how was it set up?
- how well did it perform in practice?
- did it actually save money in the end?
- is there a sweet spot for performance/cost?
I have a requirement to build a roughly 30 node cluster with 1 PB scratch storage. The workload will be non-MPI/pleasingly-parallel so there are no synchronization/jitter concerns. The results will be sent to an archive so no stringent requirements on the storage reliability/redundancy. The intended use is for a specific project so there is no need to scale compute and storage separately.
Previously we’ve always had a separate storage system, but I’m curious if it would be cost-effective to add storage on the compute nodes and build a filesystem using beegfs. However would inifinband/RDMA be essential to reduce CPU load, or would RoCE be sufficient? If infiniband is required could I fit everything including the head node, subnet manager, and metadata server under single switch? Is it possible to do build in house or would it really require a vendor?
Thanks in advance for your opinions, recommendations, and experiences, Ben