ScaleBugs: Reproducible Scalability Bugs

Hi! I’m Zahra, an undergraduate at Universitas Dian Nuswantoro, Indonesia. As part of the ScaleBugs my proposal under the mentorship of Cindy Rubio González and Haryadi S. Gunawi aims to build a dataset of reproducible scalability bugs.
Analyzing through the existing bug reports for popular distributed systems, such as Cassandra, HDFS, Ignite, and Kafka. For each bug report, we will analyzing whether the reported bug is influenced by the scale of the operation, such as the number of nodes being used. With the collected scale-dependent bugs, we will design specific workloads to replicate those scalability issues. These workloads will be carefully crafted to activate certain system functionalities under different configurations, such as varying the number of nodes. By doing so, we aim to demonstrate how the system’s functionality is affected as the number of nodes increases.