Publications – Analytics of Software, GAmes And Repository Data (ASGAARD) Lab

21.

Hao Li; Filipe R. Cogo; Cor-Paul Bezemer

An Empirical Study of Yanked Releases in the Rust Package Registry Journal Article

Transactions of Software Engineering (TSE), 2022.

Files:

Abstract | BibTeX | Tags: Release Management, Software Ecosystem

22.

Mikael Sabuhi; Ming (Chloe) Zhou; Cor-Paul Bezemer; Petr Musilek

Applications of Generative Adversarial Networks in Anomaly Detection: A Systematic Literature Review Journal Article

IEEE Access, 2021.

Files:

Abstract | BibTeX | Tags:

@article{mikael_gan2021,

title = {Applications of Generative Adversarial Networks in Anomaly Detection: A Systematic Literature Review},

author = {Mikael Sabuhi and Ming (Chloe) Zhou and Cor-Paul Bezemer and Petr Musilek},

year  = {2021},

date = {2021-12-01},

urldate = {2021-12-01},

journal = {IEEE Access},

abstract = {Anomaly detection has become an indispensable tool for modern society, applied in a wide

range of applications, from detecting fraudulent transactions to malignant brain tumors. Over time, many

anomaly detection techniques have been introduced. However, in general, they all suffer from the same

problem: lack of data that represents anomalous behaviour. As anomalous behaviour is usually costly (or

dangerous) for a system, it is difficult to gather enough data that represents such behaviour. This, in turn,

makes it difficult to develop and evaluate anomaly detection techniques. Recently, generative adversarial

networks (GANs) have attracted much attention in anomaly detection research, due to their unique ability

to generate new data. In this paper, we present a systematic review of the literature in this area, covering

128 papers. The goal of this review paper is to analyze the relation between anomaly detection techniques

and types of GANs, to identify the most common application domains for GAN-assisted and GAN-based

anomaly detection, and to assemble information on datasets and performance metrics used to assess them.

Our study helps researchers and practitioners to find the most suitable GAN-assisted anomaly detection

technique for their application. In addition, we present a research roadmap for future studies in this area. In

summary, GANs are used in anomaly detection to address the problem of insufficient amount of data for the

anomalous behaviour, either through data augmentation or representation learning. The most commonly used

GAN architectures are DCGANs, standard GANs, and cGANs. The primary application domains include

medicine, surveillance and intrusion detection.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

23.

Arthur V. Kamienski

Studying Trends, Topics, and Duplicate Questions on Q&A Websites for Game Developers Masters Thesis

University of Alberta, 2021.

Files:

Abstract | BibTeX | Tags: Computer games, Q&A websites

@mastersthesis{msc_arthur,

title = {Studying Trends, Topics, and Duplicate Questions on Q&A Websites for Game Developers},

author = {Arthur V. Kamienski},

year  = {2021},

date = {2021-09-29},

urldate = {2021-09-29},

school = {University of Alberta},

abstract = {The game development industry is growing and there is a high demand for develop-

ers that can produce high-quality games. These developers need resources to learn

and improve the skills required to build those games in a reliable and easy manner.

Question and Answer (Q&A) websites are learning resources that are commonly used

by software developers to share knowledge and acquire the information they need.

However, we still know little about how game developers use and interact with Q&A

websites. In this thesis, we analyze the largest Q&A websites that discuss game de-

velopment to understand how effective they are as learning resources and what can

be improved to build a better Q&A community for their users.

In the first part of this thesis, we analyzed data collected from four Q&A websites,

namely Unity Answers, the Unreal Engine 4 (UE4) AnswerHub, the Game Develop-

ment Stack Exchange, and Stack Overflow, to assess their effectiveness in helping

game developers. We also used the 347 responses collected from a survey we ran

with game developers to gauge their perception of Q&A websites. We found that

the studied websites are in decline, with their activity and effectiveness decreasing

over the last few years and users having an overall negative view of the studied Q&A

communities. We also characterized the topics discussed in those websites using a

latent Dirichlet allocation (LDA) model, and analyze how those topics differ across

websites. Finally, we give recommendations to guide developers to the websites that

are most effective in answering the types of questions they have, which could help the

websites in overcoming their decline.

In the second part of the thesis, we explored how we can further help Q&A web-

sites for game developers by automatically identifying duplicate questions. Duplicate

questions have a negative impact on Q&A websites by overloading them with ques-

tions that have already been answered. Therefore, we analyzed the performance of

seven unsupervised and pre-trained techniques on the task of detecting duplicate

questions on Q&A websites for game developers. We achieved the highest perfor-

mance when comparing all the text content of questions and their answers using a

pre-trained technique based on MPNet. Furthermore, we could almost double the

performance by combining all of the techniques into a single question similarity score

using supervised models. Lastly, we show that the supervised models can be used

on websites different from the ones they were trained on with little to no decrease in

performance. Our findings can be used by Q&A websites and future researchers to

build better systems for duplicate question detection, which can ultimately provide

game developers with better Q&A communities.},

keywords = {Computer games, Q&A websites},

pubstate = {published},

tppubtype = {mastersthesis}

}

Close

The game development industry is growing and there is a high demand for develop-
ers that can produce high-quality games. These developers need resources to learn
and improve the skills required to build those games in a reliable and easy manner.
Question and Answer (Q&A) websites are learning resources that are commonly used
by software developers to share knowledge and acquire the information they need.
However, we still know little about how game developers use and interact with Q&A
websites. In this thesis, we analyze the largest Q&A websites that discuss game de-
velopment to understand how effective they are as learning resources and what can
be improved to build a better Q&A community for their users.
In the first part of this thesis, we analyzed data collected from four Q&A websites,
namely Unity Answers, the Unreal Engine 4 (UE4) AnswerHub, the Game Develop-
ment Stack Exchange, and Stack Overflow, to assess their effectiveness in helping
game developers. We also used the 347 responses collected from a survey we ran
with game developers to gauge their perception of Q&A websites. We found that
the studied websites are in decline, with their activity and effectiveness decreasing
over the last few years and users having an overall negative view of the studied Q&A
communities. We also characterized the topics discussed in those websites using a
latent Dirichlet allocation (LDA) model, and analyze how those topics differ across
websites. Finally, we give recommendations to guide developers to the websites that
are most effective in answering the types of questions they have, which could help the
websites in overcoming their decline.
In the second part of the thesis, we explored how we can further help Q&A web-
sites for game developers by automatically identifying duplicate questions. Duplicate
questions have a negative impact on Q&A websites by overloading them with ques-
tions that have already been answered. Therefore, we analyzed the performance of
seven unsupervised and pre-trained techniques on the task of detecting duplicate
questions on Q&A websites for game developers. We achieved the highest perfor-
mance when comparing all the text content of questions and their answers using a
pre-trained technique based on MPNet. Furthermore, we could almost double the
performance by combining all of the techniques into a single question similarity score
using supervised models. Lastly, we show that the supervised models can be used
on websites different from the ones they were trained on with little to no decrease in
performance. Our findings can be used by Q&A websites and future researchers to
build better systems for duplicate question detection, which can ultimately provide
game developers with better Q&A communities.

Close

24.

Arthur V. Kamienski; Cor-Paul Bezemer

An Empirical Study of Q&A Websites for Game Developers Journal Article

Empirical Software Engineering Journal (EMSE), 2021.

Files:

Abstract | BibTeX | Tags: Game development, Q&A communities

25.

Quang N. Vu; Cor-Paul Bezemer

Improving the Discoverability of Indie Games by Leveraging their Similarity to Top-Selling Games Identifying Important Requirements of a Recommender System Inproceedings

International Conference on the Foundations of Digital Games (FDG), pp. 1–12, 2021.

Files:

Abstract | BibTeX | Tags: Computer games, Game discoverability, Indie games, itch.io, Steam

26.

Filipe R. Cogo; Gustavo A. Oliva; Cor-Paul Bezemer; Ahmed E. Hassan

An Empirical Study of Same-day Releases of Popular Packages in the npm Ecosystem Journal Article

Empirical Software Engineering Journal (EMSE), 2021.

Files:

Abstract | BibTeX | Tags: Dependencies, Release Management, Same-day Release, Software Ecosystem

@article{cogo2021,

title = {An Empirical Study of Same-day Releases of Popular Packages in the npm Ecosystem},

author = {Filipe R. Cogo and Gustavo A. Oliva and Cor-Paul Bezemer and Ahmed E. Hassan},

year = {2021},

date = {2021-04-05},

urldate = {2021-04-05},

journal = {Empirical Software Engineering Journal (EMSE)},

abstract = {Within a software ecosystem, client packages can reuse provider

packages as third-party libraries. The reuse relation between client and provider packages is called a dependency. When a client package depends on the code of a provider package, every change that is introduced in a release of the provider has the potential to impact the client package. Since a large number of dependencies exist within a software ecosystem, releases of a popular provider package can impact a large number of clients. Occasionally, multiple releases of a popular package need to be published on the same day, leading to a scenario in which the time available to revise, test, build, and document the release is restricted compared to releases published within a regular schedule. In this paper, our objective is to study the same-day releases that are published by popular packages in the npm ecosystem. We design an exploratory study to characterize the type of changes that are introduced in same-day releases, the prevalence of same-day releases in the npm ecosystem, and the adoption of same-day releases by client packages. A preliminary manual analysis of the existing release notes suggests that same-day releases introduce non-trivial changes (e.g., bug fixes). We then focus on three RQs. First, we study how often same-day releases are published. We found that the median proportion of regularly scheduled releases that are interrupted by a same-day release (per popular package) is 22%, suggesting the importance of having timely and systematic procedures to cope with same-day releases. Second, we

study the performed code changes in same-day releases. We observe that 32% of the same-day releases have larger changes compared with their prior release, thus showing that some same-day releases can undergo significant maintenance activity despite their time-constrained nature. In our third RQ, we study how client packages react to same-day releases of their providers. We observe the vast majority of client packages that adopt the release preceding the same-day release would also adopt the latter without having to change their versioning statement (implicit updates). We also note that explicit adoptions of sameday releases (i.e., adoptions that require a change to the versioning statement of the provider in question) is significantly faster than the explicit adoption of regular releases. Based on our findings, we argue that (i) third-party tools that support the automation of dependency management (e.g., Dependabot) should consider explicitly flagging same-day releases, (ii) popular packages should strive for optimized release pipelines that can properly handle same-day releases, and (iii) future research should design scalable, ecosystem-ready tools that support provider packages in assessing the impact of their code changes on client packages.},

keywords = {Dependencies, Release Management, Same-day Release, Software Ecosystem},

pubstate = {published},

tppubtype = {article}

}

Close

Within a software ecosystem, client packages can reuse provider
packages as third-party libraries. The reuse relation between client and provider packages is called a dependency. When a client package depends on the code of a provider package, every change that is introduced in a release of the provider has the potential to impact the client package. Since a large number of dependencies exist within a software ecosystem, releases of a popular provider package can impact a large number of clients. Occasionally, multiple releases of a popular package need to be published on the same day, leading to a scenario in which the time available to revise, test, build, and document the release is restricted compared to releases published within a regular schedule. In this paper, our objective is to study the same-day releases that are published by popular packages in the npm ecosystem. We design an exploratory study to characterize the type of changes that are introduced in same-day releases, the prevalence of same-day releases in the npm ecosystem, and the adoption of same-day releases by client packages. A preliminary manual analysis of the existing release notes suggests that same-day releases introduce non-trivial changes (e.g., bug fixes). We then focus on three RQs. First, we study how often same-day releases are published. We found that the median proportion of regularly scheduled releases that are interrupted by a same-day release (per popular package) is 22%, suggesting the importance of having timely and systematic procedures to cope with same-day releases. Second, we
study the performed code changes in same-day releases. We observe that 32% of the same-day releases have larger changes compared with their prior release, thus showing that some same-day releases can undergo significant maintenance activity despite their time-constrained nature. In our third RQ, we study how client packages react to same-day releases of their providers. We observe the vast majority of client packages that adopt the release preceding the same-day release would also adopt the latter without having to change their versioning statement (implicit updates). We also note that explicit adoptions of sameday releases (i.e., adoptions that require a change to the versioning statement of the provider in question) is significantly faster than the explicit adoption of regular releases. Based on our findings, we argue that (i) third-party tools that support the automation of dependency management (e.g., Dependabot) should consider explicitly flagging same-day releases, (ii) popular packages should strive for optimized release pipelines that can properly handle same-day releases, and (iii) future research should design scalable, ecosystem-ready tools that support provider packages in assessing the impact of their code changes on client packages.

Close

27.

Markos Viggiato; Dayi Lin; Abram Hindle; Cor-Paul Bezemer

What Causes Wrong Sentiment Classifications of Game Reviews? Journal Article

IEEE Transactions on Games, pp. 1–14, 2021.

Files:

Abstract | BibTeX | Tags: Computer games, Natural language processing, Sentiment analysis, Steam

@article{markos2021sentiment,

title = {What Causes Wrong Sentiment Classifications of Game Reviews?},

author = {Markos Viggiato and Dayi Lin and Abram Hindle and Cor-Paul Bezemer},

year  = {2021},

date = {2021-04-05},

urldate = {2021-04-05},

journal = {IEEE Transactions on Games},

pages = {1--14},

institution = {University of Alberta},

abstract = {Sentiment analysis is a popular technique to identify the sentiment of a piece of text. Several different domains have been targeted by sentiment analysis research, such as Twitter, movie reviews, and mobile app reviews. Although several techniques have been proposed, the performance of current sentiment analysis techniques are still far from acceptable, mainly when applied in domains on which they were not trained. In addition, the causes of wrong classifications are not clear. In this paper, we study how sentiment analysis performs on game reviews. We first report the results of a large scale empirical study on the performance of widely-used sentiment classifiers on game reviews. Then, we investigate the root causes for the wrong classifications and quantify the impact of each cause on the overall performance. We study three existing classifiers: Stanford CoreNLP, NLTK, and SentiStrength. Our results show that most classifiers do not perform well on game reviews, with the best one being NLTK (with an AUC of 0.70). We also identified four main causes for wrong classifications, such as reviews that point out advantages and disadvantages of the game, which might confuse the classifier. The identified causes are not trivial to be resolved and we call upon sentiment analysis and game researchers and developers to prioritize a research agenda that investigates how the performance of sentiment analysis of game reviews can be improved, for instance by developing techniques that can automatically deal with specific game-related issues of reviews (e.g., reviews with advantages and disadvantages). Finally, we show that training sentiment classifiers on reviews that are stratified by the game genre is effective.},

keywords = {Computer games, Natural language processing, Sentiment analysis, Steam},

pubstate = {published},

tppubtype = {article}

}

Close

28.

Arthur V. Kamienski; Luisa Palechor; Cor-Paul Bezemer; Abram Hindle

PySStuBs: Characterizing Single-Statement Bugs in Popular Open-Source Python Projects Inproceedings

MSR Mining Challenge, pp. 1–5, 2021.

Files:

Abstract | BibTeX | Tags: Open-source projects, Python, Single-statement bugs

29.

Rain Epp; Dayi Lin; Cor-Paul Bezemer

An Empirical Study of Trends of Popular Virtual Reality Games and Their Complaints Journal Article

IEEE Transactions on Games, pp. 1–12, 2021.

Files:

Abstract | BibTeX | Tags: Gamer complaints, Virtual reality games

30.

Sara Gholami; Hamzeh Khazaei; Cor-Paul Bezemer

Should you Upgrade Official Docker Hub Images in Production Environments? Inproceedings

ICSE New Ideas and Emerging Results (NIER), pp. 1–5, 2021.

Files:

Abstract | BibTeX | Tags: Containerization, Dependency upgrades, Docker, Docker Hub, Downgrades

31.

Hareem Sahar; Abram Hindle; Cor-Paul Bezemer

How are Issue Reports Discussed in Gitter Chat Rooms? Journal Article

Journal of Systems and Software (JSS), pp. 1–53, 2020.

Files:

Abstract | BibTeX | Tags: Developer discussions, Gitter, Issue reports

32.

Quang N. Vu

Leveraging Data From the Itch.io Online Game Distribution Platform to Help Indie Game Developers Masters Thesis

University of Alberta, 2020.

Files:

Abstract | BibTeX | Tags:

@mastersthesis{msc_quang,

title = {Leveraging Data From the Itch.io Online Game Distribution Platform to Help Indie Game Developers},

author = {Quang N. Vu},

year  = {2020},

date = {2020-09-01},

urldate = {2020-09-01},

school = {University of Alberta},

abstract = {In the game distribution world, Steam is often regarded as the most prominent digital platform for its many famous games made by large developers. On the other hand, the itch.io game distribution platform is praised for its friendliness toward small independent (indie) games developed by small teams or even a single developer. itch.io allows game developers to participate in online game jams (hackathons during which games are built) or publish their games at no publishing cost. In this thesis, we study game data mined from itch.io to help indie game developers: (1) have a higher chance of winning a game jam and (2) increase the discoverability of their games.

In the first part of the thesis, we study the game jams and their high-ranking submissions to better understand the characteristics of a popular game jam (i.e., a jam that receives many submissions) and the characteristics of high-ranking game submissions in these jams. We collected data of 1,290 past game jams and their 3,752 submissions for our analysis. We found that a quality description contributes positively to a jam's popularity and a game's ranking. Additionally, more manpower organizing a jam or developing a game increases their likelihood of being popular or high-ranking respectively. High-ranking games tend to support Windows or macOS, and belong to the Puzzle, Platformer, Interactive Fiction, or Action genres. Finally, shorter competitive jams tend to be more popular. Our findings are useful for both future game jam organizers and participants.

In the second part of the thesis, we study an approach to increase the discoverability of the indie games hosted on itch.io by recommending similar indie games to players of top-selling Steam games. We implemented a content-based recommendation technique that leverages the similarity in tags, genres, and game description between an indie game and a top-selling game using the metadata of 2,830 itch.io indie games and 326 top-selling Steam games. We then contacted the indie game

developers for feedback and suggestion on our approach. We found that the majority (67.9%) of them show positive support for our idea. We analyzed the downvoted recommendations to understand the reasons and lay out the important requirements for such an indie game recommendation approach. These requirements are useful for future research and development in indie game discoverability and recommendation.},

keywords = {},

pubstate = {published},

tppubtype = {mastersthesis}

}

Close

In the game distribution world, Steam is often regarded as the most prominent digital platform for its many famous games made by large developers. On the other hand, the itch.io game distribution platform is praised for its friendliness toward small independent (indie) games developed by small teams or even a single developer. itch.io allows game developers to participate in online game jams (hackathons during which games are built) or publish their games at no publishing cost. In this thesis, we study game data mined from itch.io to help indie game developers: (1) have a higher chance of winning a game jam and (2) increase the discoverability of their games.
In the first part of the thesis, we study the game jams and their high-ranking submissions to better understand the characteristics of a popular game jam (i.e., a jam that receives many submissions) and the characteristics of high-ranking game submissions in these jams. We collected data of 1,290 past game jams and their 3,752 submissions for our analysis. We found that a quality description contributes positively to a jam's popularity and a game's ranking. Additionally, more manpower organizing a jam or developing a game increases their likelihood of being popular or high-ranking respectively. High-ranking games tend to support Windows or macOS, and belong to the Puzzle, Platformer, Interactive Fiction, or Action genres. Finally, shorter competitive jams tend to be more popular. Our findings are useful for both future game jam organizers and participants.
In the second part of the thesis, we study an approach to increase the discoverability of the indie games hosted on itch.io by recommending similar indie games to players of top-selling Steam games. We implemented a content-based recommendation technique that leverages the similarity in tags, genres, and game description between an indie game and a top-selling game using the metadata of 2,830 itch.io indie games and 326 top-selling Steam games. We then contacted the indie game
developers for feedback and suggestion on our approach. We found that the majority (67.9%) of them show positive support for our idea. We analyzed the downvoted recommendations to understand the reasons and lay out the important requirements for such an indie game recommendation approach. These requirements are useful for future research and development in indie game discoverability and recommendation.

Close

33.

Markos Viggiato; Cor-Paul Bezemer

Trouncing in Dota 2: An Investigation of Blowout Matches Inproceedings

The 16th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE), pp. 1–7, 2020.

Files:

Abstract | BibTeX | Tags:

34.

Safwat Hassan; Cor-Paul Bezemer; Ahmed E. Hassan

Studying Bad Updates of Top Free-to-Download Apps in the Google Play Store Journal Article

The Transactions of Software Engineering (TSE) journal, 2020.

Files:

Abstract | BibTeX | Tags: Android mobile apps, Bad updates, Google Play Store, Mobile app reviews

35.

Sara Gholami

Studying Dependency Updates and a Framework for Multi-Versioning in Docker Containers Masters Thesis

University of Alberta, 2020.

Files:

Abstract | BibTeX | Tags:

@mastersthesis{msc_sara,

title = {Studying Dependency Updates and a Framework for Multi-Versioning in Docker Containers},

author = {Sara Gholami},

year  = {2020},

date = {2020-06-01},

urldate = {2020-06-01},

school = {University of Alberta},

abstract = {Containerized software systems are becoming more popular and complex as they are one of the essential techniques that enable cloud computing. One of the enabling technologies for containerized software systems is the Docker framework. Docker is an open-source framework for deploying containers, lightweight, standalone, and executable units of software with all their dependencies (packages and libraries) that can run on any computing environment. Docker images facilitate deploying and upgrading systems as all of the dependencies required for a software package are included in an image. However, there exist several risks with running Docker images in production environments. One risky situation can occur when upgrading images, as an upgrade may result in many changing packages or libraries at once.

Therefore, in this thesis, we study the Docker images and analyze them to identify the risks of package changes. Also, we propose our solution, DockerMV, to mitigate this risk by running multiple versions of an image at the same time.

In this first part of this thesis, we analyze the official Docker image repositories that are available on Docker Hub, Dockerâ€™s public registry that holds Docker images. For each image in these repositories, we extract details about its native, Node, and Python packages. Afterward, we investigate which types of applications have more package changes in their image upgrades. We find that, depending on the type of applications, the package changes have different trends. For example, Operating systems and Base Images repositories have a lower median number of changes. However, Analytics and Application Services repositories have the highest median number of package changes. Our findings show that practitioners should be extra cautious when doing in-place upgrades of images of such applications in their production environments.

In the second part of this thesis, we provide a solution for mitigating this risk by applying software multi-versioning to Docker images. We present DockerMV, an open-source extension of the Docker framework that supports multi-versioning for containerized software systems. We demonstrate the usefulness of DockerMV from the performance point of view and test it on two open-source subject systems. In particular, we demonstrate how DockerMV can be used to balance the workload between Docker images that contain different versions of the same application. In both experiments, DockerMV maintained the systemâ€™s performance while using a limited set of resources.},

keywords = {},

pubstate = {published},

tppubtype = {mastersthesis}

}

Close

Containerized software systems are becoming more popular and complex as they are one of the essential techniques that enable cloud computing. One of the enabling technologies for containerized software systems is the Docker framework. Docker is an open-source framework for deploying containers, lightweight, standalone, and executable units of software with all their dependencies (packages and libraries) that can run on any computing environment. Docker images facilitate deploying and upgrading systems as all of the dependencies required for a software package are included in an image. However, there exist several risks with running Docker images in production environments. One risky situation can occur when upgrading images, as an upgrade may result in many changing packages or libraries at once.
Therefore, in this thesis, we study the Docker images and analyze them to identify the risks of package changes. Also, we propose our solution, DockerMV, to mitigate this risk by running multiple versions of an image at the same time.
In this first part of this thesis, we analyze the official Docker image repositories that are available on Docker Hub, Dockerâ€™s public registry that holds Docker images. For each image in these repositories, we extract details about its native, Node, and Python packages. Afterward, we investigate which types of applications have more package changes in their image upgrades. We find that, depending on the type of applications, the package changes have different trends. For example, Operating systems and Base Images repositories have a lower median number of changes. However, Analytics and Application Services repositories have the highest median number of package changes. Our findings show that practitioners should be extra cautious when doing in-place upgrades of images of such applications in their production environments.
In the second part of this thesis, we provide a solution for mitigating this risk by applying software multi-versioning to Docker images. We present DockerMV, an open-source extension of the Docker framework that supports multi-versioning for containerized software systems. We demonstrate the usefulness of DockerMV from the performance point of view and test it on two open-source subject systems. In particular, we demonstrate how DockerMV can be used to balance the workload between Docker images that contain different versions of the same application. In both experiments, DockerMV maintained the systemâ€™s performance while using a limited set of resources.

Close

36.

Daniel Lee; Gopi Krishnan Rajbahadur; Dayi Lin; Mohammed Sayagh; Cor-Paul Bezemer; Ahmed E. Hassan

An Empirical Study of the Characteristics of Popular Minecraft Mods Journal Article

Empirical Software Engineering (EMSE) Journal, 2020.

Files:

Abstract | BibTeX | Tags: CurseForge, Minecraft, Mod development, Mods

37.

Hammam M. AlGhamdi; Cor-Paul Bezemer; Weiyi Shang; Ahmed E. Hassan; Parminder Flora

Towards Reducing the Time Needed for Load Testing Journal Article

Journal of Software Evolution and Process (JSEP), 2020.

Files:

Abstract | BibTeX | Tags: Load testing, Performance analysis, Performance testing

@article{AlGhamdi2020loadtests,

title = {Towards Reducing the Time Needed for Load Testing},

author = {Hammam M. AlGhamdi and Cor-Paul Bezemer and Weiyi Shang and Ahmed E. Hassan and Parminder Flora},

year  = {2020},

date = {2020-05-12},

urldate = {2020-05-12},

journal = {Journal of Software Evolution and Process (JSEP)},

abstract = {The performance of large-scale systems must be thoroughly tested under various levels of workload, as load-related issues can have a disastrous impact on the system. However, load tests often require a large amount of time, running from hours to even days, to execute. Nowadays, with the increased popularity of rapid releases and continuous deployment, testing time is at a premium and should be minimized while still delivering a complete test of the system. In our prior work, we proposed to reduce the execution time of a load test by detecting repetitiveness in individual performance metric values, such as CPU utilization or memory usage, that are observed during the test. However, as we explain in this paper, disregarding combinations of performance metrics may miss important information about the load-related behaviour of a system.

Therefore, in this paper we revisit our prior approach, by proposing a new approach that reduces the execution time of a load test by detecting whether a test no longer exercises new combinations of the observed performance metrics. We conduct an experimental case study on three open source systems (CloudStore, PetClinic, and Dell DVD Store 2), in which we use our new and prior approaches to reduce the execution time of a 24-hour load test. We show that our new approach is capable of reducing the execution time of the test to less than 8.5 hours, while preserving a coverage of at least 95% of the combinations that are observed between the performance metrics during the 24-hour tests. In addition, we show that our prior approach recommends a stopping time that is too early for two of the three studied systems. Finally, we discuss the challenges of applying our approach to an industrial setting, and we call upon the community to help us to address these challenges.},

keywords = {Load testing, Performance analysis, Performance testing},

pubstate = {published},

tppubtype = {article}

}

Close

38.

Quang N. Vu; Cor-Paul Bezemer

An Empirical Study of the Characteristics of Popular Game Jams and Their High-ranking Submissions on itch.io Inproceedings

International Conference on the Foundations of Digital Games (FDG), pp. 1–12, 2020.

Files:

Abstract | BibTeX | Tags: Empirical software engineering, Game development, Game jams, itch.io, Mining software repositories

@inproceedings{Quang20,

title = {An Empirical Study of the Characteristics of Popular Game Jams and Their High-ranking Submissions on itch.io},

author = {Quang N. Vu and Cor-Paul Bezemer},

year  = {2020},

date = {2020-04-14},

urldate = {2020-04-14},

booktitle = {International Conference on the Foundations of Digital Games (FDG)},

pages = {1--12},

abstract = {Game jams are hackathon-like events that allow participants to develop a playable game prototype within a time limit. They foster creativity and the exchange of ideas by letting developers with different skill sets collaborate. Having a high-ranking game is a great bonus to a beginning game developerâ€™s rÃ©sumÃ© and their pursuit of a career in the game industry. However, participants often face time constraints set by jam hosts while balancing what aspects of their games should be emphasized to have the highest chance of winning. Similarly, hosts need to understand what to emphasize when organizing online jams so that their jams are more popular, in terms of submission rate. In this paper, we study 1,290 past game jams and their 3,752 submissions on itch.io to understand better what makes popular jams and high-ranking games perceived well by the audience. We find that a quality description has a positive contribution to both a jamâ€™s popularity and a gameâ€™s ranking. Additionally, more manpower organizing a jam or developing a game increases a jamâ€™s popularity and a gameâ€™s high-ranking likelihood. Highranking games tend to support Windows or macOS, and belong to the â€œPuzzleâ€, â€œPlatformerâ€, â€œInteractive Fictionâ€, or â€œActionâ€ genres. Also, shorter competitive jams tend to be more popular. Based on our findings, we suggest jam hosts and participants improve the description of their products and consider co-organizing or co-participating in a jam. Furthermore, jam participants should develop multi-platform multi-genre games. Finally, jam hosts should introduce a tighter time limit to increase their jamâ€™s popularity.},

keywords = {Empirical software engineering, Game development, Game jams, itch.io, Mining software repositories},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

39.

Jiayuan Zhou; Shaowei Wang; Cor-Paul Bezemer; Ying Zou; Ahmed E. Hassan

Studying the Association between Bountysource Bounties and the Issue-addressing Likelihood of GitHub Issue Reports Journal Article

Transactions on Software Engineering (TSE), 2020.

Files:

Abstract | BibTeX | Tags: Bounties, Bountysource, GitHub, Open source software, Software evolution

40.

Simon Eismann; Cor-Paul Bezemer; Weiyi Shang; Dušan Okanović; André van Hoorn

Microservices: A Performance Tester's Dream or Nightmare? Inproceedings

ACM/SPEC International Conference on Performance Engineering (ICPE), pp. 1–12, 2020.

Files:

Abstract | BibTeX | Tags: DevOps, Microservices, Performance, Regression testing