Leveraging E2E Test Context for LLM-Enhanced Test Data and Descriptions

de Wit, M.C.A.

Leveraging E2E Test Context for LLM-Enhanced Test Data and Descriptions

Enhancing Automated Software Testing with Runtime Data Integration

Bachelor thesis (2024)

Authors

M.C.A. de Wit Electrical Engineering, Mathematics and Computer Science

Contributors

A. Deljouyi Software Engineering (mentor)

AE Zaidman Software Technology (mentor)

Asterios Katsifodimos Data-Intensive Systems (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Search-Based Software Testing LLM Test Generation Capture/Replay Context Enhancement

To reference this document use:

http://resolver.tudelft.nl/uuid:1e40dd90-0b18-40c3-96fd-6944c02a62e7

More Info

expand_more

Published Date

25-06-2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Automated software testing plays a critical role in improving software quality and reducing manual testing expenses. However, generating understandable and meaningful unit tests remains challenging, especially with frameworks optimized for coverage like Search-Based Software Testing (SBST). Large Language Models (LLMs) have the capability to generate human-like text, while capture/replay techniques can provide realistic data scenarios through trace logs, contributing to meaningful test case generation. This study introduces UTGen+, an approach that enhances LLM-based SBST by integrating trace logs from end-to-end tests, aiming to further improve test case understandability.
We conducted a comparative user study with 9 participants using UTGen+, original UTGen, and conventional SBST (EvoSuite), focusing on the effects of trace log inclusion on the naturalness and relevancy of comments, identifiers, and test data across several projects. The results indicated that while UTGen+ did not improve the naturalness and relevancy of comments and identifiers, it significantly enhanced the relevancy of test data. These findings suggest that incorporating contextual data can indeed benefit the generation of more relevant and understandable automated test cases.

Files

Research_Paper_CSE3000_Mattheo... (pdf)

(pdf | 0.494 Mb)

License info not available