Influence of External Dependency Retrieval and Prompt Engineering in Test Case Generation using LLMs

Lenke, David; Ferrer-Urbano, Francisco Javier; Chicano-García, José-Francisco

Influence of External Dependency Retrieval and Prompt Engineering in Test Case Generation using LLMs

Files

_IDEAL__Influence_of_External_Dependency_Retrieval_and_Prompt_Engineering_in_Test_Case_Generation_using_LLMs__Congress_Version_ (2).pdf (727.96 KB)

Identifiers

URI: https://hdl.handle.net/10630/40816

Publication date

2025

Authors

Lenke, David

Ferrer-Urbano, Francisco Javier

Chicano-García, José-Francisco

Metrics

Share

Export

Department/Institute

Instituto de Tecnología e Ingeniería del Software de la Universidad de Málaga

Keywords

Aprendizaje automático (Inteligencia artificial)
Lenguajes de programación
Proceso en lenguaje natural (Informática)

Abstract

The recent rise of large language models (LLMs) has enabled the generation of higher-quality test cases by leveraging the semantics of the methods under test. However, existing LLM-based approaches still struggle to achieve high coverage levels. To mitigate this issue, we present two complementary techniques in this work: Prompt Engineering and External Dependency Retrieval for context enrichment. We evaluated our improvements through an ablation study on three open-source and four proprietary projects, encompassing 261 distinct methods. For each method, we generated test suites under four implementations and performed ten independent runs, yielding a total of 10,440 executions. Our combined approach yields an average coverage increase of 12% on industrial software, with statistically significant gains over all other variants studied in this paper. Although our enhancements increase the context (the number of input tokens rises by 66.3%), this is partially compensated by a reduction in output tokens due to fewer repair attempts, so that the overall cost overhead remains moderate at about 16%. As future work, we aim to identify the minimal necessary context that still yields significant improvements in test coverage, which could help to further reduce costs.

Collections

Ponencias, Comunicaciones a congresos y Pósteres

Full item page

Influence of External Dependency Retrieval and Prompt Engineering in Test Case Generation using LLMs

Files

Identifiers

Publication date

Reading date

Authors

Collaborators

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Share

Export

Research Projects

Organizational Units

Journal Issue

Center

Department/Institute

Keywords

Abstract

Description

Bibliographic citation

Collections

Endorsement

Review

Supplemented By

Referenced by