Improvement of Source Code Conversion for Code Completion

Bachelor thesis (2022)

Authors

M.J. Turk Electrical Engineering, Mathematics and Computer Science

Contributors

M. Izadi Software Engineering - (mentor)

A. van Deursen Software Technology (mentor)

A. Lukina Algorithmics - (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:9acaa0c3-ba8d-443a-8d06-c296f65c6895

More Info

expand_more

Published Date

24-06-2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Code Completion is advancing constantly, with new research coming out all the time. One such advancement is CodeFill, which converts source files into token sequences for type prediction. To train the CodeFill model, a lot of source files are needed which take a long time to convert before training can begin. Converting the file the end-user is working on for completions is also essential for the total latency as longer files can affect the experience of using the model. In this study we aimed to improve the performance and success rate of this conversion. Our results indicate that we increased both the performance by 83 times or more depending on the input file length and the success rate by up to 45%.

Files

CSE3000_Paper_Mika_Turk.pdf

(pdf | 0.427 Mb)