Improvement of Source Code Conversion for Code Completion

More Info
expand_more

Abstract

Code Completion is advancing constantly, with new research coming out all the time. One such advancement is CodeFill, which converts source files into token sequences for type prediction. To train the CodeFill model, a lot of source files are needed which take a long time to convert before training can begin. Converting the file the end-user is working on for completions is also essential for the total latency as longer files can affect the experience of using the model. In this study we aimed to improve the performance and success rate of this conversion. Our results indicate that we increased both the performance by 83 times or more depending on the input file length and the success rate by up to 45%.