DS

D. Sochirca

1 records found

Authored

Compressing code generation language models on CPUs

Using Group Lasso pruning and post-training quantization

Code generation models have become more popular recently, due to the fact that they assist developers in writing code in a more productive manner. While these large models deliver impressive performance, they require significant computational resources and memory, making them dif ...