Petros, Callum. “Dynamic Token Pruning via Semantic-Aware Distillation for Efficient Transformer Compression”. Journal of Computer Technology and Software, vol. 3, no. 8, Nov. 2024, https://ashpress.org/index.php/jcts/article/view/189.