PETROS, Callum. Dynamic Token Pruning via Semantic-aware Distillation for Efficient Transformer Compression. Journal of Computer Technology and Software, [S. l.], v. 3, n. 8, 2024. Disponível em: https://ashpress.org/index.php/jcts/article/view/189. Acesso em: 8 jul. 2025.