[1]
C. Petros, “Dynamic Token Pruning via Semantic-aware Distillation for Efficient Transformer Compression”, JCTS, vol. 3, no. 8, Nov. 2024.