(1)
Petros, C. Dynamic Token Pruning via Semantic-Aware Distillation for Efficient Transformer Compression. JCTS 2024, 3.