You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Query-aware visual token pruning for VLMs. Five-component pipeline (cross-attention scorer, entropy controller, spatial coherence, progressive schedule, token recycling) attached via PyTorch forward hooks — no upstream model changes. Evaluated on POPE with LLaVA-1.5-7B.