Text this: ViQAgent: zero-shot video question answering via agent with open-vocabulary grounding validation