Authors: Emmanuel Daucé; Laurent PerrinetVisual search is an essential cognitive ability, offering a prototypical control problem to be addressed with Active Inference. Under a Naive Bayes assumption, the maximization of the information gain objective is consistent with the separation of the visual sensory flow in two independent pathways, namely the ``What'' and the ``Where'' pathways. On the ``What'' side, the processing of the central part of the visual field (the fovea) provides the current interpretation of the scene, here the category of the target. On the ``Where'' side, the processing of the full visual field (at lower resolution) is expected to provide hints about future central foveal processing. A map of the classification accuracies, as obtained by counterfactual saccades, defines a utility fonction on the motor space, whose maximal argument prescribes the next saccade. The comparison of the foveal and the peripheral predictions finally forms an estimate of the future information gain, providing a simple and resource-efficient way to implement information gain seeking policies in active vision. This dual-pathway information processing framework is found efficient on a synthetic visual search task and we show here quantitatively the role of the precision encoded within the accuracy map. More importantly, it is expected to draw connections toward a more general actor-critic principle in action selection, with the accuracy of the central processing taking the role of a value (or intrinsic reward) of the previous saccade.
Add to my calendar
Create your personal schedule through the official app, Whova!Get Started