Computer Engineering and Applications ›› 2022, Vol. 58 ›› Issue (18): 1-15.DOI: 10.3778/j.issn.1002-8331.2202-0295

• Research Hotspots and Reviews • Previous Articles     Next Articles

Survey on Visual Affordance Research

LI Yunlong, QING Linbo, HAN Longmei, WANG Yuchen   

  1. 1.College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
    2.Chengdu Institute of Planning and Design, Chengdu 610041, China
  • Online:2022-09-15 Published:2022-09-15



  1. 1.四川大学 电子信息学院,成都 610065
    2.成都市规划设计研究院,成都 610041

Abstract: Affordance refers to a series of interaction possibilities provided by objects in the environment, and describes the connection process between environmental attributes and individuals. Among them, visual affordance research is to use visual data such as images and videos to explore the possibility of visual subjects interacting with the environment or objects, involving scene recognition, action recognition, object detection, and other related fields. Visual affordance can be widely used in robotics, scene understanding, and other fields. Based on a large number of literature reviews, the visual affordance is classified and expounded according to three aspects:functional affordance, behavioral affordance, and social affordance. And for each type of affordability detection method, it is discussed in detail according to traditional machine learning methods and deep learning methods. At the same time, the current typical visual affordance datasets are summarized and analyzed, the application direction of visual affordance and possible future research directions are discussed.

Key words: affordance, deep learning, computer vision, machine learning

摘要: 可供性是指在环境内物体所提供的一系列交互可能,描述环境属性与个体之间的连接过程。其中,视觉可供性研究即通过使用图像、视频等视觉数据,探究视觉主体与环境或物体交互的可能性,涉及到场景识别、动作识别、物体检测等相关领域。视觉可供性可广泛应用于机器人、场景理解等领域。根据目前已有的相关研究,按功能可供性、行为可供性、社交可供性三方面对视觉可供性进行分类,并针对每一类可供性检测方法按照传统机器学习方法和深度学习方法进行详细论述。对当前典型的视觉可供性数据集进行归纳与分析,对视觉可供性的应用方向及未来可能的研究方向进行讨论。

关键词: 可供性, 深度学习, 计算机视觉, 机器学习