Computer Engineering and Applications ›› 2022, Vol. 58 ›› Issue (7): 97-105.DOI: 10.3778/j.issn.1002-8331.2102-0122

• Big Data and Cloud Computing • Previous Articles     Next Articles

Measuring Content and Performance of IPFS Network

DING Bowen, XU Yuedong, WANG Liang   

  1. Department of Electronic Engineering, School of Information Science and Technology, Fudan University, Shanghai 200433, China
  • Online:2022-04-01 Published:2022-04-01

IPFS网络内容和性能测量

丁博文,徐跃东,王亮   

  1. 复旦大学 信息科学与工程学院 电子工程系,上海 200433

Abstract: Interplanetary file system(IPFS) is a decentralized file system. Since it can serve as a storage solution for various blockchain systems, it has been used in many applications. However, there remains questions about IPFS itself, including what kinds of contents have been stored on the system, and what are the key factors that affect performance of IPFS. By implementing and deploying a crawler in the IPFS network, measurement is done about both the content and the performance aspects of IPFS. Content measurement results show that more than 80% of files stored on IPFS are less than 1?MB. The graph comprising files and directories has a degree distribution that approximately follows a power law. In performance measurement, the IPFS network is found to have around 15,000 peers, which forms a connectivity graph whose in-degree follows a power law distribution. Chunk size and number of chunks have a direct impact on performance. Considerable difference in DHT resolution time is found between private and public network modes. DHT resolution is expected to be faster in both modes if the file being resolved has more providers.

Key words: interplanetary file system(IPFS), peer-to-peer network, network measurement, power law distribution

摘要: IPFS(interplanetary file system,星际文件系统)是一个去中心化的分布式文件系统,作为对区块链系统在存储方面的重要拓展获得许多应用。然而,对于IPFS本身尚有许多问题亟待回答,包括IPFS被用于存储怎样的内容、影响性能的关键因素等。通过在IPFS网络中部署爬虫程序,收集数据,从内容和性能两方面进行了测量研究。内容测量的结果发现,IPFS网络中存储的文件有80%以上是小于1?MB的小文件;IPFS中存储的文件和目录的构成的图的度近似满足幂律分布。在性能测量中,发现当前IPFS网络由约15?000个节点组成,节点互相连接构成的网络的入度满足幂律分布。在IPFS中存储文件时,分片大小和数量对性能有直接的影响,而在不同的网络环境中,即私有集群和公开网络中,DHT查询的性能相差较大;提供者数量越多,DHT解析越快。

关键词: 星际文件系统(IPFS), 点对点网络, 网络测量, 幂律分布