Computer Engineering and Applications ›› 2021, Vol. 57 ›› Issue (2): 113-118.DOI: 10.3778/j.issn.1002-8331.1912-0323

Previous Articles     Next Articles

Privacy Protection Method Facilitating Data Sharing for Grid Manufacturing Data Lake

XIE Yuqing, WANG Yuan, JIANG Ying, YANG Miao, WANG Yongli   

  1. 1.Information & Telecommunication Branch, State Grid Zhejiang Electric Power Corporation, Hangzhou 310000, China
    2.NARI Group Co., Ltd., State Grid Electric Power Research Institute Co., Ltd., Nanjing 211100, China
    3.China Realtime Database Co., Ltd., Nanjing 211100, China
    4.School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094, China
  • Online:2021-01-15 Published:2021-01-14

便于数据共享的电网数据湖隐私保护方法

谢裕清,王渊,江樱,杨苗,王永利   

  1. 1.国网浙江省电力有限公司 信息通信分公司,杭州 310000
    2.南瑞集团有限公司(国网电力科学研究院有限公司),南京 211100
    3.江苏瑞中数据股份有限公司,南京 211100
    4.南京理工大学 计算机科学与工程学院,南京 210094

Abstract:

Data lake is the most commonly used method of data sharing in big data applications. However, the privacy problem caused by data sharing is still a missing part in the design of data lake, especially the higher requirement of data confidentiality in power grid. In order to facilitate the flow of valuable data, the lack of data privacy in existing data lakes must be completed so that data can be more securely shared across domains. In this regard, this paper proposes a data access and sharing model and optimizes the smart contract module in the blockchain. The experimental result shows that storing the hash value of the data in the block chain and storing the encrypted original data in the data lake with the trusted execution environment not only achieves the purpose of data access control and security sharing, but also reduces the gas usage and system throughput required for operations.

Key words: data lake, blockchain, smart contract, privacy protection

摘要:

数据湖是大数据应用最常用的数据共享方式,然而数据共享带来隐私问题仍然是数据湖设计中缺失的部分,特别是应用于电网数据保密性的要求更高。为了促进有价值数据的流通,必须完成现有数据湖缺失的数据隐私,以便数据更为安全的跨域共享。对此,提出一种数据访问与共享模型,并在此基础上对区块链中的智能合约模块进行了优化。实验结果表明,在区块链中存储数据的哈希值并使用可信执行环境将加密后的原始数据存储在数据湖中,不仅实现了数据访问控制和安全共享的目的,也降低了操作所需的gas使用量和系统吞吐量。

关键词: 数据湖, 区块链, 智能合约, 隐私保护