软件学报issn 1000-9825, coden ruxuew
journal of software, [doi: 10.13328/j.cnki.jos.006180]
©中国科学院软件研究所版权所有.
pandadb:
沈志宏
1
,
赵子豪
1,2
,
王华进
1
,
刘忠新
1
,
胡
川
1,2
,
1
(中国科学院计算机网络信息中心 北京 100190)
2
(中国科学院大学 北京 100049)
通讯作者: 沈志宏, e-mail: bluejoe@cnic.cn
摘要: 随着大数据应用的不断深入,对大规模
结构化/非结构化数据在存储管理方式、
本文提出了适用于异构
于智能属性图模型提出异构数据智能融合管理系统
询机制、属性协存和ai 算法集成机制.性能测试和
制对大规模异构数据的即席查询和分析具有较好的
融合数据管理场景.
关键词: 数据管理系统;异构数据融合;图数据模型;
中图法分类号: tp311
中文引用格式: 沈志宏,赵子豪,王华进,刘忠新,胡川,
http://www.jos.org.cn/1000-9825/6180.htm
英文引用格式: shen
zh, zhao zh, wang hj, liu zx, hu c, zhou yc
heterogeneous data.
ruan jian xue bao/journal of software, 2021 (in chinese).
pandadb: an intelligent management s
shen zhi-hong
1
, zhao zi-hao
1,2
, wang hua-
1
(
computer network information center, chinese academy of sciences, beijing 100190, china
2
(
university of chinese academy of sciences, beijing 100049, china
abstract:
with the development of big data application, the demand of large
analysis is becoming increasingly prominent. however, th
e differences in management, process,
brings challenges for fusion management and analysis.
fusion management and semantic computing, defines related
model, this paper implements pandadb,
an intelligent heterogeneous data fusion management system
storage mechanism, query mechanism, property co-
storage, ai algorithm scheduling and distributed architecture of pandadb. test
experiments and cases show that the co-
storage mechanism and distributed architecture of
effects, and can be applied i
n some scenarios of fusion data intelligent management such as
disambiguation.
key words: data management system;
heterogeneous data fusion
基金项目: 中国科学院战略性先导科技专项b类课题(
法工作专项(2019im020100),中国科学院信息化专项课题(
foundation item: strategic priority
research program of cas (xdb38030300)
of china(61836013);
ministry of science and technology innovation methods special work project under grant (2019im020100)
informatization plan of chinese academy of
收稿时间: 2020-07-20; 修改时间: 2020-09-03; 修改时间
e-mail: jos@iscas.ac.cn
http://www.jos.org.cn
tel: 86-10-62562563
1,2
/非结构化数据进行融合管理和分析的需求日益凸显.然而,
、检索方式方面的差异给融合管理和分析带来了技术挑战.
扩展模型,并定义了相关属性操作符和查询语法.接着,基
,并详细介绍了pandadb的总体架构、存储机制、查
案例证明,pandadb 的协存机制、分布式架构和语义索引机
表现,该系统可实际应用于学术图谱实体消歧与可视化等
;人工智能.
.pandadb:一种异构数据智能融合管理系统.软件学报.
zh, zhao zh, wang hj, liu zx, hu c, zhou yc
. pandadb: an intelligent management system for
ruan jian xue bao/journal of software, 2021 (in chinese).
http://www.jos.org.cn/1000-9825/6180.htm
for heterogeneous data
1
, liu zhong-xin
1
, hu chuan
1,2
, zhou yuan-chun
1
computer network information center, chinese academy of sciences, beijing 100190, china
)
university of chinese academy of sciences, beijing 100049, china
)
with the development of big data application, the demand of large
-scale structured/unstructured data fusion management and
e differences in management, process,
retrieval of structured/unstructured data
s an extended property graph model for heterogeneous data
operators and query syntax. based on the intelligent property graph
an intelligent heterogeneous data fusion management system
. this paper depicts the architecture,
storage, ai algorithm scheduling and distributed architecture of pandadb. test
storage mechanism and distributed architecture of
pandadb have good performance acceleration
n some scenarios of fusion data intelligent management such as
academic knowledge graph entity
heterogeneous data fusion
; graph data model; ad-hoc query; ai.
);国家自然科学基金重点项目((61836013);科技部创新方
)
research program of cas (xdb38030300)
; key project of national natural science foundation
ministry of science and technology innovation methods special work project under grant (2019im020100)
;
-11-06; jos 在线出版时间: 2021-01-20
评论