论文标题
通过自动信息提取监视能源趋势
Monitoring Energy Trends through Automatic Information Extraction
论文作者
论文摘要
能源研究至关重要,但使用计算机科学技术(例如自动文本处理和对能源领域的数据管理)的使用仍然很少见。在能源领域中采用这些技术将是对``能源信息学''的跨学科主题的重要贡献,就像``生物信息学''跨学科领域的相关进展一样。在本文中,我们介绍了一个基于Web的语义系统的架构,称为Enemonie(通过信息提取通过信息提取),以通过使用自动,连续和指导性的信息从网络上可用的不同类型的媒体中提取自动,连续和指导的信息提取,以监视最新的能源趋势。该系统处理的媒体类型将包括在线新闻文章,社交媒体文本,在线新闻视频以及开放式学术论文和技术报告,以及能源组织公开提供的各种数字能源数据。该系统将利用并促进与能源相关的本体论及其最终形式,将构成(i)文本分类的组成部分,(ii)命名的实体识别,(iii)时间表达式提取,(iv)事件提取,(v)社交网络构建,(vi)情感分析,(vii)信息融合,介绍(vii)信息融合(vii),媒体(vii),媒体(vii),媒体(vii),(vii)和可视化。智慧其多样化的数据源,自动文本处理功能以及供公众使用的演示设施; Enemonie将成为为决策者提供蒸馏和简洁信息的重要来源,包括能源生成,传输和分销系统运营商,能源研究中心,相关投资者和企业家以及院士,学生,其他对能源活动和技术节奏感兴趣的人。
Energy research is of crucial public importance but the use of computer science technologies like automatic text processing and data management for the energy domain is still rare. Employing these technologies in the energy domain will be a significant contribution to the interdisciplinary topic of ``energy informatics", just like the related progress within the interdisciplinary area of ``bioinformatics". In this paper, we present the architecture of a Web-based semantic system called EneMonIE (Energy Monitoring through Information Extraction) for monitoring up-to-date energy trends through the use of automatic, continuous, and guided information extraction from diverse types of media available on the Web. The types of media handled by the system will include online news articles, social media texts, online news videos, and open-access scholarly papers and technical reports as well as various numeric energy data made publicly available by energy organizations. The system will utilize and contribute to the energy-related ontologies and its ultimate form will comprise components for (i) text categorization, (ii) named entity recognition, (iii) temporal expression extraction, (iv) event extraction, (v) social network construction, (vi) sentiment analysis, (vii) information fusion and summarization, (viii) media interlinking, and (ix) Web-based information retrieval and visualization. Wits its diverse data sources, automatic text processing capabilities, and presentation facilities open for public use; EneMonIE will be an important source of distilled and concise information for decision-makers including energy generation, transmission, and distribution system operators, energy research centres, related investors and entrepreneurs as well as for academicians, students, other individuals interested in the pace of energy events and technologies.