论文标题
深度学习 - 对跨科学学科的选定评论的第一个元信息,它们的共同点,挑战和研究影响
Deep Learning -- A first Meta-Survey of selected Reviews across Scientific Disciplines, their Commonalities, Challenges and Research Impact
论文作者
论文摘要
深度学习属于人工智能领域,机器执行通常需要某种人类智能的任务。与大脑的基本结构类似,深度学习算法由人工神经网络组成,它类似于生物学大脑结构。模仿人类的学习过程,深度学习网络被(感官)数据(如文本,图像,视频或声音)提供了馈送。这些网络的表现优于不同任务中的最新方法,因此,整个领域在过去几年中的指数增长。在过去几年中,这种增长每年导致超过10,000个出版物。例如,仅搜索引擎PubMed仅涵盖医学领域的所有出版物的一个子集,在搜索词“深度学习”中已经提供了11,000多个结果,其中约有90%的结果来自过去三年。因此,已经无法获得对深度学习领域的完整概述,并且在不久的将来,很难获得对子领域的概述。但是,有几篇有关深度学习的评论文章,这些文章集中在特定的科学领域或应用上,例如计算机视觉的深度学习进展或诸如对象检测之类的特定任务。以这些调查为基础,这一贡献的目的是提供第一个高级,分类的,对跨不同科学学科的深度学习的选定评论进行分类。根据基本数据源(图像,语言,医学,混合)选择了类别(计算机视觉,语言处理,医学信息学和其他工作)。此外,我们回顾了每个子类别的共同体系结构,方法,利弊,评估,挑战和未来方向。
Deep learning belongs to the field of artificial intelligence, where machines perform tasks that typically require some kind of human intelligence. Similar to the basic structure of a brain, a deep learning algorithm consists of an artificial neural network, which resembles the biological brain structure. Mimicking the learning process of humans with their senses, deep learning networks are fed with (sensory) data, like texts, images, videos or sounds. These networks outperform the state-of-the-art methods in different tasks and, because of this, the whole field saw an exponential growth during the last years. This growth resulted in way over 10,000 publications per year in the last years. For example, the search engine PubMed alone, which covers only a sub-set of all publications in the medical field, provides already over 11,000 results in Q3 2020 for the search term 'deep learning', and around 90% of these results are from the last three years. Consequently, a complete overview over the field of deep learning is already impossible to obtain and, in the near future, it will potentially become difficult to obtain an overview over a subfield. However, there are several review articles about deep learning, which are focused on specific scientific fields or applications, for example deep learning advances in computer vision or in specific tasks like object detection. With these surveys as a foundation, the aim of this contribution is to provide a first high-level, categorized meta-survey of selected reviews on deep learning across different scientific disciplines. The categories (computer vision, language processing, medical informatics and additional works) have been chosen according to the underlying data sources (image, language, medical, mixed). In addition, we review the common architectures, methods, pros, cons, evaluations, challenges and future directions for every sub-category.