论文标题
迈向非黑色素瘤皮肤癌的高度富有表现力的机器学习模型
Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin Cancer
论文作者
论文摘要
病理学家拥有丰富的词汇,他们可以描述细胞形态的所有细微差别。在他们的世界中,图像和单词有自然的配对。最近的进步表明,现在可以对机器学习模型进行培训,以学习高质量的图像功能并将其表示为离散信息。这使得自然语言(也是离散的语言)可以与成像旁边共同建模,从而对成像内容的描述进行描述。在这里,我们介绍了将离散建模技术应用于非黑色素瘤皮肤癌的问题结构域的实验,特别是皮皮性癌(IEC)的组织学图像。通过实施VQ-GAN模型以重建IEC图像的高分辨率(256x256)图像,我们训练了序列到序列变压器,以使用病理学家术语来生成自然语言描述。结合使用连续生成方法获得的交互式概念矢量的概念,我们展示了一个额外的解释性角度。结果是为高度表现力的机器学习系统而努力的一种有希望的方法,不仅可以用作预测/分类工具,而且还意味着要进一步了解我们对疾病的科学理解。
Pathologists have a rich vocabulary with which they can describe all the nuances of cellular morphology. In their world, there is a natural pairing of images and words. Recent advances demonstrate that machine learning models can now be trained to learn high-quality image features and represent them as discrete units of information. This enables natural language, which is also discrete, to be jointly modelled alongside the imaging, resulting in a description of the contents of the imaging. Here we present experiments in applying discrete modelling techniques to the problem domain of non-melanoma skin cancer, specifically, histological images of Intraepidermal Carcinoma (IEC). Implementing a VQ-GAN model to reconstruct high-resolution (256x256) images of IEC images, we trained a sequence-to-sequence transformer to generate natural language descriptions using pathologist terminology. Combined with the idea of interactive concept vectors available by using continuous generative methods, we demonstrate an additional angle of interpretability. The result is a promising means of working towards highly expressive machine learning systems which are not only useful as predictive/classification tools, but also means to further our scientific understanding of disease.