Converting Tabular Data into Images for Deep Learning with Convolutional Neural Networks | Computational Resources for Cancer Research

Go To Publications

https://www.nature.com/articles/s41598-021-90923-y

Publication Type

Journal Article

Publication Year

2021

Authors

Zhu, Yitan

Brettin, Thomas

Xia, Fangfang

Partin, Alexander

Shukla, Maulik

Yoo, Hyunseung

Evrard, Yvonne A.

Doroshow, James H.

Stevens, Rick L.

Abstract

The community has successfully used convolutional neural networks (CNNs) in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, the authors developed a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searched for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. The authors applied IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generated compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibited a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data.

Citation

Date

2021-05-31

Issue

Volume

Publication Title

Scientific Reports

ISSN

2045-2322

DOI

10.1038/s41598-021-90923-y

Publication Tags

Manual Tags

Nature Research Journals, computational models, machine learning, virtual drug screening

Related Materials

Project

Cellular-Level Pilot