View Software Catalog | Computational Resources for Cancer Research

Showing 13 Results

Showing 1-10 of 13

To view details of each card, click icon

Project: MOSSAIC

Description: Offers an active learning framework for natural language processing of pathology reports to reduce the amount of labelled data required to effectively train a model.

DESCRIPTION:

Offers an active learning framework for natural language processing of pathology reports to reduce the amount of labelled data required to effectively train a model.

IMPACT: Enables rapid annotation of pathology reports via machine learning.

PRIMARY PUBLICATION: Deep Active Learning for Classifying Cancer Pathology Reports

INPUT DATA TYPE: Text

INPUT DATA FORMAT: Tabular

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot3-Active_learning_NLP

RELATED Models

RELATED Publications

ATOM Modeling PipeLine AMPL

Project: ATOM

Description: Offers an open source, modular, extensible software pipeline for building and sharing models to advance in silico drug discovery.

DESCRIPTION:

Offers an open source, modular, extensible software pipeline for building and sharing models to advance in silico drug discovery.

IMPACT: Free and open-source chemical property and activity modeling and prediction using machine learning Industry/government/academic drug discovery

PRIMARY PUBLICATION: Accelerating Therapeutics for Opportunities in Medicine: A Paradigm Shift in Drug Discovery

INPUT DATA FORMAT: Unspecified

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/ATOMScience-org/AMPL

RELATED Datasets

RELATED Publications

RELATED Education

Autoencoder Node Saliency ANS

Project: Cellular-Level Pilot

Description: Identifies the saliency of hidden nodes in autoencoders by ranking hidden nodes in the latent layer of the autoencoder according to their capability of performing a learning task.

DESCRIPTION:

Identifies the saliency of hidden nodes in autoencoders by ranking hidden nodes in the latent layer of the autoencoder according to their capability of performing a learning task.

IMPACT: Explains the unsupervised learning process in autoencoders

PRIMARY PUBLICATION: Autoencoder Node Saliency: Selecting Relevant Latent Representations

INPUT DATA TYPE: Agnostic

INPUT DATA FORMAT: Tabular

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot1-Autoencoder-Node-Saliency/

RELATED Datasets

RELATED Models

RELATED Publications

CANcer Distributed Learning Environment CANDLE

Project:

Description: Improves machine/deep learning models by performing hyperparameter optimization.

DESCRIPTION:

Improves machine/deep learning models by performing hyperparameter optimization.

IMPACT: Enables hyperparameter optimization on machine/deep learning models.

PRIMARY PUBLICATION: CANDLE/Supervisor: A Workflow Framework for Machine Learning Applied to Cancer Research

INPUT DATA FORMAT: Unspecified

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/ECP-CANDLE

RELATED Publications

RELATED Education

Dynamic Importance Sampling DynIm

Project: ADMIRRAL

Description: Performs “dynamic” sampling where the input distribution can change over time and the sampling adapts itself to the new distribution.

DESCRIPTION:

Performs “dynamic” sampling where the input distribution can change over time and the sampling adapts itself to the new distribution.

IMPACT: Enables machine learning-based adaptive multiscale simulations for cancer biology.

INPUT DATA TYPE: NumPy Arrays

INPUT DATA FORMAT: Unspecified

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot2-DynIm

RELATED Software

Enhanced Co-Expression Extrapolation E-COXEN

Project: Cellular-Level Pilot

Description: Extends the original COXEN method to select genes that are predictive of the efficacies of multiple drugs for building general drug response prediction models that are not specific to a particular drug.

DESCRIPTION:

Extends the original COXEN method to select genes that are predictive of the efficacies of multiple drugs for building general drug response prediction models that are not specific to a particular drug.

IMPACT: Enables building of anti-cancer drug response prediction models using selected genes and drugs.

PRIMARY PUBLICATION: Enhanced Co-Expression Extrapolation (COXEN) Gene Selection Method for Building Anti-Cancer Drug Response Prediction Models

INPUT DATA TYPE: RNA-Seq, Drug Molecular Descriptors

INPUT DATA FORMAT: Tabular

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot1-Enhanced_COXEN

RELATED Models

RELATED Publications

Framework for Exploring Scalable Computational Oncology FrESCO

Project: MOSSAIC

Description: The Framework for Exploring Scalable Computational Oncology (FrESCO) is a modular deep learning natural language processing (NLP) library for extracting structured information from clinical text documents and classifying information to a given data standards.

DESCRIPTION:

The Framework for Exploring Scalable Computational Oncology (FrESCO) is a modular deep learning natural language processing (NLP) library for extracting structured information from clinical text documents and classifying information to a given data standards.

IMPACT: The FrESCO framework is a computational science tool that enables the automatic extraction of information from dense clinical reports. FrESCO’s modular deep learning natural language processing (NLP) library and associated tools provide the foundation for downstream research tasks and prediction algorithms.

INPUT DATA FORMAT: Unspecified

LEVEL OF DOCUMENTATION: Moderate

AVAILABLE ON GITHUB

https://github.com/DOE-NCI-MOSSAIC/FrESCO

Imaging Generator for Tabular Data IGTD

Project: Cellular-Level Pilot

Description: Transforms tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image.

DESCRIPTION:

Transforms tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image.

IMPACT: Convolutional neural networks (CNNs) can be built based on the image representations for prediction tasks.

PRIMARY PUBLICATION: Converting Tabular Data into Images for Deep Learning with Convolutional Neural Networks

INPUT DATA TYPE: Agnostic

INPUT DATA FORMAT: Tabular

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot1-Image-Generator-for-Tabular-Data

RELATED Models

RELATED Publications

Learning Curves LC

Project: Cellular-Level Pilot

Description: Allows evaluation of a supervised learning model to determine if it can be further improved with more training data.

DESCRIPTION:

Allows evaluation of a supervised learning model to determine if it can be further improved with more training data.

IMPACT: May help to decide whether it would be worthwhile to collect more data and provide a framework for assessing the data scaling behavior of these predictors.

PRIMARY PUBLICATION: Learning Curves for Drug Response Prediction in Cancer Cell Lines

INPUT DATA TYPE: RNA-Seq

INPUT DATA FORMAT: Tabular

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot1-Learning-Curve

RELATED Models

RELATED Software

RELATED Publications

Multiscale Machine-Learned Modeling Infrastructure MuMMI

Project: ADMIRRAL

Description: Supports very large and multiscale simulations of molecular dynamic interactions between proteins (or their domains) with each other or with cell membranes. 

DESCRIPTION:

Supports very large and multiscale simulations of molecular dynamic interactions between proteins (or their domains) with each other or with cell membranes.

IMPACT: Produces data like KRas4B Campaign 1 Trajectory data for use in models.

PRIMARY PUBLICATION: Machine Learning–driven Multiscale Modeling Reveals Lipid-dependent Dynamics of RAS Signaling Proteins

INPUT DATA FORMAT: Unspecified

LEVEL OF DOCUMENTATION: Minimal

AVAILABLE ON GITHUB

https://github.com/CBIIT/NCI-DOE-Collab-Pilot2-MuMMI