Showing 13 Results
Showing 1-10 of 13
Model & Software

To view details of each card, click icon

Project: MOSSAIC
Description: Offers an active learning framework for natural language processing of pathology reports to reduce the amount of labelled data required to effectively train a model.
DESCRIPTION:

Offers an active learning framework for natural language processing of pathology reports to reduce the amount of labelled data required to effectively train a model.

IMPACT: Enables rapid annotation of pathology reports via machine learning.
INPUT DATA TYPE: Text
INPUT DATA FORMAT: Tabular
LEVEL OF DOCUMENTATION: Minimal
Project: ATOM
Description: Offers an open source, modular, extensible software pipeline for building and sharing models to advance in silico drug discovery.
DESCRIPTION:

Offers an open source, modular, extensible software pipeline for building and sharing models to advance in silico drug discovery.

IMPACT: Free and open-source chemical property and activity modeling and prediction using machine learning Industry/government/academic drug discovery
INPUT DATA FORMAT: Unspecified
LEVEL OF DOCUMENTATION: Minimal
Description: Identifies the saliency of hidden nodes in autoencoders by ranking hidden nodes in the latent layer of the autoencoder according to their capability of performing a learning task.
DESCRIPTION:

Identifies the saliency of hidden nodes in autoencoders by ranking hidden nodes in the latent layer of the autoencoder according to their capability of performing a learning task.

IMPACT: Explains the unsupervised learning process in autoencoders
INPUT DATA TYPE: Agnostic
INPUT DATA FORMAT: Tabular
LEVEL OF DOCUMENTATION: Minimal
Project:
Description: Improves machine/deep learning models by performing hyperparameter optimization.
DESCRIPTION:

Improves machine/deep learning models by performing hyperparameter optimization.

IMPACT: Enables hyperparameter optimization on machine/deep learning models.
INPUT DATA FORMAT: Unspecified
LEVEL OF DOCUMENTATION: Minimal
Project: ADMIRRAL
Description: Performs “dynamic” sampling where the input distribution can change over time and the sampling adapts itself to the new distribution.
DESCRIPTION:

Performs “dynamic” sampling where the input distribution can change over time and the sampling adapts itself to the new distribution.

IMPACT: Enables machine learning-based adaptive multiscale simulations for cancer biology.
INPUT DATA TYPE: NumPy Arrays
INPUT DATA FORMAT: Unspecified
LEVEL OF DOCUMENTATION: Minimal
Description: Extends the original COXEN method to select genes that are predictive of the efficacies of multiple drugs for building general drug response prediction models that are not specific to a particular drug.
DESCRIPTION:

Extends the original COXEN method to select genes that are predictive of the efficacies of multiple drugs for building general drug response prediction models that are not specific to a particular drug.

IMPACT: Enables building of anti-cancer drug response prediction models using selected genes and drugs.
INPUT DATA TYPE: RNA-Seq, Drug Molecular Descriptors
INPUT DATA FORMAT: Tabular
LEVEL OF DOCUMENTATION: Minimal
Project: MOSSAIC
Description: The Framework for Exploring Scalable Computational Oncology (FrESCO) is a modular deep learning natural language processing (NLP) library for extracting structured information from clinical text documents and classifying information to a given data standards.
DESCRIPTION:

The Framework for Exploring Scalable Computational Oncology (FrESCO) is a modular deep learning natural language processing (NLP) library for extracting structured information from clinical text documents and classifying information to a given data standards.

IMPACT: The FrESCO framework is a computational science tool that enables the automatic extraction of information from dense clinical reports. FrESCO’s modular deep learning natural language processing (NLP) library and associated tools provide the foundation for downstream research tasks and prediction algorithms.
INPUT DATA FORMAT: Unspecified
LEVEL OF DOCUMENTATION: Moderate
Description: Transforms tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image.
DESCRIPTION:

Transforms tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image.

IMPACT: Convolutional neural networks (CNNs) can be built based on the image representations for prediction tasks.
INPUT DATA TYPE: Agnostic
INPUT DATA FORMAT: Tabular
LEVEL OF DOCUMENTATION: Minimal
Description: Allows evaluation of a supervised learning model to determine if it can be further improved with more training data.
DESCRIPTION:

Allows evaluation of a supervised learning model to determine if it can be further improved with more training data.

IMPACT: May help to decide whether it would be worthwhile to collect more data and provide a framework for assessing the data scaling behavior of these predictors.
INPUT DATA TYPE: RNA-Seq
INPUT DATA FORMAT: Tabular
LEVEL OF DOCUMENTATION: Minimal
Project: ADMIRRAL
Description: Supports very large and multiscale simulations of molecular dynamic interactions between proteins (or their domains) with each other or with cell membranes. 
DESCRIPTION:

Supports very large and multiscale simulations of molecular dynamic interactions between proteins (or their domains) with each other or with cell membranes. 

IMPACT: Produces data like KRas4B Campaign 1 Trajectory data for use in models.
INPUT DATA FORMAT: Unspecified
LEVEL OF DOCUMENTATION: Minimal