Documentation

The AI-Driven Molecular Similarity & Visualization Platform app utilizes the trained ChemBERTa model to calculate the similarity between different molecules based on their SMILES representations. This application leverages deep learning to generate embeddings and then calculates pairwise cosine similarity between these embeddings.

What the App Does

This app takes a CSV file containing molecule IDs and SMILES strings, computes ChemBERTa embeddings, and generates a similarity matrix based on cosine similarity between the molecules. The results are presented both as a downloadable CSV file and as a heatmap for easy visualization.

Impact and Use Cases

This tool is essential for chemoinformatics research and drug discovery, where understanding molecular similarity is crucial for tasks such as virtual screening, clustering of molecules, and finding potential lead compounds.

Features