Software

ActivityFinder - Mapping of ChEMBL data to Protein Structures with Atomistic Detail

ActivityFinder is a complex database application which enables the automatic assignment of ChEMBL bioactivity data to protein-ligand complexes given in PDB format. The package will be first provided in January 2026.

AltLocEnumerator - Automatic handling of Alternative Locations (AltLocs)

The AltLocEnumerator automatically handles any Alternative Locations (AltLocs) present in structures from the PDB and enables you to either make an informed decision or use the most probable version of the structure.

ASCONA - Sequence-driven binding site alignment for multimeric proteins

ASCONA is a fully automated command line tool for generating sequence alignments and superpositions of protein binding sites. It is geared to the alignment of alternative protein conformations and especially addresses the tasks of dealing with highly flexible backbone regions, detecting multiple occurrences of a binding site in oligomeric structures, and coping with arbitrary annotation inconsistencies and structural artifacts.

For more information click here.

Conformator - A novel conformer ensemble generator

Conformator is a novel conformer ensemble generator that stands out by its handling of macrocycles, high accuracy, as well as robustness with respect to input formats and molecular geometries. With an extended set of rules for sampling torsion angles, a novel algorithm for macrocycle conformer generation, and a new clustering algorithm for the assembly of conformer ensembles, Conformator generates high-quality conformation ensembles.i

Please find a full description of Conformator in a recent publication [“Conformator: A Novel Method for the Generation of Conformer Ensembles”]

For more information click here

CSFPy - Fingerprint generation considering all structural features

The CSFPy is a Python module for the generation of the Connected Subgraph Fingerprint (CSFP).The user can load molecules by file or SMILES string, generate fingerprints and derive similarity measurements. A pdf file with a detailed description of the module is contained as well.

For more information click here.

DoGSite3 - Binding pocket prediction and site descriptor calculation

DoGSite3 is a grid-based method which uses a Difference of Gaussian filter to detect potential binding pockets - solely based on the 3D structure of the macromolecules (protein and nucleic acid chains with > 50 atoms are considered) - and splits them into subpockets. It is a reimplementation of the DoGSite algorithm with optimized parameters and an optional method to make use of a ligand bias. Global properties, describing the size, shape and chemical features of the predicted (sub)pockets are calculated.

EDIAScorer - Quantifying the electron density fit within structural models

The electron density score for individual atoms (EDIA) quantifies the electron density fit of each atom in a crystallographically resolved structure. Multiple EDIA values can be combined with the help of the power mean to compute EDIA_m, the electron density score for multiple atoms to score a set of atoms such as a ligand, a residue, or an active site.

For more information click here.

FSees - A fragment space exhaustive enumeration system

fragment space exhaustive enumeration system (FSees) is an efficient method for exhaustively enumerating all molecules within a certain molecular subspace. This chemical space is described as a fragment space and constrained by a set of user-defined physicochemical properties. The FSees algorithm uses file-based data structures to overcome the limitation of computer main memory thus allowing to enumerate very large chemical spaces. The resulting chemical library can be used as a starting point for computational lead-finding technologies, like similarity searching, pharmacophore mapping, docking, or virtual screening.

For more information click here.

iRAISE - Inverse screening and structure-based target prediction

iRAISE is an inverse screening tool based on the RAISE (=RApid Index-based Screening Engine) technology. The triangle-descriptor abstraction of proteins and molecules allows fast non-sequential screening of thousands of target structures. Scoring measures applied for selectivity considering the reference ligand and coverages of the pocket and the ligand pose support inter-protein ranking.

For more information click here.

JAMDA - Target preparation and protein-ligand docking

JAMDA is a workflow for preparing a protein target structure for docking and performing a protein-ligand docking. The JAMDA workflow encompasses all necessary steps to get from an unprepared PDB or mmCIF structure and a ligand as SMILES to the final docking poses.

JamdaScorer - Scoring and numerical optimization of protein-ligand poses

JamdaScorer applies the novel JAMDA scoring function as well as the LSL-BFGS optimization algorithm to protein-ligand poses. Note that JamdaScorer is not a docking program, therefore the ligand to be scored must be given with coordinates describing the binding pose relative to the protein.

LifeSoaks - LifeSoaks calculates solvent channels and their corresponding bottleneck radii.

LifeSoaks calculates solvent channels and their corresponding bottleneck radii by constructing a three dimensional Voronoi diagram of the atoms in a crystal unit cell. Channels are given in the form of a ccp4 grid file with a local bottleneck radius annotated to each grid point. ccp4 files can be visualized with molecular viewers such as Chimera or PyMol.

LigandExtractor - Shows and extracts ligands in files from the PDB

LigandExtractor can be used to find all ligands in a file from the PDB. In addition to finding all possible ligands, it annotates any problems or particularities as so-called "skip reasons". All ligands found in PDB files can be specifically extracted, including their conformation. The extracted ligands can then be used in other tools, for example, to define a binding site or analyse their conformation.

MicroMiner - Searching Similar Local 3D Micro-Environments in Protein Structure Databases

MicroMiner searches similar residue 3D micro-environments in protein structures. The tool can analyze structural changes of single mutations by extracting single mutations from a protein structure database at scale.

Mona - An easy-to-use cheminformatics plattform

Mona is an interactive tool that can be used to prepare and visualize large small-molecule datasets. A set centric workflow allows to intuitively handle hundred thousands of molecules. Building upon the robust framework Naomi common cheminformatics tasks such as analysis, filtering and converting of molecular files can be performed with high efficiency.

For more information click here.

mRAISE - Ligand-based virtual screening

mRAISE is a tool for ligand-based virtual screening based on the RAISE (=RApid Index-based Screening Engine) technology. The triangle-descriptor abstraction of small molecules allows fast non-sequential screening of millions of compounds. Calculated alignments are scored using a Gaussian-type scoring function which scores the general shape overlap as well as the compatibility of physicochemical features of aligned molecules.

For more information click here.

NAOMInext - Forward synthesis planning integrated into high throughput docking

NAOMInext is a software tool supporting medicinal chemists during hit-to-lead optimization. Starting from a co-crystallized small fragment, synthetic feasible lead compounds are generated directly within the proteins binding site.

For further information click here

NAOMInova - Geometrical and chemical analysis of atoms surrounding a user-defined structure

NAOMInova is a software tool enabling the geometrical and chemical analysis of atoms surrounding a user-defined structure of interest. Based on a user-selected set of protein structures, substructures can be searched and the distribution of partner points can be visualized.

For more information click here.

PELIKAN - Searching for spatial interaction patterns in sets of protein-ligand complexes

PELIKAN is a software tool enabling rapid searching of spatial interaction patterns in large collections of protein-ligand complexes. Data from protein-ligand complexes is stored in an SQLite database which can be subjected to different search processes.

For more information click here.

Phariety - Pharmacophore Screening

A command line tool for virtual screening with a pharmacophore model as query. Returns all molecules which can be mapped on the pharmacophore query together with a score based on their compatibility to the query model. The returned molecules are superposed on the query. The output file is empty when no input molecule could be mapped on the query. It contains the calculated score of the structure.

PiMine - Protein interface comparisons in collections of protein-protein interfaces

PiMine is a database-driven method for protein interface comparisons in large collections of protein-protein interfaces. It calculates tetrahedrons by using alpha-carbon atoms as vertices connected by edges annotated by corresponding distances. These tetrahedrons are used to find similar atomic arrangments in other protein-protein interfaces contained in a database. A list of similarity scores and, if desired, calculated alignments of query and target interfaces are output at the end.

ReactionViewer - Visualizing generic reaction patterns

ReactionViewer is a command line tool for visualizing generic reaction patterns written as Reaction SMILES, Reaction SMARTS or in the SMIRKS language. Visualization of individual SMILES or SMARTS patterns is also possible. Images can be exported as .pdf, .png or .svg.

REMUS - Alignment-Driven Virtual Screening

REMUS is a software tool for the efficient calculation of 3D shape-based similarity for small molecules. It uses a range of Gaussian functions, a step-limited BFGS optimization algorithm and random starting positions to calculate superimpositions. To address molecular flexibility, starting conformations are sampled using Conformator. REMUS allows the calculation of flexible alignments by adjusting the torsion angles during numerical alignment optimization.

SIENA - A fully-automated protein binding site ensemble generator

SIENA is a software pipeline enabling the fully automated construction of protein structure ensembles from the PDB. Starting with a single query structure, all binding sites with high sequence similarity are extracted from the PDB, aligned, and protonated. SIENA is able to deal with complicated cases like binding sites at protein domain interfaces or within homo-multimeric proteins.

For more information click here.

SiteMine - Binding site similarity search in large collections of protein pockets

SiteMine is an all-in-one database-driven, alignment-providing binding site similarity search tool to tackle the most pressing challenges of binding site comparison. It relies on the TetraScan approach we designed for complex 3D shape-matching applications. In short, tetrahedral search patterns are processed by GeoMine to retrieve binding site matches. All matches proposed are subsequently scored and ranked. The best-scored match is superimposed by SiteMine.

SMARTScompareViewer - Compare and analyze molecular patterns

The graphical SMARTScompareViewer and the accompanying SMARTScompare command line tool allow for in depth SMARTS pattern comparisons and analysis.

SMARTSeditor - Interactive and semi-automatic creation of molecular patterns

The SMARTSeditor is a graphic editing tool for generic chemical patterns. Based on the SMARTS language, chemical patterns can be created and edited interactively, similar to molecule editing in a chemical structure editor. The SMARTSeditor supports editing of given SMARTS or editing of chemical pattern without knowledge of the SMARTS language from scratch. It supports logic combinations as well as atom environment definitions. The generated pattern is converted into a SMARTS string which can be used in any application where the pattern is needed, e.g. for filtering of molecule databases.

For more information click here.

SpaceCompare - analyzes, compares and optimizes ultra-large combinatorial chemical spaces

SpaceCompare has three modes of operation on ultra-large, non-enumerable combinatorial chemical spaces:

calculate the exact overlap of two combinatorial chemical spaces and generate a CSV file containing all their shared products (only functionality in version 1.0)
calculate physicochemical property distributions of all products of a combinatorial chemical space
optimize the distribution of a specified property by removing fragments with high property values

You can use SpaceLight to generate custom topological fragment spaces or access the publicly-available KnowlegeSpace .

Please note that the current version of SpaceCompare cannot be run on commercial fragment spaces from BioSolveIT GmbH.

SpaceLight - Topological similarity search in large combinatorial fragment spaces

SpaceLight is a command line tool for similarity-based virtual screening in large combinatorial fragment spaces. It utilizes well-known fingerprints methods such as the Extended Connectivity Fingerprints (ECFP) and is able to search spaces containing billions or even trillions of compounds within seconds on a standard PC. In addition SpaceLight can be used for scaffold detection and custom fragment spaces generation. A pdf file with a detailed description of the module is contained as well.

Please note that SpaceLight as available from here cannot be used with commercial fragment spaces from BioSolveIT GmbH. If you are interested to do so, please contact BioSolveIT GmbH.

For more information click here.

SpaceMACS - Maximum common substructure search in large combinatorial fragment spaces

SpaceMACS is a command line tool for maximum common connected substructure search and substructure search in general in large combinatorial fragment spaces. Therefore, the tool enables chemical structure search and topological similarity searching. SpaceMACS is able to screen Spaces containing billions or even trillions of compounds in seconds to minutes on a standard PC.

Please note that SpaceMACS as available from here cannot be used with commercial fragment spaces from BioSolveIT GmbH. If you are interested to do so, please contact BioSolveIT GmbH.

For more information click here.

StructureProfiler - StructureProfiler Automatically profile protein structures in PDB format

StructureProfiler assists in automatically profiling protein structures (PDB file format) ranging from model characteristics like R factor over active site features to ligand properties such as molecular geometry, electron density coverage and frequently seen torsion angles.

For more information click here

Synthesia - Full Modification Control over Retrosynthetic Routes for Guided Optimization of Lead Structures

Synthesia is a command-line tool that uses an entire retrosynthetic route as a guide pathway to generate optimized structural analogues of a lead compound without compromising the synthesizability of the structure. The user has the ability to guide the structural modifications in a desired direction by specifying structural constraints.

For information on configuring Synthesia, see the README file that is part of the Synthesia package.

TorsionAnalyzer - Knowledge-based analysis of small molecule conformations

The TorsionAnalyzer is a commandline tool that classifies the torsion angles of rotatable bonds into relaxed, tolerable, and strained according to their likelihood in the CSD. The statistics are taken from the Torsion Library, a pregenerated collection of torsion angle statistics for a hierarchy of SMARTS.

Further information can be found in the most recent publication, as well as the preceding ones.

TorsionPatternMiner - generates statistical information about the pattern matching behavior of a torsion library with respect to a set of compounds.

The TorsionPatternMiner is a commandline tool that mines torsion angles statistics from crystal conformations. It also checks the consistency of the Torsion Library SMARTS hierarchy with an extended SMARTScompare algorithm.

Further information about torsion libraries and their analysis can be found in the most recent publication, as well as the preceding ones.

UNICON - A universal converter, tautomer and conformer generator

UNICON is a command-line tool to cope with common cheminformatics tasks. The functionality of UNICON ranges from file conversion between standard formats SDF, MOL2, SMILES, and PDB via the generation of 2D structure coordinates and 3D structures to the enumeration of tautomeric forms, protonation states and conformer ensembles.

For more information click here.

WarPP - WarPP places water molecules for the active site of a given PDB file.

WarPP places water molecules for the active site of a given PDB file. Based on interaction geometries previously derived from protein crystal structures, potential water positions are generated for free interaction directions (lone pairs of acceptors or hydrogen atoms of donors). Every small molecule present in the PDB structure is used to generate an active site in a 6.5 Å radius of each atom. Within these active sites water molecules are placed. In the corresponding publication, we showed that WarPP places 80% of water molecules within 1.0 Å distance to a crystallographically observed one.

ZBH - Center for Bioinformatics

Available software