文摘
Chemical fragment spaces are combinations of molecular fragments and connection rules. They offer thepossibility to encode an enormously large number of chemical structures in a very compact format. Fragmentspaces are useful both in similarity-based (2D) and structure-based (3D) de novo design applications. Wepresent disconnection and filtering rules leading to several thousand unique, medium size fragments whenapplied to databases of druglike molecules. We evaluate alternative strategies to select subsets of thesefragments, with the aim of maximizing the coverage of known druglike chemical space with a stronglyreduced set of fragments. For these evaluations, we use the Ftrees fragment space method. We assess adiversity-oriented selection method based on maximum common substructures and a method biased towardhigh frequency of occurrence of fragments and find that they are complementary to each other.