Aropha: Leading the Frontier of Polymer Informatics

Aropha is establishing itself as a leader in polymer informatics by advancing methodologies for data integration and curation in polymer research and science. As part of this effort, we have incorporated the HELM repository—a rigorously curated dataset of bio-based monomers—with Aropha’s proprietary library of monomer units commonly employed in both bio-based and synthetic polymer synthesis. This consolidated and standardized resource provides a robust foundation for in-silico polymer design to facilitate the systematic exploration and optimization of polymer architectures with respect to biodegradability and other critical performance metrics. Through this initiative, Aropha is aiming to provide a new paradigm in data-driven polymer informatics to accelerate the development of sustainable materials and driving innovation at the intersection of computational chemistry, materials science, and environmental sustainability.

For a curated list of bio-based and synthetic monomers, please explore the Monomers page.

PolyKnit Package for In-Silico Polymer Structure Synthesis

Our proprietary PolyKnit package is used to create in-silico structures of polymers for a deep computational analysis. This package requires SMILES strings with wildcard asterisks to represent the reactive sites on the monomer building blocks.

If you have the BigSMILES of your polymer structure, you can convert it into SMILES with wildcards using this simple script below.

# Read BigSMILES from a text file
txtBigSMILES = '.../common_monomers.txt'

with open(txtBigSMILES, 'r') as f:
    content = f.read()

lines = content.split('\n')

# Convert BigSMILES into a dictionary mapping cleaned monomer names to SMILES strings with wildcards
bigsmiles_dict = {}
for i in range(3, len(lines)):
    monomer_name = lines[i]
    if monomer_name and monomer_name.startswith('[#'):
        # Remove the '[#' prefix and trailing ']' from the monomer name
        key = monomer_name.replace('[#', '').replace(']', '')
        # Assume the corresponding SMILES string is on the next line; remove spaces and encapsulate with wildcard asterisks
        smiles_str = lines[i + 1].replace(' ', '')
        bigsmiles_dict[key] = f'*{smiles_str}*'

# The dictionary 'bigsmiles_dict' now holds the mapping from monomer names to formatted SMILES strings.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.github/workflows		.github/workflows
HELMCoreLibrary		HELMCoreLibrary
Ionis library		Ionis library
docs		docs
monomerLib2.0 library		monomerLib2.0 library
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Aropha: Leading the Frontier of Polymer Informatics

PolyKnit Package for In-Silico Polymer Structure Synthesis

About

Uh oh!

Releases

Packages

License

Aropha/Bio-based-Monomers

Folders and files

Latest commit

History

Repository files navigation

Aropha: Leading the Frontier of Polymer Informatics

PolyKnit Package for In-Silico Polymer Structure Synthesis

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages