ESMC

ESMC is the latest in the ESM family of protein language models, establishing a new frontier in representation learning for protein biology. Trained on billions of evolutionary sequences, it learns representations that reflect a mechanistic reduction of protein structure and function.

Protein Language Model
Version 2026-04

Get Started

Quickstart Guide

1

Install the esm Python package

Python
pip install esm@git+https://github.com/Biohub/esm.git@main
2

Create an API key

3

Connect to the Biohub Platform API

Python
from esm.sdk.forge import ESMCForgeInferenceClient

client = ESMCForgeInferenceClient(model="esmc-6b-2024-12", url="https://biohub.ai", token="<your API token>")
4

Run your inference

Model Details

Model Card

Open in Hugging Face

Version

2026-04

Architecture

Transformer

Supported Modalities

Sequence

Training Data

Up to 6 billion proteins

Intended Use

ESMC is designed for protein science research including structure prediction, function annotation, protein design, and understanding evolutionary relationships between proteins. It can generate novel proteins given partial sequence, structure, or functional constraints.

Limitations & Risks

Outputs should be validated experimentally. The model may generate proteins that are not synthesizable or functional. Not intended for clinical or therapeutic applications without further validation.

This model is released under the MIT License.

Explore the Model

ESM Atlas Data

ESMC Protein Language Model | Biohub Platform