candidate c:
epitope_identifier: [E_i]
origin_program: [Program 2A (assembly), Program 2B (generative)]
variable_heavy_chain_sequence, variable_light_chain_sequence
three_dimensional_complex_model
paratope_protection_mask
scores:
binding: {per_tool_scores, ensemble_score}
developability: {therapeutic_antibody_profiler, aggrescan3d, camsol, stability_proxy}
safety: {cross_reactivity_docking, motif_mimicry}
risk_flags: [String]
probability_of_success: P(success | evidence)
| Tool or step | Input | Output | Parameters and thresholds |
|---|---|---|---|
| BepiPred Epitope identification (sequence-based) | Protein sequence | Per-residue epitope probability score and predicted linear epitope segments |
|
| Ellipro Epitope identification (structure-based clustering) | Protein three-dimensional (3D) structure | Clusters of residues forming linear or conformational epitopes |
|
| Discotope Epitope identification (structure-based scoring) | Protein three-dimensional (3D) structure | Per-residue epitope probability score |
|
| Accessibility and masking analysis (for example solvent accessible surface area (SASA) and glycan masking rules) Contextual epitope filter | Epitope patches on one or more conformational states; optional glycan model and complex partners | Accessibility score per epitope; masking flags (glycan shielding, steric occlusion); exposure frequency across states |
|
| Conservation and escape-risk screen (multiple sequence alignment) Contextual epitope filter | Target sequence set (homologs, variants, or strains) and epitope residue indices | Conservation score per epitope; predicted escape-risk annotation |
|
| Functional site annotation overlay Contextual epitope filter | Known or predicted functional residues (active sites, receptor-binding interfaces, allosteric sites) | Functional relevance score per epitope and rationale notes |
|
| Off-target surface-patch similarity screen (optional) Early cross-reactivity risk screen | Epitope patch descriptors and an off-target surface library | Similarity hits and a pre-design cross-reactivity risk flag |
|
| Molecular dynamics simulation (for example GROMACS, AMBER) Dynamic interrogation (expensive; shortlist only) | Static structure and a shortlist of epitopes or regions | Trajectory; residue flexibility metrics (root mean square fluctuation (RMSF)); dominant conformational states |
|
| Evidence integration and epitope gate Decision step | All epitope scores and annotations (static, contextual, dynamic, functional) | Ranked epitope list; explicit pass or fail decision with reason codes |
|
| Tool or step | Input | Output | Parameters and thresholds |
|---|---|---|---|
| OptMAVEn-2.0 Paratope design (knowledge-based module assembly) | Antigen three-dimensional (3D) structure with epitope residues and a database of variable heavy chain (VH) and variable light chain (VL) modules | Ranked variable heavy chain and variable light chain designs (sequences and assembled structures) with binding energies or scores |
|
| RosettaAntibodyDesign (RAbD) Paratope design (complementarity determining region redesign) | Antigen structure with target epitope region and an antibody framework for complementarity determining region grafting | Ranked designs as Protein Data Bank format (PDB) structures with new complementarity determining region sequences and structures |
|
| De novo generative design (diffusion model or graph neural network (GNN)) Paratope design (generative) | Epitope three-dimensional coordinates and a fixed antibody framework | Large sets of de novo complementarity determining region loop structures and sequences |
|
| Diversity management and clustering Down-selection step (cheap first) | Design set (sequences and structures) from one or more programs | Cluster representatives covering distinct sequences and binding modes; redundancy-reduced set |
|
| Cheap pre-filters (geometry and sequence sanity checks) Down-selection step (cheap first) | Candidate antibody models and frameworks | Filtered set removing obvious steric clashes and out-of-distribution sequences |
|
| Complex modeling and interface scoring Binding evidence production | Candidate antibodies, epitope structures, and optional conformational ensembles | Antibody–antigen complex poses; binding scores; interface plausibility metrics |
|
| AbDesign Constrained refinement (expensive; shortlist only) | Antigen structure and antibody backbone fragments or ensembles | Refined antibody models with optimized interface and stability metrics |
|
| Humanization and immunogenicity screen (optional) Risk-reduction gate | Candidate sequences and germline reference sets; major histocompatibility complex (MHC) class II peptide risk models | Humanization recommendations; immunogenicity risk flags |
|
| Paratope definition artifact (protection mask) Interface bookkeeping | Complex model(s) for each candidate | List of interface residues to protect during Phase 3 optimization (for example during CamSol design mode) |
|
| Paratope gate Decision step | Binding score evidence, diversity statistics, and immunogenicity flags | Pass or fail decision with reason codes and loops back to design if needed |
|
| Tool or step | Input | Output | Parameters and thresholds |
|---|---|---|---|
| Therapeutic Antibody Profiler (TAP) Developability (sequence liabilities) | Variable heavy chain (VH) and variable light chain (VL) sequences | Pass, flag, or fail status against core sequence metrics with residue drivers |
|
| Chemical liability scan Developability (sequence liabilities) | Candidate sequences | Liability sites (for example deamidation, isomerization, oxidation) and suggested mitigations |
|
| Glycosylation motif risk scan Developability (sequence liabilities) | Candidate sequences | Potential glycosylation motifs in variable regions and risk flags |
|
| Self-interaction, polyspecificity, and viscosity risk proxy Developability (formulation and in vivo risk proxies) | Candidate sequences and or structures | Predicted self-interaction or nonspecific binding risk, plus viscosity risk proxies, with flags and drivers |
|
| Aggrescan3D Developability (aggregation risk in three-dimensional context) | Candidate structure in Protein Data Bank format (PDB); optional chain or region selection | Aggregation-prone surface patches and suggested point mutations |
|
| CamSol Developability (solubility prediction and optimization) | Candidate sequences and an optional paratope protection mask from Phase 2 | Solubility scores and ranked mutation suggestions or variant libraries |
|
| Stability proxy (for example thermal stability prediction) Developability (stability) | Candidate sequences and or structures | Stability score proxy and risk flags |
|
| AntiTarget-Dock Safety (cross-reactivity docking screen) | Candidate antibody models and a structural library of high-risk proteins | Predicted off-target binding energies and a docking-based risk score |
|
| Proto-Blast-Search (Basic Local Alignment Search Tool for proteins (BLASTp)) Safety (motif mimicry screen) | Candidate sequences and a reference proteome database | Motif similarity matches and a motif-based risk score contribution |
|
| Risk aggregation and developability gate Decision step | Outputs of all developability and safety screens | Pass or fail decision; prioritized liability list; triggers a fix-and-re-score loop on failure |
|
| Tool or step | Input | Output | Parameters and thresholds |
|---|---|---|---|
| Candidate dossier synthesis Evidence packaging | All candidate artifacts (epitope rationale, complex models, scores, flags) | Per-candidate dossier suitable for review and decision making |
|
| Probability of success model Calibration and prediction | In silico features and prior wet lab outcomes (if any) | Calibrated probability estimates per candidate |
|
| Multi-objective ranking (Pareto selection) Decision support | Probability estimates plus developability and safety evidence | Shortlist based on trade-offs (Pareto-efficient set) and constraints |
|
| Wet lab selection policy with expected information gain Experiment design | Ranked shortlist plus uncertainty estimates | Final candidate set to test: exploit for success and explore for learning |
|
| Measurement model and data quality plan Reliability plan | Assay definitions and operational constraints | Replicate plan; controls; batch randomization; data normalization strategy |
|
| Feedback and updating Learning loop | Wet lab outcomes and metadata | Updated calibration, tool weights, and design priors; potential updates to Phase 0 constraints |
|