Opened 4 years ago
Closed 4 years ago
#4981 closed defect (fixed)
Bad AlphaFold model trimming
| Reported by: | Tristan Croll | Owned by: | Tom Goddard |
|---|---|---|---|
| Priority: | normal | Milestone: | |
| Component: | Input/Output | Version: | |
| Keywords: | Cc: | ||
| Blocked By: | Blocking: | ||
| Notify when closed: | Platform: | all | |
| Project: | ChimeraX |
Description
The following bug report has been submitted:
Platform: Linux-3.10.0-1160.25.1.el7.x86_64-x86_64-with-glibc2.17
ChimeraX Version: 1.3.dev202107240347 (2021-07-24 03:47:26 UTC)
Description
Something going wrong with the model trimming when fetching the AlphaFold prediction for 7ck3 - the trimmed model contains almost nothing (quite a bit of it wrong), while the untrimmed version shows most of the structure is correctly predicted with high confidence.
Log:
> alias preview_toolshed toolshed url https://cxtoolshed-
> preview.rbvi.ucsf.edu; toolshed reload available
> alias production_toolshed toolshed url https://cxtoolshed.rbvi.ucsf.edu;
> toolshed reload available
> isolde shorthand
Initialising ISOLDE-specific command aliases:
Alias Equivalent full command
-------------------------------------------------
st isolde step {arguments}
aw isolde add water {arguments}
awsf isolde add water {arguments} sim false
al isolde add ligand {arguments}
aa isolde add aa $1 sel {arguments}
ht isolde mod his sel {arguments}
so setattr sel atoms occupancy {arguments}
ab isolde adjust bfactors {arguments}
ss isolde sim start sel
rt isolde release torsions sel {arguments}
rd isolde release distances sel {arguments}
pf isolde pepflip sel
cf isolde cisflip sel
cbb color bfactor {arguments}
cbo color byattr occupancy {arguments}
cbc color bychain; color byhet
cs clipper set contourSensitivity {arguments}
UCSF ChimeraX version: 1.3.dev202107240347 (2021-07-24)
© 2016-2021 Regents of the University of California. All rights reserved.
How to cite UCSF ChimeraX
> open 7o4f
7o4f title:
The DYW domain of A. thaliana OTP86 in its active state [more info...]
Chain information for 7o4f #1
---
Chain | Description | UniProt
A B D G | Pentatricopeptide repeat-containing protein At3g63370, chloroplastic
| PP296_ARATH
Non-standard residues in 7o4f #1
---
ZN — zinc ion
7o4f mmCIF Assemblies
---
1| author_defined_assembly
2| author_defined_assembly
> open #1 fromDatabase AlphaFold
Summary of feedback from opening #1 fetched from AlphaFold
---
notes | Fetching compressed AlphaFold Q9M1V3 from
https://alphafold.ebi.ac.uk/files/AF-Q9M1V3-F1-model_v1.cif
7522 atoms, 960 residues, atom bfactor range 24.7 to 98.2
Matchmaker 7o4f, chain A (#1) with AlphaFold Q9M1V3, chain A (#), sequence
alignment score = 654.4
RMSD between 102 pruned atom pairs is 0.619 angstroms; (across all 135 pairs:
4.767)
7522 atoms, 960 residues, atom bfactor range 24.7 to 98.2
Matchmaker 7o4f, chain B (#1) with AlphaFold Q9M1V3, chain B (#), sequence
alignment score = 646.3
RMSD between 103 pruned atom pairs is 0.590 angstroms; (across all 135 pairs:
4.891)
7522 atoms, 960 residues, atom bfactor range 24.7 to 98.2
Matchmaker 7o4f, chain D (#1) with AlphaFold Q9M1V3, chain D (#), sequence
alignment score = 654.4
RMSD between 103 pruned atom pairs is 0.611 angstroms; (across all 135 pairs:
4.759)
7522 atoms, 960 residues, atom bfactor range 24.7 to 98.2
Matchmaker 7o4f, chain G (#1) with AlphaFold Q9M1V3, chain G (#), sequence
alignment score = 647.2
RMSD between 103 pruned atom pairs is 0.592 angstroms; (across all 135 pairs:
4.901)
Chain information for UniProt Q9M1V3 chain A #2.1
---
Chain | Description | UniProt
A | Pentatricopeptide repeat-containing protein At3g63370, chloroplastic |
PP296_ARATH
Chain information for UniProt Q9M1V3 chain B #2.2
---
Chain | Description
B | Pentatricopeptide repeat-containing protein At3g63370, chloroplastic
Chain information for UniProt Q9M1V3 chain D #2.3
---
Chain | Description
D | Pentatricopeptide repeat-containing protein At3g63370, chloroplastic
Chain information for UniProt Q9M1V3 chain G #2.4
---
Chain | Description
G | Pentatricopeptide repeat-containing protein At3g63370, chloroplastic
Opened 4 AlphaFold chain models
> ui tool show Shell
> save 7o4f.pdb #1
Fetching compressed Alphafold Q9M1V3 PAE from
https://alphafold.ebi.ac.uk/files/AF-Q9M1V3-F1-predicted_aligned_error_v1.json
> color byattribute r:isolde_domain #2.4 target ra palette rainbow
[Repeated 1 time(s)]1101 atoms, 135 residues, atom isolde_domain range -1 to 4
> hide #2.1 models
> hide #2.2 models
> hide #2.3 models
> color byattribute r:isolde_domain #2.4 target ra palette rainbow
1101 atoms, 135 residues, atom isolde_domain range -1 to 13
> color byattribute r:isolde_domain #2.4 target ra palette rainbow
1101 atoms, 135 residues, atom isolde_domain range -1 to 23
> color byattribute r:isolde_domain #2.4 target ra palette rainbow
1101 atoms, 135 residues, atom isolde_domain range -1 to 0
> color byattribute r:isolde_domain #2.4 target ra palette rainbow
1101 atoms, 135 residues, atom isolde_domain range -1 to 87
> close #1-2
> open 7cfd
7cfd title:
Drosophila melanogaster Krimper eTud2-AubR15me2 complex [more info...]
Chain information for 7cfd #1
---
Chain | Description | UniProt
A B C D E F G H | FI20010p1 | A1ZAC4_DROME
I J K L M N O Z | Protein aubergine | AUB_DROME
7cfd mmCIF Assemblies
---
1| author_defined_assembly
2| author_defined_assembly
3| author_defined_assembly
4| author_defined_assembly
5| author_defined_assembly
6| author_defined_assembly
7| author_defined_assembly
8| author_defined_assembly
> cartoon
> hide
> open #1 fromDatabase AlphaFold
Summary of feedback from opening #1 fetched from AlphaFold
---
error | Fewer than 3 residues aligned; cannot match 7cfd, chain O with
AlphaFold O76922, chain O
notes | 5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain B (#1) with AlphaFold A1ZAC4, chain B (#), sequence
alignment score = 929
RMSD between 125 pruned atom pairs is 0.984 angstroms; (across all 178 pairs:
2.118)
Fetching compressed AlphaFold O76922 from
https://alphafold.ebi.ac.uk/files/AF-O76922-F1-model_v1.cif
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain I (#1) with AlphaFold O76922, chain I (#), sequence
alignment score = 48.2
RMSD between 4 pruned atom pairs is 0.867 angstroms; (across all 4 pairs:
0.867)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain C (#1) with AlphaFold A1ZAC4, chain C (#), sequence
alignment score = 935
RMSD between 141 pruned atom pairs is 1.018 angstroms; (across all 179 pairs:
1.674)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain J (#1) with AlphaFold O76922, chain J (#), sequence
alignment score = 51.8
RMSD between 6 pruned atom pairs is 1.261 angstroms; (across all 7 pairs:
1.648)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain K (#1) with AlphaFold O76922, chain K (#), sequence
alignment score = 53
RMSD between 8 pruned atom pairs is 1.551 angstroms; (across all 8 pairs:
1.551)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain D (#1) with AlphaFold A1ZAC4, chain D (#), sequence
alignment score = 925.4
RMSD between 136 pruned atom pairs is 0.934 angstroms; (across all 178 pairs:
1.734)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain E (#1) with AlphaFold A1ZAC4, chain E (#), sequence
alignment score = 905.6
RMSD between 155 pruned atom pairs is 1.177 angstroms; (across all 180 pairs:
1.888)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain L (#1) with AlphaFold O76922, chain L (#), sequence
alignment score = 48.2
RMSD between 4 pruned atom pairs is 1.100 angstroms; (across all 4 pairs:
1.100)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain F (#1) with AlphaFold A1ZAC4, chain F (#), sequence
alignment score = 936.2
RMSD between 143 pruned atom pairs is 1.059 angstroms; (across all 182 pairs:
1.673)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain A (#1) with AlphaFold A1ZAC4, chain A (#), sequence
alignment score = 931.4
RMSD between 131 pruned atom pairs is 1.015 angstroms; (across all 176 pairs:
1.780)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain G (#1) with AlphaFold A1ZAC4, chain G (#), sequence
alignment score = 933.8
RMSD between 147 pruned atom pairs is 1.263 angstroms; (across all 181 pairs:
1.573)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain N (#1) with AlphaFold O76922, chain N (#), sequence
alignment score = 50.6
RMSD between 5 pruned atom pairs is 1.368 angstroms; (across all 6 pairs:
1.962)
5902 atoms, 746 residues, atom bfactor range 21.6 to 96.8
Matchmaker 7cfd, chain H (#1) with AlphaFold A1ZAC4, chain H (#), sequence
alignment score = 870.2
RMSD between 151 pruned atom pairs is 1.283 angstroms; (across all 181 pairs:
1.594)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain M (#1) with AlphaFold O76922, chain M (#), sequence
alignment score = 50.6
RMSD between 5 pruned atom pairs is 0.710 angstroms; (across all 6 pairs:
1.685)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain Z (#1) with AlphaFold O76922, chain Z (#), sequence
alignment score = 47
RMSD between 3 pruned atom pairs is 0.729 angstroms; (across all 3 pairs:
0.729)
6914 atoms, 866 residues, atom bfactor range 20.2 to 98.8
Matchmaker 7cfd, chain O (#1) with AlphaFold O76922, chain O (#), sequence
alignment score = 44.6
Chain information for UniProt A1ZAC4 chain B #2.1
---
Chain | Description
B | FI20010p1
Chain information for UniProt O76922 chain I #2.2
---
Chain | Description
I | Protein aubergine
Chain information for UniProt A1ZAC4 chain C #2.3
---
Chain | Description
C | FI20010p1
Chain information for UniProt O76922 chain J #2.4
---
Chain | Description
J | Protein aubergine
Chain information for UniProt O76922 chain K #2.5
---
Chain | Description
K | Protein aubergine
Chain information for UniProt A1ZAC4 chain D #2.6
---
Chain | Description
D | FI20010p1
Chain information for UniProt A1ZAC4 chain E #2.7
---
Chain | Description
E | FI20010p1
Chain information for UniProt O76922 chain L #2.8
---
Chain | Description
L | Protein aubergine
Chain information for UniProt A1ZAC4 chain F #2.9
---
Chain | Description
F | FI20010p1
Chain information for UniProt A1ZAC4 chain A #2.10
---
Chain | Description | UniProt
A | FI20010p1 | A1ZAC4_DROME
Chain information for UniProt A1ZAC4 chain G #2.11
---
Chain | Description
G | FI20010p1
Chain information for UniProt O76922 chain N #2.12
---
Chain | Description
N | Protein aubergine
Chain information for UniProt A1ZAC4 chain H #2.13
---
Chain | Description
H | FI20010p1
Chain information for UniProt O76922 chain M #2.14
---
Chain | Description
M | Protein aubergine
Chain information for UniProt O76922 chain Z #2.15
---
Chain | Description
Z | Protein aubergine
Chain information for UniProt O76922 chain O #2.16
---
Chain | Description
O | Protein aubergine
Opened 16 AlphaFold chain models
> hide #2.16 models
> show #2.16 models
> select #2.1
1521 atoms, 1562 bonds, 185 residues, 1 model selected
Fetching compressed Alphafold A1ZAC4 PAE from
https://alphafold.ebi.ac.uk/files/AF-A1ZAC4-F1-predicted_aligned_error_v1.json
> color byattribute r:isolde_domain #2.1 target ra palette rainbow
1521 atoms, 185 residues, atom isolde_domain range -1 to 110
> color byattribute r:isolde_domain #2.1 target ra palette rainbow
1521 atoms, 185 residues, atom isolde_domain range -1 to 9
> hide ~sel models
> show #!2 models
> select clear
> color byattribute r:isolde_domain #2.1 target ra palette rainbow
1521 atoms, 185 residues, atom isolde_domain range -1 to 16
> color byattribute r:isolde_domain #2.1 target ra palette rainbow
1521 atoms, 185 residues, atom isolde_domain range -1 to 17
> save 7cfd.pdb #1
> save AF_A1ZAC4_cluster_0.pdb #2.1 selectedOnly true
> save AF_A1ZAC4_cluster_1.pdb #2.1 selectedOnly true
> select clear
> open 7cfb
7cfb title:
Drosophila melanogaster Krimper eTud1 apo structure [more info...]
Chain information for 7cfb #3
---
Chain | Description | UniProt
A B | FI20010p1 | A1ZAC4_DROME
Non-standard residues in 7cfb #3
---
SO4 — sulfate ion
7cfb mmCIF Assemblies
---
1| author_and_software_defined_assembly
2| author_and_software_defined_assembly
> matchmaker #3 to #1
Parameters
---
Chain pairing | bb
Alignment algorithm | Needleman-Wunsch
Similarity matrix | BLOSUM-62
SS fraction | 0.3
Gap open (HH/SS/other) | 18/18/6
Gap extend | 1
SS matrix | | | H | S | O
---|---|---|---
H | 6 | -9 | -6
S | | 6 | -6
O | | | 4
Iteration cutoff | 2
Matchmaker 7cfd, chain H (#1) with 7cfb, chain B (#3), sequence alignment
score = 114.8
RMSD between 83 pruned atom pairs is 1.228 angstroms; (across all 143 pairs:
5.190)
> matchmaker #3 to #2.1
Parameters
---
Chain pairing | bb
Alignment algorithm | Needleman-Wunsch
Similarity matrix | BLOSUM-62
SS fraction | 0.3
Gap open (HH/SS/other) | 18/18/6
Gap extend | 1
SS matrix | | | H | S | O
---|---|---|---
H | 6 | -9 | -6
S | | 6 | -6
O | | | 4
Iteration cutoff | 2
Matchmaker UniProt A1ZAC4 chain B, chain B (#2.1) with 7cfb, chain B (#3),
sequence alignment score = 99.8
RMSD between 76 pruned atom pairs is 1.068 angstroms; (across all 143 pairs:
5.602)
> save 7cfb.pdb #3
> close #1-2#3
> open 7ciw
7ciw title:
Crystal Structure of Agmatine N-Acetyltransferase mutant S171A in complex with
Ac-Agm and CoA [more info...]
Chain information for 7ciw #1
---
Chain | Description | UniProt
A | Agmatine N-acetyltransferase | Q9W469_DROME
Non-standard residues in 7ciw #1
---
COA — coenzyme A
G0R — N-(4-carbamimidamidobutyl)ethanamide
> open #1 fromDatabase AlphaFold
Summary of feedback from opening #1 fetched from AlphaFold
---
notes | Fetching compressed AlphaFold Q9W469 from
https://alphafold.ebi.ac.uk/files/AF-Q9W469-F1-model_v1.cif
1673 atoms, 216 residues, atom bfactor range 40.4 to 98.9
Matchmaker 7ciw, chain A (#1) with AlphaFold Q9W469, chain A (#), sequence
alignment score = 1100.6
RMSD between 208 pruned atom pairs is 0.547 angstroms; (across all 216 pairs:
3.620)
Chain information for UniProt Q9W469 chain A
---
Chain | Description | UniProt
2.1/A | Agmatine N-acetyltransferase | Q9W469_DROME
Opened 1 AlphaFold chain models
> close #1#2
> open 7ck3
7ck3 title:
Crystal structure of Arabidopsis CESA3 catalytic domain [more info...]
Chain information for 7ck3 #1
---
Chain | Description | UniProt
A B | Cellulose synthase A catalytic subunit 3 [UDP-forming],Cellulose
synthase A catalytic subunit 3 [UDP-forming] | CESA3_ARATH
7ck3 mmCIF Assemblies
---
1| author_defined_assembly
2| author_defined_assembly
> open #1 fromDatabase AlphaFold
Summary of feedback from opening #1 fetched from AlphaFold
---
notes | 8430 atoms, 1065 residues, atom bfactor range 23 to 97.8
Matchmaker 7ck3, chain A (#1) with AlphaFold Q941L0, chain A (#), sequence
alignment score = 1592
RMSD between 50 pruned atom pairs is 0.916 angstroms; (across all 98 pairs:
9.484)
8430 atoms, 1065 residues, atom bfactor range 23 to 97.8
Matchmaker 7ck3, chain B (#1) with AlphaFold Q941L0, chain B (#), sequence
alignment score = 1598.3
RMSD between 51 pruned atom pairs is 0.926 angstroms; (across all 98 pairs:
9.558)
Chain information for UniProt Q941L0 chain A #2.1
---
Chain | Description | UniProt
A | Cellulose synthase A catalytic subunit 3 [UDP-forming] | CESA3_ARATH
Chain information for UniProt Q941L0 chain B #2.2
---
Chain | Description
B | Cellulose synthase A catalytic subunit 3 [UDP-forming]
Opened 2 AlphaFold chain models
> open #1 fromDatabase AlphaFold trim false
Summary of feedback from opening #1 fetched from AlphaFold
---
notes | 8430 atoms, 1065 residues, atom bfactor range 23 to 97.8
Matchmaker 7ck3, chain A (#1) with AlphaFold Q941L0, chain A (#), sequence
alignment score = 1890.2
RMSD between 268 pruned atom pairs is 0.836 angstroms; (across all 369 pairs:
5.904)
8430 atoms, 1065 residues, atom bfactor range 23 to 97.8
Matchmaker 7ck3, chain B (#1) with AlphaFold Q941L0, chain B (#), sequence
alignment score = 1903.7
RMSD between 270 pruned atom pairs is 0.856 angstroms; (across all 369 pairs:
5.907)
Chain information for UniProt Q941L0 chain A #3.1
---
Chain | Description | UniProt
A | Cellulose synthase A catalytic subunit 3 [UDP-forming] | CESA3_ARATH
Chain information for UniProt Q941L0 chain B #3.2
---
Chain | Description
B | Cellulose synthase A catalytic subunit 3 [UDP-forming]
Opened 2 AlphaFold chain models
> hide #!2 models
> hide #!3 models
> show #!2 models
> show #2
> hide #2
> show #!3 models
> color bfactor #3
16860 atoms, 2130 residues, atom bfactor range 23 to 97.8
> select #3&@@bfactor>50
13038 atoms, 13380 bonds, 1632 residues, 2 models selected
> select #3&@@bfactor>70
12158 atoms, 12492 bonds, 1518 residues, 2 models selected
> select clear
> hide #!3 models
> show #!3 models
> hide #!2 models
OpenGL version: 3.3.0 NVIDIA 465.19.01
OpenGL renderer: NVIDIA TITAN Xp/PCIe/SSE2
OpenGL vendor: NVIDIA Corporation
Manufacturer: Dell Inc.
Model: Precision T5600
OS: CentOS Linux 7 Core
Architecture: 64bit ELF
Virutal Machine: none
CPU: 32 Intel(R) Xeon(R) CPU E5-2687W 0 @ 3.10GHz
Cache Size: 20480 KB
Memory:
total used free shared buff/cache available
Mem: 62G 13G 35G 306M 13G 48G
Swap: 4.9G 0B 4.9G
Graphics:
03:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP102 [TITAN Xp] [10de:1b02] (rev a1)
Subsystem: NVIDIA Corporation Device [10de:11df]
Kernel driver in use: nvidia
Locale: ('en_GB', 'UTF-8')
PyQt5 5.15.2, Qt 5.15.2
Installed Packages:
alabaster: 0.7.12
appdirs: 1.4.4
Babel: 2.9.1
backcall: 0.2.0
blockdiag: 2.0.1
certifi: 2021.5.30
cftime: 1.5.0
chardet: 4.0.0
ChimeraX-AddCharge: 1.1.4
ChimeraX-AddH: 2.1.10
ChimeraX-AlignmentAlgorithms: 2.0
ChimeraX-AlignmentHdrs: 3.2
ChimeraX-AlignmentMatrices: 2.0
ChimeraX-Alignments: 2.1
ChimeraX-AlphaFold: 1.0
ChimeraX-AltlocExplorer: 1.0
ChimeraX-AmberInfo: 1.0
ChimeraX-Arrays: 1.0
ChimeraX-Atomic: 1.27.1
ChimeraX-AtomicLibrary: 4.0
ChimeraX-AtomSearch: 2.0
ChimeraX-AtomSearchLibrary: 1.0
ChimeraX-AxesPlanes: 2.0
ChimeraX-BasicActions: 1.1
ChimeraX-BILD: 1.0
ChimeraX-BlastProtein: 1.1.1
ChimeraX-BondRot: 2.0
ChimeraX-BugReporter: 1.0
ChimeraX-BuildStructure: 2.5.2
ChimeraX-Bumps: 1.0
ChimeraX-BundleBuilder: 1.1
ChimeraX-ButtonPanel: 1.0
ChimeraX-CageBuilder: 1.0
ChimeraX-CellPack: 1.0
ChimeraX-Centroids: 1.1
ChimeraX-ChemGroup: 2.0
ChimeraX-Clashes: 2.1
ChimeraX-Clipper: 0.17.0
ChimeraX-ColorActions: 1.0
ChimeraX-ColorGlobe: 1.0
ChimeraX-ColorKey: 1.3.2
ChimeraX-CommandLine: 1.1.4
ChimeraX-ConnectStructure: 2.0
ChimeraX-Contacts: 1.0
ChimeraX-Core: 1.3.dev202107240347
ChimeraX-CoreFormats: 1.0
ChimeraX-coulombic: 1.3
ChimeraX-Crosslinks: 1.0
ChimeraX-Crystal: 1.0
ChimeraX-CrystalContacts: 1.0
ChimeraX-DataFormats: 1.2
ChimeraX-Dicom: 1.0
ChimeraX-DistMonitor: 1.1.4
ChimeraX-DistUI: 1.0
ChimeraX-Dssp: 2.0
ChimeraX-EMDB-SFF: 1.0
ChimeraX-ExperimentalCommands: 1.0
ChimeraX-FileHistory: 1.0
ChimeraX-FunctionKey: 1.0
ChimeraX-Geometry: 1.1
ChimeraX-gltf: 1.0
ChimeraX-Graphics: 1.1
ChimeraX-Hbonds: 2.1
ChimeraX-Help: 1.1
ChimeraX-HKCage: 1.3
ChimeraX-IHM: 1.1
ChimeraX-ImageFormats: 1.1
ChimeraX-IMOD: 1.0
ChimeraX-IO: 1.0.1
ChimeraX-ISOLDE: 1.3.dev32
ChimeraX-ItemsInspection: 1.0
ChimeraX-Label: 1.1
ChimeraX-LinuxSupport: 1.0
ChimeraX-ListInfo: 1.1.1
ChimeraX-Log: 1.1.4
ChimeraX-LookingGlass: 1.1
ChimeraX-Maestro: 1.8.1
ChimeraX-Map: 1.1
ChimeraX-MapData: 2.0
ChimeraX-MapEraser: 1.0
ChimeraX-MapFilter: 2.0
ChimeraX-MapFit: 2.0
ChimeraX-MapSeries: 2.1
ChimeraX-Markers: 1.0
ChimeraX-Mask: 1.0
ChimeraX-MatchMaker: 1.2.1
ChimeraX-MDcrds: 2.4
ChimeraX-MedicalToolbar: 1.0.1
ChimeraX-Meeting: 1.0
ChimeraX-MLP: 1.1
ChimeraX-mmCIF: 2.3
ChimeraX-MMTF: 2.1
ChimeraX-Modeller: 1.0.2
ChimeraX-ModelPanel: 1.1
ChimeraX-ModelSeries: 1.0
ChimeraX-Mol2: 2.0
ChimeraX-Morph: 1.0
ChimeraX-MouseModes: 1.1
ChimeraX-Movie: 1.0
ChimeraX-Neuron: 1.0
ChimeraX-Nucleotides: 2.0.2
ChimeraX-OpenCommand: 1.6.2
ChimeraX-PDB: 2.4.4
ChimeraX-PDBBio: 1.0
ChimeraX-PDBLibrary: 1.0.1
ChimeraX-PDBMatrices: 1.0
ChimeraX-Phenix: 0.3
ChimeraX-PickBlobs: 1.0
ChimeraX-Positions: 1.0
ChimeraX-PresetMgr: 1.0.1
ChimeraX-PubChem: 2.1
ChimeraX-ReadPbonds: 1.0
ChimeraX-Registration: 1.1
ChimeraX-RemoteControl: 1.0
ChimeraX-ResidueFit: 1.0
ChimeraX-RestServer: 1.1
ChimeraX-RNALayout: 1.0
ChimeraX-RotamerLibMgr: 2.0
ChimeraX-RotamerLibsDunbrack: 2.0
ChimeraX-RotamerLibsDynameomics: 2.0
ChimeraX-RotamerLibsRichardson: 2.0
ChimeraX-Sample: 0.1
ChimeraX-SaveCommand: 1.4.1
ChimeraX-SchemeMgr: 1.0
ChimeraX-SDF: 2.0
ChimeraX-Segger: 1.0
ChimeraX-Segment: 1.0
ChimeraX-SelInspector: 1.0
ChimeraX-SeqView: 2.4.1
ChimeraX-Shape: 1.0.1
ChimeraX-Shell: 1.0
ChimeraX-Shortcuts: 1.1
ChimeraX-ShowAttr: 1.0
ChimeraX-ShowSequences: 1.0
ChimeraX-SideView: 1.0
ChimeraX-Smiles: 2.1
ChimeraX-SmoothLines: 1.0
ChimeraX-SpaceNavigator: 1.0
ChimeraX-StdCommands: 1.6
ChimeraX-STL: 1.0
ChimeraX-Storm: 1.0
ChimeraX-Struts: 1.0
ChimeraX-Surface: 1.0
ChimeraX-SwapAA: 2.0
ChimeraX-SwapRes: 2.1
ChimeraX-TapeMeasure: 1.0
ChimeraX-Test: 1.0
ChimeraX-Toolbar: 1.1
ChimeraX-ToolshedUtils: 1.2
ChimeraX-Tug: 1.0
ChimeraX-UI: 1.10.1
ChimeraX-uniprot: 2.1
ChimeraX-UnitCell: 1.0
ChimeraX-ViewDockX: 1.0.1
ChimeraX-Vive: 1.1
ChimeraX-VolumeMenu: 1.0
ChimeraX-Voyager: 0.1
ChimeraX-VTK: 1.0
ChimeraX-WavefrontOBJ: 1.0
ChimeraX-WebCam: 1.0
ChimeraX-WebServices: 1.0
ChimeraX-Zone: 1.0
colorama: 0.4.4
comtypes: 1.1.10
cxservices: 1.0
cycler: 0.10.0
Cython: 0.29.23
decorator: 4.4.2
distlib: 0.3.1
distro: 1.5.0
docutils: 0.17.1
filelock: 3.0.12
funcparserlib: 0.3.6
grako: 3.16.5
h5py: 3.3.0
html2text: 2020.1.16
idna: 2.10
ihm: 0.20
imagecodecs: 2021.4.28
imagesize: 1.2.0
ipykernel: 5.5.5
ipython: 7.23.1
ipython-genutils: 0.2.0
jedi: 0.18.0
Jinja2: 2.11.3
jupyter-client: 6.1.12
jupyter-core: 4.7.1
kiwisolver: 1.3.1
line-profiler: 3.3.0
lxml: 4.6.3
lz4: 3.1.3
MarkupSafe: 1.1.1
matplotlib: 3.4.2
matplotlib-inline: 0.1.2
msgpack: 1.0.2
netCDF4: 1.5.6
networkx: 2.6.2
numexpr: 2.7.3
numpy: 1.21.0
numpydoc: 1.1.0
OpenMM: 7.6.0
openvr: 1.16.801
packaging: 21.0
ParmEd: 3.2.0
parso: 0.8.2
pexpect: 4.8.0
pickleshare: 0.7.5
Pillow: 8.2.0
pip: 21.1.1
pkginfo: 1.7.0
prompt-toolkit: 3.0.19
psutil: 5.8.0
ptyprocess: 0.7.0
pycollada: 0.7.1
pydicom: 2.1.2
Pygments: 2.9.0
PyOpenGL: 3.1.5
PyOpenGL-accelerate: 3.1.5
pyparsing: 2.4.7
PyQt5-commercial: 5.15.2
PyQt5-sip: 12.8.1
PyQtWebEngine-commercial: 5.15.2
python-dateutil: 2.8.1
pytz: 2021.1
pyzmq: 22.1.0
qtconsole: 5.1.0
QtPy: 1.9.0
RandomWords: 0.3.0
requests: 2.25.1
scipy: 1.6.3
setuptools: 57.0.0
sfftk-rw: 0.7.0.post1
six: 1.16.0
snowballstemmer: 2.1.0
sortedcontainers: 2.4.0
Sphinx: 4.0.1
sphinxcontrib-applehelp: 1.0.2
sphinxcontrib-blockdiag: 2.0.0
sphinxcontrib-devhelp: 1.0.2
sphinxcontrib-htmlhelp: 2.0.0
sphinxcontrib-jsmath: 1.0.1
sphinxcontrib-qthelp: 1.0.3
sphinxcontrib-serializinghtml: 1.1.5
suds-jurko: 0.6
tables: 3.6.1
tifffile: 2021.4.8
tinyarray: 1.2.3
tornado: 6.1
traitlets: 5.0.5
urllib3: 1.26.6
wcwidth: 0.2.5
webcolors: 1.11.1
wheel: 0.36.2
wheel-filename: 1.3.0
Change History (8)
comment:1 by , 4 years ago
| Component: | Unassigned → Input/Output |
|---|---|
| Owner: | set to |
| Platform: | → all |
| Project: | → ChimeraX |
| Status: | new → assigned |
| Summary: | ChimeraX bug report submission → Bad AlphaFold model trimming |
comment:2 by , 4 years ago
| Resolution: | → fixed |
|---|---|
| Status: | assigned → closed |
comment:3 by , 4 years ago
| Resolution: | fixed |
|---|---|
| Status: | closed → reopened |
Oops! Added the previous comment to the wrong ticket. This problem still is present and likely has to do with a residue numbering issue and I will be looking at it today.
comment:4 by , 4 years ago
PDB entry 7ck3 lists two subsequences of the Uniprot sequence for each chain. I never saw this before, only saw one sequence interval listed. Trimming code will need to be adapted to handle multiple sequence ranges.
From 7ck3.cif:
loop_
_struct_ref_seq.align_id
_struct_ref_seq.ref_id
_struct_ref_seq.pdbx_PDB_id_code
_struct_ref_seq.pdbx_strand_id
_struct_ref_seq.seq_align_beg
_struct_ref_seq.pdbx_seq_align_beg_ins_code
_struct_ref_seq.seq_align_end
_struct_ref_seq.pdbx_seq_align_end_ins_code
_struct_ref_seq.pdbx_db_accession
_struct_ref_seq.db_align_beg
_struct_ref_seq.pdbx_db_align_beg_ins_code
_struct_ref_seq.db_align_end
_struct_ref_seq.pdbx_db_align_end_ins_code
_struct_ref_seq.pdbx_auth_seq_align_beg
_struct_ref_seq.pdbx_auth_seq_align_end
1 1 7CK3 A 1 ? 314 ? Q941L0 317 ? 630 ? 317 630
2 2 7CK3 A 320 ? 429 ? Q941L0 702 ? 811 ? 702 811
3 1 7CK3 B 1 ? 314 ? Q941L0 317 ? 630 ? 317 630
4 2 7CK3 B 320 ? 429 ? Q941L0 702 ? 811 ? 702 811
follow-up: 5 comment:5 by , 4 years ago
Don't want to keep spamming you with new tickets (also buried in preparing for a talk tomorrow), but "open 6w1s; open #1 from alphafold" causes a traceback in my current installation:
Traceback (most recent call last):
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/cmd_line/tool.py", line 280, in execute
cmd.run(cmd_text)
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/core/commands/cli.py", line 2856, in run
result = ci.function(session, **kw_args)
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/open_command/cmd.py", line 118, in cmd_open
models = Command(session, registry=registry).run(provider_cmd_text, log=log)[0]
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/core/commands/cli.py", line 2856, in run
result = ci.function(session, **kw_args)
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/open_command/cmd.py", line 150, in provider_open
models, status = collated_open(session, database_name, ident,
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/open_command/cmd.py", line 433, in collated_open
return remember_data_format()
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/open_command/cmd.py", line 404, in remember_data_format
models, status = func(*func_args, **func_kw)
File "/home/tic20/.local/share/ChimeraX/1.3/site-packages/chimerax/alphafold/__init__.py", line 26, in fetch
return fetcher(session, ident, ignore_cache=ignore_cache, **kw)
File "/home/tic20/.local/share/ChimeraX/1.3/site-packages/chimerax/alphafold/fetch_alphafold.py", line 27, in fetch_alphafold
return fetch_alphafold_for_chains(session, chains,
File "/home/tic20/.local/share/ChimeraX/1.3/site-packages/chimerax/alphafold/fetch_alphafold.py", line 86, in fetch_alphafold_for_chains
_rename_chains(alphafold_model, chain)
File "/home/tic20/.local/share/ChimeraX/1.3/site-packages/chimerax/alphafold/fetch_alphafold.py", line 161, in _rename_chains
schains = structure.chains
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/core/attributes.py", line 61, in _getattr_
return base.__getattr__(self, attr_name, look_in_class=base)
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/core/attributes.py", line 57, in _getattr_
return look_in_class._attr_registration.get_attr(attr_name)
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/core/attributes.py", line 83, in get_attr
raise AttributeError("Execution of '%s' object's '%s' property raised AttributeError" % (self.class_.__name__, attr_name)) from None
AttributeError: Execution of 'Structure' object's 'chains' property raised AttributeError
AttributeError: Execution of 'Structure' object's 'chains' property raised AttributeError
File "/usr/libexec/UCSF-ChimeraX-daily/lib/python3.9/site-packages/chimerax/core/attributes.py", line 83, in get_attr
raise AttributeError("Execution of '%s' object's '%s' property raised AttributeError" % (self.class_.__name__, attr_name)) from None
________________________________
From: ChimeraX <ChimeraX-bugs-admin@cgl.ucsf.edu>
Sent: 02 August 2021 19:44
Cc: goddard@cgl.ucsf.edu <goddard@cgl.ucsf.edu>; Tristan Croll <tic20@cam.ac.uk>
Subject: Re: [ChimeraX] #4981: Bad AlphaFold model trimming
#4981: Bad AlphaFold model trimming
------------------------------------+-------------------------
Reporter: Tristan Croll | Owner: Tom Goddard
Type: defect | Status: reopened
Priority: normal | Milestone:
Component: Input/Output | Version:
Resolution: | Keywords:
Blocked By: | Blocking:
Notify when closed: | Platform: all
Project: ChimeraX |
------------------------------------+-------------------------
Comment (by Tom Goddard):
PDB entry 7ck3 lists two subsequences of the Uniprot sequence for each
chain. I never saw this before, only saw one sequence interval listed.
Trimming code will need to be adapted to handle multiple sequence ranges.
From 7ck3.cif:
loop_
_struct_ref_seq.align_id
_struct_ref_seq.ref_id
_struct_ref_seq.pdbx_PDB_id_code
_struct_ref_seq.pdbx_strand_id
_struct_ref_seq.seq_align_beg
_struct_ref_seq.pdbx_seq_align_beg_ins_code
_struct_ref_seq.seq_align_end
_struct_ref_seq.pdbx_seq_align_end_ins_code
_struct_ref_seq.pdbx_db_accession
_struct_ref_seq.db_align_beg
_struct_ref_seq.pdbx_db_align_beg_ins_code
_struct_ref_seq.db_align_end
_struct_ref_seq.pdbx_db_align_end_ins_code
_struct_ref_seq.pdbx_auth_seq_align_beg
_struct_ref_seq.pdbx_auth_seq_align_end
1 1 7CK3 A 1 ? 314 ? Q941L0 317 ? 630 ? 317 630
2 2 7CK3 A 320 ? 429 ? Q941L0 702 ? 811 ? 702 811
3 1 7CK3 B 1 ? 314 ? Q941L0 317 ? 630 ? 317 630
4 2 7CK3 B 320 ? 429 ? Q941L0 702 ? 811 ? 702 811
--
Ticket URL: <https://www.rbvi.ucsf.edu/trac/ChimeraX/ticket/4981#comment:4>
ChimeraX <https://www.rbvi.ucsf.edu/chimerax/>
ChimeraX Issue Tracker
follow-up: 6 comment:6 by , 4 years ago
Thanks! Keep sending the bugs. You help me make this work well. I prefer they are in separate tickets but don’t worry about this one - I will make the new ticket for it.
comment:7 by , 4 years ago
I made separate ticket #4994 for the traceback in comment 5. I was not able to reproduce that error where somehow structure.chains fails.
comment:8 by , 4 years ago
| Resolution: | → fixed |
|---|---|
| Status: | reopened → closed |
Fixed.
PDB 7ck3 chain A has two separate segments with uniprot ids in the mmCIF file. The fetching code was only handling the one uniprot segment per chain. In this file there are two segments specified in the mmCIF struct_ref_seq table, both with the same uniprot id. I guess this is to handle a deleted interval in the sample sequence. But other structures likely have chimeric sequences where two segments have different uniprot ids. The new code will load each segment for a chain as a separate AlphaFold model. So 7ck3 now loads two AlphaFold models for chain A. The fetch "search true" option gets just one longer segment because the sequence search only produces one longer segment.
This was fixed last Tuesday (July 27). Fix is is current daily builds. I am improving alphafold fetch every day, so best to use the most recent version.