Opened 3 years ago

Closed 3 years ago

#8038 closed defect (limitation)

Compressed .cif without .gz suffix

Reported by: kristen.browne@… Owned by: Eric Pettersen
Priority: normal Milestone:
Component: Input/Output Version:
Keywords: Cc: chimera-programmers, phil.cruz@…
Blocked By: Blocking:
Notify when closed: Platform: all
Project: ChimeraX

Description

The following bug report has been submitted:
Platform:        Windows-10-10.0.19044
ChimeraX Version: 1.6.dev202210260016 (2022-10-26 00:16:05 UTC)
Description
This seems to be impacting over 200 entries from 3DPX, mostly these come from the same user.  Every one is a .cif file similar to the attached, likely all being output by the same software package.  

Log:
UCSF ChimeraX version: 1.6.dev202210260016 (2022-10-26)  
© 2016-2022 Regents of the University of California. All rights reserved.  
How to cite UCSF ChimeraX  

> open C:/Users/brownekm/Downloads/2fae-assembly-1.cif

mmCIF parsing error: unexpected data value near line 1  




OpenGL version: 3.3.0 NVIDIA 466.47
OpenGL renderer: NVIDIA GeForce RTX 2080 Ti/PCIe/SSE2
OpenGL vendor: NVIDIA Corporation

Python: 3.9.11
Locale: en_US.cp1252
Qt version: PyQt6 6.3.1, Qt 6.3.1
Qt runtime version: 6.3.2
Qt platform: windows

Manufacturer: HP
Model: HP Z4 G4 Workstation
OS: Microsoft Windows 10 Enterprise (Build 19044)
Memory: 137,220,087,808
MaxProcessMemory: 137,438,953,344
CPU: 20 Intel(R) Core(TM) i9-10900X CPU @ 3.70GHz
OSLanguage: en-US

Installed Packages:
    alabaster: 0.7.12
    appdirs: 1.4.4
    asttokens: 2.0.8
    Babel: 2.10.3
    backcall: 0.2.0
    blockdiag: 3.0.0
    build: 0.8.0
    certifi: 2022.9.24
    cftime: 1.6.2
    charset-normalizer: 2.1.1
    ChimeraX-AddCharge: 1.4
    ChimeraX-AddH: 2.2.1
    ChimeraX-AlignmentAlgorithms: 2.0
    ChimeraX-AlignmentHdrs: 3.3
    ChimeraX-AlignmentMatrices: 2.1
    ChimeraX-Alignments: 2.8
    ChimeraX-AlphaFold: 1.0
    ChimeraX-AltlocExplorer: 1.0.3
    ChimeraX-AmberInfo: 1.0
    ChimeraX-Arrays: 1.0
    ChimeraX-Atomic: 1.43.2
    ChimeraX-AtomicLibrary: 8.0.3
    ChimeraX-AtomSearch: 2.0.1
    ChimeraX-AxesPlanes: 2.3
    ChimeraX-BasicActions: 1.1.2
    ChimeraX-BILD: 1.0
    ChimeraX-BlastProtein: 2.1.2
    ChimeraX-BondRot: 2.0.1
    ChimeraX-BugReporter: 1.0.1
    ChimeraX-BuildStructure: 2.7.1
    ChimeraX-Bumps: 1.0
    ChimeraX-BundleBuilder: 1.2
    ChimeraX-ButtonPanel: 1.0.1
    ChimeraX-CageBuilder: 1.0.1
    ChimeraX-CellPack: 1.0
    ChimeraX-Centroids: 1.3.1
    ChimeraX-ChangeChains: 1.0.2
    ChimeraX-CheckWaters: 1.3
    ChimeraX-ChemGroup: 2.0
    ChimeraX-Clashes: 2.2.4
    ChimeraX-ColorActions: 1.0.3
    ChimeraX-ColorGlobe: 1.0
    ChimeraX-ColorKey: 1.5.2
    ChimeraX-CommandLine: 1.2.5
    ChimeraX-ConnectStructure: 2.0.1
    ChimeraX-Contacts: 1.0.1
    ChimeraX-Core: 1.6.dev202210260016
    ChimeraX-CoreFormats: 1.1
    ChimeraX-coulombic: 1.3.2
    ChimeraX-Crosslinks: 1.0
    ChimeraX-Crystal: 1.0
    ChimeraX-CrystalContacts: 1.0.1
    ChimeraX-DataFormats: 1.2.2
    ChimeraX-Dicom: 1.1
    ChimeraX-DistMonitor: 1.3
    ChimeraX-DockPrep: 1.0
    ChimeraX-Dssp: 2.0
    ChimeraX-EMDB-SFF: 1.0
    ChimeraX-ExperimentalCommands: 1.0
    ChimeraX-FileHistory: 1.0.1
    ChimeraX-FunctionKey: 1.0.1
    ChimeraX-Geometry: 1.2
    ChimeraX-gltf: 1.0
    ChimeraX-Graphics: 1.1
    ChimeraX-Hbonds: 2.4
    ChimeraX-Help: 1.2.1
    ChimeraX-HKCage: 1.3
    ChimeraX-IHM: 1.1
    ChimeraX-ImageFormats: 1.2
    ChimeraX-IMOD: 1.0
    ChimeraX-IO: 1.0.1
    ChimeraX-ItemsInspection: 1.0.1
    ChimeraX-Label: 1.1.7
    ChimeraX-ListInfo: 1.1.1
    ChimeraX-Log: 1.1.5
    ChimeraX-LookingGlass: 1.1
    ChimeraX-Maestro: 1.8.2
    ChimeraX-Map: 1.1.2
    ChimeraX-MapData: 2.0
    ChimeraX-MapEraser: 1.0.1
    ChimeraX-MapFilter: 2.0
    ChimeraX-MapFit: 2.0
    ChimeraX-MapSeries: 2.1.1
    ChimeraX-Markers: 1.0.1
    ChimeraX-Mask: 1.0.1
    ChimeraX-MatchMaker: 2.0.9
    ChimeraX-MDcrds: 2.6
    ChimeraX-MedicalToolbar: 1.0.2
    ChimeraX-Meeting: 1.0.1
    ChimeraX-MLP: 1.1
    ChimeraX-mmCIF: 2.8
    ChimeraX-MMTF: 2.2
    ChimeraX-Modeller: 1.5.6
    ChimeraX-ModelPanel: 1.3.6
    ChimeraX-ModelSeries: 1.0.1
    ChimeraX-Mol2: 2.0
    ChimeraX-Mole: 1.0
    ChimeraX-Morph: 1.0.1
    ChimeraX-MouseModes: 1.1.1
    ChimeraX-Movie: 1.0
    ChimeraX-Neuron: 1.0
    ChimeraX-NIHPresets: 1.1.1
    ChimeraX-Nucleotides: 2.0.3
    ChimeraX-OpenCommand: 1.9.1
    ChimeraX-PDB: 2.6.8
    ChimeraX-PDBBio: 1.0
    ChimeraX-PDBLibrary: 1.0.2
    ChimeraX-PDBMatrices: 1.0
    ChimeraX-PickBlobs: 1.0.1
    ChimeraX-Positions: 1.0
    ChimeraX-PresetMgr: 1.1
    ChimeraX-PubChem: 2.1
    ChimeraX-ReadPbonds: 1.0.1
    ChimeraX-Registration: 1.1.1
    ChimeraX-RemoteControl: 1.0
    ChimeraX-RenderByAttr: 1.0
    ChimeraX-RenumberResidues: 1.1
    ChimeraX-ResidueFit: 1.0.1
    ChimeraX-RestServer: 1.1
    ChimeraX-RNALayout: 1.0
    ChimeraX-RotamerLibMgr: 2.0.1
    ChimeraX-RotamerLibsDunbrack: 2.0
    ChimeraX-RotamerLibsDynameomics: 2.0
    ChimeraX-RotamerLibsRichardson: 2.0
    ChimeraX-SaveCommand: 1.5.1
    ChimeraX-SchemeMgr: 1.0
    ChimeraX-SDF: 2.0
    ChimeraX-Segger: 1.0
    ChimeraX-Segment: 1.0
    ChimeraX-SelInspector: 1.0
    ChimeraX-SeqView: 2.8
    ChimeraX-Shape: 1.0.1
    ChimeraX-Shell: 1.0.1
    ChimeraX-Shortcuts: 1.1.1
    ChimeraX-ShowSequences: 1.0.1
    ChimeraX-SideView: 1.0.1
    ChimeraX-Smiles: 2.1
    ChimeraX-SmoothLines: 1.0
    ChimeraX-SpaceNavigator: 1.0
    ChimeraX-StdCommands: 1.10
    ChimeraX-STL: 1.0
    ChimeraX-Storm: 1.0
    ChimeraX-StructMeasure: 1.1
    ChimeraX-Struts: 1.0.1
    ChimeraX-Surface: 1.0
    ChimeraX-SwapAA: 2.0.1
    ChimeraX-SwapRes: 2.1.3
    ChimeraX-TapeMeasure: 1.0
    ChimeraX-Test: 1.0
    ChimeraX-Toolbar: 1.1.2
    ChimeraX-ToolshedUtils: 1.2.1
    ChimeraX-Tug: 1.0.1
    ChimeraX-UI: 1.25
    ChimeraX-uniprot: 2.2.1
    ChimeraX-UnitCell: 1.0.1
    ChimeraX-ViewDockX: 1.1.3
    ChimeraX-VIPERdb: 1.0
    ChimeraX-Vive: 1.1
    ChimeraX-VolumeMenu: 1.0.1
    ChimeraX-VTK: 1.0
    ChimeraX-WavefrontOBJ: 1.0
    ChimeraX-WebCam: 1.0.1
    ChimeraX-WebServices: 1.1.0
    ChimeraX-Zone: 1.0.1
    colorama: 0.4.5
    comtypes: 1.1.10
    cxservices: 1.2
    cycler: 0.11.0
    Cython: 0.29.32
    debugpy: 1.6.3
    decorator: 5.1.1
    docutils: 0.19
    entrypoints: 0.4
    executing: 1.1.1
    filelock: 3.7.1
    fonttools: 4.38.0
    funcparserlib: 1.0.0
    grako: 3.16.5
    h5py: 3.7.0
    html2text: 2020.1.16
    idna: 3.4
    ihm: 0.33
    imagecodecs: 2022.7.31
    imagesize: 1.4.1
    importlib-metadata: 5.0.0
    ipykernel: 6.15.3
    ipython: 8.4.0
    ipython-genutils: 0.2.0
    jedi: 0.18.1
    Jinja2: 3.1.2
    jupyter-client: 7.3.4
    jupyter-core: 4.11.2
    kiwisolver: 1.4.4
    line-profiler: 3.5.1
    lxml: 4.9.1
    lz4: 4.0.2
    MarkupSafe: 2.1.1
    matplotlib: 3.5.2
    matplotlib-inline: 0.1.6
    msgpack: 1.0.4
    nest-asyncio: 1.5.6
    netCDF4: 1.6.0
    networkx: 2.8.5
    numexpr: 2.8.4
    numpy: 1.23.1
    openvr: 1.23.701
    packaging: 21.3
    ParmEd: 3.4.3
    parso: 0.8.3
    pep517: 0.13.0
    pickleshare: 0.7.5
    Pillow: 9.2.0
    pip: 22.2.2
    pkginfo: 1.8.3
    prompt-toolkit: 3.0.31
    psutil: 5.9.1
    pure-eval: 0.2.2
    pycollada: 0.7.2
    pydicom: 2.3.0
    Pygments: 2.12.0
    PyOpenGL: 3.1.5
    PyOpenGL-accelerate: 3.1.5
    pyparsing: 3.0.9
    PyQt6-commercial: 6.3.1
    PyQt6-Qt6: 6.3.2
    PyQt6-sip: 13.4.0
    PyQt6-WebEngine-commercial: 6.3.1
    PyQt6-WebEngine-Qt6: 6.3.2
    python-dateutil: 2.8.2
    pytz: 2022.5
    pywin32: 303
    pyzmq: 24.0.1
    qtconsole: 5.3.1
    QtPy: 2.2.1
    RandomWords: 0.4.0
    requests: 2.28.1
    scipy: 1.9.0
    setuptools: 65.1.1
    sfftk-rw: 0.7.2
    six: 1.16.0
    snowballstemmer: 2.2.0
    sortedcontainers: 2.4.0
    Sphinx: 5.1.1
    sphinx-autodoc-typehints: 1.19.1
    sphinxcontrib-applehelp: 1.0.2
    sphinxcontrib-blockdiag: 3.0.0
    sphinxcontrib-devhelp: 1.0.2
    sphinxcontrib-htmlhelp: 2.0.0
    sphinxcontrib-jsmath: 1.0.1
    sphinxcontrib-qthelp: 1.0.3
    sphinxcontrib-serializinghtml: 1.1.5
    stack-data: 0.5.1
    tables: 3.7.0
    tifffile: 2022.7.31
    tinyarray: 1.2.4
    tomli: 2.0.1
    tornado: 6.2
    traitlets: 5.3.0
    urllib3: 1.26.12
    wcwidth: 0.2.5
    webcolors: 1.12
    wheel: 0.37.1
    wheel-filename: 1.4.1
    WMI: 1.5.1
    zipp: 3.10.0
File attachment: 2fae-assembly-1.cif

2fae-assembly-1.cif

Attachments (1)

2fae-assembly-1.cif (24.8 KB ) - added by kristen.browne@… 3 years ago.
Added by email2trac

Download all attachments as: .zip

Change History (12)

by kristen.browne@…, 3 years ago

Attachment: 2fae-assembly-1.cif added

Added by email2trac

comment:1 by Eric Pettersen, 3 years ago

Cc: Greg Couch added
Component: UnassignedInput/Output
Owner: set to Eric Pettersen
Platform: all
Project: ChimeraX
Status: newaccepted
Summary: ChimeraX bug report submissionCompressed .cif without .gz suffix

comment:2 by Eric Pettersen, 3 years ago

Cc: chimera-programmers added; Greg Couch removed

Apparently Chimera handled gzipped files even if they lacked the .gz extension. Is this something we want to support in ChimeraX?

in reply to:  4 comment:3 by kristen.browne@…, 3 years ago

This impacts over 220 entries in 3DPX that need to be rerun so from our end it's definitely something that would be desirable.  Let us know what you decide is best and we may have to go through and manually update all of these.

K 

-----Original Message-----
From: ChimeraX <ChimeraX-bugs-admin@cgl.ucsf.edu> 
Sent: Wednesday, November 16, 2022 12:16 PM
Cc: chimera-programmers@cgl.ucsf.edu; gregc@cgl.ucsf.edu; Browne, Kristen (NIH/NIAID) [C] <kristen.browne@nih.gov>; pett@cgl.ucsf.edu
Subject: [EXTERNAL] Re: [ChimeraX] #8038: Compressed .cif without .gz suffix

#8038: Compressed .cif without .gz suffix
---------------------------------------+----------------------
          Reporter:  kristen.browne@...  |      Owner:  pett
              Type:  defect            |     Status:  accepted
          Priority:  normal            |  Milestone:
         Component:  Input/Output      |    Version:
        Resolution:                    |   Keywords:
        Blocked By:                    |   Blocking:
Notify when closed:                    |   Platform:  all
           Project:  ChimeraX          |
---------------------------------------+----------------------
Changes (by pett):

 * cc: Greg Couch (removed)
 * cc: chimera-programmers (added)


Comment:

 Apparently Chimera handled gzipped files even if they lacked the .gz  extension.  Is this something we want to support in ChimeraX?

--
Ticket URL: <https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.rbvi.ucsf.edu%2Ftrac%2FChimeraX%2Fticket%2F8038%23comment%3A2&amp;data=05%7C01%7Ckristen.browne%40nih.gov%7Cbb9ad468524d4c4dc23a08dac7f64664%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638042157798451495%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=FoN7W0PKZ9ax7fU7qpfVabeI%2BbdjFfEn7nDFTqZcdgo%3D&amp;reserved=0>
ChimeraX <https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.rbvi.ucsf.edu%2Fchimerax%2F&amp;data=05%7C01%7Ckristen.browne%40nih.gov%7Cbb9ad468524d4c4dc23a08dac7f64664%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638042157798451495%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=syZWQOv77gKpy25mco7H82mKlwIYnDEclqoWrkDgRHM%3D&amp;reserved=0>
ChimeraX Issue Tracker
CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and are confident the content is safe.

comment:4 by Eric Pettersen, 3 years ago

FYI, those entries will all work if '.gz' is added to the file name.

comment:5 by Eric Pettersen, 3 years ago

Okay, the '.gz' thing is a lie -- at least for .cif files. The .cif reader requires an uncompressed file for some reason (I'm guessing speed), so currently the file would need to be decompressed before ChimeraX could handle it.

in reply to:  7 comment:6 by kristen.browne@…, 3 years ago

Just adding a note that when Phil unzipped the cif files he still sees the error: mmCIF parsing error: unexpected data value near line 1, so the problem reading these files doesn't end with the compression.

Thanks!

-----Original Message-----
From: ChimeraX <ChimeraX-bugs-admin@cgl.ucsf.edu> 
Sent: Wednesday, November 16, 2022 1:21 PM
Cc: chimera-programmers@cgl.ucsf.edu; Browne, Kristen (NIH/NIAID) [C] <kristen.browne@nih.gov>; pett@cgl.ucsf.edu
Subject: [EXTERNAL] Re: [ChimeraX] #8038: Compressed .cif without .gz suffix

#8038: Compressed .cif without .gz suffix
---------------------------------------+----------------------
          Reporter:  kristen.browne@...  |      Owner:  pett
              Type:  defect            |     Status:  accepted
          Priority:  normal            |  Milestone:
         Component:  Input/Output      |    Version:
        Resolution:                    |   Keywords:
        Blocked By:                    |   Blocking:
Notify when closed:                    |   Platform:  all
           Project:  ChimeraX          |
---------------------------------------+----------------------

Comment (by pett):

 Okay, the '.gz' thing is a lie -- at least for .cif files.  The .cif  reader requires an uncompressed file for some reason (I'm guessing speed),  so currently the file would need to be decompressed before ChimeraX could  handle it.

--
Ticket URL: <https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.rbvi.ucsf.edu%2Ftrac%2FChimeraX%2Fticket%2F8038%23comment%3A5&amp;data=05%7C01%7Ckristen.browne%40nih.gov%7C1e6fe4013c834d580e2308dac7ff4725%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638042196475854254%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=in8gIHkfJXE88sMM3jOh3UMOzFpdypRGZlpLGJI4yLA%3D&amp;reserved=0>
ChimeraX <https://gcc02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.rbvi.ucsf.edu%2Fchimerax%2F&amp;data=05%7C01%7Ckristen.browne%40nih.gov%7C1e6fe4013c834d580e2308dac7ff4725%7C14b77578977342d58507251ca2dc2b06%7C0%7C0%7C638042196475854254%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=yg2NfnI0vyVrKaxsWiCL%2B8vBB9YA%2Ffx1O5Y9AL0hC80%3D&amp;reserved=0>
ChimeraX Issue Tracker
CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender and are confident the content is safe.

comment:7 by Eric Pettersen, 3 years ago

Was that the particular file attached to this ticket (2fae-assembly-1.cif​)? I can uncompress that file and open it successfully in ChimeraX.

comment:8 by Eric Pettersen, 3 years ago

However, after opening that file I get an error clicking on some of the entries in the symmetry table -- which should affect the NIH3D pipeline at all AFAIK. I'll open a separate ticket for that.

comment:9 by Eric Pettersen, 3 years ago

...shouldn't affect...

comment:10 by Eric Pettersen, 3 years ago

Cc: phil.cruz@… added
Status: acceptedfeedback

The ChimeraX programmers discussed this among ourselves and decided we definitely don't want to support compressed files lacking a compression suffix.

We haven't had any complaints from "normal" users about ChimeraX requiring compressed files to be decompressed manually in some cases, so we lean towards doing nothing about it for now. Given the effort to make it happen smoothly (handle error modes [e.g. out of disk space], prevent display of temporary file names, remove decompressed files on exit) we felt that other implementation goals were more important. So this would mean that you would need to decompress files before handing them off to the pipeline. If this was really problematic for you, we could possibly be prodded into trying to get on-the-fly decompression to work in all cases.

comment:11 by Eric Pettersen, 3 years ago

Resolution: limitation
Status: feedbackclosed

Didn't hear any squawking, so assuming the status quo is okay.

Note: See TracTickets for help on using tickets.