Opened 4 years ago

Closed 4 years ago

#5327 closed defect (fixed)

Blastprotein: Parse Description for PDB, NR blast hits into Title + Species

Reported by: Zach Pearson Owned by: Zach Pearson
Priority: blocker Milestone:
Component: Sequence Version:
Keywords: Cc: Elaine Meng
Blocked By: Blocking:
Notify when closed: Platform: all
Project: ChimeraX

Description

From Elaine:

Another recommendation is to parse the PDB/NR results Description field into Title and Species, similar to what you did with Alphafold results. For both structure hits and sequence-only hits, as far as I can tell, the Description is basically a title but with the species in square brackets at the end. See attachment "nrdescription 2021-09-29.png" ... some of them have "synthetic construct" in the square brackets but I think that would be OK to put in the Species field.

Attachments (1)

nrdescription 2021-09-29.png (285.4 KB ) - added by Zach Pearson 4 years ago.

Download all attachments as: .zip

Change History (4)

by Zach Pearson, 4 years ago

comment:1 by pett, 4 years ago

Component: UnassignedSequence

comment:2 by Elaine Meng, 4 years ago

Summary: Blastprotein: Better parsing of sequence resultsBlastprotein: Parse Description for PDB, NR blast hits into Title + Species

This applies to searching both PDB only and NR (which includes PDB). Basically everything except AlphaFold.

comment:3 by Zach Pearson, 4 years ago

Resolution: fixed
Status: assignedclosed

This commit should do it.

Note: See TracTickets for help on using tickets.