Database Support¶
This module contains features for accessing databases containing protein related data.
Pfam¶
The following functions can be used to search and retrieve Pfam data:
fetchPfamMSA()
- download MSA filessearchPfam()
- search for domain families of a protein
UniProt¶
The following functions and class can be used to search and retrieve UniProt data:
queryUniprot()
- query UniProt and parse the results as a dictionaryUniprotRecord
- a wrapper from UniProt data with functions including parsing PDBssearchUniprot()
- search UniProt and return a UniprotRecord
CATH¶
The following class and its functions can be used to search and retrieve CATH data:
CATHDB
- parse, handle and navigate the tree-like structure of the CATH database
DALI¶
The following class and functions can be used to search and retrieve data using the DALI structure alignment server:
searchDali()
- search for similar structures using DALIDaliRecord
- fetch and handle outputs from DALI searchesdaliFilterMultimers()
- filter DALI results to obtain multimers of a particular size
QuartataWeb¶
The following classes and functions can be used to search and retrieve data using the QuartataWeb structure alignment server:
QuartataWebBrowser
- class based on the Splinter web browser package to search QuartataWebQuartataChemicalRecord
- class to handle the outputs of QuartataWeb searchessearchQuartataWeb()
- perform QuartataWeb searches and return the output in a QuartataChemicalRecord
Gene Ontology Annotation (GOA)¶
The following classes and functions can be used to search and retrieve data from the EBI GOA database:
queryGOA()
- query GOA using a PDB IDGOADictList
- class to handle data from GOA queriesparseOBO()
- parse an OBO file containing the Gene Ontology.parseGAF()
- parse a Gene Association File (GAF)showGoLineage()
- visualize GO treecalcGoOverlap()
- Calculate overlap between GO terms from their distance in the graph
Interpro¶
The following functions can be used to search and retrieve Interpro data:
searchInterpro()
- search for domain families of a protein
BioExcel-CV19¶
The following functions can be used to retrieve BioExcel-CV19 data:
fetchBioexcelPDB()
- fetch PDB files for starting structures for trajectories