Release Notes

v2.5.0 series come with new and improved sequence, structure, and dynamics analysis features. See release notes for details.

How to Cite

Bakan A, Meireles LM, Bahar I ProDy: Protein Dynamics Inferred from Theory and Experiments
Bioinformatics 2011 27(11):1575-1577.

Bakan A, Dutta A, Mao W, Liu Y, Chennubhotla C, Lezon TR, Bahar I Evol and ProDy for Bridging Protein Sequence Evolution and Structural Dynamics
Bioinformatics 2014 30(18):2681-2683.

Zhang S, Krieger JM, Zhang Y, Kaya C, Kaynak B, Mikulska-Ruminska K, Doruker P, Li H, Bahar I ProDy 2.0: Increased scale and scope after 10 years of protein dynamics modelling with Python
Bioinformatics 2021 37(20):3657-3659.

mmCIF File¶

This module defines functions for parsing mmCIF files.

parseMMCIFStream(stream, **kwargs)[source]¶

Returns an AtomGroup and/or a class:.StarDict containing header data parsed from a stream of CIF lines.

Parameters:

Parameters:	stream – Anything that implements the method `readlines` (e.g. `file`, buffer, stdin) title (str) – title of the `AtomGroup` instance, default is the PDB filename or PDB identifier chain (str) – chain identifiers for parsing specific chains, e.g. `chain='A'`, `chain='B'`, `chain='DE'`, by default all chains are parsed segment (str) – segment identifiers for parsing specific chains, e.g. `segment='A'`, `segment='B'`, `segment='DE'`, by default all segment are parsed subset (str) – a predefined keyword to parse subset of atoms, valid keywords are `'calpha'` (`'ca'`), `'backbone'` (`'bb'`), or None (read all atoms), e.g. `subset='bb'` model (int, list) – model index or None (read all models), e.g. `model=10` altloc (str) – if a location indicator is passed, such as `'A'` or `'B'`, only indicated alternate locations will be parsed as the single coordinate set of the AtomGroup, if altloc is set `'all'` then all alternate locations will be parsed and each will be appended as a distinct coordinate set, default is `"A"` unite_chains (bool) – unite chains with the same segment name (auth_asym_id), making chain ids be auth_asym_id instead of label_asym_id. This can be helpful in some cases e.g. alignments, but can cause some problems too. For example, using `buildBiomolecules()` afterwards requires original chain id (label_asym_id). Using biomol=True, inside parseMMCIF is fine. Default is False

stream – Anything that implements the method readlines (e.g. file, buffer, stdin)
title (str) – title of the AtomGroup instance, default is the PDB filename or PDB identifier
chain (str) – chain identifiers for parsing specific chains, e.g. chain='A', chain='B', chain='DE', by default all chains are parsed
segment (str) – segment identifiers for parsing specific chains, e.g. segment='A', segment='B', segment='DE', by default all segment are parsed
subset (str) – a predefined keyword to parse subset of atoms, valid keywords are 'calpha' ('ca'), 'backbone' ('bb'), or None (read all atoms), e.g. subset='bb'
model (int, list) – model index or None (read all models), e.g. model=10
altloc (str) – if a location indicator is passed, such as 'A' or 'B', only indicated alternate locations will be parsed as the single coordinate set of the AtomGroup, if altloc is set 'all' then all alternate locations will be parsed and each will be appended as a distinct coordinate set, default is "A"
unite_chains (bool) – unite chains with the same segment name (auth_asym_id), making chain ids be auth_asym_id instead of label_asym_id. This can be helpful in some cases e.g. alignments, but can cause some problems too. For example, using buildBiomolecules() afterwards requires original chain id (label_asym_id). Using biomol=True, inside parseMMCIF is fine. Default is False

parseMMCIF(pdb, **kwargs)[source]¶

Returns an AtomGroup and/or a StarDict containing header data parsed from an mmCIF file. If not found, the mmCIF file will be downloaded from the PDB. It will be downloaded in uncompressed format regardless of the compressed keyword.

This function extends parseMMCIFStream().

Parameters:	pdb (str) – a PDB identifier or a filename If needed, mmCIF files are downloaded using `fetchPDB()` function.

parseCIF(pdb, **kwargs)¶

This function extends parseMMCIFStream().

Parameters:	pdb (str) – a PDB identifier or a filename If needed, mmCIF files are downloaded using `fetchPDB()` function.

writeMMCIF(filename, atoms, csets=None, autoext=True, **kwargs)[source]¶

Write atoms in MMTF format to a file with name filename and return filename. If filename ends with .gz, a compressed file will be written.

Parameters:	atoms (`Atomic`) – an object with atom and coordinate data csets – coordinate set indices, default is all coordinate sets autoext – when not present, append extension `.cif` to filename header (dict) – header to write too