PDB2MD
Introduction
Using the UK National Grid Service and AMBER Molecular Dynamics software, an automated pipeline
for MD simulations has been implemented. Given a list of PDB codes and a user-specified standard protocol, the pipeline
prepares input files, runs equilibration and production MD jobs,
and returns the results for each, without the need for any user intervention.
Here the PDB2MD engine has been used to run fully solvated MD simulations on a
total of 96 different DNA sequences taken from the PDB/NDB. So far 1-2 ns of MD
has been run on each sequence. This has consumed a total of about 1.8 CPU years and has generated approx. 40Gb of trajectory data.
The outline of the protocol used is as follows:
DNA solvated with TIP3P waters using truncated octahedral PBC and 12 angstrom cut-off. Systems neutralised by addition of potassium counterions. Standard multistep energy minimisation and restrained dynamics protocol used for equilibration. 1-2 ns of production NPT MD using Amber8 PMEMD, snapshots saved every 1 ps.
Basic analysis of the simulations includes PCA analysis to check for equilibration and sampling, and Curves analysis of helical parameters.
Plots for the PCA analysis, two key helical parameters - average X-displacement and average twist, and dials plots for the backbone torsions are included below. Note that in the dials plots, small tick marks on the dial peripheries mark
the crystal structure values.
The simulation data will be deposited and available via the BioSimGrid project.
Results
| PDBID | Type | Sequence | Sim. time | MDPCANAL | Xdisp | Twist | Dials1 | Dials2 |
| 0an8 | A | GGTATACC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 116d | A | CCGTACGTACGG | 4ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 117d | A | GCGTACGTACGC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 118d | A | GTGCGCAC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 119d | B | CGTAGATCTACG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 126d | B | CATGGCCATG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 137d | A | GCGGGCCCGC, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 158d | B | CCAAGCTTGG, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 160d | A | CCCGGCCGGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 167d | B | CCATTAATGG | 5ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 172d | A | GAAGCTTC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 180d | Z | CGCACG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 181d | Z | CACGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 189d | A | GGCCGGCC, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 192d | Z | CCGCGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 194d | B | CGCGTTAACGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 196d | B | CTCTCGAGAG | 5ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 197d | A | GTACGTAC, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1bd1 | B | CCAGGCCTGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1bdn | B | CGCAAAAATGCG | 5ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d23 | B | CGATCGATCG | 5ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d29 | B | CGTGAATTCACG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d31 | B | CGCAGAATTCGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d49 | B | CGATTAATCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d56 | B | CGATATATCG, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d65 | B | CGCAAATTTGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d78 | A | GTGTACAC, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d82 | A | GTCTAGAC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d89 | B | CGCGAAAAAACG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d8g | B | CCAGTACTGG | 5ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d93 | A | CTCTAGAG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1d98 | B | CGCAAAAAAGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1dc0 | RH | CATGGGCCCATG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1dcv | B | CCGCTAGCGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1dn6 | A | GGATGGGAG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1dn9 | B | CGCATATATGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1dnz | A | ACCGGCCGGT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1enn | B | GCGAATTCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1fq2 | B | CGCGAATTCGCG | 5ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1g00 | A | GGCGCC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1hq7 | B | GCAAACGTTTGC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1i0t | Z | CGCGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1ikk | B | CCTTTAAAGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1ilc | B | ACCGAATTCGGT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1k71 | B | TGGCCTTAAGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1lp7 | B | CGCTTATATGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1m77 | A | CCCGATCGGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1n4e | B | GCTTAATTCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1nab | B | CGATCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1nvn | B | CCGGTACCGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1nvy | B | TCGGTACCGA | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1p4y | B | CCGGCGCCGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1s23 | B | CGCAATTGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1sgs | B | CGCTGGAAATTTCCAGC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 1zna | Z | CGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 206d | B | CGGTGG, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 220d | A | ACCCGCGGGT, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 232d | A | AGGCATGCCT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 243d | A | ACGTACGT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 249d | B | CGCTCTAGAGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 251d | B | CTCGAG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 253d | B | GCGTACGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 257d | A | GCCGGC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 260d | A | GCACGCGTGC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 279d | Z | GCGCGCGCGC | 0ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 281d | A | GGCATGCC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 287d | B | CGCGATATCGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 295d | A | ATGCGCAT, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 2ana | A | GGGGCCCC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 2d47 | A | CCCCCGCGGGGG, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 2d94 | A | GGGCGCCC, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 307d | B | CAAAGAAAAG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 309d | B | CGACGATCGT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 317d | A | CCCTAGGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 321d | A | CCGGGCCCGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 330d | B | ACCGCCGGCGCC,-15 | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 331d | Z | GCGCGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 335d | A | GGCAATTGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 348d | A | GACCGCGGTC, | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 362d | Z | TGCGCA | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 382d | A | CCGCCGGCGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 395d | A | GTACGCGTAC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 396d | A | GGCCGCGGCC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 399d | A | CGCCCGCGGGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 3ana | A | GGGATCCC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 414d | A | GGGGCGCCCC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 423d | B | ACCGACGTCGGT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 425d | B | ACCGGTACCGGT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 431d | B | GGCCAATTGG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 440d | A | AGGGGCCCCT | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 476d | B | GCGAATTCGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 477d | B | GGCGAATTCGCG | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |
| 9dna | A | GCCCGGGC | 2ns | Analysis | AvgXdisp | AvgTwist | Strand1 | Strand2 |