Sampling Performance of Multiple Independent Molecular Dynamics Simulations of an RNA Aptamer

2020-08-01
Yan, Shuting
Peck, Jason M.
İlgü, Müslüm
Nilsen-Hamilton, Marit
Lamm, Monica H.
Using multiple independent simulations instead of one long simulation has been shown to improve the sampling performance attained with the molecular dynamics (MD) simulation method. However, it is generally not known how long each independent simulation should be, how many independent simulations should be used, or to what extent either of these factors affects the overall sampling performance achieved for a given system. The goal of the present study was to assess the sampling performance of multiple independent MD simulations, where each independent simulation begins from a different initial molecular conformation. For this purpose, we used an RNA aptamer that is 25 nucleotides long as a case study. The initial conformations of the aptamer are derived from six de novo predicted 3D structures. Each of the six de novo predicted structures is energy minimized in solution and equilibrated with MD simulations at high temperature. Ten conformations from these six high-temperature equilibration runs are selected as initial conformations for further simulations at ambient temperature. In total, we conducted 60 independent MD simulations, each with a duration of 100 ns, to study the conformation and dynamics of the aptamer. For each group of 10 independent simulations that originated from a particular de novo predicted structure, we evaluated the potential energy distribution of the RNA and used recurrence quantification analysis to examine the sampling of RNA conformational transitions. To assess the impact of starting from different de novo predicted structures, we computed the density of structure projection on principal components to compare the regions sampled by the different groups of ten independent simulations. The recurrence rate and dependence of initial conformation among the groups were also compared. We stress the necessity of using different initial configurations as simulation starting points by showing long simulations from different initial structures suffer from being trapped in different states. Finally, we summarized the sampling efficiency for the complete set of 60 independent simulations and determined regions of under-sampling on the potential energy landscape. The results suggest that conducting multiple independent simulations using a diverse set of de novo predicted structures is a promising approach to achieve sufficient sampling. This approach avoids undesirable outcomes, such as the problem of the RNA aptamer being trapped in a local minimum. For others wishing to conduct multiple independent simulations, the analysis protocol presented in this study is a guide for examining overall sampling and determining if more simulations are necessary for sufficient sampling.

Suggestions

Multi-time-scale input approaches for hourly-scale rainfall-runoff modeling based on recurrent neural networks
Ishida, Kei; Kiyama, Masato; Ercan, Ali; Amagasaki, Motoki; Tu, Tongbi (2021-11-01)
This study proposes two effective approaches to reduce the required computational time of the training process for time-series modeling through a recurrent neural network (RNN) using multi-time-scale time-series data as input. One approach provides coarse and fine temporal resolutions of the input time-series data to RNN in parallel. The other concatenates the coarse and fine temporal resolutions of the input time-series data over time before considering them as the input to RNN. In both approaches, first, ...
Direct numerical simulation of pipe flow using a solenoidal spectral method
Tugluk, Ozan; Tarman, Işık Hakan (2012-05-01)
In this study, a numerical method based on solenoidal basis functions, for the simulation of incompressible flow through a circular-cylindrical pipe, is presented. The solenoidal bases utilized in the study are formulated using the Legendre polynomials. Legendre polynomials are favorable, both for the form of the basis functions and for the inner product integrals arising from the Galerkin-type projection used. The projection is performed onto the dual solenoidal bases, eliminating the pressure variable, si...
Semi-Bayesian Inference of Time Series Chain Graphical Models in Biological Networks
Farnoudkia, Hajar; Purutçuoğlu Gazi, Vilda (null; 2018-09-20)
The construction of biological networks via time-course datasets can be performed both deterministic models such as ordinary differential equations and stochastic models such as diffusion approximation. Between these two branches, the former has wider application since more data can be available. In this study, we particularly deal with the probabilistic approaches for the steady-state or deterministic description of the biological systems when the systems are observed though time. Hence, we consider time s...
Performance comparisons of seismic assessment methods with PSD test results of a deficient RC frame
Ozcebe, G.; KURT, ELİF; Binici, Barış; Kurç, Özgür; Canbay, Erdem; Akpinar, U. (2009-12-01)
The accuracy of estimating the performance levels of a deficient RC frame using linear elastic and nonlinear dynamic analysis is evaluated in this study. This was achieved by comparing the response of a structure tested with pseudo-dynamic testing and estimated by the linear elastic assessment procedures along with nonlinear dynamic analysis. The test structure (three bay-two storey planar frame) is a 1/2 scale reinforced concrete frame having masonry infill walls in the central span. The test frame contain...
Consensus clustering of time series data
Yetere Kurşun, Ayça; Batmaz, İnci; İyigün, Cem; Department of Scientific Computing (2014)
In this study, we aim to develop a methodology that merges Dynamic Time Warping (DTW) and consensus clustering in a single algorithm. Mostly used time series distance measures require data to be of the same length and measure the distance between time series data mostly depends on the similarity of each coinciding data pair in time. DTW is a relatively new measure used to compare two time dependent sequences which may be out of phase or may not have the same lengths or frequencies. DTW aligns two time serie...
Citation Formats
S. Yan, J. M. Peck, M. İlgü, M. Nilsen-Hamilton, and M. H. Lamm, “Sampling Performance of Multiple Independent Molecular Dynamics Simulations of an RNA Aptamer,” ACS OMEGA, pp. 20187–20201, 2020, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/37830.