Skip to content

Commit ce4a8f1

Browse files
committed
Address autocorrelation time issue 107
1 parent 0c075c5 commit ce4a8f1

1 file changed

Lines changed: 1 addition & 0 deletions

File tree

paper/basic_training.tex

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -694,6 +694,7 @@ \subsubsection{Production}
694694
Storing data especially frequently can be tempting, but utilizes a great deal of storage space and does not actually provide significant value in most situations.
695695
Particularly, observations made in MD simulations are correlated in time (e.g. see \url{https://github.com/dmzuckerman/Sampling-Uncertainty}) so storing data more frequently than the autocorrelation time results in storage of essentially redundant data.
696696
Thus, storing data more frequently than intervals of the autocorrelation time is generally unnecessary.
697+
Of course, the autocorrelation time is not known \emph{a priori} which can make it necessary to store \emph{some} redundant data.
697698
Disk space may also be a limiting factor that dictates the frequency of storing data, and should at least be considered.
698699
Trajectory snapshots can be particularly large.
699700
However, if there are no disk space limitations it may be best to avoid discarding uncorrelated data so sampling \emph{at} intervals of the autocorrelation time may be appropriate.

0 commit comments

Comments
 (0)