Skip to Main Content

The Graduate Health & Life Sciences Research Library at Georgetown University Medical Center

Research Data Management

Best practices and tools for managing your research data.

Best Practices for File Versioning

Complex research projects will inevitably produce multiple versions of files. Implementing a file versioning strategy at the beginning of your research project will help to avoid confusion amongst collaborators and avoid lost time and effort trying to recover the "right" version of a file. It is also best practice to document all changes made between versions.

Version control with file names

File versioning can be as simple as integrating version numbers into your file naming convention (i.e., v1, v1_2, v2) or by using dates. Avoid using ambiguous terms such as ‘final’ or ‘revision’. In the example below, the type of analysis, publisher, and deadline of the submission is listed, followed by a version number:

MultivariateAnalysis_PLOSSubmission_20171130_v3.csv

Using tools

 

Certain storage and file sharing platforms, such as Google Drive, DropBox and Box, also have built-in version tracking with the ability to get back to earlier file versions.

Keep Original Files Read-Only

Always keep a "read only" version of your raw, unprocessed dataset to protect against unintentional changes. For important files that are the original stem of a project, keeping a read-only file or password protected zipped package helps prevent accidents.