Skip to main content

Research Data Management at Princeton

Data types

Thinking about what kind and how much data your project will generate before you begin will help you prepare for managing your data. In general data fall in to four categories that can affect how it is managed: observational, experimental, simulated, or derived/compiled. In your data management plan, write down a detailed description of how data will be generated or obtained. Think about when, where, and how much data will be produced. Include information on the software that will be used and how the data will be processed (Krier and Strasser).

For more detailed information, see the Data Types section of the DMPTool's Data Management General Guidance, or complete the Research data explained module of the MANTRA Research Data Management Training tutorial.

Formats

The file format you choose can affect who you can share your data with and whether or not your data will be useable in the future. It is best to choose a format that is open and sustainable. It may be necessary to use a proprietary format depending on the equipment or software you are using while you gather and analyze data, but consider converting to an open or sustainable format when you share with collaborators or at the end of the project. In some cases it may be appropriate to preserve the original file format, with either a copy of the software or a note of the software version, along with the open and sustainable format. According to the DMPTool Data Management General Guidance:

Formats likely to be accessible in the future are:

  • Non-proprietary
  • Open, with documented standards
  • In common usage by the research community
  • Using standard character encodings (i.e., ASCII, UTF-8)
  • Uncompressed (space permitting)

Examples of preferred format choices:

  • Image: JPEG, JPG-2000, PNG, TIFF
  • Text: plain text (TXT), HTML, XML, PDF/A
  • Audio: AIFF, WAVE
  • Containers: TAR, GZIP, ZIP
  • Databases: prefer XML or CSV to native binary formats (DMPTool)

For more detailed information on sustainable formats, visit: