Open Ephys Format#

This is the original format used by the Open Ephys GUI. It is designed with redundancy in mind, so that data can be readily recovered even if the GUI crashes during acquisition. However, because data for each electrode is stored in a separate file, it doesn’t scale to high channel count recordings. All files are stored in a single directory, with the file names used to identify the data source.#
Platforms	Windows, Linux, macOS
Built in?	Yes
Key Developers	Josh Siegle, Aarón Cuevas López
Source Code	open-ephys-plugins/open-ephys-data-format

Advantages

Data is stored in blocks with well-defined record markers, meaning data recovery is possible even if files are truncated by a crash.
The file header contains all the information required to load the file.

Limitations

Windows imposes a limit on the number files that can be written to simultaneously, meaning the Open Ephys format is not compatible with very high-channel-count recordings (>8000 channels).
In order to achieve robustness, files contain redundant information, meaning they use extra space and take longer to load.

File organization#

All data files are stored within the same Record Node directory, with a completely flat hierarchy. Files for different experiments have a number appended after an underscore (starting with _2 for the second experiment in a session). Data from different recordings is distinguished by the recording index values within each .spikes, continuous, or .events file.

Each Record Node directory also contains at least one structure.openephys file, an XML file with metadata about the available files, as well as messages.events, a text file containing text events saved by the GUI.

Format details#

Headers#

All headers are 1024 bytes long, and are written as a MATLAB-compatible string with the following fields and values:

format = ‘Open Ephys Data Format’
version = 0.4
header_bytes = 1024
description = ‘(String describing the header)’
date_created = ‘dd-mm-yyyy hhmmss’
channel = ‘(String with channel name)’
channelType = ‘(String describing the channel type)’
sampleRate = (integer sampling rate)
blockLength = 1024
bufferSize = 1024
bitVolts = (floating point value of microvolts/bit for headstage channels, or volts/bit for ADC channels)

For those not using MATLAB, each header entry is on a separate line with the following format:

field names appear after a “header.” prefix
field names and values are separated by “ = “
the value of the field is in plain text, with intended strings enclosed in single quotes (as demonstrated above)
each line is terminated with a semicolon

Continuous#

Continuous data for each channel is stored in a separate .continuous file, identified by the processor ID (e.g. 100), stream name, and channel name (e.g. CH1). After the 1024-byte header, continuous data is organized into “records,” each containing 1024 samples.

Each record is 2070 bytes long, and is terminated by a 10-byte record marker (0 1 2 3 4 5 6 7 8 255).

Events#

Events for each stream are stored in a .events file. Each “record” contains the data for an individual event stored according to the following scheme:

Spikes#

Data from each electrode is saved in a separate file. The filename is derived from the electrode name with spaces removed (e.g., “Electrode1”) and the data stream name.

Each record contains an individual spike event (saved for one or more channels), and is written in the following format:

Since the samples are saved as 16-bit unsigned integers, converting them to microvolts involves subtracting 32768, dividing by the gain, and multiplying by 1000.

Reading data in Python#

(recommended) Create a Session object using the open-ephys-python-tools package. The data format will be automatically detected.
Create a File object using the pyopenephys package.
Use the loadContinuous, loadEvents, or loadSpikes methods from OpenEphys.py in the open-ephys/analysis-tools repository.

Reading data in Matlab#

Use the open-ephys-matlab-tools <open-ephys/open-ephys-matlab-tools>`__ library.