About the project
The echemdb projects aim at standardizing experimental and theoretical 2D or time series data according to the FAIR principles. Ultimately, this approach allows for a seamless comparison of published data with laboratory-derived data and theoretical models.
Key issues for compliance with the FAIR principles are that many research areas lack (i) metadata standards and (ii) published data is mostly inaccessible or not machine-readable. More specifically, research data often stored as CSV usually do not contain information on the units of the axis/columns or contain metadata annotating and describing the data.
To solve these issues, in a first step the authors of echemdb limit their approach to a popular research field of interfacial electrochemistry. In recent decades, the study of the electrochemical properties of well-defined single crystal electrodes by cyclic voltammetry has played a crucial role in the fundamental understanding of more complex three dimensional systems found in more applied research areas or even in application. These materials are very well defined and the measurement principle is also well established within the community.
Standardization
The data standardization approach is described Engstfeld et al., Data Science Journal, 24 (2025) 13.
In brief, to standardize CSV data the authors of echemdb adopt the frictionless datapackage structure. According to frictionless a data package consists of:
- Metadata that describes the structure and contents of the package
- Resources such as data files that form the contents of the package
The Data Package metadata is stored in a “descriptor”.
The echemdb authors augmented the frictionless schema, by adding
- units, allowing for simple unit transformations or data manipulation.
- metadata describing a resource within the package.
The metadata describes for example the electrochemical system, which contains detailed information about the electrodes or the components of the electrolyte. The metadata also contains information on the curation process, i.e., who was the experimentalist, a URL to an entry in an electronic laboratory notebook (ELN), or details on the experimental set-up. The JSON metadata schema is developed as a separate project.
By following this approach, a set of datapackages forms a collection. The entries of such a collection are displayed in different forms on this website based on the available descriptors. The unitpackage Python API provides direct access to the entries of such a collection, enabling more specific filtering, and enabling seamless integration into existing workflows, including locally stored data.
Reusability
In order to improve the reusability of published data, the authors of echemdb created svgdigitizer, a tool allowing for digitizing any kind of published 2D plots from carefully prepared SVG files. This approach has some superior functionalities compared to other tools, for example, allowing to extract units from the axis labels or reconstructing a time axis based on a given scan rate. Modules for specific types of plots, such as the electrochemistry module offers convenience functionality, and allow extracting additional properties such as the reference potential of a potential axis. By providing a set of metadata, the digitized data can directly be stored as a unitpackage.
What’s next
We anticipate developing additional tools which help in generating unitpackages or allow for interacting with unitpackages. The latter comprises common tools for the evaluation of electrochemical data. We also plan to include other electrochemical data from methods other than cyclic voltammetry in our database, including data which have been recorded concomitantly, such as disc electrode currents or mass spectrometry signals.
Contact
The authors of echemdb are from the fields of experimental and theoretical physical chemistry, as well as from computer science and mathematics.
Direct Contact:
- albert.engstfeld@uni-ulm.de (Institute of Electrochemistry, Ulm university, DE)
Ideas and suggestions, tell us more on our discussion board.
Reach individual contributors on the GitHub organization.
Discuss and stay up to date on echemdb.zulipchat.com.
Funding, Support, & Related Projects
Most of the work has been supported by funding from the following institutions:
| Institute of Electrochemistry, Ulm University (DE) | |
| Theory Department, Fritz-Haber Institute, Berlin (DE) | ![]() |
The work is also supported by:
| CRC-1316 | ![]() |
| NFDI4Chem |
Cite
For the individual repositories, refer to the Zenodo DOIs provided in the repositories readme or documentation.
The general concept is described in A.K. Engstfeld, J. M. Hermann, N. Hörmann, & J. Rüth, Data Science Journal, 24 (2025) 13

