About the project
The echemdb projects aim at standardizing experimental and theoretical 3D or time series data according to the FAIR principles. Ultimately, this approach allows for a seamless comparison of published data with laboratory-derived data and theoretical models.
Key issues for compliance with the FAIR principles are that many research areas lack (i) metadata standards and (ii) published data is mostly inaccessible or not machine-readable. More specifically, research data often stored as CSV usually do not contain information on the units of the axis/columns or contain metadata annotating and describing the data.
To solve these issues, in a first step the authors of echemdb limit themselves to a popular research field of electrochemistry. In recent decades, the study of the electrochemical properties of well-defined single crystal electrodes by cyclic voltammetry has played a crucial role in the fundamental understanding of more complex three dimensional systems found in more applied research areas or even in application. These materials are very well defined and the measurement principle is also well established within the community. The validity of our approach to other research areas is illustrated here.
Standardization
To standardize CSV data the authors of echemdb adopt the frictionless datapackage structure. According to frictionless a data package consists of:
- Metadata that describes the structure and contents of the package
- Resources such as data files that form the contents of the package
The Data Package metadata is stored in a “descriptor”.
The echemdb authors augmented the frictionless schema, by adding
- units, allowing for simple unit transformations or data manipulation.
- metadata describing a resource within the package.
The metadata describes for example the electrochemical system, which contains detailed information about the electrodes or the components of the electrolyte. The metadata also contains information on the curation process, i.e., who was the experimentalist, a URL to an entry in an electronic laboratory notebook (ELN), or details on the experimental set-up. The JSON metadata schema is developed as a separate project.
By following this approach, a set of datapackages forms a collection. The entries of such a collection are displayed in different forms on this website based on the available descriptors. A Python API provides direct access to the entries of such a collection, enabling more specific filtering, and enabling seamless integration into existing workflows.
Reusability
In order to improve the reusability of published data, the authors of echemdb created svgdigitizer, a tool allowing for digitizing any kind of published 2D plots from carefully prepared SVG files. This approach has some superior functionalities compared to other tools, for example, allowing to extract units from the axis labels or reconstructing a time axis based on a given scan rate. Modules for specific types of plots, such as the electrochemistry module offers convenience functionality, and allow extracting additional properties such as the reference potential of a potential axis. By providing a set of metadata, the digitized data can directly be stored as a unitpackage.
Contribute
Contributions are always welcome and do not necessarily require programming skills. Please leave us a message if you are interested in contributing to the echemdb.
You could get started by digitizing some published data in your area of research or by extending any of the pages of the echemdb website. If your interest is outside of cyclic voltammograms or electrochemistry, we would also be thrilled to hear about your ideas to extend these projects to other areas.
What’s next
We anticipate developing additional tools which help in generating unitpackages or allow for interacting with unitpackages. The latter comprises common tools for the evaluation of electrochemical data. We also plan to include other electrochemical data from methods other than cyclic voltammetry in our database, including data which have been recorded concomitantly, such as disc electrode currents or mass spectrometry signals.
Contact
The authors of echemdb are from the fields of experimental and theoretical physical chemistry, as well as from computer science and mathematics.
Ideas and suggestions, tell us more on our discussion board.
Reach individual contributors on the GitHub organization.
Discuss and stay up to date on echemdb.zulipchat.com.