Predictive Soil Spectroscopy

Authors
Affiliations

José Lucas Safanelli

Woodwell Climate Research Center

Robert Minarik

OpenGeoHub Foundation

Tomislav Hengl

OpenGeoHub Foundation

Jonathan Sanderman

Woodwell Climate Research Center

Published

August 1, 2024

Welcome!

Welcome to our training guide on Predictive Soil Spectroscopy! This material was first published for an in-person workshop held in St Louis, MO, at the ACS international meeting 2023.

Now it is a live guide so that anyone can access and reuse it!

Soil spectroscopy, specifically Diffuse Reflectance Spectroscopy, is rapidly becoming a routine tool for soil analysis in academia and in industry.

One of the most popular uses of soil spectroscopy is for the rapid and low-cost estimation of particle size distribution, carbon fractions, and clay minerals.

This guide touch on the basics of soil spectroscopy development including project design, considerations for building a spectral library, working with large and public spectral libraries, and predictive modeling.

Most of the learning will focus on using the free and open source R programming language.

This material was updated with R version 4.5, and it is recommended to use RStudio as the graphical user interface.

Prerequisites

This training is mostly focused on the use of tidy programming principles with pipe operators, leveraging the R packages from the tidyverse like dplyr, tidyr, and ggplot2.

For the machine learning framework, the first version of this guide was made with the MLR3 framework. However, we decided to switch to tidymodels ecosystem as it has a simpler and more user-friendly interface.

Alternatively, we have included a chemometrics chapter where some common tools and algorithms for working with spectral data are introduced. This was possible with the availability of the amazing package mdatools.

We do, however, recommend that you keep an eye on this online material as it may evolve in time and new methods may be incorporated.

If you are interested in getting started in R using tidy packages and principles, we strongly recommend checking the R 4 Data Science book page:

  • For installing R and RStudio, it is recommended to check the Prerequisites page.
  • Learning how to set a basic project on RStudio is neatly described in Workflow: projects.
  • We are going to have several demonstrations of data import and wrangling by piped operations, and plot visualizations with ggplot.

Other spectral operations, like importing raw spectral files, preprocessing, compression, and modeling can be done with dedicated libraries, e.g., asdreader, opusreader2, prospectr, resemble, tidymodels, and many others.

Disclaimer

Woodwell Climate Research Center, University of Florida, OpenGeoHub foundation and its suppliers and licensors hereby disclaim all warranties of any kind, express or implied, including, without limitation, the warranties of merchantability, fitness for a particular purpose and non-infringement. Neither Woodwell Climate Research Center, University of Florida, OpenGeoHub foundation nor its suppliers and licensors, makes any warranty that the Website will be error free or that access thereto will be continuous or uninterrupted. You understand that you download from, or otherwise obtain content or services through, the Website at your own discretion and risk.

If you notice an error or outdated information, please submit a correction/pull request or open an issue.

License

This website/book and attached software is free to use, and is licensed under the MIT License. The OSSL training data and models, if not otherwise indicated, are available either under the Creative Commons Attribution 4.0 International CC-BY and/or CC-BY-SA license / Open Data Commons Open Database License (ODbL) v1.0.

Acknowledgments

Soil Spectroscopy for Global Good is organized by Woodwell Climate Research Center, University of Florida, and OpenGeoHub foundation. This project has been funded by the USDA National Institute of Food and Agriculture award #2020-67021-32467.

Citing

José Lucas Safanelli, Robert Minarik, Jonathan Sanderman, and Tomislav Hengl. Predictive Soil Spectroscopy. 2023. Available at: https://soilspectroscopy.github.io/soilspec-workshop/.