The Sedimentary geochemistry and paleoenvironments project phase 2 data release: An open data resource for the study of Earth’s environmental history

Úna C. Farrell, Hunter C. Olson, Maya O. Thompson, Michelle L. Abshire, Oyeleye O. Adeboye, Anne-Sofie C. Ahm, Lewis J. Alcott, Thomas J. Algeo, Ross P. Anderson, Arif H. Ansar, Lucas Pinto Heckert Bastos, Kohen W. Bauer, Brian Beaty, Justin E. Birdwell, Fred T. Bowyer, Jochen J. Brocks, Tessa Brunoir, James F. Busch, Donald E. Canfield, Fabrício A. Caxito, Chao Chang, Meng Cheng, Jean N.R. Clemente, David R. Cordie, Peter W. Crockford, Huan Cui, Celeste M. Cunningham, Tais W. Dahl, Janaina Rodrigues de Paula, Carol M. Dehler, Lucas Del Mouro, Keith Dewing, Dermeval Aparecido do Carmo, Stephen Q. Dornbos, Nadja Drabon, Julie A. Dumoulin, Omabehere Innocent Ejeh, Emily Ellefson, Maya Elrick, Joseph F. Emmings, Bokanda Ekoko Eric, Hao Fang, Gabriella Fazio, Henrique A. Fernandes, Katherine L. French, Robert R. Gaines, Richard M. Gaschnig, Timothy M. Gibson, Geoffrey J. Gilleaudeau, Karin Goldberg, Zheng Gong, Amy P.I. Hagen, Galen P. Halverson, Kalev Hantsoo, Emma R. Haxen, Miles A. Henderson, João P.T.M. Hippertt, Malcolm S.W. Hodgskiss, Paul F. Hoffman, Edward C. Huang, Benjamin W. Johnson, Pavel B. Kabanov, Junyao Kang, C. Brenhin Keller, Brian Kendall, Julien Kimmig, Sara R. Kimmig, Michael A. Kipp, Andrew H. Knoll, Timmu Kreitsmann, Anurag A. Kulkarni, Alexandra Kunert, Marcus Kunzmann, Jiankang Lai, Richard O. Lease, Chao Li, Sen Li, Alex G. Lipp, Yang Liu, David K. Loydell, Xinze Lu, Katie M. Maloney, Kaarel Mänd, Alexie E.G. Millikin, N. Tanner Mills, Kento Motomura, Chiza N. Mwinde, Lyle L. Nelson, Nora M. Nieminski, Brennan O'Connell, Edel O'Sullivan, Juliana Okubo, Jaden K. Olah, Frantz Ossa Ossa, Chadlin M. Ostrander, Kärt Paiste, Camille A. Partin, Egberto Pereira, Shanan E. Peters, Tiffany Playter, Susannah M. Porter, Simon W. Poulton, Sara B. Pruss, Zhen Qiu, Daven P. Quinn, Mariano Remírez, Sebastian Richiano, Sylvain Richoz, Kathryn I. Rico, Samantha R. Ritzer, Zachary Roney, Alan D. Rooney, William C. Rose, Elias J. Rugen, Swapan K. Sahoo, Shane D. Schoepfer, Judith A. Sclafani, Nathan D. Sheldon, Yanan Shen, Graham A. Shields, Pulkit Singh, Arvind Kumar Singh, Sarah P. Slotznick, Emily F. Smith, Haijun Song, Sam C. Spinks, Richard G. Stockey, Justin V. Strauss, Eva E. Stüeken, Zongyuan Sun, Dongjie Tang, Lidya G. Tarhan, Danielle Thomson, Nicholas J. Tosca, Rosalie Tostevin, Chenyi Tu, Maoli N. Vizcaíno, Yuxuan Wang, Changle Wang, Xiaomei Wang, Lucas Veríssimo Warren, Lucy C. Webb, Philip R. Wilby, Christina R. Woltz, Rachel Wood, Yuyang Wu, Xiuqing Yang, Inessa A. Yurchenko, Junpeng Zhang, Jessica H. Whiteside, Benjamin C. Gill, Akshay K. Mehra, Kimberly V. Lau, Noah Planavsky, David T. Johnston, and Erik A. Sperling

Chemical Geology, 2026: https://doi.org/10.1016/j.chemgeo.2026.123311


Abstract:
Geochemical data from sedimentary rocks are the primary source of information regarding Earth’s surface evolution through time, including its air and water envelopes and interactions with life and deep Earth processes. The Sedimentary Geochemistry and Paleoenvironments Project (SGP) is a scientific consortium centered around open data and community-driven development of cyberinfrastructure tools and resources for sedimentary geochemistry and Earth history. Here we describe the SGP Phase 2 data release, which focused on incorporating Paleoproterozoic and Mesoproterozoic (2500–1000 million years ago) data and better accommodating carbonate data. This data release was built through the involvement of >200 researchers worldwide in academia, government, and industry, and provides the largest available public data resource for our user community in the academic fields of geochemistry, sedimentology, tectonics, paleontology, Earth history, and paleoclimate, as well as the petroleum and minerals industries. The dataset now encompasses 126,006 samples and 4,132,371 geochemical analyses. In addition to direct entry by SGP Team Members, we have ingested and incorporated datasets from the Geoscience Australia OZCHEM database, the Alberta Geological Survey, and the Deep-Time Marine Sedimentary Element Database (DM-SED) compilation. This paper details sampling in the Phase 2 dataset with respect to age, geography, lithology, and other geological characteristics, documents access via our search website and API, discusses possible issues and/or biases in the dataset that could impact analyses, describes plans for governance and stewardship of data from Indigenous lands, and serves as the citable reference paper for the data release.

Suggested citation:
Farrell, Ú. C., et al. (2026). The Sedimentary geochemistry and paleoenvironments project phase 2 data release: An open data resource for the study of Earth’s environmental history. Chemical Geology 123311.