This dataset contains supplementary materials for Chapter 4 and Chapter 5 of Yiheng Tao's PhD dissertation (2022). The dissertation’s abstract is provided here:
Carbon capture, utilization, and storage (CCUS) mitigates climate change by capturing carbon dioxide (CO2) emissions from large point sources, or CO2 from the ambient air, and subsequently reusing the captured CO2 or injecting it into deep geological formations for long-term and secure storage. Almost all current decarbonization pathways include large-scale CCUS, on the order of a billion tonnes (Gt) of CO2 captured and stored each year globally starting in 2030, yet the actual deployment has lagged far behind (around 0.04 Gt CO2 was captured in 2021). In this dissertation, I contribute to several aspects of largescale deployment of CCUS by (1) developing and applying efficient numerical models to simulate geological CO2 storage and (2) identifying key policies to address the bottlenecks of overall CCUS deployment. This dissertation concerns the United States, China, and the Belt and Road Initiative (BRI) region through research projects that are consistent with each location’s current development stage of CCUS.
Chapters 2 and 3 contain computational modeling studies. In Chapter 2, I develop a new series of vertical-equilibrium (VE) models in the dual-continuum modeling framework to simulate CO2 injection and migration in fractured geological formations. Those models are shown to be effective and efficient when properties of the formation allow for the VE assumption. In Chapter 3, I apply a VE model to simulate basin-scale CO2 injection in the Junggar Basin of Northwestern China. The results show that current regional emissions of more than 100 million tonnes of CO2 per year can be stored effectively, thereby confirming the great potential of the Junggar Basin for early CCUS deployment.
Chapters 4 and 5 contain policy analyses. In Chapter 4, I propose a dynamic system consisting of new CO2 pipelines and novel Allam-cycle power plants in the Central United States, and examine how government policies, including an extended Section 45Q tax credit, may improve the economic feasibility of this system. Lastly, in Chapter 5, I investigate and quantify CO2 emissions implications of power plant projects associated with the BRI. I also propose a “greenness ratio” to measure the level of environmental sustainability of BRI in the power sector.
This dataset contains all the model output used to generate the figures and data reported in the article "Climate, soil organic layer, and nitrogen jointly drive forest development after fire in the North American boreal zone". The data was generated during spring 2015 using the a modified version of the Ecosystem Demography model version 2, provided as a supplement accompanying the article. The data was generated using the computational resources supported by the PICSciE OIT High Performance Computing Center and Visualization Laboratory at Princeton University. The dataset contains a pdf Readme file which explains in detail how the data can be used. Users are recommended to go through this file before using the data.
O'Neill, Eric; Lark, Tyler; Xie, Yanhua; Basso, Bruno
Collection of the underlying spatially explicit data for Available Land for Cellulosic Biofuel Production: A Supply Chain Centered Comparison. Includes raw biomass yield data and soil carbon sequestration potential data for three types of marginal land for the USA midwest at the field level including field areas. Collection also includes raw land rasters for the three types of marginal land, model parameters for the MILP model used in the study, and results used to generate the figures in the paper.
Griffies, Stephen M; Beadling, Rebecca L; Krasting, John P; Hurlin, William J
This output was produced in coordination with the Southern Ocean Freshwater release model experiments Initiative (SOFIA) and is the Tier 1 experiment where freshwater is delivered in a spatially and temporally uniform pattern at the surface of the ocean at sea surface temperature in a 1-degree latitude band extending from Antarctica’s coastline. The total additional freshwater flux imposed as a monthly freshwater flux entering the ocean is 0.1 Sv. Users are referred to the methods section of Beadling et al. (2022) for additional details on the meltwater implementation in CM4 and ESM4. The datasets in this collection contain model output from the coupled global climate model, CM4, and Earth System Model, ESM4, both developed at the Geophysical Fluid Dynamics Laboratory (GFDL) of the National Oceanic and Atmospheric Administration (NOAA). The ocean_monthly_z and ocean_annual_z output are provided as z depth levels in meters as opposed to the models native hybrid vertical ocean coordinate which consists of z* (quasi-geopotential) coordinates in the upper ocean through the mixed layer, transitioning to isopycnal (referenced to 2000 dbar) in the ocean interior. Please see README for further details.
Kim, Chang-Goo; Ostriker, Eve; Gong, Munan; Kim, Jeong-Gyu
We present the public data release of the TIGRESS (Three-phase Interstellar Medium in Galaxies Resolving Evolution with Star Formation and Supernova Feedback) simulations. This release includes simulations representing the solar neighborhood environment at spatial resolutions of 2 and 4 pc. The original magneto-hydrodynamic simulation data is published along with data products from post-processing, including chemistry, CO emission line, and photoionization (HII regions). Data reading and analysis examples are provided in Python.
Notterman, Daniel A; Schneper, Lisa M; Drake, Amanda; Piyasena, Chinthika
This entry contains the data used in the PLOS ONE publication entitled, "Characteristics of salivary telomere length shortening in preterm infants" by Schneper et al. The objective of the study was to examine the association between gestational age, telomere length (TL) and rate of shortening in newborns. Genomic DNA was isolated from buccal samples of 39 term infants at birth and one year and 32 preterm infants at birth, term-adjusted age (40 weeks post-conception) and age one-year corrected for gestational duration. Telomere length was measured by quantitative real-time PCR. Demographic and clinical data were collected during clinic or research visits and from hospital records. Socioeconomic status was estimated using the deprivation category (DEPCAT) scores derived from the Carstairs score of the subject's postal code.
Chang, Claire H. C.; Lazaridi, Christina; Yeshurun, Yaara; Norman, Kenneth A.; Hasson, Uri
This study examined how the brain dynamically updates event representations by integrating new information over multiple minutes while segregating irrelevant input. A professional writer custom-designed a narrative with two independent storylines, interleaving across minute-long segments (ABAB). In the last (C) part, characters from the two storylines meet and their shared history is revealed. Part C is designed to induce the spontaneous recall of past events, upon the recurrence of narrative motifs from A/B, and to shed new light on them. Our fMRI results showed storyline-specific neural patterns, which were reinstated (i.e. became more active) during storyline transitions. This effect increased along the processing timescale hierarchy, peaking in the default mode network. Similarly, the neural reinstatement of motifs was found during part C. Furthermore, participants showing stronger motif reinstatement performed better in integrating A/B and C events, demonstrating the role of memory reactivation in information integration over intervening irrelevant events.
The dielectric function for "Astrodust" grain material is provided for different assumed values of the dust grain shape (spheroid axis ratio) and porosity (vacuum fraction), and fraction of the interstellar iron present as metallic inclusions. For each case, the dielectric function is obtained by requiring that the grains reproduce the observed infrared opacity, and match to a physically reasonable dielectric function at 1 micron, and extending to X-ray energies. The derived dielectric functions satisfy the Kramers-Kronig relations. Dielectric functions are provided from 1 Angstrom to 5 cm (12.4 keV to 2.59e-5 eV).
For each dielectric function, we also calculate absorption and scattering corss sections for spheroidal grains, for three orientations of the grain relative to incident linearly-polarized light, for wavelengths from the Lyman limit (0.0912 micron) to the microwave (4 cm), and grain "effective radii" a_eff from 3.162A to 5.012 micron.
The Molino suite contains 75,000 galaxy mock catalogs designed to quantify the information content of any cosmological observable for a redshift-space galaxy sample. They are constructed from the Quijote N-body simulations (Villaescusa-Navarro et al. 2020) using the standard Zheng et al. (2007) Halo Occupation Distribution (HOD) model. The fiducial HOD parameters are based on the SDSS high luminosity samples. The suite contains 15,000 mocks at the fiducial cosmology and HOD parameters for covariance matrix estimation. It also includes (500 N-body realizations) x (5 HOD realizations)=2,500 mocks at 24 other parameter values to estimate the derivative of the observable with respect to six cosmological parameters (Omega_m, Omega_b, h, n_s, sigma_8, and M_nu) and five HOD parameters (logMmin, sigma_logM, log M_0, alpha, and log M_1). Using the covariance matrix and derivatives calculated from Molino, one can derive Fisher matrix forecasts on the cosmological parameters marginalized over HOD parameters.
Recent advances in experimental techniques have allowed the simultaneous recordings of
populations of hundreds of neurons, fostering a debate about the nature of the collective
structure of population neural activity. Much of this debate has focused on the
empirical findings of a phase transition in the parameter space of maximum entropy
models describing the measured neural probability distributions, interpreting this phase
transition to indicate a critical tuning of the neural code. Here, we instead focus on the
possibility that this is a first-order phase transition which provides evidence that the
real neural population is in a `structured', collective state. We show that this collective
state is robust to changes in stimulus ensemble and adaptive state. We find that the
pattern of pairwise correlations between neurons has a strength that is well within the
strongly correlated regime and does not require fine tuning, suggesting that this state is
generic for populations of 100+ neurons. We find a clear correspondence between the
emergence of a phase transition, and the emergence of attractor-like structure in the
inferred energy landscape. A collective state in the neural population, in which neural
activity patterns naturally form clusters, provides a consistent interpretation for our
Webb, Michael; Jacobs, William; An, Yaxin; Oliver, Wesley
This distribution compiles thermodynamic and (where available) dynamic properties of short protein sequences as obtained from coarse-grained molecular dynamics simulations. The dataset features 2114 protein sequences with sequence lengths ranging from N=20 up to N=50 amino acids. The simulation and analysis of these sequences is described in "Active learning of the thermodynamics--dynamics tradeoff in protein condensates'' by Yaxin An, Michael A. Webb*, and William M. Jacobs* (https://doi.org/10.48550/arXiv.2306.03696). Of the 2114 protein sequences, 80 are homomeric polypeptides (replicating a single amino acid for N = 20, 30, 40, and 50), 1266 are sourced from version 9.0 of the DisProt database, and the remaining 768 sequences are novel sequences generated during an active learning campaign described in the aforementioned manuscript. The simulations were performed using the LAMMPS molecular dynamics engine. The interactions used for simulation are obtained from R. M. Regy , J. Thompson , Y. C. Kim and J. Mittal , Improved coarse-grained model for studying sequence dependent phase separation of disordered proteins, Protein Sci., 2021, 1371 —1379. Properties included in this distribution include second virial coefficients, pressure-density data, expectation for phase behavior at 300 K, estimated condensed-phase densities at 300 K (if exist), and condensed-phase self-diffusion coefficients at 300 K (if exist).
In our study, we compare the three dimensional (3D) morphologic characteristics of Earth's first reef-building animals (archaeocyath sponges) with those of modern, photosynthetic corals. Within this repository are the 3D image data products for both groups of animals. The archaeocyath images were produced through serial grinding and imaging with the Grinding, Imaging, and Reconstruction Instrument at Princeton University. The images in this repository are the downsampled data products used in our study, and the full resolution (>2TB) image stacks are available upon request from the author. For the coral image data, the computed tomography (CT) images of all samples are included at full resolution. Also included in this repository are the manual and automated outline coordinates of the archaeocyath and coral branches, which can be directly used for morphological study.
Extrapolation -- the ability to make inferences that go beyond the scope of one's experiences -- is a hallmark of human intelligence. By contrast, the generalization exhibited by contemporary neural network algorithms is largely limited to interpolation between data points in their training corpora. In this paper, we consider the challenge of learning representations that support extrapolation. We introduce a novel visual analogy benchmark that allows the graded evaluation of extrapolation as a function of distance from the convex domain defined by the training data. We also introduce a simple technique, context normalization, that encourages representations that emphasize the relations between objects. We find that this technique enables a significant improvement in the ability to extrapolate, considerably outperforming a number of competitive techniques.
The Volumetric Camera Calibration Dataset is used for a camera calibration system. Intersecting laser beams are traversed over a volume in the test domain. At each location, the intersecting beams are imaged by camera1 and camera2. A test object is imaged for evaluation.