Webb, Michael; Jacobs, William; An, Yaxin; Oliver, Wesley
This distribution compiles thermodynamic and (where available) dynamic properties of short protein sequences as obtained from coarse-grained molecular dynamics simulations. The dataset features 2114 protein sequences with sequence lengths ranging from N=20 up to N=50 amino acids. The simulation and analysis of these sequences is described in "Active learning of the thermodynamics--dynamics tradeoff in protein condensates'' by Yaxin An, Michael A. Webb*, and William M. Jacobs* (https://doi.org/10.48550/arXiv.2306.03696). Of the 2114 protein sequences, 80 are homomeric polypeptides (replicating a single amino acid for N = 20, 30, 40, and 50), 1266 are sourced from version 9.0 of the DisProt database, and the remaining 768 sequences are novel sequences generated during an active learning campaign described in the aforementioned manuscript. The simulations were performed using the LAMMPS molecular dynamics engine. The interactions used for simulation are obtained from R. M. Regy , J. Thompson , Y. C. Kim and J. Mittal , Improved coarse-grained model for studying sequence dependent phase separation of disordered proteins, Protein Sci., 2021, 1371 —1379. Properties included in this distribution include second virial coefficients, pressure-density data, expectation for phase behavior at 300 K, estimated condensed-phase densities at 300 K (if exist), and condensed-phase self-diffusion coefficients at 300 K (if exist).
Microscopy images are part of a paper entitled "Structured foraging of soil predators unveils functional responses to bacterial defenses" by Fernando Rossine, Gabriel Vercelli, Corina Tarnita, and Thomas Gregor. For detailed acquisition methods see the paper. Experiments were performed between 2019 and 2020 at Princeton University. Two types of images are provided, macroscopic and microscopic widefiled Images. Macroscopic images all show Petri dishes covered in fluorescent bacteria being consumed by amoebae. Images are shown for D. discoideum, P. violaceum, and A. castellanii. Images depicting drug treatments (Nystatin and Fluorouracil) were obtained using D. discoideum. Images used for the creation of a profile were all taken within 30 minutes of each other. Within each directory numbered images are independent replicates. The raw video directory contains time series for dishes under drug treatments. Each numbered folder is a sequence of photos (taken 30 minutes apart of each other) of a single dish. Microscopic images all show amoebae consuming bacteria on a petri dish. The 45 minute videos show either edge cells (located at the edge of amoebae colonies), or inner cells (located 2.5 millimeters towards the center of the colony, from the edge). Videos are confocal stacks, with bacteria showing in green and amoebae appearing as black holes within the bacterial lawn. As was for the macroscopic images, images are shown for D. discoideum, P. violaceum, and A. castellanii. Images depicting drug treatments (Nystatin and Fluorouracil) were obtained using D. discoideum.
Guo, Xuehui; Pan, Da; Daly, Ryan; Chen, Xi; Walker, John; Tao, Lei; McSpiritt, James; Zondlo, Mark
Gas-phase ammonia (NH3), emitted primarily from agriculture, contributes significantly to reactive nitrogen (Nr) deposition. Excess deposition of Nr to the environment causes acidification, eutrophication, and loss of biodiversity. The exchange of NH3 between land and atmosphere is bidirectional and can be highly heterogenous when underlying vegetation and soil characteristics differ. Direct measurements that assess the spatial heterogeneity of NH3 fluxes are lacking. To this end, we developed and deployed two fast-response, quantum cascade laser-based open-path NH3 sensors to quantify NH3 fluxes at a deciduous forest and an adjacent grassland separated by 700 m in North Carolina, United States from August to November, 2017. The sensors achieved 10 Hz precisions of 0.17 ppbv and 0.23 ppbv in the field, respectively. Eddy covariance calculations showed net deposition of NH3 (-7.3 ng NH3-N m−2 s−1) to the forest canopy and emission (3.2 ng NH3-N m−2 s−1) from the grassland. NH3 fluxes at both locations displayed diurnal patterns with absolute magnitudes largest midday and with smaller peaks in the afternoons. Concurrent biogeochemistry data showed over an order of magnitude higher NH3 emission potentials from green vegetation at the grassland compared to the forest, suggesting a possible explanation for the observed flux differences. Back trajectories originating from the site identified the upwind urban area as the main source region of NH3. Our work highlights the fact that adjacent natural ecosystems sharing the same airshed but different vegetation and biogeochemical conditions may differ remarkably in NH3 exchange. Such heterogeneities should be considered when upscaling point measurements, downscaling modeled fluxes, and evaluating Nr deposition for different natural land use types in the same landscape. Additional in-situ flux measurements accompanied by comprehensive biogeochemical and micrometeorological records over longer periods are needed to fully characterize the temporal variabilities and trends of NH3 fluxes and identify the underlying driving factors.
This distribution compiles numerous physical properties for 2,585 intrinsically disordered proteins (IDPs) obtained by coarse-grained molecular dynamics simulation. This combination comprises "Dataset A" as reported in "Featurization strategies for polymer sequence or composition design by machine learning" by Roshan A. Patel, Carlos H. Borca, and Michael A. Webb (DOI: 10.1039/D1ME00160D). The specific IDP sequences are sourced from version 9.0 of the DisProt database. The simulations were performed using the LAMMPS molecular dynamics engine. The interactions used for simulation are obtained from R. M. Regy , J. Thompson , Y. C. Kim and J. Mittal , Improved coarse-grained model for studying sequence dependent phase separation of disordered proteins, Protein Sci., 2021, 1371 —1379.
The dataset contains the model file for the Global Adjoint Tomography Model 25 (GLAD-M25). The model file contains parameters defined on the spectral-element mesh and is recommend to be used in SPECFEM3D GLOBE for seismic wave simulation at the global scale.
There has been considerable recent interest in the high-pressure behavior of silicon carbide, a potential major constituent of carbon-rich exoplanets. In this work, the atomic-level structure of SiC was determined through in situ X-ray diffraction under laser-driven ramp compression up to 1.5 TPa; stresses more than seven times greater than previous static and shock data. Here we show that the B1-type structure persists over this stress range and we have constrained its equation of state (EOS). Using this data we have determined the first experimentally based mass-radius curves for a hypothetical pure SiC planet. Interior structure models are constructed for planets consisting of a SiC-rich mantle and iron-rich core. Carbide planets are found to be ~10% less dense than corresponding terrestrial planets.
Geyman, Emily C.; Wu, Ziman; Nadeau, Matthew D.; Edmonsond, Stacey; Turner, Andrew; Purkis, Sam J.; Howes, Bolton; Dyer, Blake; Ahm, Anne-Sofie C.; Yao, Nan; Deutsch, Curtis A.; Higgins, John A.; Stolper, Daniel A.; Maloof, Adam C.
Carbonate mud represents one of the most important geochemical archives for reconstructing ancient climatic, environmental, and evolutionary change from the rock record. Mud also represents a major sink in the global carbon cycle. Yet, there remains no consensus about how and where carbonate mud is formed. In this contribution, we present new geochemical data that bear on this problem, including stable isotope and minor and trace element data from carbonate sources in the modern Bahamas such as ooids, corals, foraminifera, and green algae.
This dataset contains input and output files to reproduce the results of the manuscript "Homogeneous ice nucleation in an ab initio machine learning model" by Pablo M. Piaggi, Jack Weis, Athanassios Z. Panagiotopoulos, Pablo G. Debenedetti, and Roberto Car (arXiv preprint https://arxiv.org/abs/2203.01376). In this work, we studied the homogeneous nucleation of ice from supercooled liquid water using a machine learning model trained on ab initio energies and forces. Since nucleation takes place over times much longer than the simulation times that can be afforded using molecular dynamics simulations, we make use of the seeding technique that is based on simulating an ice cluster embedded in liquid water. The key quantity provided by the seeding technique is the size of the critical cluster (i.e., a size such that the cluster has equal probabilities of growing or shrinking at the given supersaturation). Using data from the seeding simulations and the equations of classical nucleation theory we compute nucleation rates that can be compared with experiments.