Link, A. James; Carson, Drew V.; So, Larry; Cheung-Lee, Wai Ling
This entry encompasses the raw NMR spectra used to determine the structure of the lasso peptide achromonodin-1. Within one file are included the five following spectra: COSY, TOCSY, NOESY (150 ms mixing time), NOESY (700 ms mixing time), and C,H HSQC. The file requires Mestrenova software to read. These spectra were used to develop the 3D structure models of achromonodin-1 that are deposited at the protein data bank (PDB) as entry 8SVB.
Mondal, Shanka Subhra; Webb, Taylor; Cohen, Jonathan
A dataset of Raven’s Progressive Matrices (RPM)-like problems using realistically rendered
3D shapes, based on source code from CLEVR (a popular visual-question-answering dataset) (Johnson, J., Hariharan, B., Van Der Maaten, L., Fei-Fei, L., Lawrence Zitnick, C., & Girshick, R. (2017). Clevr: A diagnostic dataset for compositional language and elementary visual reasoning. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2901-2910)).
Physical and biogeochemical variables from the NOAA-GFDL Earth System Model 2M experiments (pre-processed), previously published observation-based datasets, and code to reproduce figures from these datasets, used for the study 'Hydrological cycle amplification reshapes warming-driven oxygen loss in Atlantic Ocean'.
This dataset encompasses two distinct sets of data analyzed in the study, namely Asian American Scholar Forum survey data and Microsoft Academic Graph bibleometrics data:
Yu Xie, Xihong Lin, Ju Li, Qian He, Junming Huang, Caught in the Crossfire: Fears of Chinese-American Scientists, Proceedings of the National Academy of Sciences, in press (2023).
This dataset contains example input files, training data sets and potential files related to the publication "First-principles-based Machine Learning Models for Phase Behavior and Transport Properties of CO2." by Mathur et al (2023). In this work, we developed machine learning models for CO2 based on different exchange-correlation DFT functionals. We assessed their performance on liquid densities, vapor-liquid equilibrium and transport properties.
Numerical data is tabulated for all plots (Figures 2, 3a-b, 4-89, S1, S4a-b,d, S5a-b,d, S6-S156) and included as separate spreadsheets categorized by figure in a .zip file in the Supplementary Material. Error bars in Figure 4 show the spread of data observed for 4 and 5 trials on independent samples for MIL-101 and MOF-235, respectively. Figure 6a shows the average of triplicate filtrate test conversions with error propagated based on this spread. Figures 6b and S165 error bars on rate constants are determined based on propagated conversion uncertainty for independent trials and extracted standard deviations of pseudo-first order rate constants from linearized plots. Error bars on other plots represent propagation of experimental uncertainty on single trials.
This item provides access to all configurations of single-chain nanoparticles analyzed in the manuscript "Sequence Patterning, Morphology, and Dispersity in Single-Chain Nanoparticles: Insights from Simulation and Machine Learning" by Roshan A. Patel, Sophia Colmenares, and Michael A. Webb (DOI: 10.1021/acspolymersau.3c00007). The single-chain nanoparticles derive from 320 unique precursor chains that are distinguished by the fraction of linker beads that decorate a fixed-length polymer backbone and the distribution or blockiness of those linker beads. The data is provided in the form of serialized object using the `pickle' python module. The data was compiled using Python version 3.8.8 and Clang 10.0.0. The Python object loaded from the .pkl file is a nested list, with the first dimension having 7,680 entries for the 7,680 unique single-chain nanoparticles produced in the aforementioned paper. Each of those 7,680 entries is itself a list with 20 entries, representing the 20 different simulation snapshots of the given single-chain nanoparticle. Each of the 20 entries is another list with two entries, with the first being a numpy.ndarray containing the x,y,z coordinates of all the beads comprising the single-chain nanoparticle and the second being a numpy.ndarray with a numerical encoding to indicate whether the beads are backbone (indicated as '0') or linker beads (indicated as '1'). Altogether, this provides 153,600 configurations of single-chain nanoparticles.
Chronic hepatitis B (CHB), caused by hepatitis B virus (HBV), remains a major medical problem. HBV has a high propensity for progressing to chronicity and can result in severe liver disease, including fibrosis, cirrhosis and hepatocellular carcinoma. CHB patients frequently present with viral coinfection, including HIV and hepatitis delta virus. About 10% of chronic HIV carriers are also persistently infected with HBV which can result in more exacerbated liver disease. Mechanistic studies of HBV-induced immune responses and pathogenesis, which could be significantly influenced by HIV infection, have been hampered by the scarcity of immunocompetent animal models. Here, we demonstrate that humanized mice dually engrafted with components of a human immune system and a human liver supported HBV infection, which was partially controlled by human immune cells, as evidenced by lower levels of serum viremia and HBV replication intermediates in the liver. HBV infection resulted in priming and expansion of human HLA-restricted CD8+ T cells, which acquired an activated phenotype. Notably, our dually humanized mice support persistent coinfections with HBV and HIV which opens opportunities for analyzing immune dysregulation during HBV and HIV coinfection and preclinical testing of novel immunotherapeutics.
This dataset contains input files, training data and other files related to the machine learning models developed during the work by Muniz et al. In this work, we construct machine learning models based on the MB-pol many-body model. We find that the training set should include cluster configurations as well as liquid phase configurations in order to accurately represent both liquid and VLE properties. The results attest for the ability of machine learning models to accurately represent many-body potentials and provide an efficient avenue for water simulations.
Understanding the condensed-phase behavior of chiral molecules is important in biology, as well as in a range of technological applications, such as the manufacture of pharmaceuticals. Here, we use molecular dynamics simulations to study a chiral four-site molecular model that exhibits a second-order symmetry-breaking phase transition from a supercritical racemic liquid, into subcritical D-rich and L-rich liquids. We determine the infinite-size critical temperature using the fourth-order Binder cumulant, and we show that the finite-size scaling behavior of the order parameter is compatible with the 3D Ising universality class. We also study the spontaneous D-rich to L-rich transition at a slightly subcritical temperature T ~ 0.985 Tc and our findings indicate that the free energy barrier for this transformation increases with system size as N^2/3 where N is the number of molecules, consistent with a surface-dominated phenomenon. The critical behavior observed herein suggests a mechanism for chirality selection in which a liquid of chiral molecules spontaneously forms a phase enriched in one of the two enantiomers as the temperature is lowered below the critical point. Furthermore, the increasing free energy barrier with system size indicates that fluctuations between the L-rich and D-rich phases are suppressed as the size of the system increases, trapping it in one of the two enantiomerically-enriched phases. Such a process could provide the basis for an alternative explanation for the origin of biological homochirality. We also conjecture the possibility of observing nucleation at subcritical temperatures under the action of a suitable chiral external field.
Piaggi, Pablo M; Gartner, Thomas E; Car, Roberto; Debenedetti, Pablo G
The possible existence of a liquid-liquid critical point in deeply supercooled water has been a subject of debate in part due to the challenges associated with providing definitive experimental evidence. Pioneering work by Mishima and Stanley [Nature 392, 164 (1998) and Phys.~Rev.~Lett. 85, 334 (2000)] sought to shed light on this problem by studying the melting curves of different ice polymorphs and their metastable continuation in the vicinity of the expected location of the liquid-liquid transition and its associated critical point. Based on the continuous or discontinuous changes in slope of the melting curves, Mishima suggested that the liquid-liquid critical point lies between the melting curves of ice III and ice V. Here, we explore this conjecture using molecular dynamics simulations with a purely-predictive machine learning model based on ab initio quantum-mechanical calculations. We study the melting curves of ices III, IV, V, VI, and XIII using this model and find that the melting lines of all the studied ice polymorphs are supercritical and do not intersect the liquid-liquid transition locus. We also find a pronounced, yet continuous, change in slope of the melting lines upon crossing of the locus of maximum compressibility of the liquid. Finally, we analyze critically the literature in light of our findings, and conclude that the scenario in which melting curves are supercritical is favored by the most recent computational and experimental evidence. Thus, although the preponderance of experimental and computational evidence is consistent with the existence of a second critical point in water, the behavior of the melting lines of ice polymorphs does not provide strong evidence in support of this viewpoint, according to our calculations.
Physical and biogeochemical variables from the NOAA-GFDL Earth System Model 2M experiments, and previously published observation-based datasets, used for the study 'Hydrological cycle amplification reshapes warming-driven oxygen loss in Atlantic Ocean'.
The materials include codes and example input / output files for Monte Carlo simulations of lattice chains in the grand canonical ensemble, for determining phase behavior, critical points, and formation of aggregates.
Large-eddy simulations were employed over half-ice and half-water surfaces, with varying surface temperatures, wind speeds, directions, as to test if the atmospheric interaction with the heterogeneous surface can be predicted via a heterogeneity Richardson number. This dataset was used to determine that surface heat fluxes over ice, water, and the aggregate surface seem to be captured reasonably well by the wind direction and the heterogeneity Richardson number, but the mean wind and turbulent kinetic energy (TKE) profiles were not, suggesting that not only the difference in stability between the two surface, but also the individual stabilities over each surface influence the dynamics.
Large-eddy simulations were employed over five different sea ice patterns, with a constant ice fraction, to test if the overlying atmospheric boundary layer (ABL) dynamics and thermodynamics differs. The results of these simulations were used to determine that there were differences in vertical heat flux, momentum flux, and horizontal wind speed, and that more surface information is needed to predict the ABL over the sea ice surface. To see what other surface information is needed, twenty-two landscape metrics were calculated over forty-four different maps at differing resolutions, using the FRAGSTATs program. The results of that analysis are available in a .csv file in this dataset.
This repository contains the raw photon-by-photon single-molecule FRET (smFRET) trajectories, SAXS data, and MD simulation trajectories, multi-sequence alignment, and gel images for the paper titled "Sub-Domain Dynamics Enables Chemical Chain Reactions in Nonribosomal Peptide Synthetases."
O'Neill, Eric; Lark, Tyler; Xie, Yanhua; Basso, Bruno
Collection of the underlying spatially explicit data for Available Land for Cellulosic Biofuel Production: A Supply Chain Centered Comparison. Includes raw biomass yield data and soil carbon sequestration potential data for three types of marginal land for the USA midwest at the field level including field areas. Collection also includes raw land rasters for the three types of marginal land, model parameters for the MILP model used in the study, and results used to generate the figures in the paper.
Griffies, Stephen M; Beadling, Rebecca L; Krasting, John P; Hurlin, William J
This output was produced in coordination with the Southern Ocean Freshwater release model experiments Initiative (SOFIA) and is the Tier 1 experiment where freshwater is delivered in a spatially and temporally uniform pattern at the surface of the ocean at sea surface temperature in a 1-degree latitude band extending from Antarctica’s coastline. The total additional freshwater flux imposed as a monthly freshwater flux entering the ocean is 0.1 Sv. Users are referred to the methods section of Beadling et al. (2022) for additional details on the meltwater implementation in CM4 and ESM4. The datasets in this collection contain model output from the coupled global climate model, CM4, and Earth System Model, ESM4, both developed at the Geophysical Fluid Dynamics Laboratory (GFDL) of the National Oceanic and Atmospheric Administration (NOAA). The ocean_monthly_z and ocean_annual_z output are provided as z depth levels in meters as opposed to the models native hybrid vertical ocean coordinate which consists of z* (quasi-geopotential) coordinates in the upper ocean through the mixed layer, transitioning to isopycnal (referenced to 2000 dbar) in the ocean interior. Please see README for further details.
Notterman, Daniel A; Schneper, Lisa M; Drake, Amanda; Piyasena, Chinthika
This entry contains the data used in the PLOS ONE publication entitled, "Characteristics of salivary telomere length shortening in preterm infants" by Schneper et al. The objective of the study was to examine the association between gestational age, telomere length (TL) and rate of shortening in newborns. Genomic DNA was isolated from buccal samples of 39 term infants at birth and one year and 32 preterm infants at birth, term-adjusted age (40 weeks post-conception) and age one-year corrected for gestational duration. Telomere length was measured by quantitative real-time PCR. Demographic and clinical data were collected during clinic or research visits and from hospital records. Socioeconomic status was estimated using the deprivation category (DEPCAT) scores derived from the Carstairs score of the subject's postal code.