This dataset contains input files, training data and other files related to the machine learning models developed during the work by Muniz et al. In this work, we construct machine learning models based on the MB-pol many-body model. We find that the training set should include cluster configurations as well as liquid phase configurations in order to accurately represent both liquid and VLE properties. The results attest for the ability of machine learning models to accurately represent many-body potentials and provide an efficient avenue for water simulations.
This dataset contains input and output files to reproduce the results of the manuscript "Homogeneous ice nucleation in an ab initio machine learning model" by Pablo M. Piaggi, Jack Weis, Athanassios Z. Panagiotopoulos, Pablo G. Debenedetti, and Roberto Car (arXiv preprint https://arxiv.org/abs/2203.01376). In this work, we studied the homogeneous nucleation of ice from supercooled liquid water using a machine learning model trained on ab initio energies and forces. Since nucleation takes place over times much longer than the simulation times that can be afforded using molecular dynamics simulations, we make use of the seeding technique that is based on simulating an ice cluster embedded in liquid water. The key quantity provided by the seeding technique is the size of the critical cluster (i.e., a size such that the cluster has equal probabilities of growing or shrinking at the given supersaturation). Using data from the seeding simulations and the equations of classical nucleation theory we compute nucleation rates that can be compared with experiments.
This dataset contains all data relevant to a forthcoming publication in which we used molecular simulation methods to study the phase behavior of supercooled water. The dataset contains simulation input and output files, processed data files, and image files used to create all plots in the manuscript. Python analysis scripts are also included, including instructions for how to re-generate all plots in the manuscript.
Piaggi, Pablo M; Gartner, Thomas E; Car, Roberto; Debenedetti, Pablo G
Abstract:
The possible existence of a liquid-liquid critical point in deeply supercooled water has been a subject of debate in part due to the challenges associated with providing definitive experimental evidence. Pioneering work by Mishima and Stanley [Nature 392, 164 (1998) and Phys.~Rev.~Lett. 85, 334 (2000)] sought to shed light on this problem by studying the melting curves of different ice polymorphs and their metastable continuation in the vicinity of the expected location of the liquid-liquid transition and its associated critical point. Based on the continuous or discontinuous changes in slope of the melting curves, Mishima suggested that the liquid-liquid critical point lies between the melting curves of ice III and ice V. Here, we explore this conjecture using molecular dynamics simulations with a purely-predictive machine learning model based on ab initio quantum-mechanical calculations. We study the melting curves of ices III, IV, V, VI, and XIII using this model and find that the melting lines of all the studied ice polymorphs are supercritical and do not intersect the liquid-liquid transition locus. We also find a pronounced, yet continuous, change in slope of the melting lines upon crossing of the locus of maximum compressibility of the liquid. Finally, we analyze critically the literature in light of our findings, and conclude that the scenario in which melting curves are supercritical is favored by the most recent computational and experimental evidence. Thus, although the preponderance of experimental and computational evidence is consistent with the existence of a second critical point in water, the behavior of the melting lines of ice polymorphs does not provide strong evidence in support of this viewpoint, according to our calculations.
Gartner, Thomas III; Zhang, Linfeng; Piaggi, Pablo; Car, Roberto; Panagiotopoulos, Athanassios; Debenedetti, Pablo
Abstract:
This dataset contains all data related to the publication "Signatures of a liquid-liquid transition in an ab initio deep neural network model for water", by Gartner et al., 2020. In this work, we used neural networks to generate a computational model for water using high-accuracy quantum chemistry calculations. Then, we used advanced molecular simulations to demonstrate evidence that suggests this model exhibits a liquid-liquid transition, a phenomenon that can explain many of water's anomalous properties. This dataset contains links to all software used, all data generated as part of this work, as well as scripts to generate and analyze all data and generate the plots reported in the publication.
This dataset comprises of data associated with the publication "Transferability of data-driven, many-body models for CO2 simulations in the vapor and liquid phases", which can be found at https://doi.org/10.1063/5.0080061. The data includes calculations for a Many-Body decomposition, virial coefficient calculations, orientational molecular scan energies, potential energy fields, correlation plots of training and testing data, vapor-liquid equilibrium simulations, liquid density simulations, and solid cell simulations.
Muniz, Maria Carolina; Gartner III, Thomas E.; Riera, Marc; Knight, Christopher; Yue, Shuwen; Paesani, Francesco; Panagiotopoulos, Athanassios Z.
Abstract:
This dataset contains all data (including input files, simulation trajectories as well as other data files and analysis scripts) related to the publication "Vapor-liquid equilibrium of water with the MB-pol many-body potential" by Muniz et al. in preparation (2021). In this work, we assessed the performance of the MB-pol many-body potential with respect to water's vapor-liquid equilibrium properties. Through the use of direct coexistence molecular dynamics, we calculated properties such as coexistence densities, surface tension, vapor pressures and enthalpy of vaporization. We found that MB-pol is able to predict these properties in good agreement with experimental data. The results attest to the chemical accuracy of MB-pol and its large range of application across water's phase diagram.
This dataset is a sequence of laser-induced fluorescence images of a dye injected in a channel flow with canopy-like stainless steel rods simulating a vegetation canopy stand. The data is acquired close to the channel bottom at z/h=0.2, where z is the height referenced to the channel bed and h is the canopy height. The dataset provides spatial distribution of scalar concentration in a plane parallel to the channel bed. The data has been used (but the data itself has not been published or available to the public) in previous work. The references are: Ghannam, K., Poggi, D., Porporato, A., & Katul, G. (2015). The spatio-temporal statistical structure and ergodic behaviour of scalar turbulence within a rod canopy. Boundary-Layer Meteorology,157(3), 447–460. Ghannam, K, Poggi, D., Bou-Zeid, E., Katul, G. (2020). Inverse cascade evidenced by information entropy of passive scalars in submerged canopy flows. Geophysical Research Letters (accepted).
This distribution contains experimentally measured data for the extent of retained enzyme activity post thermal stressing for three distinct enzymes: glucose oxidase, lipase, and horseradish peroxidase. The data is used to form conclusions and develop machine learning models as reported in the publication "Machine Learning on a Robotic Platform for the Design of Polymer-Protein Hybrids" by Matthew Tamasi, Roshan Patel, Carlos Borca, Shashank Kosuri, Heloise Mugnier, Rahul Upadhya, N. Sanjeeva Murthy, Michael Webb*, and Adam Gormley. Details regarding the experimental protocols are reported in the aforementioned paper but are briefly discussed in the README.
Data set for "Film drop production over a wide range of liquid conditions." One .csv file is provided that contains data about the number of film drops produced by bursting bubbles of multiple sizes in various liquid conditions.
Data set corresponding to "NAPS: Integrating pose estimation and tag-based tracking." This dataset contains the corresponding videos, tracking scripts, and SLEAP models along with SLEAP, NAPS, and ArUco tracking results.
Data set for "Ocean emission of microplastic by bursting bubble jet drops." Two .csv files are provided: one for the size of a jet drop carrying microplastic, and another for the amount of microplastic captured by a jet drop.
Pacheco, Diego A; Thiberge, Stephan; Pnevmatikakis, Eftychios; Murthy, Mala
Abstract:
Sensory pathways are typically studied starting at receptor neurons and following postsynaptic neurons into the brain. However, this leads to a bias in analysis of activity towards the earliest layers of processing. Here, we present new methods for volumetric neural imaging with precise across-brain registration, to characterize auditory activity throughout the entire central brain of Drosophila and make comparisons across trials, individuals, and sexes. We discover that auditory activity is present in most central brain regions and in neurons responsive to other modalities. Auditory responses are temporally diverse, but the majority of activity is tuned to courtship song features. Auditory responses are stereotyped across trials and animals in early mechanosensory regions, becoming more variable at higher layers of the putative pathway, and this variability is largely independent of spontaneous movements. This study highlights the power of using an unbiased, brain-wide approach for mapping the functional organization of sensory activity.
Li, Zhongshu; Gallagher, Kevin P.; Mauzerall, Denise L.
Abstract:
The dataset include a list of power projects outside of China that receive Chinese foreign direct investment from 2000 to 2018. Detailed information including project capacity, location, share of Chinese ownership, type of power generating technologies are collected for each power project.
This dataset contains all the data, model and MATLAB codes used to generate the figures and data reported in the article (DOI: 10.1002/2014JD022278). The data was generated during September 2013 and February 2014 using the Ocean-Land-Atmosphere Model also provided with this package. The data was generated using the computational resources supported by the PICSciE OIT High Performance Computing Center and Visualization Laboratory at Princeton University. The dataset contains a pdf Readme file which explains in detail how the data can be used. Users are recommended to go through this file before using the data.
Physical and biogeochemical variables from the NOAA-GFDL Earth System Model 2M experiments, and previously published observation-based datasets, used for the study 'Hydrological cycle amplification reshapes warming-driven oxygen loss in Atlantic Ocean'.
Data set used to train a Deep Potential (DP) model for crystalline and disordered TiO2 phases. Training data contain atomic forces, potential energy, atomic coordinates and cell tensor. Energy and forces were evaluated with the density functional SCAN, as implemented in Quantum-ESPRESSO. Atomic configurations of crystalline systems were generated by random perturbation of atomic positions (0-0.3 A) and cell tensor (1-10%). Amorphous TiO2 was explored by DP molecular dynamics (DPMD) at temperatures in the range 300−2500 K and pressure in the range 0−81 GPa.
Data set used to train a Deep Potential (DP) model for
subcritical and supercritical water. Training data contain atomic forces,
potential energy, atomic coordinates and cell tensor. Energy and forces
were evaluated with the density functional SCAN. Atomic configurations
were extracted from DP molecular dynamics at P = 250 bar and
T = 553, 623, 663, 733 and 823 K. Input files used to train the DP model
are also provided.
These data include 39 structured interview transcripts. Each case is someone who worked at the time for Uber, UberEats, Lyft, and/or Amazon Flex (Amazon’s contractor delivery service). These data were collected between July and September 2019. All but one of the interviews occurred over the phone. My questions are focused on the structure of their gig work jobs and the technology they used at work or expected to use at work in the future. I included a description of the data, the recruitment methods, and the discussion guide in this ReadMe file.
Derrida’s Margins <derridas-margins.princeton.edu> is a website and online research tool for annotations from the Library of Jacques Derrida, housed at Princeton University Library (PUL) <library.princeton.edu>. Jacques Derrida is one of the major figures of twentieth-century thought, and his library--which bears the traces of decades of close reading--represents a major intellectual archive. This project focused on annotations related to Derrida’s landmark 1967 work De la grammatologie (Of Grammatology).
Derrida’s Margins <derridas-margins.princeton.edu> is a website and online research tool for annotations from the Library of Jacques Derrida, housed at Princeton University Library (PUL) <library.princeton.edu>. Jacques Derrida is one of the major figures of twentieth-century thought, and his library--which bears the traces of decades of close reading--represents a major intellectual archive. This project focused on annotations related to Derrida’s landmark 1967 work De la grammatologie (Of Grammatology).
Taylor, Jenny A.; Bratton, Benjamin P.; Sichel, Sophie R.; Blair, Kris M.; Jacobs, Holly M.; DeMeester, Kristen E.; Kuru, Erkin; Gray, Joe; Biboy, Jacob; VanNieuwenhze, Michael S.; Vollmer, Waldemar; Grimes, Catherine L.; Shaevitz, Joshua W.; Salama, Nina R.
Abstract:
Helical cell shape is necessary for efficient stomach colonization by Helicobacter pylori, but the molecular mechanisms for generating helical shape remain unclear. We show that the helical centerline pitch and radius of wild-type H. pylori cells dictate surface curvatures of considerably higher positive and negative Gaussian curvatures than those present in straight- or curved-rod bacteria. Quantitative 3D microscopy analysis of short pulses with either N-acetylmuramic acid or D-alanine metabolic probes showed that cell wall growth is enhanced at both sidewall curvature extremes. Immunofluorescence revealed MreB is most abundant at negative Gaussian curvature, while the bactofilin CcmA is most abundant at positive Gaussian curvature. Strains expressing CcmA variants with altered polymerization properties lose helical shape and associated positive Gaussian curvatures. We thus propose a model where CcmA and MreB promote PG synthesis at positive and negative Gaussian curvatures, respectively, and that this patterning is one mechanism necessary for maintaining helical shape.
Dust and starlight have been modeled for the KINGFISH project galaxies. For each pixel in each galaxy, we estimate: (1) dust surface density; (2) q_PAH, the dust mass fraction in PAHs; (3) distribution of starlight intensities heating the dust; (4) luminosity emitted by the dust; and (5) dust luminosity from regions with high starlight intensity. The modeling is as described in the paper "Modeling Dust and Starlight in Galaxies Observed by Spitzer and Herschel: The KINGFISH Sample", by G. Aniano, B.T. Draine, L.K. Hunt, K. Sandstrom, D. Calzetti, R.C. Kennicutt, D.A, Dale, and 26 other authors, accepted for publication in The Astrophysical Journal.
Does the default mode network (DMN) reconfigure to encode information about the changing environment? This question has proven difficult, because patterns of functional connectivity reflect a mixture of stimulus-induced neural processes, intrinsic neural processes and non-neuronal noise. Here we introduce inter-subject functional correlation (ISFC), which isolates stimulus-dependent inter-regional correlations between brains exposed to the same stimulus. During fMRI, we had subjects listen to a real-life auditory narrative and to temporally scrambled versions of the narrative. We used ISFC to isolate correlation patterns within the DMN that were locked to the processing of each narrative segment and specific to its meaning within the narrative context. The momentary configurations of DMN ISFC were highly replicable across groups. Moreover, DMN coupling strength predicted memory of narrative segments. Thus, ISFC opens new avenues for linking brain network dynamics to stimulus features and behaviour.
This movie shows the dynamical behavior of field lines seeded on one of the stars. We find
a clear cyclical process operating in the magnetosphere. First, field lines from one star can attach to the second star. Second, as the orbit progresses these field lines
develop twist and are expelled outward past the second
star as closed loops. Third, these loops open up to infinity and then reconnect on the far side of the first star
opposite to the second. Fourth, the orbital motion will
bring the second star back into contact with the closed
loops, and they reattach to the second star.
Maingi, R.; Hu, J. S.; Sun, Z.; Tritz, K.; Zuo, G. Z.; Xu, W.; Huang, M.; Meng, X. C.; Canik, J. M.; Diallo, A.; Lunsford, R.; Mansfield, D. K.; Osborne, T. H.; Gong, X. Z.; Wang, Y. F.; Li, Y. Y.
Gan, K.; Ahn, J. -W.; Gray, T. K.; Zweben, S. J.; Fredrickson, E. D.; Scotti, F.; Maingi, R.; Park, J. -K.; Canal, G. P.; Soukhanovskii, V. A.; McLean, A. G.; Wirth, B. D.
Maingi, R.; Canik, J. M.; Bell, R. E.; Boyle, D. P.; Diallo, A.; Kaita, R.; Kaye, S. M.; LeBlanc, B. P.; Sabbagh, S. A.; Scotti, F.; Soukhanovskii, V. A.
Here we publish the data used in paper "Junming Huang, Gavin Cook, and Yu Xie, Large-scale Quantitative Evidence of Media Impact on Public Opinion toward China". This dataset include estimated sentiments on The New York Times on China in eight topics from 1970 to 2019, and a time series of public attitude aggregated from surveys on China.
This setup mimics ice lying above the drainage system. In the experiment, a fluid-filled blister is generated via liquid injection into the interface between a transparent elastic layer and a porous substrate. After injection of liquid, the fluid permeates from the blister through the porous substrate, the blister volume V(t) relaxes exponentially with time. Our lab experiments show that varying the permeability of the porous substrate k significantly impacts the relaxation timescale in the experiments.
Pereira, Talmo D.; Aldarondo, Diego E.; Willmore, Lindsay; Kislin, Mikhail; Wang, Samuel S.-H.; Murthy, Mala; Shaevitz, Joshua W.
Abstract:
Recent work quantifying postural dynamics has attempted to define the repertoire of behaviors performed by an animal. However, a major drawback to these techniques has been their reliance on dimensionality reduction of images which destroys information about which parts of the body are used in each behavior. To address this issue, we introduce a deep learning-based method for pose estimation, LEAP (LEAP Estimates Animal Pose). LEAP automatically predicts the positions of animal body parts using a deep convolutional neural network with as little as 10 frames of labeled data for training. This framework consists of a graphical interface for interactive labeling of body parts and software for training the network and fast prediction on new data (1 hr to train, 185 Hz predictions). We validate LEAP using videos of freely behaving fruit flies (Drosophila melanogaster) and track 32 distinct points on the body to fully describe the pose of the head, body, wings, and legs with an error rate of <3% of the animal's body length. We recapitulate a number of reported findings on insect gait dynamics and show LEAP's applicability as the first step in unsupervised behavioral classification. Finally, we extend the method to more challenging imaging situations (pairs of flies moving on a mesh-like background) and movies from freely moving mice (Mus musculus) where we track the full conformation of the head, body, and limbs.
Kim, Donghoon; Tracy, Sally J.; Smith, Raymond F.; Gleason, Arianna E.; Bolme, Cindy A.; Prakapenka, Vitali B.; Appel, Karen; Speziable, Sergio; Wicks, June K.; Berryman, Eleanor J.; Han, Sirus K.; Schoelmerich, Markus O.; Lee, Hae Ja; Nagler, Bob; Cunningham, Eric F.; Akin, Minta C.; Asimow, Paul D.; Eggert, Jon H.; Duffy, Thomas S.
Abstract:
The behavior of forsterite, Mg2SiO4, under dynamic compression is of fundamental importance for understanding its phase transformations and high-pressure behavior. Here, we have carried out an in situ X-ray diffraction study of laser-shocked poly- and single-crystal forsterite (a-, b-, and c- orientations) from 19 to 122 GPa using the Matter in Extreme Conditions end-station of the Linac Coherent Light Source. Under laser-based shock loading, forsterite does not transform to the high-pressure equilibrium assemblage of MgSiO3 bridgmanite and MgO periclase, as was suggested previously. Instead, we observe forsterite and forsterite III, a metastable polymorph of Mg2SiO4, coexisting in a mixed-phase region from 33 to 75 GPa for both polycrystalline and single-crystal samples. Densities inferred from X-ray diffraction data are consistent with earlier gas-gun shock data. At higher stress, the behavior observed is sample-dependent. Polycrystalline samples undergo amorphization above 79 GPa. For [010]- and [001]-oriented crystals, a mixture of crystalline and amorphous material is observed to 108 GPa, whereas the [100]-oriented crystal adopts an unknown crystal structure at 122 GPa. The Q values of the first two sharp diffraction peaks of amorphous Mg2SiO4 show a similar trend with compression as those observed for MgSiO3 glass in both recent static and laser-compression experiments. Upon release to ambient pressure, all samples retain or revert to forsterite with evidence for amorphous material also present in some cases. This study demonstrates the utility of femtosecond free-electron laser X-ray sources for probing the time evolution of high-pressure silicates through the nanosecond-scale events of shock compression and release.
A subset of the Fermi-LAT public data for use with NPTFit:
https://github.com/bsafdi/NPTFit
The data here is for use with the Jupyter example notebooks provided with the
main code. Details of the files provided are given below. All files are provided
as numpy arrays binned as nside=128 HEALPix maps.
For the full public data, see:
http://fermi.gsfc.nasa.gov/ssc/data/access/
In this publication we provide the LAMMPS example files to reproduce simulations for the manuscript "A Deep Potential model for liquid-vapor equilibrium and cavitation rates of water"
The data provided in this DataSpace consists of sample training data to be used for Fluorescence Reconstruction Microscopy (FRM) testing. We provide a subset of the keratinocyte (10x magnification) dataset used in our paper, in which interested parties may find more complete information about our data collection methods. Matched pairs of phase contrast and fluorescent images are given. The nuclei were stained using Hoechst 33342 and imaged using a standard DAPI filter set.
The data provided in this DataSpace consists of sample training data to be used for Fluorescence Reconstruction Microscopy (FRM) testing. We provide a subset of the MDCK (20x magnification) dataset used in our paper, in which interested parties may find more complete information about our data collection methods. Matched pairs of DIC and fluorescent images are given. The cells stably expressed E-cadherin:RFP which enabled imaging of junctional fluorescence, while the nuclei were stained using Hoechst 33342 and imaged using a standard DAPI filter set.
We provide all the test data and corresponding predictions for our paper, “Practical Fluorescence Reconstruction Microscopy for High-Content Imaging”. Please refer to the Methods section in this paper for experimental details. For each experimental condition, we provide the input transmitted-light images (either phase contrast or DIC), the ground truth fluorescence images, and the output predicted fluorescence images which should reconstruct the ground truth fluorescence images.
Fractures in geological formations may enable migration of environmentally relevant fluids, as in leakage of CO2 through caprocks in geologic carbon sequestration. We investigated geochemically induced alterations of fracture geometry in Indiana Limestone specimens. Experiments were the first of their kind, with periodic high-resolution imaging using X-ray computed tomography (xCT) scanning while maintaining high pore pressure (100 bar). We studied two CO2-acidified brines having the same pH (3.3) and comparable thermodynamic
disequilibrium but different equilibrated pressures of CO2 (PCO2 values of 12 and 77 bar). High-PCO2 brine has a faster calcite dissolution kinetic rate because of the accelerating effect of carbonic acid. Contrary to expectations, dissolution extents were comparable in the two experiments. However, progressive xCT
images revealed extensive channelization for high PCO2, explained by strong positive feedback between ongoing flow and reaction. The pronounced channel increasingly directed flow to a small region of the fracture, which explains why the overall dissolution was lower than expected. Despite this, flow simulations revealed large increases in permeability in the high-PCO2 experiment. This study shows that the permeability evolution of dissolving fractures will be larger for faster-reacting fluids. The overall mechanism is not because more rock dissolves, as would be commonly assumed, but because of accelerated fracture channelization.