Project 90 (Partial Data)

Potterat, John; Muth, Stephen; Goel, Sharad; Salganik, Matthew
Issue date: 2026
Rights:
Creative Commons Attribution 4.0 International (CC BY)
Cite as:
Potterat, John, Muth, Stephen, Goel, Sharad, & Salganik, Matthew. (2026). Project 90 (Partial Data) [Data set]. Version 1. Princeton University. https://doi.org/10.34770/k90z-yf57
@electronic{potterat_john_2026,
  author      = {Potterat, John and
                Muth, Stephen and
                Goel, Sharad and
                Salganik, Matthew},
  title       = {{Project 90 (Partial Data)}},
  version     = 1,
  publisher   = {{Princeton University}},
  year        = 2026,
  url         = {https://doi.org/10.34770/k90z-yf57}
}
Description:

Project 90 was a prospective study of the influence of network structure on the dynamics of HIV transmission in a community of high-risk heterosexuals. The data was collected between 1988 and 1992 in Colorado Springs, CO, and the project was funded by the Centers for Disease Control and Prevention (CDC). For more details of the Project 90 study, please see the references below. Stephen Muth and John Potterat kindly provided the data to Sharad Goel and Matthew Salganik in 2007, and it was later used in their paper, S. Goel and M. J. Salganik (2010) "Assessing respondent-driven sampling" Proceedings of the National Academy of Sciences (PNAS). The release of these data allows others to replicate the analyses of Goel and Salganik. Data Release Included in this release are two tab-separated files, edges.tsv and nodes.tsv, that describe the structure of the Project 90 network and the individual-level attributes of study participants. edges.tsv: Each row indicates an edge in the network, specified by a pair of node ids. Edges represent social, sexual, and/or drug affiliation. Each edge is recorded twice. In other words, if there is an edge between 12 and 15, there is also an edge between 15 and 12. There are 43,288 edges in the file. nodes.tsv: Each row corresponds to a study participant. In addition to node id, the following attributes are listed for each individual: • Race (1 = Native American; 2 = Black; 3 = Asian/Pacific Islander; 4 = White; 5 = Other) • Gender (0 = Male; 1 = Female) • Sex Worker (0 = No; 1 = Yes) • Pimp (0 = No; 1 = Yes) • Sex Work Client (0 = No; 1 = Yes) • Drug Dealer (0 = No; 1 = Yes) • Drug Cook (0 = No; 1 = Yes) • Thief (0 = No; 1 = Yes) • Retired (0 = No; 1 = Yes) • Housewife (0 = No; 1 = Yes) • Disabled (0 = No; 1 = Yes) • Unemployed (0 = No; 1 = Yes) • Homeless (0 = No; 1 = Yes) Missing values are denoted by 'NA' and there are 5,492 individuals in the file. Please direct any questions to Prof. Matthew Salganik. readme: In addition, this text file gives one-way frequencies of all the individual attributes.

Show More