De-identified SFDPH / ZSFG Data
De-identified structured clinical data from the San Francisco Department of Public Health (SFDPH), including Zuckerberg San Francisco General (ZSFG), Laguna Honda Hospital (LHH), Population Health Division (PHD), Behavioral Health Services (BHS), and ambulatory care areas are now available for direct self-service access along with our UCSF Health clinical data!
- Patient identities are matched across UCSF and SFDPH
- Data are combined in the OMOP data model, with encounters from both UCSF and SFDPH available in a single database (and they are also available separately)
- As of December 2024, combined patient population = 7+ million patients
- UCSF = 6.3+ million patients
- SFDPH = 1.2+ million patients
- Nearly 400K patients are in both systems
How to get started:
- If you don't have access to UCSF de-identified data: Request Data Access for Research
- Not at UCSF: Email us at: [email protected]
How to get help:
- You can find detailed documentation and training videos on the UCSF Wiki (VPN required)
- You can email us at: [email protected]
To use these data, you don’t need an IRB, but you will need data programming skills like SQL, R or Python.
Also, you do not need to submit a ZSFG Research Protocol Application to use/analyze the data. However, if you plan to formally present or publish results from the SFDPH de-identified data, you will need to complete a brief form and submit the abstract, manuscript or slides to SFDPH, at least thirty days prior to the first time the results are presented or published. More information about this requirement is coming soon.