Clinical data at scale and very high performance, and an environment suited to pattern recognition and machine learning. This high performance compute cluster on AWS offers:
- Access to de-identified structured EHR data; additional data sets coming soon, including de-identified clinical notes and images
- Spark analytics engine, that enables fast data query via Spark-SQL, Machine Learning via Spark MLib, R via SparkR
- Query data using PatientExploreR
Free for UCSF Community