A rich collection of datasets made available by various organizations. Data may be public-access or restricted to researchers with appropriate credentials.
Visit the Data Library
A selection of public workspaces contributed by our team, by partners and by community members, that contain example data and preconfigured analysis tools.
Visit the Showcase
Reference genome and accessory files for Hg38 and others, preconfigured in workspaces and ready to use with tools such as the GATK Best Practices worfklows.
Most biomedical datasets are composed of some primary data files (such as genome sequence and variant calls) and varying amounts of associated metadata such as the study participants’ phenotypes, sample identifiers and so on. Terra provides a system of data tables, collectively termed “data model”, that allows you to formally describe the structure of your dataset and keep track of locations of data files in relation to the metadata. This in turn enables you to run analyses efficiently, and keeps the outputs organized so that you can progress through your research project with confidence. Datasets provided through the Data Library come with a full data model defined by the data custodian. Learn more about data models
Each Terra workspace is created with
its own dedicated storage bucket, where analysis outputs are written by default. You can upload data to the workspace bucket and use it for storage.
You can access and run analyses on data stored in buckets managed outside of Terra; this can be either public buckets or private buckets for which you have at least read permissions.
When you share your workspace with a collaborator, the contents of the bucket will also be shared with them. Authorization domains can be used to limit sharing to a specific group of users.