# Storage Options **Deep Lake datasets can be stored locally, or on several cloud storage providers including Deep Lake Storage, AWS S3, Microsoft Azure, and Google Cloud Storage.** Datasets are accessed by choosing the correct prefix for the dataset `path` that is passed to methods such as `deeplake.load(path)`, and `deeplake.empty(path)`. The path prefixes are:

Storage	Path	Notes
Storage Location	Path	Notes
Local	`/local_path`
Deep Lake Storage	`hub://org_id/dataset_name`
Deep Lake Managed DB	`hub://org_id/dataset_name`	Specify `runtime = {"tensor_db": True}` when creating the dataset
AWS S3	`s3://bucket_name/dataset_name`	Dataset can be connected to Deep Lake via Managed Credentials
Microsoft Azure (Gen2 DataLake Only)	`azure://account_name/container_name/dataset_name`	Dataset can be connected to Deep Lake via Managed Credentials
Google Cloud	`gcs://bucket_name/dataset_name`	Dataset can be connected to Deep Lake via Managed Credentials

{% hint style="info" %} Connecting Deep Lake datasets stored in your own cloud via Deep Lake [Managed Credentials](/v3.6.0/storage-and-credentials/managed-credentials.md) is required for accessing enterprise features, and it significantly simplifies dataset access. {% endhint %} ## Authentication for each cloud storage provider: ### Activeloop Storage and Managed Datasets In order to access datasets stored in Deep Lake, or datasets in other clouds that are [managed by Activeloop](/v3.6.0/storage-and-credentials/managed-credentials.md), users must register and authenticate using the steps in the link below: {% content-ref url="/pages/2NW9EhbsMvmxxHjkeqQa" %} [User Authentication](/v3.6.0/storage-and-credentials/user-authentication.md) {% endcontent-ref %} ### AWS S3 Authentication with AWS S3 has 4 options: 1. Use Deep Lake on a machine in the AWS ecosystem that has access to the relevant S3 bucket via [AWS IAM](https://aws.amazon.com/iam/), in which case there is no need to pass credentials in order to access datasets in that bucket. 2. Configure AWS through the cli using `aws configure`. This creates a credentials file on your machine that is automatically access by Deep Lake during authentication. 3. Save the `AWS_ACCESS_KEY_ID` ,`AWS_SECRET_ACCESS_KEY` , and `AWS_SESSION_TOKEN (optional)` in environmental variables of the same name, which are loaded as default credentials if no other credentials are specified. 4. Create a dictionary with the `AWS_ACCESS_KEY_ID` ,`AWS_SECRET_ACCESS_KEY` , and `AWS_SESSION_TOKEN (optional)`, and pass it to Deep Lake using: **Note:** the dictionary keys must be lowercase! ```python deeplake.load('s3:///', creds = { 'aws_access_key_id': , 'aws_secret_access_key': , 'aws_session_token': <'your_aws_session_token'>, # Optional } ) ``` `endpoint_url` can be used for connecting to other object storages supporting S3-like API such as [MinIO](https://github.com/minio/minio), [StorageGrid](https://www.netapp.com/data-storage/storagegrid/) and others. ### Custom Storage with S3 API In order to connect to other object storages supporting S3-like API such as [MinIO](https://github.com/minio/minio), [StorageGrid](https://www.netapp.com/data-storage/storagegrid/) and others, simply add `endpoint_url` the the `creds` dictionary. ```python deeplake.load('s3://...', creds = { 'aws_access_key_id': , 'aws_secret_access_key': , 'aws_session_token': <'your_aws_session_token'>, # Optional 'endpoint_url': 'http://localhost:8888' } ) ``` ### Microsoft Azure Authentication with Microsoft Azure has 4 options: 1. Log in from your machine's CLI using `az login ...`. 2. Save the `AZURE_STORAGE_ACCOUNT`, `AZURE_STORAGE_KEY` , or other credentials in environmental variables of the same name, which are loaded as default credentials if no other credentials are specified. 3. Create a dictionary with the `AWS_ACCESS_KEY_ID` ,`AWS_SECRET_ACCESS_KEY` , and `AWS_SESSION_TOKEN (optional)`, and pass it to Deep Lake using: **Note:** the dictionary keys must be lowercase! ```python deeplake.load('azure:////', creds = { 'account_key': , 'sas_token': , } ) ``` ### Google Cloud Storage Authentication with Google Cloud Storage has 2 options: 1. Create a service account, download the JSON file containing the keys, and then pass that file to the `creds` parameter in `deeplake.load('gcs://.....', creds = 'path_to_keys.json')` . It is also possible to manually pass the information from the JSON file into the `creds` parameter using: `deeplake.load('gcs://.....', creds = {information from the JSON file})` 2. Authenticate through the browser using `deeplake.load('gcs://.....', creds = 'browser')`. This requires that the project credentials are stored on your machine, which happens after `gcloud` is [initialized](https://cloud.google.com/sdk/gcloud/reference/init) and [logged in](https://cloud.google.com/sdk/gcloud/reference/auth) through the CLI. 1. After this step, re-authentication through the browser can be skipped using: `deeplake.load('gcs://.....', creds = 'cache')` --- # Agent Instructions: Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://docs-v3.activeloop.ai/v3.6.0/storage-and-credentials/storage-options.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.