LogoLogo
API ReferenceGitHubSlackService StatusLogin
v3.1.5
v3.1.5
  • Deep Lake Docs
  • List of ML Datasets
  • Quickstart
  • Dataset Visualization
  • Storage & Credentials
    • Storage Options
    • Managed Credentials
      • Enabling CORS
      • Provisioning Role-Based Access
  • API Reference
  • Enterprise Features
    • Querying Datasets
      • Sampling Datasets
    • Performant Dataloader
  • EXAMPLE CODE
  • Getting Started
    • Step 1: Hello World
    • Step 2: Creating Deep Lake Datasets
    • Step 3: Understanding Compression
    • Step 4: Accessing Data
    • Step 5: Visualizing Datasets
    • Step 6: Using Activeloop Storage
    • Step 7: Connecting Deep Lake Datasets to ML Frameworks
    • Step 8: Parallel Computing
    • Step 9: Dataset Version Control
    • Step 10: Dataset Filtering
  • Tutorials (w Colab)
    • Creating Datasets
      • Creating Complex Datasets
      • Creating Object Detection Datasets
      • Creating Time-Series Datasets
      • Creating Datasets with Sequences
      • Creating Video Datasets
    • Training Models
      • Training an Image Classification Model in PyTorch
      • Training Models Using MMDetection
      • Training Models Using PyTorch Lightning
      • Training on AWS SageMaker
      • Training an Object Detection and Segmentation Model in PyTorch
    • Data Processing Using Parallel Computing
  • Playbooks
    • Querying, Training and Editing Datasets with Data Lineage
    • Evaluating Model Performance
    • Training Reproducibility Using Deep Lake and Weights & Biases
    • Working with Videos
  • API Summary
  • How Deep Lake Works
    • Data Layout
    • Version Control and Querying
    • Tensor Relationships
    • Visualizer Integration
    • Shuffling in ds.pytorch()
    • Storage Synchronization
    • How to Contribute
Powered by GitBook
On this page
  • How to Get Started with Activeloop Deep Lake in Under 5 Minutes
  • Version control, query, and train models while streaming your deep-learning datasets from a cloud of your choice.
  • Installing Deep Lake
  • Fetching Your First Deep Lake Dataset
  • Reading Samples From a Deep Lake Dataset
  • Visualizing a Deep Lake Dataset
  • Create your own Datasets

Was this helpful?

Quickstart

A jump-start guide to using Deep Lake.

PreviousDeep Lake DocsNextDataset Visualization

Last updated 2 years ago

Was this helpful?

How to Get Started with Activeloop Deep Lake in Under 5 Minutes

Version control, query, and train models while streaming your deep-learning datasets from a cloud of your choice.

Installing Deep Lake

Deep Lake can be installed through pip. By default, Deep Lake does not install dependencies for audio, video, google-cloud, and other features. .

$ pip3 install deeplake

Fetching Your First Deep Lake Dataset

Let's load the , a rich dataset with many object detections per image. Datasets hosted on Activeloop Platform are typically identified by host organization name followed by the dataset name: activeloop/visdrone-det-train.

import deeplake

dataset_path = 'hub://activeloop/visdrone-det-train'
ds = deeplake.load(dataset_path) # Returns a Deep Lake Dataset but does not download data locally

Reading Samples From a Deep Lake Dataset

Data is not immediately read into memory because Deep Lake operates . You can fetch data by calling the .numpy() or .data() methods:

# Indexing
image = ds.images[0].numpy() # Fetch the first image and return a numpy array

labels = ds.labels[0].data() # Fetch the labels in the first image

boxes = ds.boxes[0].numpy() # Fetch the bounding boxes in the first image

# Slicing
img_list = ds.labels[0:100].numpy(aslist=True) # Fetch 100 labels and store 
                                               # them as a list of numpy arrays

Other metadata such as the mapping between numerical labels and their text counterparts can be accessed using:

labels_list = ds.labels.info['class_names']

Visualizing a Deep Lake Dataset

Deep Lake enables users to visualize and interpret large datasets. The tensor layout for a dataset can be inspected using:

ds.summary()
ds.visualize()

Create your own Datasets

You can perform all of the steps above and more with your own datasets! Please check out the links below to learn more:

The dataset can be , or using an iframe in a Jupyter notebook:

Visualizing datasets in will unlock more features and faster performance compared to visualization in Jupyter notebooks.

Congratulations, you've got Deep Lake working on your local machine

🤓
Details on all installation options are available here
Visdrone dataset
lazily
visualized in Activeloop Platform
Activeloop Platform
Getting Started
Creating Datasets