Download/Installation
omnitools.download
is a one-line utility for rapidly downloading the starter (& similar) datasets. For more about the tools themselves (omnitools.download
and omnitools.upload
), please see the dedicated page.
To download the starter dataset, make sure that omnidata-tooling is installed and then run the full download command which will prompt you to accept the component licenses to proceed:
Run the following: (Estimated download time for [RGB + 1 Task + Masks]: 1 day) (Full dataset [30TB]: 5 days)
# Make sure everything is installed
sudo apt-get install aria2
pip install 'omnidata-tools' # Just to make sure it's installed
# Install the 'debug' subset of the Replica and Taskonomy components of the dataset
omnitools.download rgb normals point_info \
--components replica taskonomy \
--subset debug \
--dest ./omnidata_starter_dataset/ --agree-all
You should see the prompt:
Examples
Here are some other examples:
Download the full Omnidata dataset and agree to licenses
omnitools.download all --components all --subset fullplus \
--dest ./omnidata_starter_dataset/ \
--connections_total 40 --agree
Download Taskonomy only:
omnitools.download all --components taskonomy --subset fullplus \
--dest ./omnidata_starter_dataset/ \
--connections_total 40 --agree
Omnidata but only depth and masks and keep the compressed files
omnitools.download rgb depth mask_valid --components all --subset fullplus \
--dest ./omnidata_starter_dataset/ \
--connections_total 40 --agree
Download meshes for Clevr
omnitools.download mesh --components clevr_simple --subset fullplus \
--dest ./omnidata_starter_dataset/ \
--dest_compressed ./omnidata_starter_dataset_compresssed --keep_compressed True \
--connections_total 40 --agree
Use multiple workers to download Omnidata--this is for worker 7/100, but do a dryrun
omnitools.download all --components all --subset fullplus \
--num_chunk 6 --num_total_chunks 100 \
--dest ./omnidata_starter_dataset/ \
--connections_total 40 --agree --dryrun
...you get the idea :)
Command-line options
omnitools.download
is pretty configurable, and you can choose which comonents/subset/split/tasks to download and extract. The downloader will spawn many workers to then download those compressed files, verify the download against checksums on the server, and unpack them. Here are the available options:
> omnitools.download -h
Citation
If you find the code or models useful, please cite our paper:
@inproceedings{eftekhar2021omnidata,
title={Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets From 3D Scans},
author={Eftekhar, Ainaz and Sax, Alexander and Malik, Jitendra and Zamir, Amir},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={10786--10796},
year={2021}
}