This is a backup of the Tiny Images Dataset, available at:
http://horatio.cs.nyu.edu/mit/tiny/data/index.html. The dataset consists of 79,302,017 images, each being a 32x32 color image. This data is stored in the form of large binary files which can be accesed by the included Matlab Tiny Images toolbox. You will need around 400Gb of free disk space to store all the files.
In total there are 5 files that need to be downloaded, 3 of which are large binary files consisting of (i) the images themselves; (ii) their associated metadata (filename, search engine used, ranking etc.); (iii) Gist descriptors for each image. The other two files are the Matlab toolbox and index data file that together let you easily load in data from the binaries.
For further instructions on usage, refer to the included index.html file:
http://www.archive.org/download/80-million-tiny-images-1-of-2/index.htmlThe dataset is separated into two items. You can download all necessary files via the links in the "Downloads" section below. You can also view item 1/2 at the following page:
http://archive.org/details/80-million-tiny-images-1-of-2Please visit the
80 Million Tiny Images Visual Dictionary to see the dataset in action.
Downloads:
Note that these files are very large and will take a considerable time to download. Please ensure you have sufficient disk space before commencing the download.
1. Image Binary (226.9 GB):
tiny_images.bin2. Metadata Binary (56.7 GB):
tiny_metadata.bin3. Gist Binary (113.4 GB):
tinygist80million.bin4. Index Data (7.0 MB):
tiny_index.mat5. Matlab Tiny Images toolbox (150Kb):
tiny_code.zip