vp_suite.datasets.physics101
- class Physics101Dataset(split, **dataset_kwargs)
Bases:
vp_suite.base.base_dataset.VPDataset
Dataset class for the Videos of the dataset “Physics 101”, as encountered in “Physics 101: Learning Physical Object Properties from Unlabeled Videos” by Wu et al. (http://phys101.csail.mit.edu/papers/phys101_bmvc.pdf).
Each sequence depicts object-centered physical properties by showing objects of various materials and apperances in different physical scenarios such as sliding down a ramp or bouncing off a flat surface.
- ACTION_SIZE: int = 0
The size of the action vector per frame (If the dataset provides no actions, this value is 0).
- AVAILABLE_CAMERAS = ['Camera_1', 'Camera_2', 'Kinect_RGB_1']
Available cameras/image sources.
- AVAILABLE_SUBSEQ = ['start', 'middle', 'end']
Available (sub-)sequence extraction position identifiers.
- DATASET_FRAME_SHAPE: (<class 'int'>, <class 'int'>, <class 'int'>) = (1080, 1920, 3)
Shape of a single frame in the dataset (height, width, channels).
- DEFAULT_DATA_DIR: pathlib.Path = PosixPath('/home/runner/work/vp-suite/vp-suite/vp-suite-data/data/phys101')
The default save location of the dataset files.
- REFERENCE: str = 'http://phys101.csail.mit.edu/'
The reference (publication) where the original dataset is introduced.
- __init__(split, **dataset_kwargs)
Initializes the dataset loader by determining its split and extracting and processing all dataset attributes from the parameters given in dataset_kwargs.
- Parameters
split (str) – The dataset’s split identifier (i.e. whether it’s a training/validation/test dataset)
**dataset_kwargs (Any) – Optional dataset arguments for image transformation, value_range, splitting etc.
- camera = 'Kinect_RGB_1'
Which camera to use from the dataset.
- download_and_prepare_dataset()
Downloads the specific dataset, prepares it for the video prediction task (if needed) and stores it in a default location in the ‘data/’ folder. Implemented by the derived dataset classes.
- subseq = 'middle'
“start” starts from the first frame, “end” ends at the last frame and “middle” lies exactly in between.
- Type
Where to extract the sequence from
- trainval_test_seed = 1612
The random seed used to separate training/validation and testing data. Value from the ‘Noether Networks’ code
- trainval_to_test_ratio = 0.8
The ratio of files that will be training/validation data (rest will be test data).