I have recently started to learn more about supervised monocular depth estimation. I used the NYU-V2 dataset for it. it is easy to design a torch loader and pre-process the data since the structure of the dataset is quite clear. But in the case of Kitti dataset, it is very confusing. Is it possible to use Kitti for supervised monocular depth estimation? I found a torch loader for kitt here: https://github.com/joseph-zhang/KITTI-TorchLoader however, I don't understand how to use it for depth estimation using the Kitti dataset. the folder structure is quite different!. My plan is to train a simple CNN using a supervised mono depth approach.
Is it possible to use Kitti dataset for supervised monocular depth estimation?
734 views Asked by PNF At
2
There are 2 answers
3
Ivan
On
The repository states that the dense depth map are completions of the lidar ray maps and projected and aligned with the raw KITTI dataset.
Andreas Geiger et al., Vision meets Robotics: The KITTI Dataset
Looking at the dev toolkit for KITTI, the get_depth function receives as an argument the camera id of the camera the Velodyne points are projected onto. This function is called here the dataloader with cam=self.cam which is set as an attribute to the Kittiloader instance.
In other words, you can choose on which camera the Velodyne points and depth completion is performed. By default cam is set to 2, which means cam_2, the left camera view.
Related Questions in PYTHON
- How to store a date/time in sqlite (or something similar to a date)
- Instagrapi recently showing HTTPError and UnknownError
- How to Retrieve Data from an MySQL Database and Display it in a GUI?
- How to create a regular expression to partition a string that terminates in either ": 45" or ",", without the ": "
- Python Geopandas unable to convert latitude longitude to points
- Influence of Unused FFN on Model Accuracy in PyTorch
- Seeking Python Libraries for Removing Extraneous Characters and Spaces in Text
- Writes to child subprocess.Popen.stdin don't work from within process group?
- Conda has two different python binarys (python and python3) with the same version for a single environment. Why?
- Problem with add new attribute in table with BOTO3 on python
- Can't install packages in python conda environment
- Setting diagonal of a matrix to zero
- List of numbers converted to list of strings to iterate over it. But receiving TypeError messages
- Basic Python Question: Shortening If Statements
- Python and regex, can't understand why some words are left out of the match
Related Questions in PYTORCH
- Influence of Unused FFN on Model Accuracy in PyTorch
- Conda CMAKE CXX Compiler error while compiling Pytorch
- Which library can replace causal_conv1d in machine learning programming?
- yolo v5 export to torchscript: how to generate constants.pkl
- Pytorch distribute process across nodes and gpu
- My ICNN doesn't seem to work for any n_hidden
- a problem for save and load a pytorch model
- The meaning of an out_channel in nn.Conv2d pytorch
- config QConfig in pytorch QAT
- Can't load the saved model in PyTorch
- How can I convert a flax.linen.Module to a torch.nn.Module?
- Snuffle in PyTorch Dataloader
- Cuda out of Memory but I have no free space
- Can not load scripted model using torch::jit::load
- Should I train my model with a set of pictures as one input data or I need to crop to small one using Pytorch
Related Questions in PYTORCH-DATALOADER
- Is there a way to check the output from a Data Loader will match the input to a model?
- Pytorch how to use num_worker>0 for Dataloader when using multiple gpus
- Adding sliding window dimension to data causes error: "Expected 3D or 4D (batch mode) tensor ..."
- Experiencing memory consumption Increase while experimenting with custom architecture in PyTorch
- TypeError: RandomIoUCrop() requires input sample to contain tensor or PIL images and bounding boxes. Sample can also contain masks
- DataLoader Causing RAM Crashes in PyTorch Federated Learning: Solutions?
- Is it better to store CUDA or CPU tensors that are loaded by torch DataLoader?
- Handling large PyTorch datasets in Google Colab
- Loading image bounding boxes outputs equal size error
- Pytorch Text classification example - error while training, some issues with indexing
- Issues with the creation of dataloader using NeighborLoader/ HGTLoader with Heterogenous Graph
- Can't apply same transform to image and mask for data-augmentation
- Python machine learning pytorch test/train epoch results problem
- Reading files asynchronously hangs within Pytorch Dataset when DataLoader.num_workers > 1
- Can't create a NeighborLoader for HeteroData: 'EdgeStorage' object has no attribute 'num_nodes'
Related Questions in KITTI
- KITTI dataset: ground truth labels (bird's eye view) match after an image generation?
- How Can I create geo-referenced PCD map with KITTI Dataset Ground Truth Data
- Kitti dataset get rgb data for each point in points clouds from corresponding image
- ICP to trajectory estimation, transformation matrix different than groundtruth (point clouds)
- How to label my Lidar data based on Kitti format?
- 2D camera and 3D lidar calibration (kitti format)
- In KITTI dataset, why is it required to divide np.asarray(PIL.Image.open(img_path)) by 256 to get groundtruth depth map?
- Extract the baseline and focal length using KITTI 2015 stereo dataset
- TypeError: transpose() received an invalid combination of arguments - got (tuple), but expected one of: (int dim0, int dim1) or (name dim0, name dim1)
- Please explain Bird's eye view in the KITTI test set performance evaluation table in the VoxelNet paper
- Instance annotations in KITTI-360 2D instacne datasets
- How are the addtional values in KITTI Projection Matrix calculated?
- What is the definition of depth in the task of depth completion - KITTI dataset
- How do we project from camera to lidar coordinate when both of the sensors share same coordinate systems?
- Convert point cloud from pointcloud2 (rosbag) to bin (KITTI)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
I think it is plausible since the KITTI dataset contains depth maps with the corresponding raw LiDaR scans and RGB images (left-image, right-image and depth map) (KITTI). I don't know how exactly the github repo works but the dataset/dataloader should be in a similar format. However, taking a look on the repo files, I think you need only to install the library and then pass as input the root_path of your dataset and the pytorch image transformations.