- ScanNet++ (98 video clips with 32 frames each): TAE
- NYU-Depth V2: OPW<=0.37
- NYU-Depth V2: AbsRel<=0.045 [test: new layout]
- NYU-Depth V2 (640×480): AbsRel<=0.058 [currently no longer up to date]
- DA-2K (mostly 1500×2000): Acc (%)>=86
- UnrealStereo4K (3840×2160): AbsRel<=0.04
- MVS-Synth (1920×1080): AbsRel<=0.06
- HRSD (1920×1080): AbsRel<=0.08
- Middlebury2021 (1920×1080): SqRel<=0.5
- Hybrid with 7×7 synthetic light field views✖️: LPIPS😍 (no data)
- Hybrid with 7×7 synthetic light field views✖️: PSNR😞>=32dB
- Appendix 1: Rules for qualifying models for the rankings (to do)
- Appendix 2: Metrics selection for the rankings (to do)
- Appendix 3: List of all research papers from the above rankings
RK | Model Links: Venue Repository |
TAE ↓ {Input fr.} DAV |
---|---|---|
1 | Depth Any Video |
2.1 {MF} |
2 | DepthCrafter |
2.2 {MF} |
3 | ChronoDepth |
2.3 {MF} |
4 | NVDS |
3.7 {4} |
RK | Model Links: Venue Repository |
OPW ↓ {Input fr.} FD |
OPW ↓ {Input fr.} NVDS+ |
OPW ↓ {Input fr.} NVDS |
---|---|---|---|---|
1 | FutureDepth |
0.303 {4} | - | - |
2 | NVDS+ |
- | 0.339 {4} | - |
3 | NVDS |
0.364 {4} | - | 0.364 {4} |
RK | Model | AbsRel ↓ {Input fr.} |
Training dataset |
Official repository |
Practical model |
Vapour- Synth |
---|---|---|---|---|---|---|
1 | ZoeDepth +PFR=128 ENH: |
0.0388 {1} |
ENH: UnrealStereo4K |
ENH: |
- | - |
RK | Model | AbsRel ↓ {Input fr.} |
Training dataset |
Official repository |
Practical model |
VapourSynth |
---|---|---|---|---|---|---|
1 | ZoeDepth +PFR=128 ENH: |
0.0589 {1} |
ENH: MVS-Synth |
ENH: |
- | - |
RK | Model | AbsRel ↓ {Input fr.} |
Training dataset |
Official repository |
Practical model |
VapourSynth |
---|---|---|---|---|---|---|
1 | DPT-B + R + AL ENH: |
0.074 {1} |
ENH: HRSD |
ENH: - |
- | - |
RK | Model | SqRel ↓ {Input fr.} |
Training dataset |
Official repository |
Practical model |
VapourSynth |
---|---|---|---|---|---|---|
1 | LeReS-GBDMF ENH: |
0.444 {1} |
ENH: HR-WSI |
ENH: |
- | - |
RK | Model | PSNR ↑ {Input fr.} |
Training dataset |
Official repository |
Practical model |
VapourSynth |
---|---|---|---|---|---|---|
1 | LFVRT MDE: DPT Backbone: ViT |
32.66 {3+1D} |
GoPro & TAMULF | MDE: |
- | - |
📝 Note: The above ranking includes only one model, as the other methods are image-based and don't have any temporal information making them unsuitable for light field video reconstruction from monocular video.