Network | Layer | Setting | Output |
---|
Feature extraction | Layer0_1 | 3×3,32 | H×W×32 | Layer0_2 | 1×1,32 | H×W×32 | Layer1_x | | H×W×32 | Layer2_x | (4 pairs) | H×W×64 | Layer3_x | | H×W×128 | Attention mode | Channel, spatial | H×W×128 | Layer4 | 1×1,32 | H×W×32 | Cost volume | Cascade | H×W×D×64 | 3DCNN | 3DLayer0 | | H×W×D×32 | 3DLayer1 | | H×W×D×32 | 3DStack1_1 | | H×W×D×64 | 3DStack1_2 | | H×W×D×64 | 3DStack1_3 | | H×W×D×64 | Network | Layer | Parameter | Output | 3DCNN | 3DStack1_4 | | H×W×D×32 | 3DStack2_1 | | H×W×D×64 | 3DStack2_2 | | H×W×D×64 | 3DStack2_3 | | H×W×D×64 | 3DStack2_4 | | H×W×D×32 | 3DStack3_1 | | H×W×D×64 | 3DStack3_2 | | H×W×D×64 | 3DStack3_3 | | H×W×D×64 | 3DStack3_4 | | H×W×D×32 | Classify | | H×W×D×1 | Disparity regression | | Upsampling | H×W×D | | Regression | H×W |
|