SRFlow3DNet: Monocular Facial Scene Flow with Optical Flow Branch Regularization

Zhang Jialin; Li Dong

doi:10.12052/gdutxb.260010

Zhang Jialin, Li Dong. SRFlow3DNet: monocular facial scene flow with optical flow branch regularizationJ. Journal of Guangdong University of Technology. DOI: 10.12052/gdutxb.260010

Citation:

Zhang Jialin, Li Dong. SRFlow3DNet: monocular facial scene flow with optical flow branch regularizationJ. Journal of Guangdong University of Technology. DOI: 10.12052/gdutxb.260010

Citation:

Zhang Jialin, Li Dong. SRFlow3DNet: monocular facial scene flow with optical flow branch regularizationJ. Journal of Guangdong University of Technology. DOI: 10.12052/gdutxb.260010

SRFlow3DNet: Monocular Facial Scene Flow with Optical Flow Branch Regularization

Graphical Abstract

Abstract

Abstract

Existing scene flow datasets are mainly designed for general or autonomous driving scenarios, lacking high-resolution, geometrically consistent facial motion data, which limits effective modeling of 3D facial motion. In this research, a monocular RGB (Red, Green and Blue) facial scene flow estimation model is studied , aiming to accurately predict per-pixel 3D motion in facial regions from consecutive image sequences. To this end, the SRFlow3D dataset (Splatting Rasterization Flow 3D) is proposed, which leverages dynamic 3D Gaussian representations with splatting-based rasterization to simultaneously render RGB images, optical flow, and depth-related supervision data. Based on the SRFlow3D dataset, the SRFlow3DNet (Splatting Rasterization Guided Flow 3D Network) is proposed, which jointly predicts optical flow and depth variation along the viewing direction under monocular RGB input to obtain per-pixel 3D scene flow, and introduces optical flow branch regularization to enhance geometric and temporal consistency of non-rigid facial motion. Experimental results show that, compared with existing scene flow estimation methods, SRFlow3DNet reduces the optical flow metric End-Point Error (EPE) from 0.4984 to 0.3768 and the scene flow metric 3D End-Point Error (EPE3D) from 1.0826 to 0.4308, achieving significant performance gains in the monocular RGB facial scene flow estimation task.

FullText(HTML)

References (25)

Cited By

Turn off MathJax

Article Contents

SRFlow3DNet: Monocular Facial Scene Flow with Optical Flow Branch Regularization

Abstract

Catalog

Export File

Citation

Format

Content