Thank you for your great work on ViewCrafter!
We are currently trying to evaluate ViewCrafter on the RealEstate10K dataset as a baseline in our experiments. Specifically, we aim to generate novel views starting from a single image, following a trajectory of target camera poses provided in the dataset.
However, we encountered an issue:
RealEstate10K provides 4×4 camera extrinsic matrices (RT), while the infer.py script in ViewCrafter seems to take rotation angles (Euler or similar) as input.
Could you kindly clarify:
How can we convert the RT matrices to the input format expected by infer.py?
Or is there an existing interface/script in the codebase that directly supports RT input?
We would greatly appreciate your guidance, as we would like to include ViewCrafter in our comparison fairly and correctly.
Thank you for your great work on ViewCrafter!
We are currently trying to evaluate ViewCrafter on the RealEstate10K dataset as a baseline in our experiments. Specifically, we aim to generate novel views starting from a single image, following a trajectory of target camera poses provided in the dataset.
However, we encountered an issue:
RealEstate10K provides 4×4 camera extrinsic matrices (RT), while the infer.py script in ViewCrafter seems to take rotation angles (Euler or similar) as input.
Could you kindly clarify:
How can we convert the RT matrices to the input format expected by infer.py?
Or is there an existing interface/script in the codebase that directly supports RT input?
We would greatly appreciate your guidance, as we would like to include ViewCrafter in our comparison fairly and correctly.