Add color handling to VideoEncoder GPU #1125

Dan-Flores · 2025-12-11T06:18:30Z

This PR resolves this TODO:
// TODO-VideoEncoder: Enable configuration of color properties, similar to FFmpeg.

Support is added for the following parameters:

Color spaces: BT.601, BT.709, BT.2020
Color ranges: tv (limited), pc (full)

As a result, 6 ColorConversionMatrices are stored and utilized for NPP color conversion functions. To my understanding, these are the most commonly used or newer color spaces parameters (docs for AVColorSpace)

The testing caveats:

FFmpeg 4 and 6 fails frame equal assertions in test_nvenc_against_ffmpeg_cli. It only passes when using non-default color range and colorspace (specifically, not limited range and BT601). Since these tests pass on FFmpeg 7 and 8, I suspect there is some issue in FFmpeg 4 and 6. I've opened Enable color parameters in NVENC test on FFmpeg 4 and 6 #1140 to track this issue.
The av1_nvenc test is disabled on FFmpeg4, as the codec is not implemented.

…o gpu_pix_fmts

NicolasHug · 2025-12-11T12:35:25Z

src/torchcodec/_core/CudaDeviceInterface.cpp

-    {-0.148f, -0.291f, 0.439f, 128.0f},
-    // V = 0.439*R - 0.368*G - 0.071*B + 128 (BT.601 coefficients)
-    {0.439f, -0.368f, -0.071f, 128.0f}};
+// RGB to YUV conversion matrices to use in NPP color conversion functions


Can you share how these were derived? What were the original values that were used as reference?

These follow the pattern described in the note in CudaCommon, I can add a comment referencing that note here

torchcodec/src/torchcodec/_core/CUDACommon.cpp

Lines 43 to 44 in ee8ce04

// Color space and color range

// ---------------------------

The note is about YUV -> RGB so it's not 100% targeted to what the matrices are doing. But yes, add a ref to that note, it's still useful.

You asked me offline whether we should update the note to explain limited range: yes, we should :)
There's a TODO in the note for that, but I never had the chance to do it - and frankly I forgot the underlying logic lol. If you'd like to give it a go, please do it - in a follow-up PR.

NicolasHug · 2025-12-11T12:43:54Z

test/test_encoders.py

+            assert encoder_metadata["color_range"] == ffmpeg_metadata["color_range"]
+            assert encoder_metadata["color_space"] == ffmpeg_metadata["color_space"]


We'll want to be stricter here:

Suggested change

assert encoder_metadata["color_range"] == ffmpeg_metadata["color_range"]

assert encoder_metadata["color_space"] == ffmpeg_metadata["color_space"]

assert encoder_metadata["color_range"] == ffmpeg_metadata["color_range"] == color_range

assert encoder_metadata["color_space"] == ffmpeg_metadata["color_space"] == color_space

NicolasHug · 2025-12-11T12:45:35Z

test/test_encoders.py

-            assert encoder_metadata["pix_fmt"] == "yuv420p"
-            assert ffmpeg_metadata["pix_fmt"] == "yuv420p"


Looks like we're not assert pix_fmt anymore, which makes it hard to verify that the changes in this PR are correct. IIRC, passing NV12 actually resulted in a yuv420p format at the end. We should try to undertand why that was the case. It may not add a lot of value to support both nv12 and yuv420p as parameter values if they're both the same thing (and if they both end-up being yuv420 anyway).

I suspect the format changes occur based on codec implementation. By adding back this assertion, I observed a deprecated pixel format yuvj420p is set when pc (full) color range is used by h264_nvenc and hevc_nvenc, but not av1_nvenc.
I can incorporate pixel formats into my benchmarking PR, to see if there is some optimization to using nv12.

The new assertions you added are good. But I personally still do not understand why passing NV12 actually ends up being reported yuv420.
Is it actually still NV12, and it's just FFmpeg that can't tell the difference? Or is it indeed yuv420?

Until we get a good understanding on that, I think we should refrain from allowing pixel_format with CUDA encoding. We have a surprising behavior that we cannot explain right now: passing NV12 leads to yuv420. If we're surprised, our users will be surprised too, and we won't have a good explanation to give them. It is safer to simply not expose this functionality just yet, and let them rely on the default behavior.

…o gpu_pix_fmts

Dan-Flores · 2025-12-18T14:48:30Z

src/torchcodec/_core/Encoder.cpp


  if (videoStreamOptions.pixelFormat.has_value()) {
+    // TODO-VideoEncoder: Enable pixel formats to be set by user
+    // and handled with the appropriate NPP function on GPU.


I moved this TODO from setupHardwareFrameContextForEncoding to here in initializeEncoder to centralize pixel_format handling.

The behavior is unchanged: If pixel_format argument is used while frames are on GPU, an error is raised.
The default usage of nv12 is moved into initializeEncoder` as well.

are we raising an error for pixel_format on gpu because of what nicolas mentioned below? that passing NV12 leads to yuv420?

Essentially yes. Because we do not understand the codec's behavior yet, we do not want the user to set or expect a pixel format.

Dan-Flores added 4 commits December 11, 2025 05:06

only 2 pixfmts, enable 6 color param combos

cd5f8aa

Merge branch 'main' of https://github.com/meta-pytorch/torchcodec int…

939240b

…o gpu_pix_fmts

comments

b538d13

comments2

0480f1f

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 11, 2025

NicolasHug reviewed Dec 11, 2025

View reviewed changes

Dan-Flores added 8 commits December 12, 2025 05:44

adjust test, fix pixel format checks

351a55d

keep plumbing, only use nv12

ffcf872

error sooner on gpu on any pixel format

a671f31

skip non-default color params on 4+6, skip av1 gpu on 4

a9ad8e0

Merge branch 'main' of https://github.com/meta-pytorch/torchcodec int…

261549d

…o gpu_pix_fmts

remove unused option

f4d777c

reduce diff

da3a6d7

add TODO, liink issue

1dc7690

Dan-Flores changed the title ~~Add pixel formats and color handling to VideoEncoder GPU~~ Add color handling to VideoEncoder GPU Dec 17, 2025

Dan-Flores added 2 commits December 17, 2025 15:31

restore None test case

ce86d61

reuse codecContext color params, no hardcoded defaults

daf2fda

Dan-Flores marked this pull request as ready for review December 18, 2025 14:48

Dan-Flores commented Dec 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add color handling to VideoEncoder GPU #1125

Add color handling to VideoEncoder GPU #1125

Dan-Flores commented Dec 11, 2025 •

edited

Loading

Uh oh!

NicolasHug Dec 11, 2025

Uh oh!

Dan-Flores Dec 12, 2025

Uh oh!

NicolasHug Dec 12, 2025 •

edited

Loading

Uh oh!

NicolasHug Dec 11, 2025

Uh oh!

NicolasHug Dec 11, 2025

Uh oh!

Dan-Flores Dec 12, 2025

Uh oh!

NicolasHug Dec 12, 2025

Uh oh!

Dan-Flores Dec 18, 2025

Uh oh!

mollyxu Dec 18, 2025

Uh oh!

Dan-Flores Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	// Color space and color range
	// ---------------------------

		assert encoder_metadata["color_range"] == ffmpeg_metadata["color_range"]
		assert encoder_metadata["color_space"] == ffmpeg_metadata["color_space"]

		assert encoder_metadata["pix_fmt"] == "yuv420p"
		assert ffmpeg_metadata["pix_fmt"] == "yuv420p"

Add color handling to VideoEncoder GPU #1125

Are you sure you want to change the base?

Add color handling to VideoEncoder GPU #1125

Conversation

Dan-Flores commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Dan-Flores commented Dec 11, 2025 •

edited

Loading

NicolasHug Dec 12, 2025 •

edited

Loading