Show conversations instead of decoded text in the completions table by qgallouedec · Pull Request #5309 · huggingface/trl

qgallouedec · 2026-03-19T02:40:44Z

Two related fixes to make the completions table more readable, especially for tool-calling and multi-turn setups.

Changes

Log conversations instead of decoded strings (WDYT?)

Previously, prompts and completions were decoded with batch_decode (flat strings) before being logged:

Now the raw conversation lists are logged directly and passed to print_prompt_completions_sample (which already knows how to render them turn-by-turn):

Tradeoff: the rendered system prompt (including tool definitions injected by the chat template) is no longer visible in the table, since we show the original messages rather than the fully-templated string. The gain in readability for multi-turn and tool-calling conversations outweighs this.

And for VLMs:

What do you guys think? What's the most useful?

Fix: render `tool_calls`

Tool-calling turns were silently blank (the assistant message has tool_calls instead of content), which the old code ignored. Now renders as name(arg=value, ...):

Note

Medium Risk
Changes the shape/serialization of logged prompt/completion data (strings -> message lists) and alters how multimodal/tool-call content is rendered, which may affect downstream logging/analysis expecting decoded text.

Overview
Completions logging now records full chat conversations instead of decoded strings in GRPOTrainer and RLOOTrainer, improving readability for multi-turn and tool-calling rollouts.

The completions table/console rendering is updated to display tool_calls and VLM blocks (showing text blocks and a [IMAGE] placeholder), and a new _strip_images_from_messages helper removes non-serializable PIL objects before writing parquet / logging tables. Tests are updated to assert the new tool-call rendering output.

^{Reviewed by Cursor Bugbot for commit 00a5fd0. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-03-19T02:44:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

cursor · 2026-03-19T02:46:44Z

-        self._logs["completion"].extend(gather_object(completions_text))
+        # Log prompts and completions
+        self._logs["prompt"].extend(gather_object(prompts))
+        self._logs["completion"].extend(gather_object(completions))


Logging conversation lists breaks drop_duplicates on unhashable type

High Severity

Previously self._logs["prompt"] contained flat strings from batch_decode, but now it stores conversation lists (list of dicts). Downstream, when log_unique_prompts=True, df.drop_duplicates(subset=["prompt"]) is called on these values. Lists are unhashable in Python, so this will raise a TypeError and crash the logging step. This affects both grpo_trainer.py and rloo_trainer.py.

Additional Locations (1)

trl/trainer/rloo_trainer.py#L1318-L1321

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 30a67b60a8

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

…ages

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 00a5fd0. Configure here.}

cursor · 2026-04-04T03:44:06Z

-                        axis=1,
-                        copy=False,
-                    )
+                        images.append([logging_backend.Image(image) for image in image_list])


Removed null guard crashes on None image lists

High Severity

Removing the if image_list: guard in the image logging loop causes a TypeError. In VLM batches, self._logs["images"] can contain None entries for prompts without images. Iterating over these None values raises TypeError: 'NoneType' object is not iterable, crashing logging for mixed-image batches.

^{Reviewed by Cursor Bugbot for commit 00a5fd0. Configure here.}

AmineDiro · 2026-04-04T15:03:41Z

I think showing decoded text is better but I terminal based debugging can be limited especially for very long outputs. IMHO, we'll need to separate the debug info from how we vizualize it. We can decide to use UI based debugging (using trackio for example) or build specific TUI for the debug info. wdyt @qgallouedec ?

qgallouedec added 2 commits March 19, 2026 02:35

Show conversations (not decoded text) in the completions table

de62232

style

90bcb30

qgallouedec requested review from AmineDiro, albertvillanova and lewtun March 19, 2026 02:41

Merge branch 'main' into log-conversations-and-tool-calls

30a67b6

cursor Bot reviewed Mar 19, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Mar 19, 2026

View reviewed changes

Comment thread trl/trainer/grpo_trainer.py

Add _strip_images_from_messages utility to clean image data from mess…

9987042

…ages

cursor Bot reviewed Mar 19, 2026

View reviewed changes

Comment thread trl/trainer/rloo_trainer.py Outdated

Merge branch 'main' into log-conversations-and-tool-calls

dc5c026

cursor Bot reviewed Mar 25, 2026

View reviewed changes

Comment thread trl/trainer/utils.py

qgallouedec and others added 4 commits March 25, 2026 15:56

revert

359426d

Strip images from completion logs in RLOOTrainer

b906865

consistency

d02242d

Merge branch 'main' into log-conversations-and-tool-calls

f0f766f

cursor Bot reviewed Mar 27, 2026

View reviewed changes

Comment thread trl/trainer/rloo_trainer.py Outdated

qgallouedec commented Apr 2, 2026

View reviewed changes

Comment thread trl/trainer/rloo_trainer.py Outdated

qgallouedec added 2 commits April 2, 2026 15:47

Apply suggestions from code review

16a822d

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Merge branch 'main' into log-conversations-and-tool-calls

00a5fd0

qgallouedec marked this pull request as draft April 4, 2026 03:34

cursor Bot reviewed Apr 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show conversations instead of decoded text in the completions table#5309

Show conversations instead of decoded text in the completions table#5309
qgallouedec wants to merge 11 commits intomainfrom
log-conversations-and-tool-calls

qgallouedec commented Mar 19, 2026 •

edited by cursor Bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 19, 2026

Uh oh!

cursor Bot Mar 19, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Apr 4, 2026

Uh oh!

AmineDiro commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Mar 19, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Log conversations instead of decoded strings (WDYT?)

Fix: render tool_calls

Uh oh!

HuggingFaceDocBuilderDev commented Mar 19, 2026

Uh oh!

cursor Bot Mar 19, 2026

Choose a reason for hiding this comment

Logging conversation lists breaks drop_duplicates on unhashable type

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Apr 4, 2026

Choose a reason for hiding this comment

Removed null guard crashes on None image lists

Uh oh!

AmineDiro commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Mar 19, 2026 •

edited by cursor Bot

Loading

Fix: render `tool_calls`

Logging conversation lists breaks `drop_duplicates` on unhashable type