Skip to content

Question about sampling 256 samples for C4 dataset evaluation #65

@nku-ligl

Description

@nku-ligl

I have a question: why do we sample 256 samples of length seqlen when evaluating the C4 dataset? This specifically refers to the get_c4 and get_c4_new functions in datautils.py.
The paper mentions that 128 samples are drawn from C4 as the calibration dataset, but I haven’t found explanations in the paper for why 256 samples are used during C4 dataset PPL evaluation.
Should we evaluate on the full C4 dataset instead? Could anyone familiar with this help explain? Thanks a lot!

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions