Skip to content

MLLM eval pitfalls #1

@sam-motamed

Description

@sam-motamed

Very cool work! I am wondering how robust MLLM evals are given the best models even make lots of mistakes when videos are physically implausible?

as reference;

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions