Video Recordings

Disentangling 20 years of confusion: quo vadis, human evaluation?

Speaker:

David M. Howcroft (Heriot-Watt University, Edinburgh)

Abstract:

Human assessment remains the most trusted form of evaluation in natural language generation, but there is huge variation in terms of both what is assessed and how it is assessed. We recently surveyed 20 years of publications in the NLG community to better understand this variation and conclude that we need to work together to develop clear standards for human evaluations.

Length:

00:49:16

Date:

29/03/2021

Images:

Attachments: (video, slides, etc.)

download

107.7 MB

1121 downloads

admin

Video Recordings

Institute of Formal and Applied Linguistics

Disentangling 20 years of confusion: quo vadis, human evaluation?