This quiz throws at you 10 football articles, more
precisely match reports. Your task is to determine whether they were
written by a human journalist or generated by machine.
The results will aid my master's thesis on conditional text
generation. Basically this website serves as a human judgement
A couple of things to be aware of:
- Titles are normalized both in generated
and real articles. Each title has format team1 ? team2
where "?" is an inequality sign, denoting who won
the reported match.
- Images may or may not be relevant to the article - they
are selected by machine also for real articles. Sometimes
images may not be there at all.
- Articles (both real and generated) are cropped after a few
sentences, so don't be surprised if an article is unfinished.
The number of sentences displayed is random (between 2 and 10)
because I want to measure, for each article, how the ability to
fool the reader decreases with length.
- At the bottom of some articles there are normalized
match reports, such as:
Match ends, Salford City 2, Bromley 1.
Second Half ends, Salford City 2, Bromley 1.
Goal! Salford City 2, Bromley 1. Nathan Pond (Salford City).
Note that they present events in reverse-chronological order
and that they occur both in real and fake texts.
Generated articles are mostly grammatically correct, so
instead try to look for logical incoherences. For instance,
if the article states that there was a goalless draw and
later that someone has scored, it's clearly generated.
If you know anything about football, try to incorporate
this knowledge :) If an article states that someone has
scored, but you know they are a goalkeeper, it's probably
fake (unless it's Rogério Ceni).
After each article you will get feedback whether your
answer was correct or not.
A pre-trained Transformer
with 355M trainable parameters finetuned on BBC Sport match
reports for 16000 iterations.
Feel free to take the test multiple times if you want to improve
your sensitivity to fake news :)