"Note that, even if Grover fails to detect a given piece as fake, our findings suggest that releasing many such articles taken together would be relatively easy to spot. Thus, if a source of Neural Fake News disseminates a large number of articles, Grover will be increasingly capable of spotting these articles as malicious."
My interpretation is that it would, since if an adversary released a lot of rejection-sampled articles and you retrained Grover on them, then Grover could tag them all as machine-generated.
(2) Remove the 92 articles that Grover detects (with 92% accuracy rate)
(3) Choose the best of the remaining 8 articles