Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

(1) Generate 100 fake news articles

(2) Remove the 92 articles that Grover detects (with 92% accuracy rate)

(3) Choose the best of the remaining 8 articles



"Note that, even if Grover fails to detect a given piece as fake, our findings suggest that releasing many such articles taken together would be relatively easy to spot. Thus, if a source of Neural Fake News disseminates a large number of articles, Grover will be increasingly capable of spotting these articles as malicious."


That doesn't solve the exploit above


My interpretation is that it would, since if an adversary released a lot of rejection-sampled articles and you retrained Grover on them, then Grover could tag them all as machine-generated.


Of course, to retrain Grover, you'd have to already know that they were machine-generated.


The generated news also needs to look like it was written by a human and be readable by the average human.

So you need a step that filters out obvious junk, maybe by eyeballing or with another net.

It is possible that the only way to beat grover is rubbish such as contorted, but valid, grammer or overuse of synonyms.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: