Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Not sure why you put those things in quotes, that's kind of strange.

That aside, the training isn't blind, it's guided, and it's likely they use verified correct sources of info to train for some things, like medical diagnoses.



I can help with "verified correct sources", have a look at "Language Models are Few-Shot Learners" section 2.2 [1].

You may also be interested in Apendix A in the same document: "Details of Common Crawl Filtering"

[1] https://arxiv.org/pdf/2005.14165.pdf




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: