If you are doing a ham/spam type classification, then you won't need the alien language. I am almost a total tech novice and I was able to do well with just some bash scripts. Of course the docs will teach you about better ways to train the system, if you are interested in going from 98% correct classification to 99.5% correct.
No doubt, I'll be combing through all of the CRM114 information on the website. Is there anything that is not referenced there that will be of use?