Ah sorry. So there are three coloured buttons. When you hold one, the site takes a series of photos from your webcam, and assign them to that "class". Then it'll train and start classifying your video input live.
It's a pretty neat way of creating a reasonable training set of 3 classes.
I can't run the demo here (browser not capable enough, and no camera) and I'm getting really curious what this is about.