They are seeing 60k downloads of 6GB models per day, which is about 33Gbps of ba...

krick · on Dec 9, 2019

> 60k downloads of 6GB models per day

Wait... you mean the actual computation is running client in the browser? I didn't even open this "game", but I assumed such high cost is because there is a separate GPT-2 running on a GPUs for each and every user.

_gfrc · on Dec 9, 2019

No, the computation is not running on the client in the browser. This is the traffic to transfer the model from GCS to Google Colab.

This is what makes the price so surprising - you are copying data from one Google Service to another, but it's billed as egress.

jstanley · on Dec 9, 2019

But if you were hosting it yourself you wouldn't transfer 6G of data around per user. You'd be a bit more intelligent about it.

dwild · on Dec 9, 2019

They send it to Google Colab. It allow you to run model over a powerful server for free. 6 GB of download at this crazy bandwidth cost would still be cheap versus offering themselves that kind of beast of a server to 60k persons each day. I remember when I tried it I saw that it used 10 GB of memory, that was crazy!

jrockway · on Dec 9, 2019

You'd save on transferring the bytes around, but now you would have to self-host jupyter and the GPUs it uses. That is going to be even more expensive than IP transit because now you have to have 60,000 12GB GPUs in your datacenter.

Like I said before, this is one of those things that wouldn't exist without the Cloud. If you run things on your user's computers, you have to send them a lot of bits. If you run things on your own computers, you're spared that bandwidth, but now have to have enough "computers" to satisfy your users. It's simply something that's not super cheap to run these days.

I will admit that it is surprising that Google <-> Google traffic is billed at the normal egress rates, but the reasoning does make sense -- a 30Gbps flow is nothing to sneeze at. That is using some tangible resources.

jsty · on Dec 9, 2019

No, the models are running in Colab - but in the end-user's account - so each 'run' costs the lab 6GB of internet egress when the model is downloaded from GCS to the Colab VM.