Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Twitter is supposedly not providing the data

Didn't he get access to the firehose, which is, AFAIK, -ALL- the data? (not the 50GB he tweeted about but supposedly a 2TB/day stream?)



The firehose is all the posts.

It's not all the views. And 'daily active users' are mainly viewing, not posting.

Which makes me wonder why the argument about 'the firehose' and 'bots' matters.


Perhaps because it's not actually about the bots, it's about buyer's remorse and trying to get out of the transaction via some excuse.


But why would the analysis care about bots that are viewing? They aren't creating disinformation by viewing (maybe by "liking" though). Bots-as-followers are benign. The "bot problem" is bots being used to post disinfo at scale to cram the entire space with propaganda, that is: to quote Steve Banon "flood the zone with shit". Which is what bots and troll farms have done successfully over the past 7 years.


Engagement signals are what drive ‘the algorithm’.

Bots that create fake engagement signals (fake clicks, fake media views, fake DM engagement, as well as fake replies and likes) can manipulate what content gets promoted onto real users’ feeds.

From a business point of view, the problem is that if people can manipulate and promote messages on Twitter by using fake engagement, they are not paying Twitter to promote tweets. As well, I guess, as undermining regular users’ confidence that the things that show up in ‘trending topics’ really are ‘trending’.

But bots posting propaganda - if they’re any good at it - might, weirdly be a good thing for Twitter’s numbers, because they create controversial posts that drive engagement into which Twitter can sell advertising.

Looking at the firehose tells you very little about any of that though. There could be a million bots who post lorem ipsum tweets at a rate of hundreds a second and they would all show up in the firehose, but if twitter’s algorithm means none of those tweets are ever shown to a human, does it matter?

Musk’s whole schtick with the bot numbers has just been a thorough demonstration that he has barely even thought about Twitter’s business or what a social network even is.


> But why would the analysis care about bots that are viewing?

Because the business issue is about advertising reach. Bots aren't real advertising targets.


IIRC "the firehose" is around 10%


No, you're thinking of the "decahose." The "Firehose" is the whole thing.

https://developer.twitter.com/en/docs/twitter-api/enterprise...

> The Decahose delivers a 10% random sample of the realtime Twitter Firehose through a streaming connection.


Shouldn’t it be called the Decihose, then? The name is confusing.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: