Issues for large (many channels) bots

night · October 8, 2014, 12:09am

This would be extremely nice. If you must disconnect us, possibly make higher rate limits that do just that. Error messages allow us to script responses to react to those message, and rate limit on our own ends.

brildum · October 8, 2014, 1:36am

I thought I’d just share that I’ve added the following tasks to our queue:

Create API to expose log of moderator actions
Make it easier to bots to operate (regarding rate-limits. This is vague because it will require some serious thought as to the best way to accomplish this. Additionally, there will probably be discussions on here at some point to help us make decisions)
Timeouts should not override longer timeouts
/mods command can exceed max IRC message length
Group chat /ban does not work

Keep in mind that these are on the queue but have not been prioritized or scheduled yet, so I can’t comment regarding when to expect any of these changes.

I’ll also add that we hope to start writing at least one blog post related to the chat system each month. Discussing what we’ve done and what we plan on doing, both from a technical and users perspective.

BatedUrGonnaDie · October 8, 2014, 9:21am

Just a small concern for bots that have different names for each channel. You talked about adding connection limits, but if you have a bot that you can customize the name for each channel based on what the user wants you need to open a new connection. I’m curious as to how this would be possible if you implemented connection limits per IP besides just spreading them out over IP’s.

brildum · October 8, 2014, 3:31pm

The rate-limit could be sufficiently high to allow it to not be an issue. Whether connection limits will ever be a reality, I cannot say. It is something we may want to add, to avoid someone trying to DOS our servers by opening too many connections.

expertsonline · October 8, 2014, 4:20pm

I love this idea! The Mods aspect should be easy to implement as the permissions module is already present. A much higher limit/no limit in channels the bot is modded would be a welcome addition and help reduce the amount of resources bot developers need to spend on working around these limits with additional connections and proxies.

However, the way elevated mod limits are currently implemented would also need to change. Currently, if 30 messages are sent to the modded channel and one message is sent to an un-modded channel, then the bot gets IP banned as it counts as 31 messages on an un-modded channel. This forces bot developers to restrict each connection to one channel.

Knoober · October 11, 2014, 8:03pm

I am also struggling with the limits on my bot (knoobot.com). I have all messages/commands/joins funneling through a processor for each that fires off at specific intervals to prevent the ban limit from being reached. Needless to say, that isn’t very scaleable.

@brildum - how soon do you think Twitch would be able to start the bot white-list application process?

For tracking moderators, Knoobot listens for the MODE +o and -o and maintains a local list per channel.
@brildum would it be possible to maintain a list in the kraken API?
Something like: api.twitch.tv/kraken/chat/moderators ?

cerebellum42 · October 11, 2014, 9:48pm

@Knoober there are two existing solutions to this problem. You can call http://tmi.twitch.tv/group/user/channel/chatters which includes the list of moderators for a channel (but also a list of all other chatters in that channel) or you can send .mods to the channel that you are in and simply listen to the “The moderators of this room…” message.

One note about the latter Method: You need to be in TWITCHCLIENT 3 mode to be able to identify which channel the message with the moderators is sent to. Otherwise it will be PRIVMSG’d to your bot directly instead of to the channel.

Knoober · October 14, 2014, 1:36am

@cerebellum42 thanks for the info. I think the chatters will work better since it is one less command to send to the twitch irc server. JSON objects are easier to work with as well.

mca64 · October 14, 2014, 2:27pm

little off topic

@brildum i made twitch chat directly into a game (StarCraft: Brood War) but for some users from Korea they get disconnected after few minutes. For rest all works stable. Any suggestion?

brildum · October 14, 2014, 4:15pm

My guess would be some connection issues between our servers in San Francisco and those users’ ISP, since its happening to a subset of users from a very specific location. If the round-trip time between USA and Korea is long enough and the chat room is active enough, they may cross the 20 message buffer on the server and get disconnected.

mcuniverse2014 · January 11, 2015, 8:29pm

You can skip the rate limit by connecting directly with the IRC server. Using some script servers like mIRC will cause the rate limit because there is a pause somewhere in the sending of the data.

If you can constantly stream data to the IRC server, there is no rate limit. This is how these bots are flooding channels. The only time you will see them drop is when there is a major packet loss from there end. This is how they create view bots.

To control those bots, you need a verification of actual viewers versus actual people connected to the Twitch IRC API. This is a very stupid design from my point of view. So one IP address can create thousands of users. Please fix this twitch!!! Allow only a limited amount of accounts from one IP address and you will stop a significant majority of view botters.

But in order to control the chat through moderation, you have to have a separate program connected to the IRC server that would handle all the chat commands coming through the channel. The twitch chat servers can’t handle all the API’s running then moderators trying to do there job.

IMO, if Twitch would limit IP’s, moderators would have it extremely easy

Sunspots · January 12, 2015, 3:05am

Where are you getting this information?
I doubt the chat has anything to do with viewbotting.
Stream views and the chat should be two completely separate systems.

Ninja101 · January 12, 2015, 3:30am

Chat users by themselves don’t affect the view count at all

mcuniverse2014 · January 12, 2015, 7:20am

I have been programming for over 30 years and I have been working with the twitch IRC servers for over a year. Not that I do anything like what I am talking about. But it is there and I know about it. Twitch can check my IP logs on me if they want. I have nothing to hide, but I do know how they are doing the view botters.

mcuniverse2014 · January 12, 2015, 7:28am

Yes they do, that is the only way to do the view count. Think about this a second. What type of computer would you need to load 1000+ API’s at once just to load the Java API for chat, Not much really. All you would need is the memory.

Now how good of a computer would you have to have in order to load 1000’s of API’s for the video, that is IF the view count was based upon actual video users? Move over Watson, I have a 1000 computers in my bedroom. Not to be an ass about it, but it would physically not be possible. You could hijack a 1000 computers, log into their twitch account, load up the API for the video, and view a certain channel. The chances of this are probably smaller than the previous one.

So the way twitch is using the view counter, grabbing it from IRC users in your channel, seems more like what is happening. Please a twitch programmer notice this and correct me if I am wrong. I am trying to combat these assholes that are ruining a good thing.

george · January 12, 2015, 10:13am

IRC users have no affect on a channel’s viewer or views count. You have to be actively requesting the video feed to be counted as a viewer.

mcuniverse2014 · January 12, 2015, 10:40am

Question then: Why when I logon through an external IRC client, I can see all the viewers in the chat channel, but in the Web Browser API chat the count is different?

I only see a few of those users. Even waiting 5 minutes and refreshing the list in web browser does not show the total number of users that are in the IRC channel. Does not matter how long I wait, The IRC user count is a lot higher than what is shown in the chat list, and the count in IRC is close to viewer count.

Fugiman · January 13, 2015, 12:17am

Because the web client filters out “anonymous” irc users.

Concurrent viewers is based off of who is downloading the video. It has no relation to chat.

Brandon · February 28, 2015, 11:06pm

night:

In addition, there’s also been random complications when running bots large scale, more recently (as stated in my other post) is the server-side 20 messages queue. It’s extremely easy to fill these 20 messages, and the slightest lag spikes (like a networking dropout) cause a massive disconnection issue (and with the number of connections bots have, reconnecting to hundreds of connections that just disconnected is like booting up a bot on cold start – it takes forever due to rate limits). If you’d like to see how buggy that queue is, try joining the top 20 channels on Twitch with an IRC client without TWITCHCLIENT 2+ passed and you’ll get insta-disconnected (here’s an example of what you can paste into a telnet window to test with as of 11:45AM EST: NICK justinfan43545345JOIN #starladder1JOIN #nightblue3JOIN #wcs_europeJ - Pastebin.com). Getting people who could invest time to help debug these issues would be extremely helpful.

So how are people handling this? Just spreading out large channels evenly? Are there any cases that are so extreme that a single channel would require its own connection for receiving? My local dev server is struggling hard to even join 3 of the top 20 channels right now, sometimes not even 1 of them, and this is with TC3. I can join all 20 when I push this code to my Heroku production environment with TC3. I suspect that my local connection (likely the router) is responsible for this when developing locally, but it seems pretty insane that I’m being dropped after 1 or 2 channels. I’ll troubleshoot that on my own, but my original question is still valid.

night · February 28, 2015, 11:59pm

I would imagine that with 1 or 2 channels you will not run into this case I mention. The reason the top 20 channels causes you to be disconnected is due to the NAMES replies being incredibly long. Additionally, the JOINs and PARTs of large channels can contribute to this issue as they are sent out in batches every ~20 seconds. With TC3, you really shouldn’t have any issue with channels unless you’re in multiple (more than 1 or 2) fast moving ones on the same connection.