BeBot - An Anarchy Online and Age Of Conan chat automaton

Archive => Anarchy Online Archive => BeBot 0.4 support => Topic started by: Dockwagon on July 31, 2007, 08:27:50 pm

Title: Problem accessing fc webpages
Post by: Dockwagon on July 31, 2007, 08:27:50 pm
Been struggling a bit with whois_update, and I am not sure what to do anymore

on server with bebot :

user@machine:~/bot$ wget http://www.anarchy-online.com/org/stats/d/1/name/7745538/basicstats.xml
--20:01:16--  http://www.anarchy-online.com/org/stats/d/1/name/7745538/basicstats.xml
           => `basicstats.xml'
Resolving www.anarchy-online.com... 216.74.158.92
Connecting to www.anarchy-online.com|216.74.158.92|:80... failed: Connection refused.
user@machine:~/bot$ wget www.vg.no
--20:11:37--  http://www.vg.no/
           => `index.html'
Resolving www.vg.no... 193.69.165.21
Connecting to www.vg.no|193.69.165.21|:80... connected.
HTTP foresp?rsel sendt, mottar topptekster... 200 OK
Lengde: 181.306 (177K) [text/html]

100%[====================>] 181.306      121.77K/s             

20:11:40 (121.43 KB/s) - `index.html' saved [181306/181306]

On another server I have access to, but can't use :

anothermachine:~$ wget http://www.anarchy-online.com/org/stats/d/1/name/7745538/basicstats.xml
--20:01:50--  http://www.anarchy-online.com/org/stats/d/1/name/7745538/basicstats.xml
           => `basicstats.xml'
Connecting to www.anarchy-online.com:0... connected!
HTTP request sent, awaiting response... 200 OK
Length: 78,092 [text/xml]

    0K .......... .......... .......... .......... .......... 65% @  84.18 KB/s
   50K .......... .......... ......                          100% @ 175.08 KB/s

20:01:51 (102.50 KB/s) - `basicstats.xml' saved [78092/78092]

Been trying to figure out how to use proxy server in php and bebot, without much success, any suggestions?
FC suggests that I clear cookies in my webbrowser, which isnt very helpful
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on July 31, 2007, 08:41:46 pm
It seems like someone at FC is blocking IPs of botusers, most likely to the possible high amount of http queries the whois queries can produce each day.

Same for me, I can't access www.anarchy-online.com at all, regardless of the way I try to connect - firefox, wget, lynx, telnet, whatever. Always ends in a connection refused right away, even though those clients aren't blocked by any firewall on my side.
Title: Re: Problem accessing fc webpages
Post by: Dockwagon on July 31, 2007, 08:45:50 pm
Hm, that is not good.

Whole org takes a hit when they do stunts like this.

But, is there any way to get in touch with FC and make some servers which can mirror the user database?

Then they could only have 1-10 bots crawling their web and the bandwith/cpu used will be paid by the users willing to mirror. I can mirror no problem
Title: Re: Problem accessing fc webpages
Post by: Dockwagon on July 31, 2007, 08:51:24 pm
Hm, does php have what is called cpickle in python?

Cause crawling the db and save the data needed in gzipped cpickle would be easier on bandwith.

Title: Re: Problem accessing fc webpages
Post by: Alreadythere on July 31, 2007, 08:57:35 pm
No clue how cPickle relates to the problem - I haven't worked with python yet, but serialization doesn't seem to be our problem, and that seems to be the only thing cPickle offers.

Though any program mirroring the FC site doesn't have to be written in PHP, any language works. Only reason I use php for the whois update script is the easy way it accesses DBs, the fact that I got some experience with php and that bebot is written in php.
Title: Re: Problem accessing fc webpages
Post by: Dockwagon on July 31, 2007, 09:13:12 pm
Can you help me make the fsockopen use proxy then? My php fu is weak :/
I've tried with the example from http://www.ziguras.com/php/using-fsockopen-to-connect-to-remote-servers but can't get it to work properly. Any help would be appriciated :-)

I tested some anonymous proxies from http://www.proxy4free.com/page1.html and wgetting www.anarchy-online.com worked with : export http_proxy=http://url:port

cPickle is just to store/read objects. Objects can be data structures, functions etc. Quick and easy way to do it in python. I just thougth that getting a dump of the data from the xml we need for whois could save bandwith if we spread it with mirrors.
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on July 31, 2007, 10:46:44 pm
Never tried to get fsockopen to work through proxies. Besides, that would just shift the problem, not solve it.
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on July 31, 2007, 11:04:36 pm
Just tested the proxy example on the page you posted, and it worked with my private squid proxy.
Title: Re: Problem accessing fc webpages
Post by: Dockwagon on July 31, 2007, 11:24:13 pm
Proves that me and php == incompetence.

Did you include it into bebot's Sources/Bot.php ?

It seems straight forward, but somehow I get it wrong. If you've included it to Bot.php, is it possible to get a copy? :-)

Making a list with lots of different anonymous proxies and a random selection I think I can do myself.
If I get it working, I could open the table for storing character info to the world with read only access.
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on July 31, 2007, 11:44:12 pm
I tried it with my proxy (just the code snippet in the linked page), but timed out with some of the random proxies of your list that I tested, never got any result. Either those expect some logon or are seriously lagged.

If we do some central updating solution a text listing or even just a diff with changed entries to the day before would be best I think, and then just zip or gzip it to reduce total bandwidth usage.
Title: Re: Problem accessing fc webpages
Post by: Dockwagon on August 01, 2007, 12:24:09 am
Need to use the proxies who are listed as anonymous.

If it were possible to get in touch with fc and make a solution, tuzzy downloaded all the xml files a year ago or so, and she reports the file was ~590 ish mb.
Looking into the whois table in bebot 0.4 it's not much information stored, and I can understand FC wanting to stop what can be considered a DoS attack. To my understanding the whois_chache, which now is included in bebot, tries to cache every org member of an org which someone have done a !whois on. This would grow quite a bit :-)

I havent done much db replication in mysql, aka nil, but if it were possible to let mysql handle the replication, it could be an idea? Or, maybe the config/security issues are to great? Diff seems like a very simple solution too :-)

Or, make a centralized !whois db, a bit like what vhab have done with items.

The key for a permanent solution I think would be to get in touch with fc and make an acceptable solution with them.
Title: Re: Problem accessing fc webpages
Post by: Wolfbiter on August 01, 2007, 04:10:05 am
My cache was up to ~120k players at one point, and of those 25k was orgless. That made the update script take a long time to finish (and to query the xml files ~26k times during that time). One way to limit the amount would be to clear the cache of orgless people (since they take up the most of the xml queries in a large cache) now and then.

Another thing to do when you have many bots running on the same machine would be to do $noupdate=true on every lookup, so it'll never fetch a new when the timestamp is old, just on people that doesn't exist. That way if you have a large cache there wont be any problem with a slow update script (not sure how long time mine took towards the end, but I think it was around 3-4h at ~80k players) and the bots fetching a fresh lookup every time someone does a whois because the cache isn't updated (could be solved by setting a much higher expire time).

I was never banned from FC, and it was just two weeks ago or so when I cleared my cache. So either they changed something since then, or your bots do an unrealistic amount of connections towards funcom and fixing that sounds like a better idea than to try and go around it.

A central whois database exists already.. it's called www.anarchy-online.com... Any off-box/lan database will be slower which is the purpose of the local cache.
Title: Re: Problem accessing fc webpages
Post by: Ebag333 on August 01, 2007, 08:21:53 am
I was never banned from FC, and it was just two weeks ago or so when I cleared my cache. So either they changed something since then, or your bots do an unrealistic amount of connections towards funcom and fixing that sounds like a better idea than to try and go around it.

In my case I'm pretty sure it was due to a bug in the flexible security plugin causing it to spam Anarchy-Online looking for whois info on alts of toons that were invalid.

Hittings FC's servers every 2ish seconds (plus regular whois lookups) would do it for me real quick, especially considering I use multiple bots. :)

A couple of workarounds would be to use proxies, or to try and parse the HTML from Auno or Vhab's sites.  Vhab's HTML is already clean enough that it should be fairly easy to parse to a XMLish format.
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on August 01, 2007, 08:31:39 am
My cache was up to ~120k players at one point, and of those 25k was orgless. That made the update script take a long time to finish (and to query the xml files ~26k times during that time). One way to limit the amount would be to clear the cache of orgless people (since they take up the most of the xml queries in a large cache) now and then.
I've got 42k unorged people in my cache, which created that many queries - guess FC didn't like it anymore. They blocked my pretty sudden as of friday last week.

One other way to lower the load is doing updates just for orged people, no need to delete the unorged ones, the most current version of the script simply doesn't try to update them anymore on default.
For me that will result in like 1500 queries, with a higher delay those won't create that much load on the FC site anymore.
Title: Re: Problem accessing fc webpages
Post by: Vhab on August 01, 2007, 09:40:46 am
All this blocking sounds a bit strange though.
I run nightly mirrors for quite some time and I haven't been blocked yet.
Auno seems to be fine aswell.
Though, maybe they looked at it case-by-case as it runs from the box helpbot is on.
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on August 01, 2007, 09:56:48 am
All this blocking sounds a bit strange though.
It's definitly a block, if I do telnet www.anarchy-online.com 80 I get connection refused right away.

Auno seems to be fine aswell.
I think auno is blocked too, site got Last update: Thu, 26 Jul 2007 10:30:03 +0000 on the character info page. Which is the last day I could connect to the site too.
Title: Re: Problem accessing fc webpages
Post by: Ebag333 on August 01, 2007, 06:35:18 pm
http://forums.anarchy-online.com/showthread.php?p=4820341

Post your outrage. :)
Title: Re: Problem accessing fc webpages
Post by: Dockwagon on August 01, 2007, 07:07:42 pm
No need to troll or flame methinks. Either they implemented some anti DoS measures without knowing the consequenses or they don't want us to use the bandwith.

But, I dream about a shell on their forums db server and provoke blocked ip, would be fun :-)
Title: Re: Problem accessing fc webpages
Post by: Vhab on August 01, 2007, 07:21:06 pm
Highly likely, but they're feeling the wraith of vhab eitherway :D
Title: Re: Problem accessing fc webpages
Post by: Vhab on August 01, 2007, 09:13:54 pm
Quote from: Famine
The issues relating to the “helpbot” are not intentional as far as we can tell at this juncture. This might be some type of automated blocking in relation to new hardware or to the current server moves. We are currently looking into this and will report any findings.

Quote from: Silirrion
Ok folks, firstly please drop the conspiracy theories and personal attacks. They are uncalled for are jumping to conclusions. We have never taken active action against any community run service, and we wouldn't unless it was because that service was causing an issue with the performance of the game in general.

No-one has been banned specifically, or targeted in that way.

What has happened is that we have had to close down access to certain sections of the servers due to an attempted Denial of Service Attack. This is partly through automated monitoring, and partly the actions of our server team when they recognized that something was amiss. We will always act in the best interests of the game performance, and this is no difference. It is a shame that the actions of a malicious minority can affect ordinary players, but this action was required to prevent the problem causing issues with the live dimensions.

We are looking at ways to help prevent this particular form of attack.

We are also looking into additional solutions that will allow the community to have access to the database feeds again.

I recommend those that currently have been blacklisted attempt to contact famine on irc.
Also remember to mention your IP as without they can't do anything.
Title: Re: Problem accessing fc webpages
Post by: Alreadythere on August 01, 2007, 09:29:15 pm
That thread got quite some attention.
Title: Re: Problem accessing fc webpages
Post by: Vhab on August 01, 2007, 09:37:40 pm
That thread got quite some attention.
If I want to get FC's attention I bring out the zerg :D
Title: Re: Problem accessing fc webpages
Post by: Temar on August 01, 2007, 11:27:41 pm
My Bots are Working again
Title: Re: Problem accessing fc webpages
Post by: Newsworthy on August 06, 2007, 06:53:33 pm
My bot doesn't want to update its rosters now... I restart it to force roster update, starts up "Roster update underway Stand By" etc...

1 min later, bot logs off. Relogs 30 secs later, and says Roster update not scheduled.

PHP error log:
[06-Aug-2007 17:07:42] PHP Warning:  fsockopen(): unable to connect to www.anarchy-online.com:80 (A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
) in C:\Bebot\Sources\Bot.php on line 1075


Is this related to the issue at hand here?

Title: Re: Problem accessing fc webpages
Post by: Ebag333 on August 06, 2007, 10:37:31 pm
What version are you using?

Always worth trying SVN version.
Title: Re: Problem accessing fc webpages
Post by: Temar on August 06, 2007, 11:19:13 pm
That error is from funcom Blocking us i get it alot too but my bot doesnt crash
Title: Re: Problem accessing fc webpages
Post by: Newsworthy on August 07, 2007, 06:52:30 am
Using version 4.0. Will reinstall all features and see if it fixes. Thanks :)
Title: Re: Problem accessing fc webpages
Post by: Newsworthy on August 08, 2007, 02:39:24 am
Did not fix :(
Title: Re: Problem accessing fc webpages
Post by: Ebag333 on August 08, 2007, 06:09:42 am
Can you go to http://www.anarchy-online.com via IE (or another browser) on the machine your bot is running on?
Title: Re: Problem accessing fc webpages
Post by: Malosar on August 09, 2007, 09:42:45 pm
I still can't access their webpage, and thus my bot won't update whois or rosters. The longer this goes on, the more useless my bot becomes. Any word as to an alternative solution? Or are they once again ignoring the bot community and not talking with our esteemed devs to come up with an alternative?
Title: Re: Problem accessing fc webpages
Post by: Dracutza on August 09, 2007, 10:43:13 pm
My bot doesn't want to update its rosters now... I restart it to force roster update, starts up "Roster update underway Stand By" etc...

1 min later, bot logs off. Relogs 30 secs later, and says Roster update not scheduled.

Same thing here... only my problem just started today.
Title: Re: Problem accessing fc webpages
Post by: Blueeagle on August 10, 2007, 12:26:18 am
Same thing here... only my problem just started today.

Was this perhaps during the server down-time?
Title: Re: Problem accessing fc webpages
Post by: Nytridr on August 10, 2007, 01:40:01 am
that problem is the same one metioned over here

http://bebot.link/index.php/topic,935.msg6803/topicseen.html#new (http://bebot.link/index.php/topic,935.msg6803/topicseen.html#new)

basically just give it time for FC to fix the hampter.  Think the hampster is a bit tired from the move.. Let FC juice him up a bit and we will all be set
Title: Re: Problem accessing fc webpages
Post by: Malosar on August 10, 2007, 12:05:49 pm
ah ok, not really up with what FC are doing overall as I don't play anymore. Since their website is back up I'll check my bot today and see if all is ok.
SimplePortal 2.3.7 © 2008-2024, SimplePortal