<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Scott Yang's Playground &#187; alexa</title>
	<atom:link href="http://scott.yang.id.au/tag/alexa/feed/" rel="self" type="application/rss+xml" />
	<link>http://scott.yang.id.au</link>
	<description>Faith, Technology and Randomness in Life, According to Scott</description>
	<lastBuildDate>Thu, 09 Feb 2012 09:01:12 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Top 500 Sites in Australia According to Alexa</title>
		<link>http://scott.yang.id.au/2009/05/top-500-sites-in-australia-according-to-alexa/</link>
		<comments>http://scott.yang.id.au/2009/05/top-500-sites-in-australia-according-to-alexa/#comments</comments>
		<pubDate>Fri, 29 May 2009 04:10:08 +0000</pubDate>
		<dc:creator>scotty</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[alexa]]></category>
		<category><![CDATA[Australia]]></category>

		<guid isPermaLink="false">http://scott.yang.id.au/2009/05/top-500-sites-in-australia-according-to-alexa/</guid>
		<description><![CDATA[Need to analyse a list of top sites in Australia, and the following Python scripts helped me to get the site name + domain name off from Alexa with minimum effort: #!/usr/bin/env python import re, urllib r = r'&#60;a href="/siteinfo/(.*?)" &#62;&#60;strong&#62;(.*?)&#60;/strong&#62;'; for i in range(25): u = 'http://www.alexa.com/topsites/countries;%d/AU' % i for x, m in enumerate(re.findall(r, [...]]]></description>
			<content:encoded><![CDATA[<p>Need to analyse a list of top sites in Australia, and the following Python scripts helped me to get the site name + domain name off <a href="http://www.alexa.com/topsites/countries/AU">from Alexa</a> with minimum effort:</p>
<pre class="code">
#!/usr/bin/env python
import re, urllib
r = r'&lt;a  href="/siteinfo/(.*?)"  &gt;&lt;strong&gt;(.*?)&lt;/strong&gt;';
for i in range(25):
    u = 'http://www.alexa.com/topsites/countries;%d/AU' % i
    for x, m in enumerate(re.findall(r, urllib.urlopen(u).read())):
        print '%d. %s (%s)' % (x + 1 + i * 20, m[1], m[0].strip())
</pre>
<p>YMMV. Considering how Alexa has been trying to obfuscate their HTML pages to prevent scrapping, I won&#8217;t be surprised that this script stops to work tomorrow&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://scott.yang.id.au/2009/05/top-500-sites-in-australia-according-to-alexa/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Alexa Updates Its Ranking System</title>
		<link>http://scott.yang.id.au/2008/04/alexa-updates-its-ranking-system/</link>
		<comments>http://scott.yang.id.au/2008/04/alexa-updates-its-ranking-system/#comments</comments>
		<pubDate>Thu, 17 Apr 2008 02:27:54 +0000</pubDate>
		<dc:creator>scotty</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[alexa]]></category>
		<category><![CDATA[compete]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[quantcast]]></category>
		<category><![CDATA[techcrunch]]></category>

		<guid isPermaLink="false">http://scott.yang.id.au/2008/04/alexa-updates-its-ranking-system/</guid>
		<description><![CDATA[TechCrunch: Alexa overhauls ranking system where it now uses multiple sources rather than just their toolbar data. I actually noticed the change in Alexa ranking before I spotted the post at TC. Good move I think, at least Alexa is now in the position of competing with Compete and Quantcast, which are almost useless for [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.techcrunch.com/2008/04/16/alexa-overhauls-ranking-system/">TechCrunch: Alexa overhauls ranking system</a> where it now uses multiple sources rather than just their toolbar data. I actually <a href="http://twitter.com/scottyang/statuses/790758129">noticed the change in Alexa ranking</a> before I spotted the post at TC. Good move I think, at least Alexa is now in the position of competing with <a href="http://www.compete.com/">Compete</a> and <a href="http://www.quantcast.com/">Quantcast</a>, which are almost useless for sites outside the <del>centre of universe</del> United States. <a href="http://www.ozbargain.com.au/">OzBargain</a> has jumped from 90,000+ to 49,000+ on Alexa &#8212; but still shows insufficient data on Compete and Quantcast.</p>
]]></content:encoded>
			<wfw:commentRss>http://scott.yang.id.au/2008/04/alexa-updates-its-ranking-system/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

