<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: iTunes AppStore scraping &#8211; decoding the browse URL</title>
	<atom:link href="http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/</link>
	<description>Politics, Technology, Food</description>
	<lastBuildDate>Tue, 09 Feb 2010 12:00:14 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: paul</title>
		<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/comment-page-1/#comment-20</link>
		<dc:creator>paul</dc:creator>
		<pubDate>Thu, 02 Jul 2009 02:47:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.paulhinks.com/?p=457#comment-20</guid>
		<description>Hi Cyrille - yes both your points are correct. The 3500 was a finger slip (sorry) and the paging either stopped working or never worked. It just looked so logical - why else is that &#039;/1&#039; at the end there? There is a further posting about an alternative approach - see &lt;a href=&quot;http://www.paulhinks.com/2009/06/15/appstore-scraping-the-front-door-method/&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;. You&#039;ll still need to use this method if you want to associate categories with an app as well as a genre though so it&#039;s not a total loss. Good luck!.
--Paul</description>
		<content:encoded><![CDATA[<p>Hi Cyrille &#8211; yes both your points are correct. The 3500 was a finger slip (sorry) and the paging either stopped working or never worked. It just looked so logical &#8211; why else is that &#8216;/1&#8242; at the end there? There is a further posting about an alternative approach &#8211; see <a href="http://www.paulhinks.com/2009/06/15/appstore-scraping-the-front-door-method/" rel="nofollow">here</a>. You&#8217;ll still need to use this method if you want to associate categories with an app as well as a genre though so it&#8217;s not a total loss. Good luck!.<br />
&#8211;Paul</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cyrillus</title>
		<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/comment-page-1/#comment-21</link>
		<dc:creator>Cyrillus</dc:creator>
		<pubDate>Wed, 01 Jul 2009 19:47:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.paulhinks.com/?p=457#comment-21</guid>
		<description>Hi Paul,
It seems that the page browsing does not work -- here&#039;s the output of my in-project AppStore crawler :
Got category Business (#6000)
  Downloading page 1... done, got 2500 items
  Downloading page 2... done, got 2500 items (2500 duplicates)
  Downloading page 3... done, got 2500 items (2500 duplicates)
  Downloading page 4... done, got 2500 items (2500 duplicates)
  Downloading page 5... done, got 2500 items (2500 duplicates)

And it goes on and on... Could it be that Apple disabled the page browsing ? (Also, the limit is set to 2500 and not 3500 as you say in the article)</description>
		<content:encoded><![CDATA[<p>Hi Paul,<br />
It seems that the page browsing does not work &#8212; here&#8217;s the output of my in-project AppStore crawler :<br />
Got category Business (#6000)<br />
  Downloading page 1&#8230; done, got 2500 items<br />
  Downloading page 2&#8230; done, got 2500 items (2500 duplicates)<br />
  Downloading page 3&#8230; done, got 2500 items (2500 duplicates)<br />
  Downloading page 4&#8230; done, got 2500 items (2500 duplicates)<br />
  Downloading page 5&#8230; done, got 2500 items (2500 duplicates)</p>
<p>And it goes on and on&#8230; Could it be that Apple disabled the page browsing ? (Also, the limit is set to 2500 and not 3500 as you say in the article)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: paul</title>
		<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/comment-page-1/#comment-19</link>
		<dc:creator>paul</dc:creator>
		<pubDate>Tue, 09 Jun 2009 20:07:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.paulhinks.com/?p=457#comment-19</guid>
		<description>Hi Tim - Yes I have confirmed that you do get duplicates, for just the reason you say - there are several apps that appear in multiple categories. They do have the same ID though so it&#039;s no big deal. And if it&#039;s important to track category as well as genre it&#039;s the only way to find it because the apps don&#039;t link back to all sub-categories (at least as far as I can see).</description>
		<content:encoded><![CDATA[<p>Hi Tim &#8211; Yes I have confirmed that you do get duplicates, for just the reason you say &#8211; there are several apps that appear in multiple categories. They do have the same ID though so it&#8217;s no big deal. And if it&#8217;s important to track category as well as genre it&#8217;s the only way to find it because the apps don&#8217;t link back to all sub-categories (at least as far as I can see).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tim</title>
		<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/comment-page-1/#comment-18</link>
		<dc:creator>Tim</dc:creator>
		<pubDate>Tue, 09 Jun 2009 18:56:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.paulhinks.com/?p=457#comment-18</guid>
		<description>I suppose these could produce duplicates. I haven&#039;t gone through the XML files to check that but if you go to the browse function in iTunes (which is was we&#039;re doing with curl I suppose), go to Books and find &quot;ABC Book&quot;. But the genre for this one is actually &quot;Education&quot;.
If you now go to the &quot;Education&quot; category, you&#039;ll find it there too.
There is no reason for the ID to be different though, so it should be easy to check.</description>
		<content:encoded><![CDATA[<p>I suppose these could produce duplicates. I haven&#8217;t gone through the XML files to check that but if you go to the browse function in iTunes (which is was we&#8217;re doing with curl I suppose), go to Books and find &#8220;ABC Book&#8221;. But the genre for this one is actually &#8220;Education&#8221;.<br />
If you now go to the &#8220;Education&#8221; category, you&#8217;ll find it there too.<br />
There is no reason for the ID to be different though, so it should be easy to check.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: paul</title>
		<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/comment-page-1/#comment-16</link>
		<dc:creator>paul</dc:creator>
		<pubDate>Tue, 12 May 2009 16:38:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.paulhinks.com/?p=457#comment-16</guid>
		<description>Hi Peter - glad they&#039;re useful. I know there&#039;s not much information out there so I wanted to record everything I found. I&#039;ll continue to update as I find out more.

Cheers.

--Paul</description>
		<content:encoded><![CDATA[<p>Hi Peter &#8211; glad they&#8217;re useful. I know there&#8217;s not much information out there so I wanted to record everything I found. I&#8217;ll continue to update as I find out more.</p>
<p>Cheers.</p>
<p>&#8211;Paul</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter B</title>
		<link>http://www.paulhinks.com/blog/2009/05/11/itunes-appstore-scraping-decoding-the-browse-url/comment-page-1/#comment-17</link>
		<dc:creator>Peter B</dc:creator>
		<pubDate>Tue, 12 May 2009 07:02:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.paulhinks.com/?p=457#comment-17</guid>
		<description>Paul,
I&#039;m finding your iTunes appstore articles incredibly useful for a project I&#039;m currently working on. Many thanks for sharing your findings in this area.
Cheers, Peter.</description>
		<content:encoded><![CDATA[<p>Paul,<br />
I&#8217;m finding your iTunes appstore articles incredibly useful for a project I&#8217;m currently working on. Many thanks for sharing your findings in this area.<br />
Cheers, Peter.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
